Fault Injector

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 1236 Experts worldwide ranked by ideXlab platform

J. Sosnowski - One of the best experts on this subject based on the ideXlab platform.

  • Software Implementation of Explicit DMC Algorithm with Improved Dependability
    Novel Algorithms and Techniques In Telecommunications Automation and Industrial Electronics, 2020
    Co-Authors: P. Gawkowski, Maciej Ławryńczuk, Piotr M. Marusak, Piotr Tatjewski, J. Sosnowski
    Abstract:

    The paper presents an approach to improve the dependability of software implementation of the explicit DMC (Dynamic Matrix Control) Model Predictive Control (MPC) algorithm. The investigated DMC algorithm is implemented for a control system of a rectification column - a process with strong cross-couplings and significant time delays. The control plant has two manipulated inputs and two outputs. The Fault sensitivity of the proposed implementation is verified in experiments with a software implemented Fault Injector. The experimental results prove the efficiency of proposed software improvements.

  • ECC - On improving dependability of the numerical GPC algorithm
    2009 European Control Conference (ECC), 2009
    Co-Authors: P. Gawkowski, Piotr M. Marusak, Piotr Tatjewski, Maciej Lawrynczuk, J. Sosnowski
    Abstract:

    The paper studies the dependability of software implementation of the numerical Generalized Predictive Control (GPC) Model Predictive Control (MPC) algorithm. The algorithm is implemented for a control system of a multivariable chemical reactor - a process with strong cross-couplings. Fault sensitivity of the proposed implementations is verified in experiments with a software implemented Fault Injector. Experimental methodology of disturbing the software implementation of the control algorithm is presented. The impact of Faults on the quality of the controller performance is analysed and techniques improving the dependability of the algorithm implementation are proposed. The experimental results prove the efficiency of the software improvements.

  • On improving dependability of the numerical GPC algorithm
    2009 European Control Conference (ECC), 2009
    Co-Authors: P. Gawkowski, Maciej Ławryńczuk, Piotr M. Marusak, Piotr Tatjewski, J. Sosnowski
    Abstract:

    The paper studies the dependability of software implementation of the numerical Generalized Predictive Control (GPC) Model Predictive Control (MPC) algorithm. The algorithm is implemented for a control system of a multivariable chemical reactor - a process with strong cross-couplings. Fault sensitivity of the proposed implementations is verified in experiments with a software implemented Fault Injector. Experimental methodology of disturbing the software implementation of the control algorithm is presented. The impact of Faults on the quality of the controller performance is analysed and techniques improving the dependability of the algorithm implementation are proposed. The experimental results prove the efficiency of the software improvements.

  • PRDC - Using software implemented Fault inserter in dependability analysis
    2002 Pacific Rim International Symposium on Dependable Computing 2002. Proceedings., 2002
    Co-Authors: P. Gawkowski, J. Sosnowski
    Abstract:

    We investigate program susceptibility to hardware Faults in Win32 environment. For this purpose we use the software implemented Fault Injector FITS. We analyze natural Fault resistivity of COTS systems and the effectiveness of various software techniques improving system dependability. The problems of experiment tuning and result interpretation are discussed in context of a wide spectrum of applications.

  • HLDVT - Experimental validation of Fault detection and Fault tolerance mechanisms
    Seventh IEEE International High-Level Design Validation and Test Workshop 2002., 2002
    Co-Authors: P. Gawkowski, J. Sosnowski
    Abstract:

    The paper deals with the problem of validating the effectiveness of hardware and software mechanisms decreasing system susceptibility to hardware Faults. The validation process is based on the use of software implemented Fault Injector (FITS). The performed analysis concentrates on tuning the profile of Faults and experiment set-ups. The presented simulation results are explained in context of the considered applications.

Ramesh Chandra - One of the best experts on this subject based on the ideXlab platform.

  • A global-state-triggered Fault Injector for distributed system evaluation
    IEEE Transactions on Parallel and Distributed Systems, 2004
    Co-Authors: Ramesh Chandra, R M Lefever, K.r. Joshi, M. Cukier, W.h. Sanders
    Abstract:

    Validation of the dependability of distributed systems via Fault injection is gaining importance because distributed systems are being increasingly used in environments with high dependability requirements. The fact that distributed systems can fail in subtle ways that depend on the state of multiple parts of the system suggests that a global-state-based Fault injection mechanism should be used to validate them. However, global-state-based Fault injection is challenging since it is very difficult in practice to maintain the global state of a distributed system at runtime with minimal intrusion into the system execution. We present Loki, a global-state-based Fault Injector, which has been designed with the goals of low intrusion, high precision, and high flexibility. Loki achieves these goals by utilizing the ideas of partial view of global state, optimistic synchronization, and offline analysis. In Loki, Faults are injected based on a partial, view of the global state of the system, and a post-runtime analysis is performed to place events and injections into a single global timeline and to discard experiments with incorrect Fault injections. Finally, the experiments with correct Fault injections are used to estimate user-specified performance and dependability measures. A flexible measure language has been designed that facilitates the specification of a wide range of measures.

  • dynamic node management and measure estimation in a state driven Fault Injector
    Symposium on Reliable Distributed Systems, 2000
    Co-Authors: Ramesh Chandra, Michel Cukier, R M Lefever, William H Sanders
    Abstract:

    Validation of distributed systems using Fault injection is difficult because of their inherent complexity, lack of a global clock, and lack of an easily accessible notion of a global state. To address these challenges, the Loki Fault Injector injects Faults based on a partial view of the global state of a distributed system, and performs a post-runtime analysis using an off-line clock synchronization algorithm to determine whether the Faults were properly injected. In this paper, we first describe enhanced runtime architecture for the Loki Fault Injector and then present a new method for obtaining measures in Loki. The enhanced runtime allows dynamic entry and exit of nodes in the system. It also offers more efficient multicast of notification messages and more efficient communication between state machines on the same host, and is more scalable than the previous runtime. We then detail a new and flexible method for obtaining a wide range of performance and dependability measures in Loki.

  • SRDS - Dynamic node management and measure estimation in a state-driven Fault Injector
    Proceedings 19th IEEE Symposium on Reliable Distributed Systems SRDS-2000, 2000
    Co-Authors: Ramesh Chandra, Michel Cukier, R M Lefever, William H Sanders
    Abstract:

    Validation of distributed systems using Fault injection is difficult because of their inherent complexity, lack of a global clock, and lack of an easily accessible notion of a global state. To address these challenges, the Loki Fault Injector injects Faults based on a partial view of the global state of a distributed system, and performs a post-runtime analysis using an off-line clock synchronization algorithm to determine whether the Faults were properly injected. In this paper, we first describe enhanced runtime architecture for the Loki Fault Injector and then present a new method for obtaining measures in Loki. The enhanced runtime allows dynamic entry and exit of nodes in the system. It also offers more efficient multicast of notification messages and more efficient communication between state machines on the same host, and is more scalable than the previous runtime. We then detail a new and flexible method for obtaining a wide range of performance and dependability measures in Loki.

  • Loki: a state-driven Fault Injector for distributed systems
    Proceeding International Conference on Dependable Systems and Networks. DSN 2000, 2000
    Co-Authors: Ramesh Chandra, R M Lefever, M. Cukier, W.h. Sanders
    Abstract:

    Distributed applications can fail in subtle ways that depend on the state of multiple parts of a system. This complicates the validation of such systems via Fault injection, since it suggests that Faults should be injected based on the global state of the system. In Loki, Fault injection is performed based on a partial view of the global state of a distributed system, i.e. Faults injected in one node of the system can depend on the state of other nodes. Once Faults are injected, a post-runtime analysis, using off-line clock synchronization, is used to place events and injections on a single global timeline and to determine whether the intended Faults were properly injected. Finally, experiments containing successful Fault injections are used to estimate the specified measures. In addition to briefly reviewing the concepts behind Loki and its organization, we detail Loki's user interface. In particular, we describe the graphical user interfaces for specifying state machines and Faults, for executing a campaign and for verifying whether the Faults were properly injected.

  • DSN - Loki: a state-driven Fault Injector for distributed systems
    Proceeding International Conference on Dependable Systems and Networks. DSN 2000, 2000
    Co-Authors: Ramesh Chandra, Michel Cukier, R M Lefever, William H Sanders
    Abstract:

    Distributed applications can fail in subtle ways that depend on the state of multiple parts of a system. This complicates the validation of such systems via Fault injection, since it suggests that Faults should be injected based on the global state of the system. In Loki, Fault injection is performed based on a partial view of the global state of a distributed system, i.e. Faults injected in one node of the system can depend on the state of other nodes. Once Faults are injected, a post-runtime analysis, using off-line clock synchronization, is used to place events and injections on a single global timeline and to determine whether the intended Faults were properly injected. Finally, experiments containing successful Fault injections are used to estimate the specified measures. In addition to briefly reviewing the concepts behind Loki and its organization, we detail Loki's user interface. In particular, we describe the graphical user interfaces for specifying state machines and Faults, for executing a campaign and for verifying whether the Faults were properly injected.

J.j. Beahan - One of the best experts on this subject based on the ideXlab platform.

  • A software-implemented Fault injection methodology for design and validation of system Fault tolerance
    2001 International Conference on Dependable Systems and Networks, 2001
    Co-Authors: R.r. Some, G. Khanoyan, L. Callum, A. Agrawal, J.j. Beahan
    Abstract:

    Presents our experience in developing a methodology and tool at the Jet Propulsion Laboratory (JPL) for software-implemented Fault injection (SWIFI) into a parallel-processing supercomputer which is being designed for use in next-generation space exploration missions. The Fault Injector uses software-based strategies to emulate the effects of radiation-induced transients occurring in the system hardware components. JPL's SWIFI tool set, which is called JIFI (JPL's Implementation of a Fault Injector), is being used in conjunction with an appropriate system Fault model to evaluate candidate hardware and software Fault tolerance architectures, to determine the sensitivity of applications to Faults, and to measure the effectiveness of Fault detection, isolation and recovery strategies. JIFI has been validated to inject Faults into user-specified CPU registers and memory regions with a uniform random distribution in location and time. Together with verifiers, classifiers and run scripts, JIFI enables massive Fault injection campaigns and statistical data analysis.

  • DSN - A software-implemented Fault injection methodology for design and validation of system Fault tolerance
    Proceedings International Conference on Dependable Systems and Networks, 2001
    Co-Authors: R.r. Some, G. Khanoyan, L. Callum, A. Agrawal, J.j. Beahan
    Abstract:

    Presents our experience in developing a methodology and tool at the Jet Propulsion Laboratory (JPL) for software-implemented Fault injection (SWIFI) into a parallel-processing supercomputer which is being designed for use in next-generation space exploration missions. The Fault Injector uses software-based strategies to emulate the effects of radiation-induced transients occurring in the system hardware components. JPL's SWIFI tool set, which is called JIFI (JPL's Implementation of a Fault Injector), is being used in conjunction with an appropriate system Fault model to evaluate candidate hardware and software Fault tolerance architectures, to determine the sensitivity of applications to Faults, and to measure the effectiveness of Fault detection, isolation and recovery strategies. JIFI has been validated to inject Faults into user-specified CPU registers and memory regions with a uniform random distribution in location and time. Together with verifiers, classifiers and run scripts, JIFI enables massive Fault injection campaigns and statistical data analysis.

R M Lefever - One of the best experts on this subject based on the ideXlab platform.

  • A global-state-triggered Fault Injector for distributed system evaluation
    IEEE Transactions on Parallel and Distributed Systems, 2004
    Co-Authors: Ramesh Chandra, R M Lefever, K.r. Joshi, M. Cukier, W.h. Sanders
    Abstract:

    Validation of the dependability of distributed systems via Fault injection is gaining importance because distributed systems are being increasingly used in environments with high dependability requirements. The fact that distributed systems can fail in subtle ways that depend on the state of multiple parts of the system suggests that a global-state-based Fault injection mechanism should be used to validate them. However, global-state-based Fault injection is challenging since it is very difficult in practice to maintain the global state of a distributed system at runtime with minimal intrusion into the system execution. We present Loki, a global-state-based Fault Injector, which has been designed with the goals of low intrusion, high precision, and high flexibility. Loki achieves these goals by utilizing the ideas of partial view of global state, optimistic synchronization, and offline analysis. In Loki, Faults are injected based on a partial, view of the global state of the system, and a post-runtime analysis is performed to place events and injections into a single global timeline and to discard experiments with incorrect Fault injections. Finally, the experiments with correct Fault injections are used to estimate user-specified performance and dependability measures. A flexible measure language has been designed that facilitates the specification of a wide range of measures.

  • dynamic node management and measure estimation in a state driven Fault Injector
    Symposium on Reliable Distributed Systems, 2000
    Co-Authors: Ramesh Chandra, Michel Cukier, R M Lefever, William H Sanders
    Abstract:

    Validation of distributed systems using Fault injection is difficult because of their inherent complexity, lack of a global clock, and lack of an easily accessible notion of a global state. To address these challenges, the Loki Fault Injector injects Faults based on a partial view of the global state of a distributed system, and performs a post-runtime analysis using an off-line clock synchronization algorithm to determine whether the Faults were properly injected. In this paper, we first describe enhanced runtime architecture for the Loki Fault Injector and then present a new method for obtaining measures in Loki. The enhanced runtime allows dynamic entry and exit of nodes in the system. It also offers more efficient multicast of notification messages and more efficient communication between state machines on the same host, and is more scalable than the previous runtime. We then detail a new and flexible method for obtaining a wide range of performance and dependability measures in Loki.

  • SRDS - Dynamic node management and measure estimation in a state-driven Fault Injector
    Proceedings 19th IEEE Symposium on Reliable Distributed Systems SRDS-2000, 2000
    Co-Authors: Ramesh Chandra, Michel Cukier, R M Lefever, William H Sanders
    Abstract:

    Validation of distributed systems using Fault injection is difficult because of their inherent complexity, lack of a global clock, and lack of an easily accessible notion of a global state. To address these challenges, the Loki Fault Injector injects Faults based on a partial view of the global state of a distributed system, and performs a post-runtime analysis using an off-line clock synchronization algorithm to determine whether the Faults were properly injected. In this paper, we first describe enhanced runtime architecture for the Loki Fault Injector and then present a new method for obtaining measures in Loki. The enhanced runtime allows dynamic entry and exit of nodes in the system. It also offers more efficient multicast of notification messages and more efficient communication between state machines on the same host, and is more scalable than the previous runtime. We then detail a new and flexible method for obtaining a wide range of performance and dependability measures in Loki.

  • Loki: a state-driven Fault Injector for distributed systems
    Proceeding International Conference on Dependable Systems and Networks. DSN 2000, 2000
    Co-Authors: Ramesh Chandra, R M Lefever, M. Cukier, W.h. Sanders
    Abstract:

    Distributed applications can fail in subtle ways that depend on the state of multiple parts of a system. This complicates the validation of such systems via Fault injection, since it suggests that Faults should be injected based on the global state of the system. In Loki, Fault injection is performed based on a partial view of the global state of a distributed system, i.e. Faults injected in one node of the system can depend on the state of other nodes. Once Faults are injected, a post-runtime analysis, using off-line clock synchronization, is used to place events and injections on a single global timeline and to determine whether the intended Faults were properly injected. Finally, experiments containing successful Fault injections are used to estimate the specified measures. In addition to briefly reviewing the concepts behind Loki and its organization, we detail Loki's user interface. In particular, we describe the graphical user interfaces for specifying state machines and Faults, for executing a campaign and for verifying whether the Faults were properly injected.

  • DSN - Loki: a state-driven Fault Injector for distributed systems
    Proceeding International Conference on Dependable Systems and Networks. DSN 2000, 2000
    Co-Authors: Ramesh Chandra, Michel Cukier, R M Lefever, William H Sanders
    Abstract:

    Distributed applications can fail in subtle ways that depend on the state of multiple parts of a system. This complicates the validation of such systems via Fault injection, since it suggests that Faults should be injected based on the global state of the system. In Loki, Fault injection is performed based on a partial view of the global state of a distributed system, i.e. Faults injected in one node of the system can depend on the state of other nodes. Once Faults are injected, a post-runtime analysis, using off-line clock synchronization, is used to place events and injections on a single global timeline and to determine whether the intended Faults were properly injected. Finally, experiments containing successful Fault injections are used to estimate the specified measures. In addition to briefly reviewing the concepts behind Loki and its organization, we detail Loki's user interface. In particular, we describe the graphical user interfaces for specifying state machines and Faults, for executing a campaign and for verifying whether the Faults were properly injected.

W.h. Sanders - One of the best experts on this subject based on the ideXlab platform.

  • A global-state-triggered Fault Injector for distributed system evaluation
    IEEE Transactions on Parallel and Distributed Systems, 2004
    Co-Authors: Ramesh Chandra, R M Lefever, K.r. Joshi, M. Cukier, W.h. Sanders
    Abstract:

    Validation of the dependability of distributed systems via Fault injection is gaining importance because distributed systems are being increasingly used in environments with high dependability requirements. The fact that distributed systems can fail in subtle ways that depend on the state of multiple parts of the system suggests that a global-state-based Fault injection mechanism should be used to validate them. However, global-state-based Fault injection is challenging since it is very difficult in practice to maintain the global state of a distributed system at runtime with minimal intrusion into the system execution. We present Loki, a global-state-based Fault Injector, which has been designed with the goals of low intrusion, high precision, and high flexibility. Loki achieves these goals by utilizing the ideas of partial view of global state, optimistic synchronization, and offline analysis. In Loki, Faults are injected based on a partial, view of the global state of the system, and a post-runtime analysis is performed to place events and injections into a single global timeline and to discard experiments with incorrect Fault injections. Finally, the experiments with correct Fault injections are used to estimate user-specified performance and dependability measures. A flexible measure language has been designed that facilitates the specification of a wide range of measures.

  • Loki: a state-driven Fault Injector for distributed systems
    Proceeding International Conference on Dependable Systems and Networks. DSN 2000, 2000
    Co-Authors: Ramesh Chandra, R M Lefever, M. Cukier, W.h. Sanders
    Abstract:

    Distributed applications can fail in subtle ways that depend on the state of multiple parts of a system. This complicates the validation of such systems via Fault injection, since it suggests that Faults should be injected based on the global state of the system. In Loki, Fault injection is performed based on a partial view of the global state of a distributed system, i.e. Faults injected in one node of the system can depend on the state of other nodes. Once Faults are injected, a post-runtime analysis, using off-line clock synchronization, is used to place events and injections on a single global timeline and to determine whether the intended Faults were properly injected. Finally, experiments containing successful Fault injections are used to estimate the specified measures. In addition to briefly reviewing the concepts behind Loki and its organization, we detail Loki's user interface. In particular, we describe the graphical user interfaces for specifying state machines and Faults, for executing a campaign and for verifying whether the Faults were properly injected.