Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/317795.317812acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
Article
Free Access

The effect of system workload on error latency: an experimental study

Authors Info & Claims
Published:01 August 1985Publication History

ABSTRACT

In this paper, a methodology for determining and characterizing error latency is developed. The method is based on real workload data, gathered by an experiment instrumented on a VAX 11/780 during the normal workload cycle of the installation. This is the first attempt at jointly studying error latency and workload variations in a full production system. Distributions of error latency were generated by simulating the occurrence of faults under varying workload conditions. A family of error latency distributions so generated illustrate that error latency is not so much a function of when in time a fault occurred but rather a function of the workload that followed the failure. The study finds that the mean error latency varies by a 1 to 8 (hours) ratio between high and low workloads. The method is general and can be applied to any system.

References

  1. Butner 80.S. E. Butner and R. K. Iyer, "A Statistical Study of Reliability and System Load at SLAC," Diges~, Tenth Inferncaiona~ Symposium on Fcmlt Tolerant Computing, Kyoto, Japan, Oct 1980.Google ScholarGoogle Scholar
  2. Castillo 80.X. Castillo and D. P. Siewiorek, "A Performance Reliabtlity Model for Computing Systems," Digesz, Tenth Internation~ Symposium on Faul.t Tolerant Computing, Kyoto, Japan, Oct 1980.Google ScholarGoogle Scholar
  3. Castillo 81.X. CastiUo and D. P. Siewlorek, "Workload, Performance and Reliability of Digital Computing Systems," Digest, ~2eventh International Symposium on Fault-Tolerant Computing, Portland, Maine, June 1981, pp. 84-89.Google ScholarGoogle Scholar
  4. Chillarege 85.R. Chtllarege and R. K. Iyer, "An Experimental Study of Error Latency and System Workload." CSG Technical Report, Univ. of Illinois, Urbana, IL.Google ScholarGoogle Scholar
  5. Cortes 84.M. L. Cortes and R. K. Iyer, "Device Failures and System Activity: A Thermal Effects Model," "Digest, Fourteenth lnter. Symposium on Fault- Tolerant Computing," Orlando, Florida June 1984.Google ScholarGoogle Scholar
  6. DEC 80a.Digital Equipment Corporation, VAX Hardware Handboott, DEC 1980.Google ScholarGoogle Scholar
  7. DEC 80b.Digital Equipment Corporation, VAX Architecture Handbook, DEC 1980.Google ScholarGoogle Scholar
  8. DEC 80c.Digital Equipment Corporation, KA780 Field Maintenxmze Print Set, DEC 1980.Google ScholarGoogle Scholar
  9. Gunther 80.N. L. Gunther and W. C, Carter, "Remarks on the Prob. of detecting faults," Digest lOth International SympoMum on F_m_d_t-Tolerance Comput- /rig, Kyoto, Japan, Oct 1980.Google ScholarGoogle Scholar
  10. Iyer 82a.R. K. Iyer, S, E, Burner and E. J. McCIuskey, "A Statistical Fallure/Imad Relationship; Results of a Multi-Computer Study," IFP~ Transactions on Computers, July 1982.Google ScholarGoogle Scholar
  11. Iyer 82b.R. K. Iyer and D. J. Rossetti, "A Statistical Load Dependency of CPU Errors at SLAC," Digest, 12th International Symposium on Fault Tolerant Comtncing, Santa Monica, California, June 1982.Google ScholarGoogle Scholar
  12. Iyer 83.R. K. Iyer and D. J. Rossetti, "Permanent CPU Errors and System Activity: Measurement and Modellng", Digest, Real-Time Systems Symtx~iurn, Arlington, Virginia, Dec 1983.Google ScholarGoogle Scholar
  13. Lala 83.J. H. Lala, "Fault Detection, Isolation and Reconfiguration ff FIMP: Methods and Experimental Results", Firth Dig. Avionics Syst. Conf., 1983.Google ScholarGoogle Scholar
  14. McGough 81.J. G. McGough and F. L. Swern, "Measurement of Fault Latency in a Digital Avionic Mint Processor," NASA Contractor Report 3462, Oct 1981.Google ScholarGoogle Scholar
  15. McGough 83.J. G. McGough and F. L. Swern, "Measurement of Fault Latency in a Digital Avionic Mini Processor Part H," NASA Contractor Report 3651, Jan 1983.Google ScholarGoogle Scholar
  16. Rossetti 81.D. J. Rossetti and R. K. lyer, "A Software System for Reliability and Workload Analysis," CRC Tech Rpt 81-18, Center for Reliable Computing, Computer Systems Laboratory, Stanford Univ, Stanford, C.A., Dec 198 I.Google ScholarGoogle Scholar
  17. Shedletsky 73.J. J. Schedletsky and E. J. McCluskey, "The Error Latency of a Fault in a Combinational ClrcuR," Digest FTCS-3, june 1973.Google ScholarGoogle Scholar
  18. TEK84.Tektronix, User's Manual 9 I DW l For VAX/UNIX 4.1bsd Releasel, 1984, Tektronix, Oregon, USA.Google ScholarGoogle Scholar

Index Terms

  1. The effect of system workload on error latency: an experimental study

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              SIGMETRICS '85: Proceedings of the 1985 ACM SIGMETRICS conference on Measurement and modeling of computer systems
              August 1985
              203 pages
              ISBN:0897911695
              DOI:10.1145/317795

              Copyright © 1985 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 August 1985

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              Overall Acceptance Rate459of2,691submissions,17%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader