96
Views
0
CrossRef citations to date
0
Altmetric
Research Articles

Probabilistic modeling of hardware and software interactions for system reliability assessment

ORCID Icon, , &

References

  • Bain, L. 2017. Statistical analysis of reliability and life-testing models: Theory and methods. New York, NY: Marcel Dekker.
  • Banks, D. L. 2005. Foundations of risk analysis: A knowledge and decision-oriented perspective. Terje Aven. Journal of the American Statistical Association 100 (470):703–4. doi: 10.1198/jasa.2005.s16.
  • Bengtsson, H., H. Corrada Bravo, R. Gentleman, O. Hossjer, H. Jaffee, D. Jiang, and P. Langfelder. 2019. “Package ‘matrixStats’”.
  • Bussieck, M. R., and A. Meeraus. 2004. General algebraic modeling system (GAMS). In Modeling languages in mathematical optimization, 137–57. Boston, MA: Springer.
  • Chandrasekar, P., and T. Sangeetha. 2014. Smart shopping cart with automatic billing system through RFID and ZigBee. In International Conference on Information Communication and Embedded Systems (ICICES2014), IEEE, 1–4.
  • Corp, G. D. 2019. “Stochastic Programming.” Accessed October 16, 2019. https://www.gams.com/latest/docs/UG_EMP_SP.html.
  • Davila Frias, A., N. Yodo, and O. P. Yadav. 2019. Mixed-degradation profiles assessment of critical components in cyber-physical systems. In 2019 Annual Reliability and Maintainability Symposium (RAMS), January 28–31, 2019, 1–6.
  • Ebeling, C. E. 2010. An introduction to reliability and mainainability engineering. Long Grove: Waveland Press, Inc.,
  • El-Sayed, N., and B. Schroeder. 2013. Reading between the lines of failure logs: Understanding how HPC systems fail. In 2013 43rd annual IEEE/IFIP international conference on dependable systems and networks (DSN), 2013: IEEE, pp. 1–12. doi: 10.1109/DSN.2013.6575356.
  • Feng, E., J. Zheng, and C. Liu. 2014. An integrated reliability model of hardware-software system. In 2014 10th International Conference on Reliability, Maintainability and Safety (ICRMS), 6–8 Aug, 577–80.
  • Friedman, M. A., P. Tran, and P. L. Goddard. 1992. Reliability techniques for combined hardware and software systems. Rome, NY: Rome Lab.
  • Goel, A. L., and K. Okumoto. 1979. Time-dependent error-detection rate model for software reliability and other performance measures. IEEE Transactions on Reliability R-28 (3):206–11. doi: 10.1109/TR.1979.5220566.
  • Gray, J. 1990. A census of Tandem system availability between 1985 and 1990. IEEE Transactions on Reliability 39 (4):409–18. doi: 10.1109/24.58719.
  • Hecht, H., and M. Hecht. 1986. Software reliability in the system context. IEEE Transactions on Software Engineering SE-12 (1):51–8. doi: 10.1109/TSE.1986.6312919.
  • Hewlett Packard Enterprise Development LP. 2020. “Support communication - customer bulletin-Document ID: a00097382en_us.” Accessed May 25, 2020. https://support.hpe.com/hpesc/public/docDisplay?docLocale=en_US&docId=a00097382en_us.
  • Hogg, R. V., J. W. McKean, and A. T. Craig. 2013. Introduction to mathematical statistics. 7th ed. Boston, MA: Pearson Education.
  • IEEE. 1990. Standard glossary of software engineering terminology (IEEE Std. 610.12-1990). IEEE New York: EUA.
  • Iyer, R. K., and P. Velardi. 1985. Hardware-related software errors: Measurement and analysis. IEEE Transactions on Software Engineering SE-11 (2):223–31. doi: 10.1109/TSE.1985.232198.
  • Iyer, R. K., D. J. Rossetti, and M.-C. Hsueh. 1986. Measurement and modeling of computer reliability as affected by system activity. ACM Transactions on Computer Systems 4 (3):214–37. doi: 10.1145/6420.6422.
  • Jeske, D. R., and X. Zhang. 2005. Some successful approaches to software reliability modeling in industry. Journal of Systems and Software 74 (1):85–99. doi: 10.1016/j.jss.2003.10.024.
  • Kadav, A., M. J. Renzelmann, and M. M. Swift. 2009. Tolerating hardware device failures in software. In Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles, 59–72. doi: 10.1145/1629575.1629582.
  • Koc, H., S. S. Shaik, and P. P. Madupu. 2019. Reliability modeling and analysis for cyber physical systems. In 2019 IEEE 9th Annual Computing and Communication Workshop and Conference (CCWC), 2019: IEEE, 0448–0451. doi: 10.1109/CCWC.2019.8666606.
  • Kumar, S., and K. Li. 2002. Performance impact of using ESP to implement VMMC firmware. In Proceedings of the 1st Workshop on novel uses of system area networks (SAN-1). Cambridge, MA: Citeseer.
  • Li, M.-L., P. Ramachandran, S. K. Sahoo, S. V. Adve, V. S. Adve, and Y. Zhou. 2008. Understanding the propagation of hard errors to software and implications for resilient system design. ACM SIGPLAN Notices 43 (3):265–76. doi: 10.1145/1353536.1346315.
  • Lin, T.-T., and D. P. Siewiorek. 1990. Error log analysis: Statistical modeling and heuristic trend analysis. IEEE Transactions on Reliability 39 (4):419–32. doi: 10.1109/24.58720.
  • Los Alamos National Laboratory. 2005. Ultrascale Systems Research Center (USRC) data sources. https://usrc.lanl.gov/data-sources.php.
  • Lyu, M. R. 1997. Handbook of software reliability engineering. Los Alamitos, CA: IEEE Computer Society Press.
  • Mersmann, O., H. Trautmann, D. Steuer, B. Bornkamp, and M. O. Mersmann. 2018. “Package ‘truncnorm’,” Technical report. https://github. com/olafmersmann/truncnorm.
  • Ohba, M., and S. Yamada. 1984. S-shaped software reliability growth models. In International colloquium on reliability and maintainability (ICRMS), 4 th, Tregastel, France, 430–6.
  • Park, J., H.-J. Kim, J.-H. Shin, and J. Baik. 2012. An embedded software reliability model with consideration of hardware related software failures. In 2012 IEEE Sixth International Conference on Software Security and Reliability, 2012: IEEE, 207–14.
  • Pham, H. 2000. Software reliability. Piscataway: Springer.
  • Pham, H. 2003. Recent studies in software reliability engineering. In Handbook of reliability engineering, 285–302. London: Springer.
  • Pham, H. 2006. System software reliability. Piscataway: Springer.
  • Ranade, D. M. 2003. Shared data clusters: Scaleable, manageable, and highly available systems (Veritas Series). John Wiley & Sons.
  • Rausand, M., and A. Høyland. 2003. System reliability theory: Models, statistical methods, and applications. Vol. 396. John Wiley & Sons.
  • Ross, S. M. 2014. Introduction to probability models. Los Angeles, CA: Academic press.
  • Roy, D. S., C. Murthy, and D. K. Mohanta. 2015. Reliability analysis of phasor measurement unit incorporating hardware and software interaction failures. IET Generation, Transmission & Distribution 9 (2):164–71. doi: 10.1049/iet-gtd.2014.0115.
  • Roy, D. S., S. Verma, C. Murthy, and D. K. Mohanta. 2016. Reliability assessment of cyber-physical systems: A hardware–software interaction perspective. In Cyber-physical systems: a computational perspective, 249–60. Boca Raton, FL: Taylor & Francis Group, LLC.
  • Salvendy, G. 2001. Handbook of industrial engineering: Technology and operations management. New York, NY: John Wiley & Sons.
  • Schroeder, B., and G. A. Gibson. 2007. Disk failures in the real world: What does an MTTF of 1, 000, 000 hours mean to you? FAST 7 (1):1–16.
  • Schroeder, B., and G. Gibson. 2010. A large-scale study of failures in high-performance computing systems. IEEE Transactions on Dependable and Secure Computing 7 (4):337–50. doi: 10.1109/TDSC.2009.4.
  • Shapiro, A., and A. Nemirovski. 2005. On complexity of stochastic programming problems. In Continuous optimization, 111–46. Boston, MA: Springer.
  • Siddesh, G. M., G. C. Deka, K. G. Srinivasa, and L. M. Patnaik. 2015. Cyber-physical systems: A computational perspective. Boca Raton, FL: CRC Press.
  • Song, K., I. H. Chang, and H. Pham. 2017. A software reliability model with a Weibull fault detection rate function subject to operating environments. Applied Sciences 7 (10):983. doi: 10.3390/app7100983.
  • Tang, D., and R. K. Iyer. 1992. Analysis of the VAX/VMS error logs in multicomputer environments-a case study of software dependability. In [1992] Proceedings Third International Symposium on Software Reliability Engineering, IEEE, 216–26.
  • Teng, X., H. Pham, and D. R. Jeske. 2006. Reliability modeling of hardware and software interactions, and its applications. IEEE Transactions on Reliability 55 (4):571–7. doi: 10.1109/TR.2006.884589.
  • Welke, S. R., B. W. Johnson, and J. H. Aylor. 1995. Reliability modeling of hardware/software systems. IEEE Transactions on Reliability 44 (3):413–8. doi: 10.1109/24.406575.
  • Zhang, T., M. Xie, L. C. Tang, and S. H. Ng. 2005. Reliability and modeling of systems integrated with firmware and hardware. International Journal of Reliability, Quality and Safety Engineering 12 (3):227–39. doi: 10.1142/S021853930500180X.
  • Zio, E. 2007. An introduction to the basics of reliability and risk analysis. Vol. 13. World scientific.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.