File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Improving Availability of Multicore Real-Time Systems Suffering Both Permanent and Transient Faults

TitleImproving Availability of Multicore Real-Time Systems Suffering Both Permanent and Transient Faults
Authors
Keywordslifetime reliability
multicore real-time systems
soft-error reliability
System availability
Issue Date2019
Citation
IEEE Transactions on Computers, 2019, v. 68, n. 12, p. 1785-1801 How to Cite?
AbstractCMOS scaling has greatly increased concerns for both lifetime reliability due to permanent faults and soft-error reliability due to transient faults. Most existing works only focus on one of the two reliability concerns, but often times techniques used to increase one type of reliability may adversely impact the other type. A few efforts do consider both types of reliability together and use two different metrics to quantify the two types of reliability. However, for many systems, the user's concern is to maximize system availability by improving the mean time to failure (MTTF), regardless of whether the failure is caused by permanent or transient faults. Addressing this concern requires a uniform metric to measure the effect due to both types of faults. This paper introduces a novel analytical expression for calculating the MTTF due to transient faults. Using this new formula and an existing method to evaluate system MTTF, we tackle the problem of maximizing availability for multicore real-time systems with consideration of permanent and transient faults. A framework is proposed to solve the system availability maximization problem. Experimental results on a hardware board and simulation results of synthetic tasks show that our scheme significantly improves system MTTF (and hence availability) compared with existing techniques.
Persistent Identifierhttp://hdl.handle.net/10722/336229
ISSN
2023 Impact Factor: 3.6
2023 SCImago Journal Rankings: 1.307
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorZhou, Junlong-
dc.contributor.authorHu, Xiaobo Sharon-
dc.contributor.authorMa, Yue-
dc.contributor.authorSun, Jin-
dc.contributor.authorWei, Tongquan-
dc.contributor.authorHu, Shiyan-
dc.date.accessioned2024-01-15T08:24:40Z-
dc.date.available2024-01-15T08:24:40Z-
dc.date.issued2019-
dc.identifier.citationIEEE Transactions on Computers, 2019, v. 68, n. 12, p. 1785-1801-
dc.identifier.issn0018-9340-
dc.identifier.urihttp://hdl.handle.net/10722/336229-
dc.description.abstractCMOS scaling has greatly increased concerns for both lifetime reliability due to permanent faults and soft-error reliability due to transient faults. Most existing works only focus on one of the two reliability concerns, but often times techniques used to increase one type of reliability may adversely impact the other type. A few efforts do consider both types of reliability together and use two different metrics to quantify the two types of reliability. However, for many systems, the user's concern is to maximize system availability by improving the mean time to failure (MTTF), regardless of whether the failure is caused by permanent or transient faults. Addressing this concern requires a uniform metric to measure the effect due to both types of faults. This paper introduces a novel analytical expression for calculating the MTTF due to transient faults. Using this new formula and an existing method to evaluate system MTTF, we tackle the problem of maximizing availability for multicore real-time systems with consideration of permanent and transient faults. A framework is proposed to solve the system availability maximization problem. Experimental results on a hardware board and simulation results of synthetic tasks show that our scheme significantly improves system MTTF (and hence availability) compared with existing techniques.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Computers-
dc.subjectlifetime reliability-
dc.subjectmulticore real-time systems-
dc.subjectsoft-error reliability-
dc.subjectSystem availability-
dc.titleImproving Availability of Multicore Real-Time Systems Suffering Both Permanent and Transient Faults-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TC.2019.2935042-
dc.identifier.scopuseid_2-s2.0-85074408085-
dc.identifier.volume68-
dc.identifier.issue12-
dc.identifier.spage1785-
dc.identifier.epage1801-
dc.identifier.eissn1557-9956-
dc.identifier.isiWOS:000498734600007-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats