Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • CMMSradio
    • Way of the Quality Warrior
    • Critical Talks
    • Asset Performance
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Hero
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Breaking Bad for Reliability
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • The RCA
      • Communicating with FINESSE
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Hardware Product Develoment Lifecycle
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Special Offers
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • Your Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
      • FMEA Introduction
      • AIAG & VDA FMEA Methodology
    • Barringer Process Reliability Introduction
      • Barringer Process Reliability Introduction Course Landing Page
    • Fault Tree Analysis (FTA)
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
    • Accendo Reliability Webinar Series
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
Home » Articles » Sample Size and Duration and MTBF

by Fred Schenkelberg Leave a Comment

Sample Size and Duration and MTBF

Sample Size and Duration and MTBF

Sample Size and Duration and MTBF

If you have been a reliability engineer for a week or more, or worked with a reliability engineer for a day or more, someone has asked about testing planning. The conversation may have started with “How many samples and how long will the test take?”

You have heard the sample size question.

What I continue to hear in these discussions is the mistaken idea that adding another sample extends the effective time the testing represents in normal use. If I have a 1,000-hour test and add another unit, that does not mean the results represent reliability for an additional 1,000 hours of use time.

The Legacy of the Exponential Distribution

The problem stems from the exponential distribution, where the chance of failure each hour for each unit is the same. There is no change to the hazard rate over time; therefore, accumulating more individual hours provides additional information about how the system will behave over an hour of use.

This is rarely if ever is true. Hazard rates change as different failure mechanisms evolve, as materials settle or wear, as damage accumulates, or environmental contains change. If you check, you will find that assuming a content failure rate is invalid for your product or system.

Yet, you know that.

Then why do so many, when planning a reliability test, continue to rely on the total time of testing (number of units in test times the number of hours under test)?

  • It’s in the textbooks describing how to plan a life test
  • It’s what we have always done
  • It’s in XYZ standard
  • It’s what the customer requested

Maybe it’s time to step back and consider what you are trying to accomplish with the life test?

Is the Goal Reliability of MTBF?

In general terms, we conduct testing to learn something. For product life testing, we generally want to know what will fail or when it will fail. We want to understand if the current design and assembly process will create items that will work as expected for as long as expected within our customers’ use conditions. Basically, how many should survive for 2 years (for example).

Learning what will fail is often an exploratory or discovery-style testing. HALT or STRIFE, or HAST, or margin testing, or similar test is the deliberate work to reveal failure mechanisms. Life testing, accelerated life testing, ALT, duration testing, endurance testing, or similar tends to focus on will the units survive long enough.

The HALT approach involves finding failure mechanisms and designing them out of the product, thus improving the reliability performance of the product. The ALT approach involves understanding the time to failure behavior of failure mechanisms in order to characterize the reliability performance of the product.

Or, are you just after estimating MTBF?

Realize that MTBF is not reliability. It is neither a suitable nor complete description of your product’s probability of survival over a duration in your customer’s set of use conditions. It’s just a representation of the average time to failure, or the inverse of the average failure rate. Not very informative when trying to identify or eliminate failure mechanisms or make a decision on reliability performance.

If you only want MTBF (and I’m not able to dissuade you from this folly 😉 then you can use the idea that adding another unit to your test improves your ability to estimate MTBF. If you are assuming the constant hazard rate, then you are fine focusing only on MTBF.

The results of MTBF-based testing will provide some information on the actual reliability performance of your product. If you run your test for 1,000 hours, then you will learn about the reliability performance of the first 1,000 hours of you product’s use. If you run more samples, you will improve your understanding of the performance over those 1,000 hours.

In the vast majority of cases where the assumption of a constant hazard rate is invalid, running 10 units for 1,000 hours does not imply any meaningful information about the reliability performance out to 10,000 hours. The MTBF value may say you have achieved a 10,000 MTBF, yet you have really only learned something about the initial 1,000-hour operation for your product.

Be Prepared for Proper Test Planning

In summary, when you hear this concept of test planning based on a constant hazard rate assumption, help your team understand the error. Help them develop and conduct reliability testing that actually provides meaningful information. Help them create results they can use to make decisions.

Filed Under: Articles, NoMTBF

About Fred Schenkelberg

I am the reliability expert at FMS Reliability, a reliability engineering and management consulting firm I founded in 2004. I left Hewlett Packard (HP)’s Reliability Team, where I helped create a culture of reliability across the corporation, to assist other organizations.

« Importance of Failure Repeatability

Comments

  1. Kelvin Montero says

    November 27, 2016 at 8:49 AM

    Great article… and yes I’ve been in aforementioned environment as well.

    Reply
    • Fred Schenkelberg says

      November 27, 2016 at 8:52 AM

      thanks for the note and kind words. How did it all turn out in your environment? How did you handle the sample size quetsions?

      Cheers,

      Fred

      Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

The NoMTBF logo

Devoted to the eradication of the misuse of MTBF.

Photo of Fred SchenkelbergArticles by Fred Schenkelberg and guest authors

in the NoMTBF article series

Recent Posts

  • Sample Size and Duration and MTBF
  • Importance of Failure Repeatability
  • Measuring the Success of Your Risk Management Program
  • 31 Best Practice Maintenance Management Tips
  • Hardware Product Realization in the Age of AI

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy

Book the Course with John
  Ask a question or send along a comment. Please login to view and use the contact form.
This site uses cookies to give you a better experience, analyze site traffic, and gain insight to products or offers that may interest you. By continuing, you consent to the use of cookies. Learn how we use cookies, how they work, and how to set your browser preferences by reading our Cookies Policy.