Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
    • About Us
    • Colophon
    • Survey
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • CMMSradio
    • Way of the Quality Warrior
    • Critical Talks
    • Asset Performance
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
  • Articles
    • CRE Preparation Notes
    • NoMTBF
    • on Leadership & Career
      • Advanced Engineering Culture
      • ASQR&R
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • AI & Predictive Maintenance
      • Asset Management in the Mining Industry
      • CMMS and Maintenance Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • RCM Blitz®
      • ReliabilityXperience
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
      • The People Side of Maintenance
      • The Reliability Mindset
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Field Reliability Data Analysis
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability by Design
      • Reliability Competence
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
      • Reliability Knowledge
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Institute of Quality & Reliability
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • R for Engineering
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Statistical Methods for Failure-Time Data
      • Testing 1 2 3
      • The Hardware Product Develoment Lifecycle
      • The Manufacturing Academy
  • eBooks
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Glossary
    • Feed Forward Publications
    • Openings
    • Books
    • Webinar Sources
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • Your Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Live Courses
      • Introduction to Reliability Engineering & Accelerated Testings Course Landing Page
      • Advanced Accelerated Testing Course Landing Page
    • Integral Concepts Courses
      • Reliability Analysis Methods Course Landing Page
      • Applied Reliability Analysis Course Landing Page
      • Statistics, Hypothesis Testing, & Regression Modeling Course Landing Page
      • Measurement System Assessment Course Landing Page
      • SPC & Process Capability Course Landing Page
      • Design of Experiments Course Landing Page
    • The Manufacturing Academy Courses
      • An Introduction to Reliability Engineering
      • Reliability Engineering Statistics
      • An Introduction to Quality Engineering
      • Quality Engineering Statistics
      • FMEA in Practice
      • Process Capability Analysis course
      • Root Cause Analysis and the 8D Corrective Action Process course
      • Return on Investment online course
    • Industrial Metallurgist Courses
    • FMEA courses Powered by The Luminous Group
      • FMEA Introduction
      • AIAG & VDA FMEA Methodology
    • Barringer Process Reliability Introduction
      • Barringer Process Reliability Introduction Course Landing Page
    • Fault Tree Analysis (FTA)
    • Foundations of RCM online course
    • Reliability Engineering for Heavy Industry
    • How to be an Online Student
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
    • Accendo Reliability Webinar Series
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home
Home » Articles » on Maintenance Reliability » Maintenance Management » Machines only Stop Working after their Parts Fail 

by Mike Sondalini Leave a Comment

Machines only Stop Working after their Parts Fail 

Machines only Stop Working after their Parts Fail 

 The level of care given to an equipment item depends directly on the size of business risk if it fails. Reduce the chance of its failure and you increase its reliability. It becomes vital to know the full range of risks each part within an equipment item will suffer. The right asset management strategy starts by knowing which parts are at risk and it is completed when the necessary actions to prevent all failures are in use. Asset Management that does not identify all the risks to equipment parts when selecting equipment maintenance and operating strategies will get you doing the wrong work.

Keywords: maintenance strategy selection, failure prevention, asset management strategy

 How do you get high equipment reliability, low maintenance costs and world class production uptime? We know those outcomes are the result of applying the right practices across the asset life cycle and that you need to design your business systems to intentionally deliver them. If you want world class operational performance you cannot have it by accident or luck. To have highly reliable production plant, machines and equipment it is necessary to ensure that they do not fail, and are not failed, from their operating circumstances. 

Figure 1 shows a typical electric motor drive end bearing and housing assembly. From the drawing we can construct the flow chart of how it works. The diagram for the drive end bearing and housing is shown in Figure 1. The flow chart makes it clear that the motor is made of a chain of parts strung together. All machines are designed and built in the same way—parts put together in long series. This is an arrangement at high risk of ruin and it explains why high reliability is so difficult to get— break a part, any part along the chain, and the whole machine fails.

A drawing of an electric motor drive end bearing - illustrating that the motor is modeled by a series system.


 Figure 1 – Every Machine is a Long Chain of Parts 

Reliability engineering analysis tells us that the failure rate of machines is the sum of the failure rate of the parts. Figure 2 represents a 3-part machine and shows how the machine failure rate is the addition of its individual parts’ failure rate.

Drawing of the bathtub curve for the machine as a whole, and broken down by various components.
Figure 2 – A Machine’s Failure Fate is the Sum of Its Parts Failure Rates 

Figure 3 shows the three parts failure history and the effect it has on the machine downtime. First a part fails and then the machine stops. The secret to world class reliability success is in the last sentence—prevent a machine’s parts from failing and you will create a highly reliable machine. The machine failure rate has the naming convention ROCOF—Rate of Occurrence of Failure.

Plots of rate of occurrence of failure over time and the rate of failure for various parts
Figure 3 – A Machine Fails because its Parts First Fail 

When the working parts are not failed the machine continues to operate. When the working parts fail the machine is stopped. The message to take-away is simple and clear—if you want to create a reliable machine then stop its working parts from failing. A working part is a component, including lubricant, which must be present for the equipment to function properly and faithfully meet its duty. Most reasons machine parts fail are indicated for the ‘bathtub’ curve of Figure 4. Every time the ‘chain’ of parts is broken (any part, anyhow, anytime) a machine dies. Mostly parts fail because poor business processes do not prevent materials-of-construction over-stress and/or stop human error. Plant and equipment failures are no accident; your machines are sent to their deaths1.

Figure 4 – Parts are failed for only a Few Reasons, though each has Many Causes

Select Operating and Maintenance Strategy that Prevent the Risk of Parts Failure 

A highly reliable machine needs strategies and practices across its life-cycle that reduces the chance of its parts failing in operation. When you keep working parts in top condition you make machines reliable. This makes the aim of your operating and maintenance strategy twofold: 

1. First create the right conditions inside your machinery and equipment for their working parts to naturally have long, reliable and productive lives by removing all risks of over-stress. 

2. Secondly, proactively keep the conditions of working parts to within their ideal operating envelope by putting in-place the right reliability practices and preventing human error. 

Where these two requirements cannot be achieved failure and maintenance result. 

Intentionally create the exact situations that make the working parts in your machines live for a very long time. You will see a huge growth in production plant availability when you introduce the causes of reliability. Figure 5 shows the very few activities to focus on to stop equipment failures. You make machines highly reliable by using your business management systems, maintenance strategies and operating practices to deliver sure, failure-free operation. Teach your people exactly how to cause reliability, document it precisely and refresh that education regularly.

reliability strategies depend on the component failure over time pattern.
Figure 5 – We Know Exactly How to Control and Prevent Equipment Failure

The effect will be like that shown in Figure 6 where machine failure rates fall as fewer and fewer parts are put at risk of failure.

Figure 6 – Prevent Parts from Failing and You Create Equipment Reliability

Identifying Parts at Risk 

To identify the parts at risk of failure in your equipment you only need the parts drawings and Bill of Materials (BOM) like those shown in Figures 7 and 8.

Figure 7 – Exploded Parts Drawing

On the BOM parts list you identify the working parts that can fail and the types of failure causes that they can suffer—error, induced stress, and wear-and-tear. Once you know the risks and where they will come from you use your business management systems to prevent them from happening.

Figure 8 – Bill of Materials Identifying Failure Types for Working Parts

Focus on Prevention Instead of Cure 

The end we want is to achieve a very particular outcome for our plant, machines and equipment—long, failure-free operating lives. Using predictive strategies like reliability modelling, condition monitoring and preventive maintenance are necessary but poor ways to care for machines and equipment because they permit their failure. They are not failure prevention strategies. They are maintenance and repair strategies that need failures to exist for them to be used and their use invites countless outages to replace failing parts and to fix frequent breakdowns. You will achieve far greater reliability more quickly by focusing your effort, time and money on preventing the risks that cause equipment failure than by looking for failures starting and then correcting the problems. Figure 9 shows the defect prevention strategy to adopt for long-lived, highly reliable equipment—remove the causes of failure by providing the ideal conditions for outstanding parts reliability.

Figure 9 – Prevent Machine Failures by Preventing the Risk of Failures for Working Parts

Conclusion 

Machines intentionally need be treated in the right ways that produce failure-free operation. It requires designing life-cycle business processes that remove the risks of failure to the ‘chain-of-parts’ in your machines by making sure every working components in your plant, machinery and equipment are actively given the necessary conditions for low-stress, error-free operation. Identify the design, operational and maintenance activities that produce high parts reliability and then build business process to deliver them successfully to your operating plant and equipment. Work on removing the risks to the ‘chain-of-parts’ in your machines so they cannot break. 

My best regards to you, 

Mike Sondalini 

Filed Under: Articles, Maintenance Management, on Maintenance Reliability

About Mike Sondalini

In engineering and maintenance since 1974, Mike’s career extends across original equipment manufacturing, beverage processing and packaging, steel fabrication, chemical processing and manufacturing, quality management, project management, enterprise asset management, plant and equipment maintenance, and maintenance training. His specialty is helping companies build highly effective operational risk management processes, develop enterprise asset management systems for ultra-high reliable assets, and instil the precision maintenance skills needed for world class equipment reliability.

« Communication Skills Really Matter
3 MTBF Stories »

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Headshot of Mike SondaliniArticles by Mike Sondalini
in the Maintenance Management article series

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Posts

  • How to Get from Idea to Scale
  • Two Ways to Think and Talk about Reliability
  • Reliability Allocation
  • Good Project Management Leadership Will Reduce Project Risk
  • Using Condition Monitoring as a Reliability Improvement Strategy

© 2025 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy

Book the Course with John
  Ask a question or send along a comment. Please login to view and use the contact form.
This site uses cookies to give you a better experience, analyze site traffic, and gain insight to products or offers that may interest you. By continuing, you consent to the use of cookies. Learn how we use cookies, how they work, and how to set your browser preferences by reading our Cookies Policy.