Accendo Reliability

Your Reliability Engineering Professional Development Site

  • Home
  • About
    • Contributors
  • Reliability.fm
    • Speaking Of Reliability
    • Rooted in Reliability: The Plant Performance Podcast
    • Quality during Design
    • Critical Talks
    • Dare to Know
    • Maintenance Disrupted
    • Metal Conversations
    • The Leadership Connection
    • Practical Reliability Podcast
    • Reliability Matters
    • Reliability it Matters
    • Maintenance Mavericks Podcast
    • Women in Maintenance
    • Accendo Reliability Webinar Series
    • Asset Reliability @ Work
  • Articles
    • CRE Preparation Notes
    • on Leadership & Career
      • Advanced Engineering Culture
      • Engineering Leadership
      • Managing in the 2000s
      • Product Development and Process Improvement
    • on Maintenance Reliability
      • Aasan Asset Management
      • CMMS and Reliability
      • Conscious Asset
      • EAM & CMMS
      • Everyday RCM
      • History of Maintenance Management
      • Life Cycle Asset Management
      • Maintenance and Reliability
      • Maintenance Management
      • Plant Maintenance
      • Process Plant Reliability Engineering
      • ReliabilityXperience
      • RCM Blitz®
      • Rob’s Reliability Project
      • The Intelligent Transformer Blog
    • on Product Reliability
      • Accelerated Reliability
      • Achieving the Benefits of Reliability
      • Apex Ridge
      • Metals Engineering and Product Reliability
      • Musings on Reliability and Maintenance Topics
      • Product Validation
      • Reliability Engineering Insights
      • Reliability in Emerging Technology
    • on Risk & Safety
      • CERM® Risk Insights
      • Equipment Risk and Reliability in Downhole Applications
      • Operational Risk Process Safety
    • on Systems Thinking
      • Communicating with FINESSE
      • The RCA
    • on Tools & Techniques
      • Big Data & Analytics
      • Experimental Design for NPD
      • Innovative Thinking in Reliability and Durability
      • Inside and Beyond HALT
      • Inside FMEA
      • Integral Concepts
      • Learning from Failures
      • Progress in Field Reliability?
      • Reliability Engineering Using Python
      • Reliability Reflections
      • Testing 1 2 3
      • The Manufacturing Academy
  • eBooks
    • Reliability Engineering Management DRAFT
  • Resources
    • Accendo Authors
    • FMEA Resources
    • Feed Forward Publications
    • Openings
    • Books
    • Webinars
    • Journals
    • Higher Education
    • Podcasts
  • Courses
    • 14 Ways to Acquire Reliability Engineering Knowledge
    • Reliability Analysis Methods online course
    • Measurement System Assessment
    • SPC-Process Capability Course
    • Design of Experiments
    • Foundations of RCM online course
    • Quality during Design Journey
    • Reliability Engineering Statistics
    • An Introduction to Reliability Engineering
    • An Introduction to Quality Engineering
    • Process Capability Analysis course
    • Root Cause Analysis and the 8D Corrective Action Process course
    • Return on Investment online course
    • CRE Preparation Online Course
    • Quondam Courses
  • Webinars
    • Upcoming Live Events
  • Calendar
    • Call for Papers Listing
    • Upcoming Webinars
    • Webinar Calendar
  • Login
    • Member Home

by Ray Harkins 2 Comments

Sturge’s Rule: A Method for Selecting the Number of Bins in a Histogram

Sturge’s Rule: A Method for Selecting the Number of Bins in a Histogram

If you’ve worked around the fields of reliability or quality for any length of time, you’ve certainly encountered and have likely assembled the humble yet mighty histogram. This specialized bar graph is one of the most common starting points for analyzing continuous data. It not only portrays the frequency of numerical data across its range of values, but also provides hints at the data’s underlying probability distribution.

While histograms like the one shown above can be automatically generated by statistical programs like Minitab, it’s not uncommon for analysts to build them “from scratch” using spreadsheet programs.
One of the preferential aspects of building a histogram is selecting the appropriate number of bins, as there is no right or wrong answer to this issue. But too few bins “over smooths” the data, potentially masking those hints at the underlying distribution. And too many bins make the data look choppy and discontinuous.
In the early 20th century, German statistician Herbert Sturges formulated a method (now called Sturges’ Rule) of choosing the optimum number of bins in a histogram that minimize the potential for these pitfalls. His formula is simple:
k = 1 + 3.322 log n
Where:
k = the number of bins
n = the number of observations in the data set.
Applying Sturge’s rule to some common sample sizes, we obtain the following number of bins:

Most data visualization practitioners agree that Sturge’s Rule provides the most attractive outcome where the data 1) is not heavily shewed, and 2) contain between 30 and 200 observations. Applying Sturges’ Rule to datasets with greater than 200 points may again lead to over-smoothing, but it still remains a good starting point.
Less popular, but not necessarily less effective means of selecting the best number of bins for your histogram include the Square-Root Choice, Rice’s Rule, Doane’s formula, Scott’s Normal Reference Rule, and Freedman–Diaconis’ Choice.
Once you’ve selected the number of bins, calculate the minimum bin width to using the following formula:
Min Bin Width = (Max Observed Value – Min Observed Value) / k
It’s common practice then to round the Min Bin Width up to a convenient decimal to make the increments along the x-axis a little more readable.
To learn more about quality engineering statistics, including histograms, control charts and the normal probability distribution, sign up the online short course titled “Process Capability Analysis”.

Filed Under: Articles, on Tools & Techniques, The Manufacturing Academy

« Myth Busting 11: Leave Room for Surprises
Leaving a Positive and Indelible Mark »

Comments

  1. Mark Fiedeldey says

    August 16, 2020 at 4:57 AM

    Ray,
    Using Kernel Density Estimating (KDE) techniques is a good way to guide the histogram development.
    Mark

    Reply
  2. Mark Fiedeldey says

    August 16, 2020 at 5:14 AM

    Ray,
    I should have added that
    Jaroslav Stanek
    has a real nice Youtube video on a justification for KDE.

    https://www.youtube.com/watch?v=QR7mHqn14fk

    Mark

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Logo for The Manufacturing Acadamey headshot of RayArticle by Ray Harkins
in the The Manufacturing Academy article series

Join Accendo

Receive information and updates about articles and many other resources offered by Accendo Reliability by becoming a member.

It’s free and only takes a minute.

Join Today

Recent Posts

  • How Reliability Engineers Can Improve Their Communication in Information Sessions
  • FMEA Detection Risk: Insights and Advices
  • How to Structure Your ERM System
  • Rate of Occurrence of Failure
  • What is Six Sigma and How is it Used in Quality Engineering?

© 2023 FMS Reliability · Privacy Policy · Terms of Service · Cookies Policy

This site uses cookies to give you a better experience, analyze site traffic, and gain insight to products or offers that may interest you. By continuing, you consent to the use of cookies. Learn how we use cookies, how they work, and how to set your browser preferences by reading our Cookies Policy.