R - Quality Control Individual Range Chart Made Nice.

In R we have the qcc package but charts are not very nice, specially if you want to put your chart in a HTML file.

Here I describe the process of creating the chart starting by using the qcc package and ending by using our own calculations and a nice dygraphs chart.

You might avoid all the comments if you go directly to my github.com repository:

https://github.com/LaranIkal/R-ANALYTICS

Note. Due to github restrictions for html files sizes, the html file needs to be downloaded before you can open it.

If you want to continue here, you can see the R code and outputs I copied from the html file( ) result from the R markdown file( ) on my github.com repository:

# Loading needed libraries
# R quality control library
suppressWarnings( suppressMessages( library( qcc ) ) )
# One of the R nice charts library
suppressWarnings( suppressMessages( library( dygraphs ) ) )

measurements = c( -0.001, -0.011, .2, 0.001, -0.018, -0.019, -0.019, -0.012, -0.016, -.2 )

# Using qcc library to display Individual Range Chart.
iRangeChart = qcc( measurements, type = "xbar.one", add.stats = TRUE, plot = TRUE )

This is very basic chart created using the qcc package:

# Getting qcc library values: 
qccCenterLimit = round( iRangeChart$center, 8 )
qccStandardDeviation = round( iRangeChart$std.dev, 8 )
qccUpperControlLimit = round( iRangeChart$limits[1,2], 8 )
qccLowerControlLimit = round( iRangeChart$limits[1,1], 8 )
qccBeyondLimits = iRangeChart$violations$beyond.limits # Positions in our data to be red in chart.


###############################################################################
#                         Getting values manually.                            #
###############################################################################

# mean = average = sum(1..n) / n
ourCenterLimit = mean( measurements ) # CL

# By the formula:
# MovingRanges = absoluteValue( measurements[i+1] - measurements[i] )
# i = Measurement number
measurementsMovingRanges = abs ( measurements[1:length(measurements)-1] -
                                   measurements[2:length(measurements)] )

movingRangeCenterLimit = mean( measurementsMovingRanges )

# According to the formula:
# myStandardDeviation = MovingRangeMean / d2, where d2 = 1.128 for Individual Range Chart
ourStandardDeviation = movingRangeCenterLimit / 1.128

# According to the formula:
#  UCL = SamplesMean + ( 3 * ( MovingRangeMean / d2 ) )
ourUpperControlLimit = ourCenterLimit + ( 3 * ( movingRangeCenterLimit / 1.128 ) ) # USL

# According to the formula:
#  LCL = SamplesMean - ( 3 * ( MovingRangeMean / d2 ) )
ourLowerControlLimit = ourCenterLimit - ( 3 * ( movingRangeCenterLimit / 1.128 ) ) # LCL

# Getting our out of limit positions in our data 
ourBeyondLimits = c( which( measurements > ourUpperControlLimit )
                  , which( measurements < ourLowerControlLimit ) )


# Just to prove our calculations are the same, we do this comparison.

if( round( ourCenterLimit, 8 ) == qccCenterLimit ) {
  print( "Center Limit is The Same." )
} else cat( "Center Limit is DIFFERENT." )

> [1] "Center Limit is The Same."

if( round( ourStandardDeviation, 8 ) == qccStandardDeviation ) {
  cat( "Standard Deviation is The Same." )
} else cat( "Standard Deviation is DIFFERENT." )

> Standard Deviation is The Same.

if( round( ourUpperControlLimit, 8 ) == qccUpperControlLimit ) {
  cat( "Upper Control Limit is The Same." )
} else cat( "Upper Control Limit is DIFFERENT." )

> Upper Control Limit is The Same.

if( round( ourLowerControlLimit, 8 ) == qccLowerControlLimit ) {
  cat( "Lower Control Limit is The Same." )
} else cat( "Lower Control Limit is DIFFERENT." )

> Lower Control Limit is The Same.

if( identical( ourBeyondLimits, qccBeyondLimits ) ) {
  cat( "Beyond Limit Points Are The Same." )
} else cat( "Beyond Limit Points Are DIFFERENT." )

> Beyond Limit Points Are The Same.

###############################################################################
#                 Creating a nice chart using dygraphs.                       #
###############################################################################

# Getting our dataframe as dygraphs needs it, note BeyondLimits is initialized to NA
measurementsData = data.frame( Sequence = seq( 1, length( measurements ), 1 )
                               , Values = measurements, BeyondLimits = NA )

# Now setting values to beyond limit points
measurementsData$BeyondLimits[ourBeyondLimits] = measurementsData[ourBeyondLimits,2]

# You can also set Specification limits, Lower Spec Lim and Upper Spec Lim
ourLSL = -0.1
ourUSL = 0.1

dygraph( measurementsData, main = NULL
         , xlab = "Sequence Number", ylab = "Value" ) %>%
  dySeries( name = "Values", label = "Normal", drawPoints = TRUE, pointShape = "dot"
            , color = "blue", pointSize = 2 ) %>%
  dySeries( name = "BeyondLimits", label = "Beyond Lims", drawPoints = TRUE, pointShape = "dot"
            , color = "red", pointSize = 3 ) %>%
  dyLimit( ourUpperControlLimit, color = "black"
           , label = "UCL", labelLoc = "left" ) %>% 
  dyLimit( ourCenterLimit, color = "black"
           , label = "CL", labelLoc = "left" ) %>% 
  dyLimit( ourLowerControlLimit, color = "black"
          , label = "LCL", labelLoc = "left" ) %>% 
  dyLimit( ourUSL, color = "blue", label = "USL", labelLoc = "left" ) %>% 
  dyLimit( ourLSL, color = "blue", label = "LSL", labelLoc = "left" ) %>% 
  dyRangeSelector()

NOTE. The chart below is for you to see how it looks, if you want to see the dynamic chart, go to my github.com repository and download the html file or the Rmd file source and run the Rcode.

Number of measurements = 10

Center = -0.0095	LCL = -0.197148	Number beyond limits = 2
StdDev = 0.062549	UCL = 0.178148	Number violating runs = NA

Some Theory.

d2 is a value from constants table, which is 1.128 for Individual Range Chart calculations. The number 3 is a constant and typical value used in statistical control charts. This values are the same used by qcc R library.

This is a good site to go for information about the constant values: https://andrewmilivojevich.com/xbar-and-r-chart/

An interesting link on how to interpret this qcc chart: https://www.spcforexcel.com/knowledge/control-charts-basics/interpreting-control-charts

Note: I am not related to any of the above sites, I just put them as information.

Violating Runs

By the theory: Violating runs are points out of control, or points you should be careful with.

qcc library has a parameter called run.length as a way to control the way violating runs are calculated. just set the value before calling qcc library by running qcc.options function in this way:

qcc.options( run.length = 5 ) # As I saw in my tests, 7 is the default value for qcc library

You can see the qcc library sources in github how they are calculating violating runs.

In my case, it was not needed to go further with the violating runs calculation, that is why they are not mentioned in my calculations.

Note: I am not a specialist in quality control, I am just showing how I did to calculate the values in R and how to create a nice chart.

Enjoy it!!!.

Carlos Kassab
https://www.linkedin.com/in/carlos-kassab-48b40743/

More information about R:
https://www.r-bloggers.com

Using R and H2O Isolation Forest to identify product anomalies during the manufacturing process.

Note. - This article has some improvements from Yana Kane-Esrig( https://www.linkedin.com/in/ykaneesrig/ ) , mentioned in this article: http://laranikalranalytics.blogspot.com/2021/03/updated-using-r-and-h2o-to-identify.html Introduction: We will identify anomalous units on the production line by using measurements data from testing stations and Isolation Forest model. Anomalous products are not failures, anomalies are units close to measurement limits, so we can display maintenance warnings before the station starts to make scrap. Before starting we need the next software installed and working: - R language installed. - H2O open source framework. - Java 8 ( For H2O ). Open JDK: https://github.com/ojdkbuild/contrib_jdk8u-ci/releases - R studio. Get your data. About the data: Since I cannot use my real data, for this article I am using SECOM Data Set from UCI Machine Learning Repository ...

R-Analytics

Search This Blog

R - Quality Control Individual Range Chart Made Nice.

Some Theory.

Violating Runs

Popular posts from this blog

UPDATED: Using R and H2O to identify product anomalies during the manufacturing process.

Using R and H2O Isolation Forest For Data Quality

Using R and H2O Isolation Forest to identify product anomalies during the manufacturing process.