JR

Collecting and Processing Information

In this note, you will find information explaining why we need data, but also how to collect and process it.

Information: click the arrow to view the information

  • Why do we need data?
    • Data is used throughout the Engineering Design Process to optimize designs and improve efficiency during production
    • Statistics refer to the collection and organization of data(science) as well as the analysis and presentation of those data(mathematics)
    • Statistics are often evaluated using the assistance of computers and software such as Microsoft Excel
    • Such tools allow user to easily access, collect, organize, maintain, manipulate, and interpret data – which helps to improve efficiency

Basic Statistics Vocabulary:

  • Mean - the average of a given data set

  • Median - the middle number in a given ordered set

  • Mode - the most frequently occurring number in a given data set

  • Standard Deviation - how much variation exists from the average(mean) in a given data set

  • Range - the distribution of the data set or the difference between the largest and smallest values

  • Tolerance - the amount of characteristic(product/part/dimension/etc) can vary without compromising(affecting) the overall function or design of the product

  • Normal size - the size used in the general description of a part/product

  • Basic size - the converted normal size(fraction to decimal) that can produce some deviation

  • Upper Specification Limit - the highest acceptable deviation or value for a characteristic

  • Lower Specification Limit - the lowest acceptable deviation or value for a characteristic

  • Mean - the average of a given data set:

    • x = represents the data set
    • ∑ = the sum of a mathematical operation
    • n = the total number of variables in the data set
    • Equation for mean =     ∑x/n
    • Add the numbers in the data set first, then divide the product by the number of variables
  • Median - the middle number in a given ordered data set

    • If the given data set has an even number of data, the median is the average of the two center data
    • Add the two numbers together then divide the product by 2
  • Mode - the most frequently occurring number in a given data set

  • Standard Deviation - Standard Deviation (SD) is a UNIT. It is used to measure the variability of individual data to the mean.

    • The smallest value that the standard deviation can be is 0. Standard deviation can not be negative.
    • Standard Deviation Steps Explained:
      • The mean must first be found through adding all numbers in a data set, and dividing by the pieces of data within the data set.
      • Subtract the mean from each value
      • Square the differences
      • Find the average of the squared numbers to find the variance
      • Square the variance to find the standard deviation
    • Calculating Standard Deviation
      • Equation for Standard Deviation  =  ∑(xi – μ)²

    √     n - 1

    • xi = represents the individual data
    • μ = represents the mean of the data set
    • ∑ = the sum of a mathematical operation
    • n = the total number of variables in the data set
    • The range is the distribution of the data set or the difference between the largest and smallest values in a data set
  • Engineering tolerance is the amount a characteristic can vary without compromising the overall function or design of the product.

  • Tolerances generally apply to the following:

    • Physical dimensions (part and/or fastener)
    • Physical properties (materials, services, systems)
    • Calculated values (temperature, packaging)
  • American National Standards Institute (ANSI) standards.

    • Examples:
    • Bilateral Tolerance (1.125  0.025)
    • Unilateral Tolerance (2.575)