167 - Quality Methods - Topic 2

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 41

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

42 Terms

1

What is a descriptive statistic?

Describes the characteristics of a product or process using information collected on it.

New cards
2

What are inferential statistics (inductive)?

Draws conclusions on unknown process parameters based on information contained in a sample.

Uses probability

New cards
3

What is attribute data?

Discrete data. Data values can only be integers. Counted data or attribute data.

New cards
4

What is variable data?

Continuous data. Data values can be any real number. Measured data.

New cards
5

What is precision?

The precision of a measurement is determined by how reproducible that measurement value is.

(For example if a sample is weighed by a student to be 42.58 g, and then measured by another student five different times with the resulting data: 42.09 g, 42.15 g, 42.1 g, 42.16 g, 42.12 g … )

then the original measurement is not very precise since it cannot be reproduced.

New cards
6

What is accuracy?

The accuracy of a measurement is determined by how close a measured value is to its “true” value.

(For example, if a sample is known to weigh 3.182 g, then weighed five different times by a student with the resulting data: 3.200 g, 3.180 g, 3.152 g, 3.168 g, 3.189 g)

The most accurate measurement would be 3.180 g, because it is closest to the true “weight” of the sample.

New cards
7

Six sigma application

knowt flashcard image
New cards
8

What three types of frequency distribution data are there?

Categorical, ungrouped and grouped

New cards
9

What is categorical frequency distributions?

Data that can be placed in specific categories, such as nominal- or ordinal-level data.

Examples - political affiliation, blood type, etc.

<p>Data that can be placed in specific categories, such as nominal- or ordinal-level data. </p><p>Examples - political affiliation, blood type, etc.</p>
New cards
10

What are ungrouped frequency distributions?

It can be used for data that can be enumerated and when the range of values in the data set is not large.

Examples - number of miles your instructors have to travel from home to campus, number of girls in a 4-child family etc.

<p>It can be used for data that can be enumerated and when the range of values in the data set is not large.</p><p>Examples - number of miles your instructors have to travel from home to campus, number of girls in a 4-child family etc.</p>
New cards
11

What are grouped frequency distributions?

Can be used when the range of values in the data set is very large. The data must be grouped into classes that are more than one unit in width.

Examples - the life of batteries in hours.

<p>Can be used when the range of values in the data set is very large. The data must be grouped into classes that are more than one unit in width. </p><p>Examples - the life of batteries in hours.</p>
New cards
12

What is a Relative Cumulative Frequency Histogram?

Helps us to observe (and hence find out) the number of data observations that lie below a particular range of data sets.

It also helps us to observe and understand how the values within a particular data set change. It also keeps us updated with the total frequency of everything.

<p>Helps us to observe (and hence find out) the number of data observations that lie below a particular range of data sets. </p><p>It also helps us to observe and understand how the values within a particular data set change. It also keeps us updated with the total frequency of everything.</p>
New cards
13

How do you construct a histogram?

Step 1: Find range of distribution, largest - smallest values

Step 2: Choose number of classes, 5 to 20

Step 3: Determine width of classes, one decimal place more than the data, class width = range/number of classes

Step 4: Determine class boundaries

Step 5: Draw frequency histogram

(If no. of observations < 100 – 5 to 9 cells Between 100-500 – 8 to 17 cells Greater than 500 – 15 to 20 cells)

New cards
14

What are the characteristics of different frequency distribution graphs?

knowt flashcard image
New cards
15

What is the different between location, spread and shape?

knowt flashcard image
New cards
16

Draw and label three graphs: symmetrical, positively skewed and negatively skewed

knowt flashcard image
New cards
17

What are the measures of central tendency?

A measure of central tendency of a distribution is a numerical value that describes the central position of the data or how data tend to build up in the centre

The three measures in common use are the: Average Median Mode

New cards
18

What are the three different techniques for calculating the average?

Ungrouped data, grouped data and weighted average

New cards
19

How do you calculate the average of ungrouped data?

knowt flashcard image
New cards
20

How do you calculate the average of grouped data?

knowt flashcard image
New cards
21

How do you calculate the weighted average of grouped data?

knowt flashcard image
New cards
22

How do you calculate the median of grouped data?

knowt flashcard image
New cards
23

What are the three measures of dispersion?

Range, standard deviation and variance

New cards
24

How do you calculate the sample standard deviation for ungrouped data?

This is the ungrouped technique and alternative formulas can be used (however are typically more complex

<p>This is the ungrouped technique and alternative formulas can be used (however are typically more complex</p>
New cards
25

How do you calculate the sample standard deviation for grouped data?

knowt flashcard image
New cards
26

In terms of quality, is a smaller or larger standard deviation better?

The smaller the value of the SD, the better the quality, because the distribution is compacted around the central value

New cards
27

What are the other three measures that are frequently used to analyse a collection of data?

Skewness, Kurtosis and the coefficient of variation

New cards
28

What is Skewness? And how do you calculate it?

Skewness a(3) is the lack of symmetry of the data.

Skewness is a number whose size tells us the extent of departure from symmetry

  • if the value of a3 is 0, the data are symmetrical

  • >0 = positive (skewed to the right; i.e. the long tail is on the right hand side)

  • <0 = negative (skewed to the left; i.e. the long tail is on the left hand side)

  • Values of +1 or -1 imply a strongly unsymmetrical distribution

<p>Skewness a(3) is the lack of symmetry of the data. </p><p>Skewness is a number whose size tells us the extent of departure from symmetry </p><ul><li><p>if the value of a3 is 0, the data are symmetrical </p></li><li><p> &gt;0 = positive (skewed to the right; i.e. the long tail is on the right hand side) </p></li><li><p>&lt;0 = negative (skewed to the left; i.e. the long tail is on the left hand side) </p></li><li><p>Values of +1 or -1 imply a strongly unsymmetrical distribution</p></li></ul>
New cards
29

What is Kurtosis? And how do you calculate it?

Kurtosis provides information regarding the shape of the population distribution (the ‘peakedness’ or ‘heaviness’ of the tails of a distribution).

<p>Kurtosis provides information regarding the shape of the population distribution (the ‘peakedness’ or ‘heaviness’ of the tails of a distribution).</p>
New cards
30

What is the coefficient of variation? And how do you calculate it?

Correlation variation (CV) is a measure of how much variation exists in relation to the mean.

Having the standard deviation alone is not particularly useful as it requires a context - the coefficient of variation provides a reference for this.

<p>Correlation variation (CV) is a measure of how much variation exists in relation to the mean.</p><p>Having the standard deviation alone is not particularly useful as it requires a context - the coefficient of variation provides a reference for this.</p>
New cards
31

What does it mean to have a small value for the coefficient of variation?

The smaller the value, the smaller is the amount of variation relative to the mean

New cards
32

What is a population/universe?

Set of all items that possess a characteristic of interest

New cards
33

What is a sample?

A subset of a population

New cards
34

What is the components that make up a statistic vs a comparison

Parameter is a characteristic of a population, i.e. it describes a population

• Example: average weight of the population, 50,000 cans made in a month.

Statistic is a characteristic of a sample, used to make inferences on the population parameters that are typically unknown, called an estimator

• Example: average weight of a sample of 500 cans from that month’s output, an estimate of the average weight of the 50,000 cans

<p>Parameter is a characteristic of a population, i.e. it describes a population </p><p>• Example: average weight of the population, 50,000 cans made in a month.</p><p>Statistic is a characteristic of a sample, used to make inferences on the population parameters that are typically unknown, called an estimator </p><p>• Example: average weight of a sample of 500 cans from that month’s output, an estimate of the average weight of the 50,000 cans</p>
New cards
35

What are the characteristics of a normal curve?

It is symmetrical -- Half the cases are to one side of the centre; the other half is on the other side.

  • The distribution is single peaked, not bimodal or multi-modal

  • Also known as the Gaussian distribution

  • Mean is best measure of central tendency

  • Most of the cases will fall in the centre portion of the curve and as values of the variable become more extreme they become less frequent, with "outliers" at the "tail" of the distribution few in number. It is one of many frequency distributions.

New cards
36

What is the standard normal distribution and how do you calculate it?

knowt flashcard image
New cards
37

What area/percentage is covered by one to 3 standard deviations?

knowt flashcard image
New cards
38

How do you test for normality?

By using Skewness (a3 ) and Kurtosis (a4 )

Skewed to the left or to the right (a3=0 for a normal distribution)

  • The data are peaked as the normal distribution (a4=3 for a normal distribution)

  • The larger the sample size, the better the judgment of normality (sample size of 100 is recommended).

New cards
39

What can a scatter diagram show?

Its a way to determine if a cause and effect relationship exists between to variables.

Supplies the data to confirm a hypothesis that two variables are related

• Provides both a visual and statistical means to test the strength of a relationship

• Provides a good follow-up to cause and effect diagrams

• Scatter diagram pattern

<p>Its a way to determine if a cause and effect relationship exists between to variables.</p><p>Supplies the data to confirm a hypothesis that two variables are related </p><p>• Provides both a visual and statistical means to test the strength of a relationship </p><p>• Provides a good follow-up to cause and effect diagrams </p><p>• Scatter diagram pattern</p>
New cards
40

What are the 6 scatter diagram patterns?

knowt flashcard image
New cards
41

What is the significance of the coefficient of correlation?

A useful statistic - coefficient of correlation (r)

  • Describes the goodness of fit of the linear model

  • It is a dimensionless number (r) that lies between +1 and -1

  • The + and – signs tell whether the there is a negative correlation or a positive correlation

  • The closer the value is to 1 the better the fit.

  • If the r value = 1, then all points fall on the line

New cards
42

How do you calculate the coefficient of correlation?

knowt flashcard image
New cards
robot