Stat 1000 - Class 1,2 (Unit 1)

0.0(0)
studied byStudied by 1 person
full-widthCall with Kai
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/48

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

49 Terms

1
New cards

Statistics

Set of methods for obtaining, organizing, summarizing, presenting and analyzing data

2
New cards

Data

Comes from characteristics measured on individuals, or units

3
New cards

Individuals/ Units

Nearly anything: people, animals, places, things, etc

4
New cards

Observations

collected data values

5
New cards

Population

Totality of individuals about which we want information

6
New cards

Sample

Subset of the individuals in a population that we actually examine in order to gather information

7
New cards

Good sample

Representative of the populations

8
New cards

Identifying the population that a sample represents

replace the sample size with “all”

9
New cards

Variable

characteristic or property of an individual.

10
New cards

Examples of possible variables

Lifespan of a light bulb, The number of heads in five tosses of a quarter, Hair colour

11
New cards

Classifications of data

categorical and quantitative

12
New cards

Categorical data

  • values of categorical/qualitative variables.

  • These are variables that place individuals into one of several groups categories.

13
New cards

Categorical variables (examples)

  • Eye colour

  • Favourite singer

  • Reason for taking STAT 1000

14
New cards

Categorical and ordinal

meaningful, logical ordering to the values of a categorical variable.

15
New cards

Categorical and nominal

not a meaningful, logical ordering to the values of a categorical variable

16
New cards

Quantitative data

Represents quantitative variables

17
New cards

Quantitative variables are

Take numerical values for which arithmetic operations (such as subtracting, averaging, etc.) make sense (i.e. their results are meaningful).

18
New cards

Quantitative variables (examples)

  • Height

  • Volume of air in a balloon

  • Exam score

  • Time

19
New cards

Data distribution tells us:

What values a variable takes, and How often it takes these values

20
New cards

Bar Charts

Display variable values on one axis, and frequencies on the other.

  • Bars don’t touch (not continuous)

<p><span>Display variable values on one axis, and frequencies on the other.</span></p><ul><li><p>Bars don’t touch (not continuous)</p></li></ul><p></p>
21
New cards
<p>Pie charts</p>

Pie charts

visual representation of the relative frequency/proportion of the observed values for a categorical variable

22
New cards

Frequency distribution

count of how many of our data values fall into various predetermined classes or intervals

23
New cards
<p>Frequency Distribution <strong>Example</strong></p>

Frequency Distribution Example

knowt flashcard image
24
New cards

Relative frequency distributions

Dividing the number of data values in each class by the total number of data values, we get the relative frequency, or proportion of individuals in each class

25
New cards

Proportions (relative frequency distributions)

Values between 0 and 1 that are decimal representations of fractions. You can convert proportions to percentages by multiplying by 100.

26
New cards
<p>Relative frequency distribution <strong>Example</strong></p>

Relative frequency distribution Example

knowt flashcard image
27
New cards

Frequency distribution (intervals)

  • choose them ourselves

28
New cards

Frequency distribution (interval rules)

  • Our first interval must include the lowest data value (called the minimum)

  • Our last interval must contain the highest data value (called the maximum)

  • All intervals should be of equal length

  • Each interval includes the left endpoint, but not the right

29
New cards

Choosing the intervals (frequency distribution)

“nice choices”, that summarize our data well. We’d typically use around 5 - 10 intervals total

30
New cards

Why cant we just use non-overlapping intervals?

because of decimals (continuous variables)

  • 70-79 how about 79.5?

31
New cards

Continuous variables

These are quantitative variables that can take any value within a given range.

32
New cards

Continuous variables (examples)

Test scores, age, height, distance

33
New cards

Discrete variables

These are quantitative variables that can only take a “countable” number of values: i.e. they can only take a specific, distinct values.

34
New cards

discrete variables (examples)

  • The number of children in a family

  • The number of days of rain in a month

  • The number of books a person has read in their life

35
New cards

Histograms

More useful and commonly used display of continuous data

  • Graphical displays of the frequency (or relative frequency) of data values falling into each of several intervals.

  • Histograms are especially useful when we’re dealing with large data sets.

<p><span>More useful and commonly used display of continuous data</span></p><ul><li><p><span>Graphical displays of the frequency (or relative frequency) of data values falling into each of several intervals.</span></p></li><li><p><span>Histograms are especially useful when we’re dealing with large data sets.</span></p></li></ul><p></p>
36
New cards

What type of data is used for a histogram

continuous, quantitative data

37
New cards

Why is there no spaces between the bars in a histogram

because they are continuous data

38
New cards

What does the base of a histogram represent

length of the interval (equal length)

39
New cards

What does the height of a histogram represent

the frequency of the data in each interval

40
New cards

Distribution shape (histogram)

A histogram can be used to characterize the shape of the data distribution

  • Symmetric

  • Skewed to the right

  • Skewed to the left

41
New cards

Symmetric shape (histogram)

center divides it into two approximate mirror images

<p><span>center divides it into two approximate mirror images</span></p>
42
New cards

Skewed to the right (Histogram)

longer tail on the right side

  • most of the data values are concentrated on the left

<p><strong>longer tail on the right side </strong></p><ul><li><p>most of the data values are concentrated on the left</p></li></ul><p></p>
43
New cards

Skewed to the left (Histogram)

longer tail on the left side

  • most of the data values are concentrated on the right.

<p><strong>longer tail on the left side</strong></p><ul><li><p>most of the data values are concentrated on the right.</p></li></ul><p></p>
44
New cards

Distribution shape (!!WARNING!!)

Be careful interpreting the shape of a histogram if it’s displayed vertically!!

  • x-axis has to start at 0 (when flipped horizontal)

<p><span>Be careful interpreting the shape of a histogram if it’s displayed vertically!!</span></p><ul><li><p><strong>x-axis has to start at 0</strong> (when flipped horizontal)</p></li></ul><p></p>
45
New cards

Time series data

which are values for a variable measured over time

46
New cards

How can you visually display time series data

time plots

47
New cards

What constitutes a Time Plot

Time is plotted on the x - axis, and variable values are plotted on the y - axis

<p><span>Time is plotted on the x - axis, and variable values are plotted on the y - axis</span></p>
48
New cards

How is data represented on a Time Plot

Data values are represented by points. We connect these points to better visualize the pattern/trend.

49
New cards

Seasonal variation (time plot) {example}

fluctuations in data values that occur at regular intervals due to seasonal factors, showing predictable changes at specific times of the year.

<p>fluctuations in data values that occur at regular intervals due to seasonal factors, showing predictable changes at specific times of the year.</p>