- Definitions and Concepts

0.0(0)
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/25

flashcard set

Earn XP

Description and Tags

Statistics - Basic definitions and concepts

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

26 Terms

1
New cards

Use of statistics

  • Design of experiment / sampling methods

  • Data collection

  • Organization of data (and graphing)

  • Analysis

  • Estimation of uncertainties

  • Interpretation

2
New cards

Incorrect data causes

  • Sampling bias

  • Sample too small

  • Sample not representative

  • Leading question (also, people lie)

3
New cards

Data is correct, but presentation is misleading

  • Cherry picking data

  • Misleading graphs

4
New cards

Presentation of data

  • Percentages can be misleading with small numbers

  • Changes in absolute numbers can be misleading with large numbers

5
New cards

Leading question affects the data

Asking what makes a situation worse first is more likely to be voted first

6
New cards

Population

Collection of the entire set of all measurements of interest in a study

Ex. StatsCanada census where all Canadian households must respond to a survey

7
New cards

Parameter (relating to a population)

A numerical measurement that describes a population

Ex. Class average if you consider a class as an entire population

8
New cards

Sample

A smaller set of measurements taken from a population

9
New cards

Statistic (hint: relation to sample)

A numerical measurement describing a sample

10
New cards

Two types of stats

Descriptive statistics:

  • A description of the data (sample or population)

  • Find averages and dispersion

  • Graph the data

Inferential statistics:

  • An interpretation of sample data

  • Given a sample, what are the properties of a population?

11
New cards

Inference

Inferring data when a population is inaccessible and the parameters are unknown. (guessing/determining results)

12
New cards

Types of Data

  • Defined when experiment is designed

  • Qualitative and Quantitative data

13
New cards

Qualitative Data

not numerical, also called categorical data

14
New cards

Quantitative Data

  • numerical

  • can be discrete: counting, gaps b/w values

  • or continuous: measured, no gaps (size of gaps depends on precision of instruments)

15
New cards

Level of measurement

  • Nominal: categorical data that cannot be ordered

    • Ex. colours (red, blue, purple)

    • brands (Ford, GM, Toyota)

  • Ordinal: categorical data that has a natural order

    • Ex. high, low

    • large, medium, small

16
New cards

Level of measurements

  • Interval: numerical data for which differences are significant but not ratios

    • Zero is arbitrary, it has no special meaning

    • Ex. temperature: 20C is not twice as hot as 10C

  • Ratio: numerical data, differences and ratios are significant

    • Zero really means that nothing is measured

    • Ex. weight: 20 kg is twice as heavy as 10 kg

17
New cards

Level of measurement (Image order)

knowt flashcard image
18
New cards

Identifying types of data (Image order)

knowt flashcard image
19
New cards

Identifying types of data (examples)

If we record the colour of the cars passing on a bridge in a day, the variable is qualitative nominal

If we record the age of the cars in categories such as ‘new’, ‘fairly new’, ‘used’, ‘old’, and ‘antique’, the variable is qualitative ordinal

The time between each car measured by a chronometer is quantitative continuous ratio

The year of construction of the cars is quantitative discrete interval

20
New cards

Designing (a good) experiment

  • Random sampling

    • All subgroups represented

    • No arbitrary selection

  • Controlling effects

    • Placebo effect

    • Double blind experiment

  • Repeatable

    • Any sample can be unrepresentative of the population by fluke

21
New cards

Simple sampling schemes

22
New cards

Complex sampling schemes

  • Stratified sampling

    • Break the population in subgroups, f.ex. gender, country, age

    • Random selection in each subgroup

  • Cluster sampling

    • Divide population in many subgroups

    • Randomly select clusters

    • Study all members of selected clusters

23
New cards

Sampling methods (image)

knowt flashcard image
24
New cards

Designing an experiment

  • It should be possible to pick any member of a population

  • Every member should be equally likely to be picked (as much as possible)

  • In many modern surveys of people (for politics for example) random sampling has been partially abandoned because it is too hard to achieve.

  • Experiments that have known biases can still be valid if the biases are taken into account in the analysis.

25
New cards

Probabilistic Survey (Gallup)

  • eg. random telephone #'s sample

    • landlines

    • cellphones

==> UNCERTAINTIES

26
New cards

Non-probabilistic survey

  • eg. pool of 400 000 people

    • choose within those

==> NO UNCERTANTIES (but is not [completely] random)