TOPIC #7: Chance variability (the box model)

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/16

flashcard set

Earn XP

Description and Tags

Module 3 DATA1001

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

17 Terms

1
New cards

Observed value = expected value + chance error, what does this mean?

Every time you perform a chance process, there is chance variability.

2
New cards

What is the law of large numbers/law of averages

the proportion of heads becomes more stable as the length of the simulation increases and approaches a fixed number called the relative frequency. the chance error in the number of heads is likely to be large in absolute size, but small relative to the number of tosses.

3
New cards

what does the box model show and what do you need to know?

it is a simple way to describe many chance processes like the distinct numbers that go in the box (tickets), the number of each kind of tickets, the number of draws from the box

4
New cards

why do we use the box model?

the box is the summary of the population (what is in there and in what proportions), we take draws from the box to create the sample.

5
New cards

what does EV and OV mean?

Expected value and observed value

6
New cards

what is the chance error in terms of ov and ev and what is it modelled by?

OV - EV. it is modelled by the standard error
(SE)

7
New cards

How do you calculate EV and SE

number of draws x mean of the box , root number of draws x SD of the box

8
New cards

the SD of the box is the …

population SD

9
New cards

how do you calculate the SD of the box? (Three ways)

  1. formula: RMS(gaps) = root of the mean of the squared gaps

  2. R: Popsd() with package rafalib

  3. shortcut for simple binary boxes

    • if a box only contains two different numbers (“big” and “small”) then SD = (big - small)root proportion of big x proportion of small

10
New cards

How do you model the sum/mean of a sample for large number of draws from the box

normal distribution with mean - EV and sd = SE .

11
New cards

what are different types of histograms?

data histogram, probability histogram, simultation histogram

12
New cards

data histogram

represents the amount of data by area

<p>represents the amount of data by area </p>
13
New cards

probability histogram

represents chance by area

<p>represents chance by area </p>
14
New cards

simulation histogram

represents chance by area, for a simulation of a chance process

<p>represents chance by area, for a simulation of a chance process </p>
15
New cards

what is the relationship between probability and simulation histograms?

for repeated simulations of a chance process resulting in a sum, the simulation histogram of the observed values converges to the probability histogram

16
New cards

what is the central limit theorem?

when drawing at random with replacement from a box, if the sample size for the sum or average is sufficiently large then the probability histogram or distribution of chance for the sum or average will closely follow the normal curve, even if the contents of the box does not.

17
New cards