statistics midterm

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/44

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 5:14 PM on 4/4/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

45 Terms

1
New cards

boxplot

a plot that pictures quartile data

2
New cards

interquartile range

Q3-Q1

3
New cards

quantile

the data for proportion p

  • q

4
New cards

standard deviation

measures the average distance data have from the mean

5
New cards

z score

measure of interesting-ness or surprise

  • used to compare data values in different populations

6
New cards

resistant

when a measure is not impacted much by skewness or extreme data

7
New cards

resistant values

  • median

  • Q1

  • Q3

  • IQR

8
New cards

nonresistant values

  • mean

  • range

  • standard deviation

  • r

  • r squared

9
New cards

bivariant

data that consist of observations of two variables

  • pair of variables

10
New cards

x variable

  • independent variable

  • tells about y, not exact causation

11
New cards

y variable

dependent variable, depends on independent

12
New cards

scatterplot

used to plot bivariant data on the xy plane

13
New cards

time series data

independent variable is time

14
New cards

line graph

consecutive points in time are connected by line segments

ex: scatter plot

15
New cards

linear

an increase in the value of one variable roughly corresponds to a proportional increase in teh value of the other variable

  • has a constant slope

16
New cards

pearson’s correlation coefficent

magic number that measures how close data are to being perfectly linear

  • r

17
New cards

r squared

gives the proportion of variation in the y-variable that’s explained by the linear model

  • larger values of it correspond to data that is more perfectly linear

  • higher it is the closer it is to linear relation ship

18
New cards

unrelated

a correlation of r=0 does not imply the variables are _____

ex: quadratic data=perfectly quadratic, not linear and r is basically 0, but has a relationship

19
New cards

residual

distance of the point from the linear regression line

20
New cards

line of best fit

the line that runs centrally throughout all the data

  • also linear regression lines and least squares line

  • most accurately represents the data

  • makes the sum of squared errors small

21
New cards

interpolating

Xo is within the rance of x values [min, max] then we are _____ the data

  • internally extending the data with a prediction

22
New cards

extrapolating

Xo is outside the range of x-values, [min, max], and so we are _____ the data

  • we are extending the data externally

  • can do it within the advisable range (5% below and above range length)

23
New cards

meaningful

the y-intercept, when x=0, is often not _____.

24
New cards

0

the sum of all residual is ____

25
New cards

sum of squared deviation

  • measures variability of data

  • totals up total deviation from the mean

26
New cards

continuous

if x is ____ then Prob(x=a)= 0 and Prob(x>b) = Prob(x≥b) and vice versa

27
New cards

continuous random variable

normal with mean and standard deviation

28
New cards

normal

we say x is ___ if the density function of x is ____ to mean and standard deviation for some constants a and b>0

29
New cards

bell curve

  • symmetric

  • centered at the mean

  • steepest at mean-deviation and mean + deviation

  • curve has most of its area btwn the mean - 3(standard deviation) and mean + 3(standard deviation)

  • tapers off quickly

30
New cards

standardization

using z score

31
New cards

law of large numbers

as the number of trials grows very large, then the relative frequency of event e gets closer and closer to the probability of e

  • averaging many values of X (sample) gives an estimate for mean

  • can be confident that a relative frequency or average is a good estimate for a true probabity or mean mew

32
New cards

mean

the values of x tend to be near the ____

33
New cards

standard deviation

the average distance X takes from mean is the _____

34
New cards

probability

the study of likelihood, randomness, chance, etc

  • relative frequency at which the event occurs

  • equal to the proportion of instances where the event occurs when looking at a large collection of instances where it could occur

35
New cards

probability experiment

any situation that leads to a random result

  • consists of one or more trials

  • possible results are outcomes

36
New cards

sample space

the collection of all outcomes

37
New cards

event

any subcollection of outcomes

38
New cards

disjoint

variables have no outcomes in common

39
New cards

random

a rule assignment that assigns each outcome in S some real number

40
New cards

discrete

random variable X that only takes integer units

41
New cards

expected value

called the proportion mean of x

  • approximates the average value of x over those trials

  • does not have to be a value that x takes

42
New cards

continuous variable

takes on values from an interval

  • measure x finitely

43
New cards

density function

  • how to determine if a random variable is continuous

  • equal to or greater than 0

  • area under this curve=1

  • probability: prob(a<A≤b)

  • ddist

44
New cards

distribution function

pdist(a)=Prob(x ≤ a)

45
New cards

Quantive function

inverse of pdist= value of pdist(prob) is the input of in qdist and the input of pdist is the result of q dist

  • qdist

  • Prob(x ≤ q)

Explore top notes

Explore top flashcards