D2. Data and Variables

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/33

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 7:54 PM on 6/5/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

34 Terms

1
New cards

Categorical data is

Qualitative: often binary indicators

Ordinal: order is meaningful but differences aren’t

2
New cards

Numerical (Quantitative) data is

Cardinally meaningful, i.e. discrete or continuous

3
New cards

The support of a variable is

The set of values it can take

4
New cards

Multivariate variables are

The list of measured features when we measure several features of an object

5
New cards

Cross-sectional data sets have

One observation per unit
E.g. data on one attribute measured in N people a cross-section dataset would be indicated as

<p>One observation per unit<br>E.g. data on one attribute measured in N people a cross-section dataset would be indicated as</p>
6
New cards

A time series is

A series of data points indexed in time order

7
New cards

A time series on a multivariate variable is

A single observation

8
New cards

The function for discrete variables is a

Probability Mass Function

9
New cards

The function for continuous variables is a

Probability Density Function

10
New cards

A PDF for a continuous distribution with a = 1.5 and b = 2 looks like

knowt flashcard image
11
New cards

A PMF looks like

knowt flashcard image
12
New cards

The formal definition of a PDF is

knowt flashcard image
13
New cards

What are the parameters of the binomial distribution

N and p

14
New cards

What does the PMF of a binomial distribution with N = 10 and p = 0.5 look like relative to N = 15 and N = 20 (same p)

knowt flashcard image
15
New cards

All PDFs have the properties

They are non-negative: f (x) ≥ 0

The area underneath them is equal to one: ∫ f(x)dx = 1

16
New cards

The PDF for X~U (a, b) is

f(x) = 1/(b-a) if a≤x≤b

=0 otherwise

17
New cards

The cumulative distribution function (CDF) is

A function giving the probability that the variable X is less than or equal to some value x

18
New cards

The CDF is defined mathematically as

F(x) = P(X≤x) for all x in the support of X

19
New cards

For a variable X~U(a, b) the CDF is

knowt flashcard image
20
New cards

For any CDF

knowt flashcard image
21
New cards

In the CDF, the 1st quartile is the value such that

P(X ≤ x) = 0.25

22
New cards

The probability that the value of a Standard Normal variable would lie within ±1.645 of its central location (μ = 0) is

0.9

23
New cards

The probability that the value of a Standard Normal variable would lie within ±1.96 of its central location (μ = 0) is

0.95

24
New cards

The probability that the value of a Standard Normal variable would lie within ±2.58 of its central location (μ = 0) is

0.99

25
New cards

if X ∼ N(μ, σ^2) and Z = (X−μ)/σ then

Z ∼ N(0, 1)

26
New cards

Suppose X ∼ N(μ, σ^2) and further suppose that we want to know P(X ≤ x). What value do we look up in the statistical table?

knowt flashcard image
27
New cards

The PDF of a variable x, given the CDF, is

knowt flashcard image
28
New cards

The joint CDF is

F (x, y) = P (X ≤ x, Y ≤ y)

29
New cards

The joint PMF is

f (x, y) = P (X = x, Y = y)

30
New cards

The joint PDF is

knowt flashcard image
31
New cards

The marginal PDF of x, f(x), is

f(x, whatever the value of y)

32
New cards

The PDF of a variable x, given the CDF, is

knowt flashcard image
33
New cards

If X and Y are independent then the joint PDF is

The product of the marginals. f (x, y) = f (x) g (y)

34
New cards

If X and Y are independent then the joint CDF is

The product of the marginals. F (x, y) = F (x) G (y)