Data Analysis

0.0(0)
studied byStudied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/48

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 5:30 AM on 5/7/25
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

49 Terms

1
New cards

In what cases should you use the median over the mean?

When outliers are present in datasets.

2
New cards
<p>Find IQR</p>

Find IQR

IQR=Q3-Q1

<p>IQR=Q3-Q1</p>
3
New cards
term image
knowt flashcard image
4
New cards

How to use probability table?

knowt flashcard image
5
New cards
term image
6
New cards

What is sensitivity?

What is specifcity?


What is precision?


True positive, yielded from probability table.
Actual positive (TP+FN).

<p><br>True positive, yielded from probability table.<br>Actual positive (TP+FN).</p>
7
New cards
term image

Precision= TP/(FP+TP)

0.566

Cannot trust the positives.

<p>Precision= TP/(FP+TP)</p><p></p><p>0.566<br><br>Cannot trust the positives.</p>
8
New cards

What are random variables?

The outcome of a random process.

Flipping a coin.

Rolling a dice.

Discrete: random variables are restricted to a certain value.

Continuous: can take ny value in an interval.

Eg. the outcome of measuring an individuals time in a 100m sprint.

9
New cards

What is a probability mass function?

Probability that X takes on a particular value.

Pr(X=x)=1/6 for x=1,2,3,4,5,6 and Pr(X=x)=0 otherwise.

10
New cards

How to calculate mean?

<p></p>
11
New cards

What is variance and how do you calculate it

knowt flashcard image
12
New cards
<p>Find mean and variance.</p>

Find mean and variance.

E(x)=4.05

<p>E(x)=4.05</p>
13
New cards
term image

Whilst E(x) is lower for A, distribution of numbers suggests that it might not be the better treatment.

14
New cards

How to do probability density functions?

Mean is integral from - infinity to infinity x*f(x)

Var is integral infinity (x-u)²*f(x)

<p>Mean is integral from - infinity to infinity x*f(x)<br><br>Var is integral infinity (x-u)²*f(x)</p>
15
New cards
<p>variance also.</p>

variance also.

integral(0,6)(x-4)²*x/18dx=2

<p></p><p>integral(0,6)(x-4)²*x/18dx=2</p>
16
New cards
term image

Must know how to do by hand. First part is integration by parts.

<p>Must know how to do by hand. First part is integration by parts.</p>
17
New cards

What is a cumulative distribution function?

knowt flashcard image
18
New cards
term image

19
New cards
term image
knowt flashcard image
20
New cards

What are the properties of mean?

Bell curve: means = median.

<p>Bell curve: means = median.</p>
21
New cards

What are the properties of variance?

Independent: result of one trial does not impact any other.

<p>Independent: result of one trial does not impact any other.</p>
22
New cards
term image
knowt flashcard image
23
New cards

What are Bernoulli trials?

Experiment with two possible outcomes: success and failure.

p= success q= failure.

24
New cards

What is the binomial distribution?

knowt flashcard image
25
New cards
<p>Find µ and sd.</p>

Find µ and sd.

E(x)=np=500x.95=475

sd=sqrt(npq)=sqrt(np(1-p))=sqrt(500×0.95×0.05=4.87

<p>E(x)=np=500x.95=475<br></p><p>sd=sqrt(npq)=sqrt(np(1-p))=sqrt(500×0.95×0.05=4.87<br></p>
26
New cards
<p>What is the probability that a person infects more than 17 people</p><p></p>

What is the probability that a person infects more than 17 people

(12+-2(2)=(8,12)

z=x-u/sigma = 15-12/2=1.5

<p>(12+-2(2)=(8,12)<br><br>z=x-u/sigma = 15-12/2=1.5</p>
27
New cards

What is a confounding variable?

Variable that affects the relationship between the variables in the question.

Like an uncontrolled variable.

28
New cards
<p>Use CLT to find sample mean and sd.<br><br>Find 95% CI</p>

Use CLT to find sample mean and sd.

Find 95% CI

N(10,4/100)
CLT is sd²/n

E(xbar)+-1.96*sd(xbar)=10+-1.96×0.2

29
New cards
term image

From CLT, xbar - N(µ,100/400)

µ≤xbar+0.98 and vice versa
49.82≤µ≤50.8

<p>From CLT, xbar - N(µ,100/400)</p><p></p><p>µ≤xbar+0.98 and vice versa<br>49.82≤µ≤50.8</p>
30
New cards
term image
knowt flashcard image
31
New cards
term image

No, since 2.9µm does not lie within the 95% CI, we cannot say with 95% confidence that 2.9 lies within the population mean.

<p>No, since 2.9µm does not lie within the 95% CI, we cannot say with 95% confidence that 2.9 lies within the population mean.</p>
32
New cards

What is the t-distribution?

Used when sd is not known.

Looks similar to normal distribution, but there is more mass in the distribution tails (wider), as this accounts fort the extra uncertainty associated with not knowing the sd.

33
New cards
<p>A new method for performing the above surgical procedure that seeks to</p><p class="p1">improve the consistency in the time required to complete the procedure. One</p><p class="p1">day, 18 surgeries are performed. The sample mean of the time taken for the</p><p class="p1">procedure is 29.47 minutes. What must the standard deviation be such that the</p><p class="p1">95% confidence interval for the mean does not include 30 minutes?<br><br>t score = 2.08 for t21 and 2.11 for t17</p>

A new method for performing the above surgical procedure that seeks to

improve the consistency in the time required to complete the procedure. One

day, 18 surgeries are performed. The sample mean of the time taken for the

procedure is 29.47 minutes. What must the standard deviation be such that the

95% confidence interval for the mean does not include 30 minutes?

t score = 2.08 for t21 and 2.11 for t17

34
New cards
35
New cards

How to find variance for proportion?

var(p hat)=var(X/n)=1/n²8var(X), where var(X)=npq, thus var(phat)=pq/n

36
New cards

A disease is known to have a prevalence of 20%. If we randomly sample 200 members of the population, what is the 95% probability interval for the prevalence?

knowt flashcard image
37
New cards

We suspect we have a positively biased coin. If we observe a sample proportion of ˆ p = 0.6, how many observations would we need such that the 95% confidence interval only contains values of positive bias?

knowt flashcard image
38
New cards

What is the null hypothesis?

Always an equality statement.

A new chemotherapeutic drug does not change the 5 year survival rate of lung

cancer.

Birth weight is not associated with the child’s IQ.

Rest days do not affect the chance of soft tissue injuries.


The alternative hypothesis, H1, is the opposite.

Prove by contradiction.

<p>Always an equality statement.<br><br>A new chemotherapeutic drug does not change the 5 year survival rate of lung</p><p class="p1">cancer.</p><p class="p1">Birth weight is not associated with the child’s IQ.</p><p class="p1">Rest days do not affect the chance of soft tissue injuries.<br><br><br>The alternative hypothesis, H<sub>1</sub>, is the opposite.<br><br>Prove by contradiction.</p>
39
New cards
term image

If it was true, say do not reject H0, not accept.

<p>If it was true, say do not reject H0, not accept.</p>
40
New cards

Suppose we flip a fair coin 100 times. We expect 50 heads. But in an experiment, 80 heads are observed.
What would the statement of the p-value?

If the coin was truly fair (H0), what is the likelihood that we would see a result this extreme or more?
If the probability is really low, like 0.001, then we assume that it isn’t feasible that the coin is fair.

The cutoff for rejecting the p-value is 0.05.

41
New cards

What is a type II error (ß)?

ß is when we do not reject H0 when H1 is true.

The power = 1-ß, which is the probability of avoiding the type 2 error.

<p>ß is when we do not reject H0 when H1 is true.<br><br>The power = 1-ß, which is the probability of avoiding the type 2 error.</p>
42
New cards
<p>Pr(Zs&lt;0.017)=0.507</p>

Pr(Zs<0.017)=0.507

43
New cards
term image
knowt flashcard image
44
New cards
term image
knowt flashcard image
45
New cards
<p></p>

knowt flashcard image
46
New cards
term image
knowt flashcard image
47
New cards

What are the things you need to check before you proceed with regression?

No patterns should arise from the residual plot.

<p>No patterns should arise from the residual plot.</p>
48
New cards
term image
knowt flashcard image
49
New cards
term image

t8, bc these are 2 degrees of freedom n-2.

<p>t8, bc these are 2 degrees of freedom n-2.</p><p></p>

Explore top flashcards