$go to Math$

Statistics One-Variable & Two-Variable Data

Unit Four Review

Studied by 0 people

0.0(0)

Get a hint

Hint

When is the sample method independent?

1 / 54

Earn XP

Description and Tags

Statistics

One-Variable & Two-Variable Data

55 Terms

When is the sample method independent?

When an individual selected for one sample does not dictate which individual is to be in a second.

New cards

When is a sampling method dependent?

When an individual selected to be in one sample is used to determine the individual in the second sample.

New cards

What are dependent samples often referred to as?

Matched-pairs samples

New cards

What is paired data?

Two sets of observation that are uniquely paired so that an observation in one set matches an observation in the other based on a specific criterion or characteristic.

New cards

What is a natural measure of the effect of an observed object/action?

Difference between two groups' outcomes

New cards

What is the first step to testing hypotheses about the mean difference of matched pairs Data?

Determine the null and alternative hypotheses. The hypotheses can be structured in one of three ways, where µ_d is the population mean difference of the matched-pairs data. (Always null equal Zero)

<p>Determine the null and alternative hypotheses. The hypotheses can be structured in one of three ways, where µ<sub>d</sub> is the population mean difference of the matched-pairs data. (Always null equal Zero)</p>

New cards

What is the second step of testing hypotheses about the mean difference of matched-pairs data?

Select a level of significance, å, depending on the seriousness of making a type I error.

New cards

What is the third step of Testing Hypotheses about the mean difference of Matched-pairs data by Hand?

Calculate the test statistic using the sample data., where the values of d-bar, s_d are the mean and standard deviation of the differenced data.

<p>Calculate the test statistic using the sample data., where the values of d-bar, s<sub>d</sub> are the mean and standard deviation of the differenced data.</p>

New cards

What is the fourth step of testing hypotheses about the mean difference of matched-pairs?

If P-value < å, reject the null hypothesis

New cards

What is the fifth step of testing Hyptheses about the mena difference of Matched Pairs

Summarize the results and draw a conclusion.

New cards

What is the confidence interval for matched pairs data?

New cards

What is the formula to determine the t-distribution of two independent population means?

New cards

how do we use R-studio to find a t.test() function for the mean difference?

t.test(xdata, ydata, mu = ____, alternative = “_____”, conf.level = ____, paired = 1)

New cards

What is the first step in testing hypotheses about the difference of two independent population means?

Determine the null and alternative hypotheses, where the null hypothesis is that the means equal one another

New cards

What is the second step of testing hypotheses about the difference of two independent population means?

Select a level of significance, å, depending on the significance of making a type I error

New cards

How do we determine degrees of freedom in two Independent population means?

Using the smaller of n₁-1 & n₂-1

New cards

What is the third step of testing the Hypothesis about the difference of two Independent population?

Compute test statistic

New cards

What is the Fourth step of testing the Hypothesis about the difference of two Independent population?

If p-value < å, reject the null hypothesis

New cards

What is the Fifth step of testing the Hypothesis about the difference of two Independent population?

State the Conclusion

New cards

How do you compute the endpoints of the confidence interval for µ₁ - µ₂?

Use the formula: (point estimate) ± (critical value) × (standard error).

New cards

how is the mean of sampling distribution of the difference between two proportions (independent samples) found?

The mean of the sampling distribution is calculated as the difference between the two population proportions, denoted as p1 - p2.

New cards

How is the standard deviation of a sampling distribution of the difference between two proportions (independent Samples) found?

The standard deviation of the sampling distribution is found using the formula:

New cards

What is the z score of a sampling distribution of the difference between two proportions (independent samples) found

New cards

What is the first step of finding the confidence level for p₁ - p₂?

Find z_å/2

New cards

What is the second step of finding the confidence level for p₁ - p₂?

the endpoint of the confidence intervalf for p₁-p₂ are:

<p>the endpoint of the confidence intervalf for p<sub>1 </sub>-p<sub>2</sub> are: </p>

New cards

What is the third step of finding the confidence level for p₁ - p₂?

interpret confidence interval

New cards

what is the standard deviation of the hypothesis test for proportion of two independent samples?

New cards

How is the standard error computed since p is unknown in a relation of two proportion of independent samples?

using a weighted average point estimate known as the pooled estimate of p.

New cards

What is the first step of the hypothesis test of the sampling distribution. of two independent proportion samples?

Determine the null and alternative hypotheses. The hypothesis can be structured in one of three ways:

New cards

What is the second step of the hypothesis test of the sampling distribution. of two independent proportion samples?

Select a level of significance, å, depending on the seriousness of making a Type I error

New cards

What is the third step of the hypothesis test of the sampling distribution. of two independent proportion samples?

Compute the test statistic

New cards

What is the fourth step of the hypothesis test of the sampling distribution. of two independent proportion samples?

If P value < å, reject the null hypothesis

New cards

What is the fifth step of the hypothesis test of the sampling distribution. of two independent proportion samples?

state the conclusion

New cards

what is the code in RSTUdio for the sampling distribution for two proportions (independent samples)

prop.test(c(x₁,x₂),c(n₁,n₂), alternative = “two.sided”, “less”, “greater”, conf.level = , correct = FALSE)

New cards

What does r represent in statistics?

The correlation coefficient which measures the strength of a linear relationship between two variables. If r is close to 1 or -1, it indicates a strong relationship, while values near 0 suggest a weak relationship.

New cards

What does b₀ and b₁ equal in the following equation? y = b₀ + b₁x

y-intercept; slope

New cards

what is the least squares regression lines?

The line which minimizes the sum of the squared residuals for all the points in the plot. In other words, the least squares line is the line with coefficients b₀ and b₁ such that the quantity (e₁)² + (e₂)² + … + (e_n)²

New cards

What does ŷ represent?

The predicted value of the dependent variable (y) in a regression equation.

New cards

How do you find e_i?

y_i - ŷ_i where yi is the actual value of the dependent variable and ŷi is the predicted value.

New cards

If the observed line is above the estimation line in a regression model what does it mean?

It indicates that the actual value of the dependent variable is greater than the predicted value, suggesting a positive residual. This means that the model underestimated the observed.

New cards

What does it mean if the residuals show a pattern?

It suggests that the model is not capturing some aspect of the data, indicating potential issues with the model's fit or the presence of non-linearity.

New cards

What do smaller residuals imply?

imply that the predicted values are closer to the actual values, indicating a better fit of the regression model to the data.

New cards

What do larger residuals imply?

They suggest that the predicted values are further from the actual values, indicating a poorer fit of the regression model to the data.

New cards

How do residuals help to identify outliers?

Residuals can be analyzed to detect points that deviate significantly from the overall trend in the data, indicating potential outliers that do not fit the expected pattern.

New cards

What does it mean if the residuals widen or narrow systematically?

It indicates that the variance of the errors is changing, suggesting potential issues with the model, such as heteroscedasticity, which can affect the reliability of the regression results.

New cards

in the least squares regression line what does b₁ represent?

the slope of the regression line, indicating the change in the dependent variable for each one-unit change in the independent variable. b₁ = r•(s_y•s_x), where r is the correlation coefficient, s_y is the standard deviation of the dependent variable, and s_x is the standard deviation of the independent variable.

New cards

In the least squares regression line what does b₀ represent?

the y-intercept of the regression line, representing the predicted value of the dependent variable when the independent variable is zero. ÿ - b₁µ (where ÿ is sample mean of yand µ is sample mean of x)

New cards

If b₁is greater than 0?

There is positive linear association and vice versa

New cards

What is the code in RSTUDIO for least squares line?

Im(responsevariable~explanatoryvariable),, the plot() and abline() function allow you to see the scatterplot and regression line.

New cards

True or False:

When extrapolation is done to a regression model the linear relationship may not hold, meaning the boundaries of linearity are required.

True

New cards

When can the regression line be used>

When the value belongs inside of the observed minimum and maximum explanatory variable values, ensuring that predictions are made within the range of the data.

New cards

How do we determine the strength of our prediction if all the assumptions of the linear regression are satisfied?

The squared correlation coefficient (R²) indicates how well the regression line fits the data. Which is called the coefficient of determination

New cards

What does a value of R² near 0 suggest?

It indicates that the regression line does not explain the variability of the data well, suggesting a weak relationship between the variables.

New cards

What does a value of R² near 1 suggest?

It indicates that the regression line explains a large proportion of the variability in the data, suggesting a strong relationship between the variables.

New cards

How is R² determined in RSTUDIO?

>cor(x,y)²

New cards