Chi-squared Test

  • to find the expected frequency of each element in the table, find the two product of the two relevant probability and times by the total number of sample

  • If the expected frequency found is less than 5, then need to add to the next column to combine the data

  • This then can be used to calculate the chi-square statistic

  • Then use the formula booklet to find the corrosponding statistic value depending on the significance level, compare this with the calculated value, if calculated value is higher, then there is sufficient evidence to reject H0

  • Hypothesis testing (always one tailed test):

    • H0: the two variables are independent

    • H1: the two variables are not independent

    • or

    • H0: distribution is a good fit

    • H1: distribution is not a good fit

  • In a contingency table, the degree of freedom is always (m-1)(n-1)

  • But if not in a table, degree of freedom always minus one from the number of data, unless we estimated the value of mean or anything when finding the test statistics, then we minus another 1.

  • Note: In questions when asking whether a distribution is a good find, to find the mean, use expectation formula, then to find the expected frequency need to put in the data into the distributions to find probability then times by the frequency, think of expected frequency in this case the frequency if distribution is a good fit

  • degree of freedom needs to minus one if mean is calculated as this is an estimate

  • If test statistics contribution is higher, this means the data is more unexpected, so mention whether the observed frequency is more or less than the expected

  • if test statistics contribution is low, this means data does not contribute much to the test statistics