Core Skills

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/115

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

116 Terms

New cards

Continuous Data

Quantitative measurements on a continuous scale

New cards

Regression Equation

The line of best fit is described by a regression equation in the form of y = mx + c
WE use this to predict values of Y for a particular value of X (but only within the range of X values of our data set, as we cannot be sure the relationship is the same beyond the limits of our data set)

New cards

Nominal Data

Data in the form of categories with names. Data is non-quantative (its often counted to produce a discrete value)

New cards

Ordinal data

Data that is ranked / on a rating scale. Data is not quantitive because we do not know the size of the difference between categories

New cards

Descriptive Characteristics

Measures calculated from a data set which summarise some characteristic of the data (Quantify patterns from findings)

New cards

Median

The middle number in a sample when they are placed in order. If there are two numbers in the middle, the median is an average of the two

New cards

Measures of central Tendency

The mean, median and mode

New cards

Poisson distribution

Common distribution for discrete data; the shape is dependent on the mean.
- Mean is near to zero, heavily skewed normal distribution
- Mean is big, looks normal distribution

New cards

One tailed test

Specific; states "no positive / negative relationship"
- We are interested in only positive or only negative deviations of the test statistics
P value is HALVED

New cards

Pseudo-replication

Use of non-independent data points as if they were actually independent

New cards

Trend

A relationship between two variables, positive or negative

New cards

Correlation

A trend / relationship of two variables where changes coincide, yet casualty is not important / related
Covary variables

New cards

Spearman's rank correlation coefficient (rho)

Non-parametric statistic used to test the significance of correlations between variables
Can be used when normality / linearity violated

New cards

MS among

The average size of the difference between group means and grand means

New cards

Covariate

The word used to describe a continuous independent variable in situations where there is a mixture of continuous and independent variables, such as during ANCOVA

New cards

Induction

The derivation of general ideas from specific observations.

New cards

Hypothetico-deductive reasoning

An alternative to inductive reasoning; it argues that there is no way of proving a hypothesis to be true.

New cards

Hypothesis

An idea that is tentatively put forward to explain an observation. It may be generated by or contribute to a more general theory.

New cards

Theory

A set of general ideas or rules which are used to explain a group of observations.

New cards

Paradigm

A whole way of thinking / viewing the world

New cards

Paradigm Shift

A dramatic change in the way in which we think about a subject on science, when the evidence has accumulated in favour of rejecting a previous set of hypotheses or theories

New cards

Null Hypothesis

The form of hypothesis that we test formally, that predicts that nothing will happen / no effect will be observed / there is no difference or relationship between the two variables.

New cards

Statisitcs

The branch of mathematics that scientists use to provide a more objective assessment of patterns in data collected from experiments or observations

New cards

Sample Size

Number of individuals sampled (n)

New cards

Frequency

The number of times something occurs, or a count of the number of items in a particular category

New cards

Mean

The average of a sample of numbers - x̅

New cards

Mode

Most common number in a sample

New cards

Frequency Histogram

A graph showing the frequency of quantitative observations in each of a series of ordered numerical categories.
Discrete - Categories represent each possible total count made
Continuous - Categories are arbitrary

New cards

Distribution

The shape of a data set as seen on a frequency histogram.
Hypothetical distributions with mathematical equations includes normal, poisson and binomial

New cards

Deviate

The distance between a particular data point / observation and the mean ( also known as a residual in some context )

New cards

Sum of Squares

Total of all the squared deviates for a particular data set. It quantifies the magnitude of the total variability in a data set, but ignores the direction of that variability
(SS)

New cards

Variance

The average size of the squared devotees in a sample; a measure of variability in a data set.
Sample variance (s2) is an estimation of the population variance (σ2)

New cards

Standard Deviation

The average size of the deviates in a data set (s)
By squaring rooting the variance, the get a measure of the variation that is not affected by sample size and is in units we understand.

New cards

Population

All the individuals in a particular group

New cards

Sample

A subset of the population, chosen to represent the population

New cards

Normal Distribution

(Bell curve / Gaussian distribution)
A population of continuous data can have a "normal distribution' which attains a certain mathematical characteristic.
- Its bell shaped / symmetrical
- 68.5% of all points = within one S.D from the mean

New cards

Standard Error of the Mean

(SEM)
A measure of the confidence we have in our sample mean as an estimate of the real population mean (μ)
It is defined as standard deviation of a population of sample means
(SEM = S / root N)

New cards

Skew

The skew of a distribution of a sample
Skew to the right, long tail to the distribution on the right
Skew to the left, long tail to the distribution on the left
Not symmetrical (so not normal)

New cards

Statistical (parametric) tests

Tests which make several key assumptions about the distribution of the data from which they are calculated.

New cards

Non-Parametric Tests

Tests which make fewer assumptions about data (such as normal distribution)
Often deal with ranked data

New cards

Binomial Distribution

Good description of discrete data but only in situations where maximum possible count is close to the mean

New cards

Bar chart

Graph used for visualising differences between samples

New cards

Scatter graph

Type of graph normally used for visualising trends between variances

New cards

Experiment

Manipulation of a variable of interest in order to observe the effects on other variables

New cards

Control

A default where the manipulation of the variable being tested is not performed, used for comparison against the results of the experiment

New cards

Observational Experiment

A scientific study where data are collected but no manipulation is performed

New cards

Measurement Precision

A measurement is not precise if there is unbiased measurement error - they key is that imprecision is random ( just as likely to overestimate as you are to underestimate

New cards

Measurement Accuracy

A measurement is accurate if it is free from bias - which occurs when there is systematic error in your measurements resulting in a consistent over / underestimation

New cards

Confounding Variables

A variable that influences your results in a way that may be confused with the variable in which you are actually interested in. They are caused by a lack of independence in data points, avoided by measuring such variables to account for them or using appropriate control measurements

Can be confused with a real effect
Caused by systematic, non-random variation

New cards

Noise

Cause via random variation
Can make it tricky to spot a real variation

New cards

Order effect

The order of presenting the treatments affects the dependent variable

New cards

Replication

Repetition of an experimental manipulation or observation in identical circumstances. It allows you to gauge how much background or environmental variability there is in your data, regardless of the variable you are interested in. It increases the statistical power

New cards

Effect size

A large difference between two means / a steep slope of a trend

New cards

Statistical power

The degree of ability to detect the signal of an effect that you're interested in. More replication, large effect and low background variability results in a higher statistical power

New cards

Floor and ceiling effects

When a variable produces an effect below a certain (ceiling) or above (floor) a certain threshold. Above / below these thresholds, the signals cannot become any greater / lesser

New cards

Cause and effect

A manipulative experiment that is conducted to show that changes in A CAUSE changes in B
We otherwise do no know which way around in a significant relationship, cause and effect are

New cards

Reverse Causation

When causation is in the opposite direction to the hypothesis

New cards

A/D Observation

-easier
-Cheaper / Quicker
-Realistic
Tells us less about the cause and effect
-more confounding variables
-Possible Reverse Causation

New cards

A/D Experiment

-Difficult / Time consuming
-Expensive
-Artificial (Floor/Ceiling)
Tells us more about cause and effect
-Less confounding variables
-No reverse causation

New cards

Statistical Test

A test perfumed on your data to assess the validity of your Null hypothesis

New cards

P value

The probability that differences / trends could have arisen by chance, if the null hypothesis was true

New cards

Test statistic

Summarises the difference between samples

New cards

Treatment

Manipulation performed in an experiment
-Manipulated and control

New cards

Statistical Significance

When we conduct a statistical test, we compare our obtained probability (P value) and compare it to our arbiter threshold value.
If the probability is lower than this value, we say the effect is statistically significant and we can reject our Null Hypothesis.

New cards

Threshold values (significance level)

Threshold value is dependent on the particular scientific situation, and is set before data is collected so the decision is not influenced by subjective impressions of the data

New cards

Independent samples t-test

A parametric statistical test used to test for a difference between the means of two independent samples of continuous data - are the samples from the same population with a single mean

New cards

Independent samples t test - t test statistic

The test statistic 't' tells us about the size of the difference between the two samples.
t is big when variance is small and difference between means is big

New cards

Degrees of Freedom

A modified form of the sample size n, it represents the power of the statistical test
- df = n1 + n2 - ( ? )
This ? relates to the number of parameters being estimated in the test

New cards

Two tailed test

general; "there is / is not a relationship"
- We are interested in both positive and negative devotions of the test statistic

New cards

Type I error

Rejection of the Null hypothesis when it is in fact true.
At a p value lower than 0.05, there is a 5% chance we will make a Type I error

New cards

Type II error

The failure to reject the Null Hypothesis when it is in fact false.
The chance is influenced by experimental design, sample size, the chosen test and our threshold value

New cards

Independence

Data points are independent if they have nothing special in common except for the treatment or variable of interest.

New cards

Non-independence

Arises from repeated measures or non-random sampling
Causes confounded results; we cannot tell if observed differences are result of treatments or other confounding variables

New cards

Repeated Measures

Repeated observations made on the same subjects in an experiment.

New cards

Paired Design

An experimental design for collection of non-independent samples.

New cards

Before and after experiment

A paired design
Data are collected from the same group of individuals, before and after an experimental treatment. In this situation, the two data points are non-independent of each other, and the animals themselves act as the control.
Examines average change in variable
Time could still be confounding

New cards

Welch two-sample t-test

Used when variances of sample are significantly different, but data are still normal.
There is a small tweak to the degrees of freedom

New cards

Paired samples Wilcoxon test

Non-parametric equivalent of the paired t-test.
It assumes the samples are paired, rather than independent

New cards

Homogeneity of variance

When the variances in each sample in a statistical test are assumed to be the same (homogenous)

New cards

Transformation

We often assume data is normally distributed - if not we can try to transform in order to maintain a normal distribution
- Square root / logarithm of data

New cards

Arcsine transformation

Proportion data is rarely normally distributed; taking the arcsine f the square root of the proportions will allow us to transform the distribution of the collected data

New cards

Levenes test

A test for the homogeneity of variance of samples
H0 is that the variances ARE THE SAME

New cards

Shapiro-wilk test

A test for normality of sample distribution
H0 is that data ARE NORMALLY DISTRIBUTED

New cards

Two-sample wilcoxon test

Non-parametric equivalente of independent samples t test
Examines the difference between two samples of ranked data
H0 is that two samples come from a single population with a single mean rank

New cards

Chi-Sqaured test (X2)

A test used to examine differences between observed and expected counts / frequencies - we are asking if the frequencies of individual observations made in two or more categories are significantly different from the frequencies we would expect to find if H0 was true

df = (a-1)

New cards

contingency table

A table of observed counts or frequencies in a number of categories

New cards

Causal relationship

A trend / relationship between two variables where one variable causes changes in the other variable

New cards

Pearsons Correlation Coefficient

Parametric statistic used to test the significance of correlations between two variables
Both variables must be normally distributed and have a linear relationship

New cards

Data dredging

Use of certain statistics to test large numbers of possible relationships between variables in the absence of specific hypotheses formulated in advance
- useful for spotting patterns and generating new hypotheses

New cards

ANOVA (analysis of variance)

A Parametric statistical test for differences between any number of groups or samples, and can analyse differences in samples causes by more than one variable.

New cards

Factor (anova)

An independent variable affecting a sample in analysis of variance

New cards

Level (anova)

Each different value that each factor of anova could take

New cards

Multi-way ANOVA

ANOVA that tests more than one null hypothesis simultaneously

New cards

F ratio

Statistic used to test the null hypothesis in ANOVA, from the between / within SS
It allows us to compare the relative amounts of variation among and within groups
Large F shows a large variation between groups compared to within groups

New cards

Grand Mean

X bar bar
Mean of all the data points in all the groups / samples in ANOVA

New cards

Group Mean

X bar
Mean of the data points in an individual group / sample in ANOVA

New cards

SS among

The total amount of variation among (between) groups - adding up the squared differences between each group mean and the grand mean

New cards

SS within

Total amount of variation within groups - adding up squared differences between each data point and the relative group net.

New cards

MS within

The average size of the difference between the data points and the relative group mean

100

New cards

ANOVA table

Results of ANOVA presented in a table, showing among / within SS, MS, df, F and P