Core Skills

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/115

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

116 Terms

1
New cards

Continuous Data

Quantitative measurements on a continuous scale

2
New cards

Regression Equation

The line of best fit is described by a regression equation in the form of y = mx + c
WE use this to predict values of Y for a particular value of X (but only within the range of X values of our data set, as we cannot be sure the relationship is the same beyond the limits of our data set)

3
New cards

Nominal Data

Data in the form of categories with names. Data is non-quantative (its often counted to produce a discrete value)

4
New cards

Ordinal data

Data that is ranked / on a rating scale. Data is not quantitive because we do not know the size of the difference between categories

5
New cards

Descriptive Characteristics

Measures calculated from a data set which summarise some characteristic of the data (Quantify patterns from findings)

6
New cards

Median

The middle number in a sample when they are placed in order. If there are two numbers in the middle, the median is an average of the two

7
New cards

Measures of central Tendency

The mean, median and mode

8
New cards

Poisson distribution

Common distribution for discrete data; the shape is dependent on the mean.
- Mean is near to zero, heavily skewed normal distribution
- Mean is big, looks normal distribution

9
New cards

One tailed test

Specific; states "no positive / negative relationship"
- We are interested in only positive or only negative deviations of the test statistics
P value is HALVED

10
New cards

Pseudo-replication

Use of non-independent data points as if they were actually independent

11
New cards

Trend

A relationship between two variables, positive or negative

12
New cards

Correlation

A trend / relationship of two variables where changes coincide, yet casualty is not important / related
Covary variables

13
New cards

Spearman's rank correlation coefficient (rho)

Non-parametric statistic used to test the significance of correlations between variables
Can be used when normality / linearity violated

14
New cards

MS among

The average size of the difference between group means and grand means

15
New cards

Covariate

The word used to describe a continuous independent variable in situations where there is a mixture of continuous and independent variables, such as during ANCOVA

16
New cards

Induction

The derivation of general ideas from specific observations.

17
New cards

Hypothetico-deductive reasoning

An alternative to inductive reasoning; it argues that there is no way of proving a hypothesis to be true.

18
New cards

Hypothesis

An idea that is tentatively put forward to explain an observation. It may be generated by or contribute to a more general theory.

19
New cards

Theory

A set of general ideas or rules which are used to explain a group of observations.

20
New cards

Paradigm

A whole way of thinking / viewing the world

21
New cards

Paradigm Shift

A dramatic change in the way in which we think about a subject on science, when the evidence has accumulated in favour of rejecting a previous set of hypotheses or theories

22
New cards

Null Hypothesis

The form of hypothesis that we test formally, that predicts that nothing will happen / no effect will be observed / there is no difference or relationship between the two variables.

23
New cards

Statisitcs

The branch of mathematics that scientists use to provide a more objective assessment of patterns in data collected from experiments or observations

24
New cards

Sample Size

Number of individuals sampled (n)

25
New cards

Frequency

The number of times something occurs, or a count of the number of items in a particular category

26
New cards

Mean

The average of a sample of numbers - x̅

27
New cards

Mode

Most common number in a sample

28
New cards

Frequency Histogram

A graph showing the frequency of quantitative observations in each of a series of ordered numerical categories.
Discrete - Categories represent each possible total count made
Continuous - Categories are arbitrary

29
New cards

Distribution

The shape of a data set as seen on a frequency histogram.
Hypothetical distributions with mathematical equations includes normal, poisson and binomial

30
New cards

Deviate

The distance between a particular data point / observation and the mean ( also known as a residual in some context )

31
New cards

Sum of Squares

Total of all the squared deviates for a particular data set. It quantifies the magnitude of the total variability in a data set, but ignores the direction of that variability
(SS)

32
New cards

Variance

The average size of the squared devotees in a sample; a measure of variability in a data set.
Sample variance (s2) is an estimation of the population variance (σ2)

33
New cards

Standard Deviation

The average size of the deviates in a data set (s)
By squaring rooting the variance, the get a measure of the variation that is not affected by sample size and is in units we understand.

34
New cards

Population

All the individuals in a particular group

35
New cards

Sample

A subset of the population, chosen to represent the population

36
New cards

Normal Distribution

(Bell curve / Gaussian distribution)
A population of continuous data can have a "normal distribution' which attains a certain mathematical characteristic.
- Its bell shaped / symmetrical
- 68.5% of all points = within one S.D from the mean

37
New cards

Standard Error of the Mean

(SEM)
A measure of the confidence we have in our sample mean as an estimate of the real population mean (μ)
It is defined as standard deviation of a population of sample means
(SEM = S / root N)

38
New cards

Skew

The skew of a distribution of a sample
Skew to the right, long tail to the distribution on the right
Skew to the left, long tail to the distribution on the left
Not symmetrical (so not normal)

39
New cards

Statistical (parametric) tests

Tests which make several key assumptions about the distribution of the data from which they are calculated.

40
New cards

Non-Parametric Tests

Tests which make fewer assumptions about data (such as normal distribution)
Often deal with ranked data

41
New cards

Binomial Distribution

Good description of discrete data but only in situations where maximum possible count is close to the mean

42
New cards

Bar chart

Graph used for visualising differences between samples

43
New cards

Scatter graph

Type of graph normally used for visualising trends between variances

44
New cards

Experiment

Manipulation of a variable of interest in order to observe the effects on other variables

45
New cards

Control

A default where the manipulation of the variable being tested is not performed, used for comparison against the results of the experiment

46
New cards

Observational Experiment

A scientific study where data are collected but no manipulation is performed

47
New cards

Measurement Precision

A measurement is not precise if there is unbiased measurement error - they key is that imprecision is random ( just as likely to overestimate as you are to underestimate

48
New cards

Measurement Accuracy

A measurement is accurate if it is free from bias - which occurs when there is systematic error in your measurements resulting in a consistent over / underestimation

49
New cards

Confounding Variables

A variable that influences your results in a way that may be confused with the variable in which you are actually interested in. They are caused by a lack of independence in data points, avoided by measuring such variables to account for them or using appropriate control measurements

Can be confused with a real effect
Caused by systematic, non-random variation

50
New cards

Noise

Cause via random variation
Can make it tricky to spot a real variation

51
New cards

Order effect

The order of presenting the treatments affects the dependent variable

52
New cards

Replication

Repetition of an experimental manipulation or observation in identical circumstances. It allows you to gauge how much background or environmental variability there is in your data, regardless of the variable you are interested in. It increases the statistical power

53
New cards

Effect size

A large difference between two means / a steep slope of a trend

54
New cards

Statistical power

The degree of ability to detect the signal of an effect that you're interested in. More replication, large effect and low background variability results in a higher statistical power

55
New cards

Floor and ceiling effects

When a variable produces an effect below a certain (ceiling) or above (floor) a certain threshold. Above / below these thresholds, the signals cannot become any greater / lesser

56
New cards

Cause and effect

A manipulative experiment that is conducted to show that changes in A CAUSE changes in B
We otherwise do no know which way around in a significant relationship, cause and effect are

57
New cards

Reverse Causation

When causation is in the opposite direction to the hypothesis

58
New cards

A/D Observation

-easier
-Cheaper / Quicker
-Realistic
Tells us less about the cause and effect
-more confounding variables
-Possible Reverse Causation

59
New cards

A/D Experiment

-Difficult / Time consuming
-Expensive
-Artificial (Floor/Ceiling)
Tells us more about cause and effect
-Less confounding variables
-No reverse causation

60
New cards

Statistical Test

A test perfumed on your data to assess the validity of your Null hypothesis

61
New cards

P value

The probability that differences / trends could have arisen by chance, if the null hypothesis was true

62
New cards

Test statistic

Summarises the difference between samples

63
New cards

Treatment

Manipulation performed in an experiment
-Manipulated and control

64
New cards

Statistical Significance

When we conduct a statistical test, we compare our obtained probability (P value) and compare it to our arbiter threshold value.
If the probability is lower than this value, we say the effect is statistically significant and we can reject our Null Hypothesis.

65
New cards

Threshold values (significance level)

Threshold value is dependent on the particular scientific situation, and is set before data is collected so the decision is not influenced by subjective impressions of the data

66
New cards

Independent samples t-test

A parametric statistical test used to test for a difference between the means of two independent samples of continuous data - are the samples from the same population with a single mean

67
New cards

Independent samples t test - t test statistic

The test statistic 't' tells us about the size of the difference between the two samples.
t is big when variance is small and difference between means is big

68
New cards

Degrees of Freedom

A modified form of the sample size n, it represents the power of the statistical test
- df = n1 + n2 - ( ? )
This ? relates to the number of parameters being estimated in the test

69
New cards

Two tailed test

general; "there is / is not a relationship"
- We are interested in both positive and negative devotions of the test statistic

70
New cards

Type I error

Rejection of the Null hypothesis when it is in fact true.
At a p value lower than 0.05, there is a 5% chance we will make a Type I error

71
New cards

Type II error

The failure to reject the Null Hypothesis when it is in fact false.
The chance is influenced by experimental design, sample size, the chosen test and our threshold value

72
New cards

Independence

Data points are independent if they have nothing special in common except for the treatment or variable of interest.

73
New cards

Non-independence

Arises from repeated measures or non-random sampling
Causes confounded results; we cannot tell if observed differences are result of treatments or other confounding variables

74
New cards

Repeated Measures

Repeated observations made on the same subjects in an experiment.

75
New cards

Paired Design

An experimental design for collection of non-independent samples.

76
New cards

Before and after experiment

A paired design
Data are collected from the same group of individuals, before and after an experimental treatment. In this situation, the two data points are non-independent of each other, and the animals themselves act as the control.
Examines average change in variable
Time could still be confounding

77
New cards

Welch two-sample t-test

Used when variances of sample are significantly different, but data are still normal.
There is a small tweak to the degrees of freedom

78
New cards

Paired samples Wilcoxon test

Non-parametric equivalent of the paired t-test.
It assumes the samples are paired, rather than independent

79
New cards

Homogeneity of variance

When the variances in each sample in a statistical test are assumed to be the same (homogenous)

80
New cards

Transformation

We often assume data is normally distributed - if not we can try to transform in order to maintain a normal distribution
- Square root / logarithm of data

81
New cards

Arcsine transformation

Proportion data is rarely normally distributed; taking the arcsine f the square root of the proportions will allow us to transform the distribution of the collected data

82
New cards

Levenes test

A test for the homogeneity of variance of samples
H0 is that the variances ARE THE SAME

83
New cards

Shapiro-wilk test

A test for normality of sample distribution
H0 is that data ARE NORMALLY DISTRIBUTED

84
New cards

Two-sample wilcoxon test

Non-parametric equivalente of independent samples t test
Examines the difference between two samples of ranked data
H0 is that two samples come from a single population with a single mean rank

85
New cards

Chi-Sqaured test (X2)

A test used to examine differences between observed and expected counts / frequencies - we are asking if the frequencies of individual observations made in two or more categories are significantly different from the frequencies we would expect to find if H0 was true

df = (a-1)

86
New cards

contingency table

A table of observed counts or frequencies in a number of categories

87
New cards

Causal relationship

A trend / relationship between two variables where one variable causes changes in the other variable

88
New cards

Pearsons Correlation Coefficient

Parametric statistic used to test the significance of correlations between two variables
Both variables must be normally distributed and have a linear relationship

89
New cards

Data dredging

Use of certain statistics to test large numbers of possible relationships between variables in the absence of specific hypotheses formulated in advance
- useful for spotting patterns and generating new hypotheses

90
New cards

ANOVA (analysis of variance)

A Parametric statistical test for differences between any number of groups or samples, and can analyse differences in samples causes by more than one variable.

91
New cards

Factor (anova)

An independent variable affecting a sample in analysis of variance

92
New cards

Level (anova)

Each different value that each factor of anova could take

93
New cards

Multi-way ANOVA

ANOVA that tests more than one null hypothesis simultaneously

94
New cards

F ratio

Statistic used to test the null hypothesis in ANOVA, from the between / within SS
It allows us to compare the relative amounts of variation among and within groups
Large F shows a large variation between groups compared to within groups

95
New cards

Grand Mean

X bar bar
Mean of all the data points in all the groups / samples in ANOVA

96
New cards

Group Mean

X bar
Mean of the data points in an individual group / sample in ANOVA

97
New cards

SS among

The total amount of variation among (between) groups - adding up the squared differences between each group mean and the grand mean

98
New cards

SS within

Total amount of variation within groups - adding up squared differences between each data point and the relative group net.

99
New cards

MS within

The average size of the difference between the data points and the relative group mean

100
New cards

ANOVA table

Results of ANOVA presented in a table, showing among / within SS, MS, df, F and P