Quantative methods for economics

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/43

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

44 Terms

1
New cards

population

the complete set of units (people, firms, etc.) we want

to study

2
New cards

sample

a subset of the population examined to learn about the

population.

3
New cards

representative sample

a sample that mirrors the population on

relevant characteristics

4
New cards

sampling bias 

systematic under- or over-representation of some

population members.

5
New cards

statistic

either a function applicable to data or the result of that

function, i.e. a number

6
New cards

parameter

a numerical characteristic of a population that a

statistic aims to estimate

7
New cards

Qualitive (categorical) data

the result of categorising or describing attributes of a population

8
New cards

Quantative (numerical) data

the result of counting or measuring attributes of a population

9
New cards

variable 

a characteristic of a unit being observed that may assume more than one set of values for each member of the population 

10
New cards

numerical variable

takes on values with equal units such as petals per flower

11
New cards

categorical variable

place a person or thing into a category such as colour of flower

12
New cards

data

the observed value of the variable(s)

13
New cards

quantative discrete variables

take on only certain numerical values, e.g. calls per week 

14
New cards

quantative continuous variables

take on all values in a defined range, e.g, length, weight, time

15
New cards

median

middle value seperating the greater and lesser halves of a data set

16
New cards

mode

most frequent value in a data set

17
New cards

function

a rule that assigns to each input exactly one output. it comes with a domain (allowed inputs) and a codomain (possible outputs). The set of outputs actually attained is the range. (image)

18
New cards

Domain and range

a function that maps every element in the domain to exactly 1 element in the range. Although each input can be sent to only one output, 2 different inputs can be sent to same output

19
New cards

statistical functions

when we aggregate data, we take a high dimensional domain and map it to a low dimensional range.

20
New cards

bar graph

the length of the bar for each category is proportional to the number or percent of individuals in each category. Bars may be vertical or horizontal. include the zero in the bar chart

21
New cards

simple random sampling

picking individuals out of proportion with equal chance

22
New cards

the problem, sampling bias

some members of population are not as likely to be chosen as others and we do not account for it

23
New cards

common type of sampling bias

self-selection, exclusion, survivorship

24
New cards

simple random sample

any group of individuals is equally likely to be chosen as any other groups of individuals

25
New cards

proportionate stratisified sample

divide the population into groups called strata and then take a proportionate number from each stratum. Advantage: sample is representative along the characteristic used for stratisfication

26
New cards

disproportionate stratisfied sample

over-sample (pick individuals with a higher chance from) groups with large variance, e.g, smaller groups. Leads to biased results if not adjusted

27
New cards

cluster sample

divide population into clusters (groups) and then randomly select some of the clusters. Include all the members from these clusters

28
New cards

convinience sample

use results that are readily available (already collected). cheaper but might be biased 

29
New cards

distribution

a description of how often each outcome occurs

30
New cards

empirical cumulative distribution function(ECDF)

A standard representation for an empirical (observed) distribution

31
New cards

Histograms

divides the span of our data into non-overlapping bins of the same size. Then, for each bin, we count the number of values that fall into that interval. The histogram plots these counts as bars with the base of the bar defined by the intervals, histograms are preffered over EDCFs as they are easier to interpret

32
New cards

Smooth density plots

basically smoothing out the edges of a histogram

33
New cards

Advantage of smooth density plot

prettier and easier to compate several distributions as less messy

34
New cards

Disadvantage of smooth density plot

Interpretation slightly more difficult, form is dependent on underlying smoothing, never a good idea to use methods we dont understand well 

35
New cards

what does it mean when data is not pretty?

it means asymetric for instance or carrying outlines

36
New cards

percentiles

the values for which p=0.01, 0.02,…0.99 of the data are less than or equal to that value respectively

37
New cards

median (percentile)

the most often used percentile is the 50% percentile, called the median

38
New cards

quaritiles

these are the percentiles at p= 0.025, 0.5, 0.75

39
New cards

range

the difference between the largest value and smallest value

40
New cards

box plot

provides a 5 number summary for data composed of the range along with quarities

41
New cards

stratification

often when we divide observations into groups based on the values of one or more variables associated with these observations

42
New cards

variance 

a measure of variation in the population. It is defined as the sum of squared deviations from the mean divided by the number of units 

43
New cards

standard deviation

the square root of the variance

44
New cards

what is the function of standard deviation?

it provides a numerical measure of the overall amount of variation in a data set, always positive or zero, it is small when the data are all concentrated close to the mean, exhibiting little variation or spread, can also be used to determine whether a particular data value is close to or far from the mean