Quantative methods for economics

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/43

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No study sessions yet.

44 Terms

New cards

population

the complete set of units (people, firms, etc.) we want

to study

New cards

sample

a subset of the population examined to learn about the

population.

New cards

representative sample

a sample that mirrors the population on

relevant characteristics

New cards

sampling bias

systematic under- or over-representation of some

population members.

New cards

statistic

either a function applicable to data or the result of that

function, i.e. a number

New cards

parameter

a numerical characteristic of a population that a

statistic aims to estimate

New cards

Qualitive (categorical) data

the result of categorising or describing attributes of a population

New cards

Quantative (numerical) data

the result of counting or measuring attributes of a population

New cards

variable

a characteristic of a unit being observed that may assume more than one set of values for each member of the population

New cards

numerical variable

takes on values with equal units such as petals per flower

New cards

categorical variable

place a person or thing into a category such as colour of flower

New cards

data

the observed value of the variable(s)

New cards

quantative discrete variables

take on only certain numerical values, e.g. calls per week

New cards

quantative continuous variables

take on all values in a defined range, e.g, length, weight, time

New cards

median

middle value seperating the greater and lesser halves of a data set

New cards

mode

most frequent value in a data set

New cards

function

a rule that assigns to each input exactly one output. it comes with a domain (allowed inputs) and a codomain (possible outputs). The set of outputs actually attained is the range. (image)

New cards

Domain and range

a function that maps every element in the domain to exactly 1 element in the range. Although each input can be sent to only one output, 2 different inputs can be sent to same output

New cards

statistical functions

when we aggregate data, we take a high dimensional domain and map it to a low dimensional range.

New cards

bar graph

the length of the bar for each category is proportional to the number or percent of individuals in each category. Bars may be vertical or horizontal. include the zero in the bar chart

New cards

simple random sampling

picking individuals out of proportion with equal chance

New cards

the problem, sampling bias

some members of population are not as likely to be chosen as others and we do not account for it

New cards

common type of sampling bias

self-selection, exclusion, survivorship

New cards

simple random sample

any group of individuals is equally likely to be chosen as any other groups of individuals

New cards

proportionate stratisified sample

divide the population into groups called strata and then take a proportionate number from each stratum. Advantage: sample is representative along the characteristic used for stratisfication

New cards

disproportionate stratisfied sample

over-sample (pick individuals with a higher chance from) groups with large variance, e.g, smaller groups. Leads to biased results if not adjusted

New cards

cluster sample

divide population into clusters (groups) and then randomly select some of the clusters. Include all the members from these clusters

New cards

convinience sample

use results that are readily available (already collected). cheaper but might be biased

New cards

distribution

a description of how often each outcome occurs

New cards

empirical cumulative distribution function(ECDF)

A standard representation for an empirical (observed) distribution

New cards

Histograms

divides the span of our data into non-overlapping bins of the same size. Then, for each bin, we count the number of values that fall into that interval. The histogram plots these counts as bars with the base of the bar defined by the intervals, histograms are preffered over EDCFs as they are easier to interpret

New cards

Smooth density plots

basically smoothing out the edges of a histogram

New cards

Advantage of smooth density plot

prettier and easier to compate several distributions as less messy

New cards

Disadvantage of smooth density plot

Interpretation slightly more difficult, form is dependent on underlying smoothing, never a good idea to use methods we dont understand well

New cards

what does it mean when data is not pretty?

it means asymetric for instance or carrying outlines

New cards

percentiles

the values for which p=0.01, 0.02,…0.99 of the data are less than or equal to that value respectively

New cards

median (percentile)

the most often used percentile is the 50% percentile, called the median

New cards

quaritiles

these are the percentiles at p= 0.025, 0.5, 0.75

New cards

range

the difference between the largest value and smallest value

New cards

box plot

provides a 5 number summary for data composed of the range along with quarities

New cards

stratification

often when we divide observations into groups based on the values of one or more variables associated with these observations

New cards

variance

a measure of variation in the population. It is defined as the sum of squared deviations from the mean divided by the number of units

New cards

standard deviation

the square root of the variance

New cards

what is the function of standard deviation?

it provides a numerical measure of the overall amount of variation in a data set, always positive or zero, it is small when the data are all concentrated close to the mean, exhibiting little variation or spread, can also be used to determine whether a particular data value is close to or far from the mean