BUSN5000 Midterm 1

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/53

Earn XP

Description and Tags

Business

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

54 Terms

New cards

Mutate

Tidyverse function for creating new variables.

New cards

Filter

Tidyverse function for including data matching a certain condition

New cards

Select

Tidyverse function for keeping/dropping variables

New cards

Group_By

Tidyverse function for categorizing data

New cards

Summarise

Tidyverse function for computing the basic summary statistics

New cards

Population Average

The big average we’re trying to figure out

New cards

Sample average

The average of the population we have.

New cards

records

A data set is made up of _____ that contain information on a specific entity.

New cards

fields

Each record is made of _____ that contain measurements of known types.

New cards

Panel Data

Data collected over time on multiple entities, such as individuals, firms, or countries.

New cards

Acquire, Transform, Analyze, Communicate

The 4 stages of analysis

New cards

Cross Section

Many units observed at a particular time

New cards

Time Series

A single unit observed over multiple time periods

New cards

Data Set

Multiple data tables structured for a particular analysis

New cards

Database

A collection of tables where each table has some known and meaningful relationship to the other tables.

New cards

Volume

A word to describe the literal size and scale of data

New cards

Velocity

A word to describe the speed of generation, collection, and storage of data

New cards

Variety

A word to describe the complexity of sources and forms of data

New cards

Veracity

A word to describe the degree of consistency and completeness of data.

New cards

Content

What a variable measures

New cards

Validity

Whether a variable measures what its supposed to measure

New cards

Reliability

Whether repeated measurements return the same value

New cards

Comparability

Whether a variable is measured the same way across units

New cards

Coverage

Whether all units intended for inclusion are included

New cards

Selection

Whether selected units are representative of those not covered

New cards

Data Schema

A representation of the data structure that comprises all the attributes of the data and their data types

New cards

Vector

A sequence of data elements of the same type

New cards

Matrix

A two-dimensional array of data elements of the same type

New cards

Data Frame

A tabular data structure

New cards

List

An ordered collection of objects

New cards

Factor

A vector that can contain only predefined values, and is used to store categorical data

New cards

Array

A multidimensional collection of same-type data elements.

New cards

Frequentism

The approach to thinking that states the probability of some event happening is the number of times it happens over the number of random trials

New cards

Law of Large Numbers

The idea that the more trials you use, the closer your data gets to being exactly accurate.

New cards

Estimand

The thing we want to estimate

New cards

Estimator

The formulas we use to make an estimate

New cards

Estimate

Our best guess for something, with bias and sampling error.

New cards

Measurement error

When a variable’s empirical measurement does not accurately capture the thing we are interested in

New cards

CEF

The workhorse of data science, has the expected value (average) of a variable given another variable.

New cards

Law of Iterated Expectations

The law that states the unconditional expectation is equal to the weighted average of conditional expectations. E(Y) = E(Y|X)

New cards

Covariance

Indicates the strength of a relationship

New cards

Human Capital Theory

Models education as an investment much like you would for any other capital asset, predicts age-earnings profile will be concave.

New cards

Consistency

The bias and sampling error approach zero as sample size increases

New cards

Central Limit Theory

THe Theory that under random sampling, given enough data, a random variable will approach a normal distribution.

New cards

Confidence Interval

how likely an estimate is close to its target in the population

New cards

Item Nonresponse

When data is missing because respondents refused to provide it

New cards

Unit Nonresponse

When data is missing because of people that the data was not collected from

New cards

Missing Completely at Random

Sampling error is completely independent of X and Y.

New cards

Missing at Random

Selection into a dataset depends on X, but not other unobserved factors.

New cards

Exogenous

Anything that went wrong with sampling is external.

New cards

Endogenous

Anything that went wrong with sampling is internal.

New cards

Imputation

The process of filling in the missing values based on data you observe

New cards

Simpson’s Paradox

The idea that there is a lurking third variable that effects correlations

New cards

Bayes’ Rule

A mathematical formula used to update the probability of a hypothesis based on new evidence or information. It calculates the probability of an event occurring given prior knowledge and new data.

Explore top notes

Periodic Table Basics

Updated 61d ago

Note

APCSP Unit 2 Study Guide

Updated 1027d ago

Note

The Columbian Exchange

Updated 1005d ago

Note

Every Graph to Know for AP Macroeconomics

Updated 219d ago

Note

17: Prosocial and Antisocial Behavior

Updated 958d ago

Note

GIS Quiz 3 (copy)

Updated 220d ago

Note

NaOH Titration Flashcards

Updated 43d ago

Note

Vocabulary

Updated 1102d ago

Note

Explore top flashcards

Flashcards (70)

Flashcards (35)

Neuro 2.2 - brodmann and cerebrum stuff

Updated 132d ago

Flashcards (56)

Psy 316 - Midterm 3

Updated 426d ago

Flashcards (117)

PSY3360 Midterm II Review

Updated 614d ago

Flashcards (67)

SOC 1000 Final Exam Study Guide

Updated 946d ago

Flashcards (70)

Wrld History Final Review

Updated 796d ago

Flashcards (110)

analytical chem

Updated 171d ago

Flashcards (38)