Analytics Exam 1

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 70

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

71 Terms

1

Descriptive Analysis

Investigates what is happening currently or has occurred in the past

New cards
2

Diagnostic

Helps understand why something happened

New cards
3

Predictive

Forecasts what might happen in the future

New cards
4

Prescriptive

Helps understand what should happen to meet goals and objectives

New cards
5

Example of Descriptive Analysis

What were gross sales by region for the past two years

New cards
6

Example of Diagnostic Analysis

Why did sales decrease in Region 1 in the prior year

New cards
7

Example of Predictive Analysis

What will sales be next year if we increase market share by 10%

New cards
8

Example of Prescriptive Analysis

What is the most cost-effective way to ship our products

New cards
9

Stages of the data analysis process

Plan, Analyze, Report

New cards
10

Stage 1 : Plan

Identifying the motivation for the analysis, determining the objective and questions to answer, and devising a strategy

New cards
11

Stage 2: Analyze

Data preparation, building information models, and exploring the data

New cards
12

Stage 3: Report

interpreting the analyses, and communicate effectively

New cards
13

What is motivation in data analysis

The reason the analysis is being performed. Why are we doing the analysis

New cards
14

What are the four risks with data analysis?

Data, Analysis, Assumptions, Biases

New cards
15

Data

Choosing inappropriate data, or data that are incomplete or incorrect

New cards
16

Analysis

Choosing an inappropriate method, or applying a data analysis method incorrectly

New cards
17

Assumptions

Not understanding or evaluating assumptions about the data or the results

New cards
18

Biases

Mental shortcuts that can affect decisions

New cards
19

Relational Database

A collection of logically related data that can be retrieved, manipulated, and updated to meet users’ needs.

New cards
20

Inner Join

Will select all of the rows from both the tables basked on the matched values

New cards
21

Left Join

Returns all the rows from the left table and will show any matching data from the right table

New cards
22

Right Join

Returns all the rows form the right table and will show any matching data from the left table

New cards
23

Full Join

Will return all the rows from both tables

New cards
24

Primary Key

A uniquely identified key

New cards
25

Foreign Key

If a primary key appears in another table, it is referred to as the foreign key

New cards
26

Attribute

The columns if the source of the data was a database

New cards
27

Record

Each row in a data set from a database

New cards
28

Fields

the individual columns in a data set

New cards
29

Descriptive Statistics

Help understand characteristics of data

New cards
30

Application of descriptive stats

Average observations in the data, the data’s shape, the distribution of the data

New cards
31

Correlation Analysis

Reveals relationships in data by measuring the linear relationship between two variables

New cards
32

How does correlation analysis work?

The measure is numerical between -1 and +1, the higher the absolute number the stronger the relationship

New cards
33

Skewness graph

Positive skew shows a tail off to the right, whereas the negative skew shows a tail off to the left

New cards
34

Positive skew

From left to right goes mode, median, mean

New cards
35

Negative skew

From left to right goes mean, median, mode

New cards
36

Coefficient of skewness

absolute value of CS > 1, high degree of skewness/ .5 < CS < 1, moderate skewness/ CS < .5, relative symmetry

New cards
37

Coefficient of kurtosis

If the CK > 3, data is somewhat peaked with less dispersion. If the CK < 3, data is somewhat flat with a high dispersion

New cards
38

What includes motivation

Opportunity, professional issues, problem solving, process and performance assessment

New cards
39

four types of objectives

Descriptive, diagnostic, predictive, and prescriptive

New cards
40

Descriptive Objectives

Designed to better understand data to answer business questions

New cards
41

Diagnostic Objectives

To identify a problem or issue to understand why an outcome occurred

New cards
42

Predictive Objectives

Focus on what may happen in the future

New cards
43

Prescriptive objectives

Goal is to investigate how to take advantage of future opportunities or mitigate a future risk outcome

New cards
44

When given the objective what questions should be asked?

Questions that address the objective, focus on a single issue, is measurable, and if the data necessary to answer it are available

New cards
45

Given the question, what relevant analysis should be used?

N/A

New cards
46

Regression analysis

Linear regression builds mathematical and statistical models to explain the relationship between a dependent variable and one or more independent variables

New cards
47

Linear optimization

The process of selecting values of variables that minimize or maximize some quality of interest

New cards
48

Data Plan Analysis

Focus on the objective, select a data strategy, select an analysis strategy, consider risks, embed controls

New cards
49

Measured raw data

Data created or captured by a controlled process capturing the valueu of the data. Their format can be discrete or continuous data

New cards
50

Nonmeasured raw data

Data often created automatically by the computer or company policy for control. These field are typically formatted as discrete data

New cards
51

Calculated Data

Data created when one or more fields in a particular row have any number of mathematical operators applied. These field are formatted as discrete or continuous data.

New cards
52

Measurement scale descriptions

Categorical, Ordinal, interval, and ratio

New cards
53

Categorical Data

Labeled or named data that can be sorted into groups according to specific characteristics

New cards
54

Ordinal Data

Ordered or ranked categorical data

New cards
55

Interval Data

Ordinal data that have equal and constant differences between observations and an arbitrary zero point. Examples of temperature, time, or credit score

New cards
56

Rational data

Interval data with a natural zero point. Economic data, such as dollars or euros

New cards
57

Analysis Strategies for descriptive and diagnostic analystics

Data dispersion, visualizations, correlation, calculations

New cards
58

Data Dispersion

Min, Max, Range, Variance, STD dev, skewness, and kurtosis

New cards
59

Visualizations

Bar, pie charts, histograms, and box plots

New cards
60

Correlation

Scatterplot

New cards
61

Calculations

Totals, subtotals, percent change, percent of total

New cards
62

Risk and embedded controls

Implement controls to reduce risks within the data and analysis strategies

New cards
63

Data profiling

The process of investigating data quality and structure. It has 3 components: Data quality, data structure, and deciding/informing

New cards
64

Table Structures

Aggregate and slice. Aggregate - calculate the total sales amount. Slice - Break the total down by region to examine the regional sales in more detail.

New cards
65

Quality Issues

Issues within a spreadsheet such as gender has Male and M, Date is empty, Email doesn’t have @gmail.com

New cards
66

Imputation

using estimated values are substituted for missing data

New cards
67

Data matching

A process that compares data and determines whether they describe the same entity

New cards
68

Star Schema

The recommended data structure for analytical databases. They consist of fact tables and dimension tables

New cards
69

Dimension Table

Provide context to analysis and give meaning to facts.

New cards
70

Fact tables

Correspond to business transactions such as orders, sales, pruchases, and payments

New cards
71

Cardinalities

The relationship between two tables, specifying how many rows in one table can be associated with how many rows in another table

New cards

Explore top notes

note Note
studied byStudied by 14 people
1005 days ago
4.0(1)
note Note
studied byStudied by 162 people
624 days ago
5.0(1)
note Note
studied byStudied by 16 people
122 days ago
5.0(1)
note Note
studied byStudied by 22 people
743 days ago
5.0(1)
note Note
studied byStudied by 61 people
882 days ago
4.0(1)
note Note
studied byStudied by 8 people
176 days ago
5.0(1)
note Note
studied byStudied by 10 people
898 days ago
5.0(1)
note Note
studied byStudied by 255 people
686 days ago
4.8(9)

Explore top flashcards

flashcards Flashcard (127)
studied byStudied by 31 people
911 days ago
5.0(1)
flashcards Flashcard (20)
studied byStudied by 19 people
266 days ago
5.0(1)
flashcards Flashcard (20)
studied byStudied by 8 people
784 days ago
5.0(1)
flashcards Flashcard (28)
studied byStudied by 29 people
737 days ago
5.0(2)
flashcards Flashcard (67)
studied byStudied by 9 people
837 days ago
5.0(1)
flashcards Flashcard (315)
studied byStudied by 51 people
763 days ago
5.0(4)
flashcards Flashcard (29)
studied byStudied by 15 people
379 days ago
5.0(1)
flashcards Flashcard (26)
studied byStudied by 84 people
17 days ago
5.0(1)
robot