Marketing 338 - Exam 1

studied byStudied by 0 people
0.0(0)
learn
LearnA personalized and smart learning plan
exam
Practice TestTake a test on your terms and definitions
spaced repetition
Spaced RepetitionScientifically backed study method
heart puzzle
Matching GameHow quick can you match all your cards?
flashcards
FlashcardsStudy terms and definitions

1 / 99

encourage image

There's no tags or description

Looks like no one added any tags here yet for you.

100 Terms

1

The Iceberg Principle

10% of the problem is seen and understood, 90% of the problem is not seen or understood

New cards
2

symptoms

visible or surface level issues

New cards
3

symptoms examples

company has declining sales

New cards
4

problem

underlying causes leading to symptoms

New cards
5

problem examples

poor marketing strategy, low quality products, ineffective customer service

New cards
6

primary data

data gathered by the researcher for a the specific purpose of addressing a research problem

New cards
7

primary data examples

survey, observation, experiment, interviews

New cards
8

secondary data

data previously collected from other sources that are already published

New cards
9

secondary data examples

census data, internet information, library database, books, journal articles

New cards
10

internal secondary database

data from within an organization

New cards
11

internal secondary database examples

Sales records, accounting information, customer database

New cards
12

external secondary database

Data from outside the organization

New cards
13

external secondary database examples

Industry reports, government publications, commercial databases, academic research papers

New cards
14

data preparation (3 steps)

integrate multiple sets of data

clean data

transform data

New cards
15

data integration

merges multiple data into one dataset for meaningful and valuable information

New cards
16

analyze data

Use appropriate analytical tools and techniques to derive insights from the data

New cards
17

report findings

Present the insights and recommendations in a clear and actionable manner.

New cards
18

relational database

A relational database is a collection of data organized into a set of tables. Each table contains data categories (variables) in columns and observations (cases) in rows

New cards
19

key variable

A variable common between two datasets, which can be used to link records between them.

New cards
20

vertical data integration

Combining datasets where each dataset contains different observations but shares the same structure (e.g., merging data from different time periods).

New cards
21

vertical integration excel command

append

New cards
22

horizontal data integration

Combining datasets where each dataset contains different variables but shares the same observations (e.g., merging data from different sources for the same time period)

New cards
23

horizontal integration excel command

Merge

New cards
24

=UPPER

converts text to uppercase

New cards
25

=PROPER

converts text to proper case (first letter of each word capitalized)

New cards
26

=TRIM

removes extra spaces from text

New cards
27

=Remove duplicates

deletes duplicate rows

New cards
28

text to colums

splits text into seperate colums based on delimiters

New cards
29

select right delimiters

Choose the correct delimiters (e.g., comma, semicolon) to separate text into columns.

New cards
30

New cards
31

incomplete data

Data that lacks attribute values or has missing attributes of interest, also known as Missing Data

New cards
32

examples of missing data

  • Missing data as part of the data collection process.

  • Non-responses on survey questions.

New cards
33

noisy data

data that contains errors or outliers

New cards
34

noisy data examples

  • Salary = "-100" (error)

  • Age = "222" (outlier)

  • Errors due to human or computer mistakes during data entry

New cards
35

inconsistent data

data that contains discrepancies in codes or names

New cards
36

inconsistent data examples

  • Date represented as "2/11/2019" in one row and "2019 Feb 11" in another.

  • Different data sources leading to inconsistencies.

New cards
37

listwise deletion

Excludes the entire case (record) with missing data.

New cards
38

disadvantages (listwise deletion)

  • Can be problematic if the missing items are not at random (NMAR).

  • Reduces the power of the model due to a smaller sample size

New cards
39

advantages (listwise deletion)

  • Can compare analyses because the sample is the same for different variables

New cards
40

pairwise deletion

Only analyzes cases with available data on each variable

New cards
41

advantages (pairwise deletion)

  • Keeps as many cases as possible for each analysis.

  • Uses all available information for each analysis.

New cards
42

disadvantages (pairwise deletion)

  • Cannot compare analyses because the sample is different for different variables.

  • Can lead to different sample sizes across analyses.

New cards
43

imputation

The process of replacing missing data with substituted values.

New cards
44

mean subsititution

Filling missing values with the mean of the observed values.

New cards
45

outliers

An observation that lies at an abnormal distance from other values in a random sample from a population.

New cards
46

wide format (unstacked)

Data presented with each different variable in a separate column.

New cards
47

long format (stacked)

Data presented with one column containing all the values and another column listing the context of the value

New cards
48

when to transform (long to wide)

If the datasets are merged without transformation, multiple identical rows for demographics are created.

New cards
49

when to transform (wide to long)

When summarizing wide-format datasets at a glance becomes difficult due to the number of variables.

New cards
50

transform long to wide (excel command)

pivot columns (convert rows to columns)

New cards
51

transform wide to long (excel command)

unpivot columns (convert columns to rows)

New cards
52

data visualization

the graphical representation of information and data. By using visual elements like charts, graphs, and maps, data visualization tools provide an accessible way to see and understand trends, outliers, and patterns in data.

New cards
53

qualitative data

Data that describes qualities or characteristics. It is non-numerical and can be observed but not measured. (Colors, textures, smells, tastes, appearances, beauty)

New cards
54

quantitative data

Data that can be measured and expressed numerically. It can be counted or compared on a numeric scale (Height, weight, length, price, temperature)

New cards
55

when to use a word cloud

  • Analyzing customer feedback and reviews to identify common themes or issues.

  • Monitoring social media discussions to gauge public sentiment about a brand or product.

  • Summarizing survey responses to highlight frequently mentioned topics

New cards
56

Four elements for telling a story with data

understand the audience, decide key takeaways, identify the data, good criteria for data visualization

New cards
57

understand the audience

Know who your audience is and what they care about. Tailor your data presentation to meet their needs and expectations.

New cards
58

Decide Key Takeaways (Identify the Task)

Determine the main messages or insights you want to convey with your data. Focus on the most important findings that will help your audience make informed decisions

New cards
59

Identify the Data Type

Recognize the type of data you have (qualitative or quantitative) to select the appropriate visualization methods and analyses.

New cards
60

Good Criteria for Good Data Visualization

  • Title and Labels: Ensure visuals have clear titles, labels, and axes information.

  • Accurate Representation: Avoid misleading elements like non-zero baselines in bar graphs or incorrect proportions in pie charts.

  • Simplicity: Remove unnecessary information or clutter to make the visuals easy to understand.

New cards
61

scales of measurement

The process of assigning numbers or labels to certain characteristics of objects according to pre-specified rules.

New cards
62

scale

A classification that describes the nature of information and the process of creating a continuum on which objects are located according to the amount of the measured characteristics they possess.

New cards
63

types of scales

nominal, ordinal, interval, ratio

New cards
64

nominal scale

Uses labels without any quantitative value. There is no ordering (Gender (Male, Female), Types of fruit (Apple, Banana, Cherry))

New cards
65

ordinal

  • Ranks objects or arranges them in order, but does not indicate the magnitude of difference between them. ( T-shirt sizes (Small, Medium, Large), Rank in a competition (1st, 2nd, 3rd)

New cards
66

interval

  • Scales with equal intervals between values, but no true zero point.(Temperature scales (Celsius, Fahrenheit), IQ scores)

New cards
67

ratio

Scales with a true zero point, allowing for comparisons of absolute magnitude (Height, Weight, Income, Age)

New cards
68

Why We Should Learn Scales of Measures

Choosing the right scales is crucial as they influence our decisions about which statistical methods to use. Different scales have different properties and applications.

New cards
69

why are descriptive analyses important?

summarize data to provide meaningful insights. They help understand the central tendency, dispersion, and overall distribution of data

New cards
70

mean

the average of a dataset

New cards
71

mode

the most frequently occurring value in a dataset

New cards
72

median

the middle value when the data is ordered

New cards
73

range

The difference between the highest and lowest values in a dataset

New cards
74

variance

The average squared deviation of each value from the mean.

New cards
75

standard deviation

The square root of the variance, indicating the spread of data around the mean.

New cards
76

Why Do We Care About Dispersion?

Understanding dispersion helps us assess the variability and consistency within a dataset. It complements measures of central tendency by providing a fuller picture of data distribution.

New cards
77

Relationship Between Measures of Central Tendency and Dispersion and Scales of Measures:

The choice of descriptive statistics depends on the scale of measurement. For instance, mean and standard deviation are appropriate for interval and ratio scales, while mode is suitable for nominal data.

New cards
78

Frequency

The number of times a particular value occurs in a dataset.

New cards
79

percentage frequency

The frequency expressed as a percentage of the total number of observations

New cards
80

bar chart

  • Displays categorical data with rectangular bars representing different categories (Comparing discrete categories (e.g., favorite colors, types of pets))

New cards
81

histogram

Displays continuous data with adjacent bars representing frequency distribution (Showing the distribution of continuous data (e.g., age, income))

New cards
82

cross tabulation

A method to examine relationships between two categorical variables by creating a contingency table.

New cards
83

Side-by-Side Bar Chart

A bar chart that allows for the comparison of multiple categories side by side.

New cards
84

Segmented Bar Chart

A bar chart that displays stacked segments representing different sub-categories within each bar.

New cards
85

one variable

bar chart, histogram

New cards
86

two variables

cross tabulation, side by side bar chart, segmented bar chart

New cards
87

mode + scales of measure

nominal, ordinal, interval (if meaningful), ratio (if meaningful)

New cards
88

mean + scales of measure

interval, ratio

New cards
89

median + scales of measure

Ordinal (sometimes), interval, ratio

New cards
90

range + scales of measure

ordinal (sometimes), interval, ratio

New cards
91

variance + scales of measure

interval, ratio

New cards
92

standard deviation + scales of measure

interval, ratio

New cards
93

checklist for good data visualization

  • titles, labels, x- and y- axis labels

  • start with zero

  • accurate proportions (pie chart)

  • no unnecessary information or illustrations

New cards
94

when completing both vertical and horizontal integration, which do you complete first?

vertical (append) and then horizontal (merge)

New cards
95

inner join

knowt flashcard image
New cards
96

left join (left outer join)

knowt flashcard image
New cards
97

right join (right outer join)

knowt flashcard image
New cards
98

Full Outer Join

knowt flashcard image
New cards
99

Lefti Anti Join

knowt flashcard image
New cards
100

Right Anti Join

knowt flashcard image
New cards
robot