Marketing 338 - Exam 1

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/99

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

100 Terms

1
New cards

The Iceberg Principle

10% of the problem is seen and understood, 90% of the problem is not seen or understood

2
New cards

symptoms

visible or surface level issues

3
New cards

symptoms examples

company has declining sales

4
New cards

problem

underlying causes leading to symptoms

5
New cards

problem examples

poor marketing strategy, low quality products, ineffective customer service

6
New cards

primary data

data gathered by the researcher for a the specific purpose of addressing a research problem

7
New cards

primary data examples

survey, observation, experiment, interviews

8
New cards

secondary data

data previously collected from other sources that are already published

9
New cards

secondary data examples

census data, internet information, library database, books, journal articles

10
New cards

internal secondary database

data from within an organization

11
New cards

internal secondary database examples

Sales records, accounting information, customer database

12
New cards

external secondary database

Data from outside the organization

13
New cards

external secondary database examples

Industry reports, government publications, commercial databases, academic research papers

14
New cards

data preparation (3 steps)

integrate multiple sets of data

clean data

transform data

15
New cards

data integration

merges multiple data into one dataset for meaningful and valuable information

16
New cards

analyze data

Use appropriate analytical tools and techniques to derive insights from the data

17
New cards

report findings

Present the insights and recommendations in a clear and actionable manner.

18
New cards

relational database

A relational database is a collection of data organized into a set of tables. Each table contains data categories (variables) in columns and observations (cases) in rows

19
New cards

key variable

A variable common between two datasets, which can be used to link records between them.

20
New cards

vertical data integration

Combining datasets where each dataset contains different observations but shares the same structure (e.g., merging data from different time periods).

21
New cards

vertical integration excel command

append

22
New cards

horizontal data integration

Combining datasets where each dataset contains different variables but shares the same observations (e.g., merging data from different sources for the same time period)

23
New cards

horizontal integration excel command

Merge

24
New cards

=UPPER

converts text to uppercase

25
New cards

=PROPER

converts text to proper case (first letter of each word capitalized)

26
New cards

=TRIM

removes extra spaces from text

27
New cards

=Remove duplicates

deletes duplicate rows

28
New cards

text to colums

splits text into seperate colums based on delimiters

29
New cards

select right delimiters

Choose the correct delimiters (e.g., comma, semicolon) to separate text into columns.

30
New cards

31
New cards

incomplete data

Data that lacks attribute values or has missing attributes of interest, also known as Missing Data

32
New cards

examples of missing data

  • Missing data as part of the data collection process.

  • Non-responses on survey questions.

33
New cards

noisy data

data that contains errors or outliers

34
New cards

noisy data examples

  • Salary = "-100" (error)

  • Age = "222" (outlier)

  • Errors due to human or computer mistakes during data entry

35
New cards

inconsistent data

data that contains discrepancies in codes or names

36
New cards

inconsistent data examples

  • Date represented as "2/11/2019" in one row and "2019 Feb 11" in another.

  • Different data sources leading to inconsistencies.

37
New cards

listwise deletion

Excludes the entire case (record) with missing data.

38
New cards

disadvantages (listwise deletion)

  • Can be problematic if the missing items are not at random (NMAR).

  • Reduces the power of the model due to a smaller sample size

39
New cards

advantages (listwise deletion)

  • Can compare analyses because the sample is the same for different variables

40
New cards

pairwise deletion

Only analyzes cases with available data on each variable

41
New cards

advantages (pairwise deletion)

  • Keeps as many cases as possible for each analysis.

  • Uses all available information for each analysis.

42
New cards

disadvantages (pairwise deletion)

  • Cannot compare analyses because the sample is different for different variables.

  • Can lead to different sample sizes across analyses.

43
New cards

imputation

The process of replacing missing data with substituted values.

44
New cards

mean subsititution

Filling missing values with the mean of the observed values.

45
New cards

outliers

An observation that lies at an abnormal distance from other values in a random sample from a population.

46
New cards

wide format (unstacked)

Data presented with each different variable in a separate column.

47
New cards

long format (stacked)

Data presented with one column containing all the values and another column listing the context of the value

48
New cards

when to transform (long to wide)

If the datasets are merged without transformation, multiple identical rows for demographics are created.

49
New cards

when to transform (wide to long)

When summarizing wide-format datasets at a glance becomes difficult due to the number of variables.

50
New cards

transform long to wide (excel command)

pivot columns (convert rows to columns)

51
New cards

transform wide to long (excel command)

unpivot columns (convert columns to rows)

52
New cards

data visualization

the graphical representation of information and data. By using visual elements like charts, graphs, and maps, data visualization tools provide an accessible way to see and understand trends, outliers, and patterns in data.

53
New cards

qualitative data

Data that describes qualities or characteristics. It is non-numerical and can be observed but not measured. (Colors, textures, smells, tastes, appearances, beauty)

54
New cards

quantitative data

Data that can be measured and expressed numerically. It can be counted or compared on a numeric scale (Height, weight, length, price, temperature)

55
New cards

when to use a word cloud

  • Analyzing customer feedback and reviews to identify common themes or issues.

  • Monitoring social media discussions to gauge public sentiment about a brand or product.

  • Summarizing survey responses to highlight frequently mentioned topics

56
New cards

Four elements for telling a story with data

understand the audience, decide key takeaways, identify the data, good criteria for data visualization

57
New cards

understand the audience

Know who your audience is and what they care about. Tailor your data presentation to meet their needs and expectations.

58
New cards

Decide Key Takeaways (Identify the Task)

Determine the main messages or insights you want to convey with your data. Focus on the most important findings that will help your audience make informed decisions

59
New cards

Identify the Data Type

Recognize the type of data you have (qualitative or quantitative) to select the appropriate visualization methods and analyses.

60
New cards

Good Criteria for Good Data Visualization

  • Title and Labels: Ensure visuals have clear titles, labels, and axes information.

  • Accurate Representation: Avoid misleading elements like non-zero baselines in bar graphs or incorrect proportions in pie charts.

  • Simplicity: Remove unnecessary information or clutter to make the visuals easy to understand.

61
New cards

scales of measurement

The process of assigning numbers or labels to certain characteristics of objects according to pre-specified rules.

62
New cards

scale

A classification that describes the nature of information and the process of creating a continuum on which objects are located according to the amount of the measured characteristics they possess.

63
New cards

types of scales

nominal, ordinal, interval, ratio

64
New cards

nominal scale

Uses labels without any quantitative value. There is no ordering (Gender (Male, Female), Types of fruit (Apple, Banana, Cherry))

65
New cards

ordinal

  • Ranks objects or arranges them in order, but does not indicate the magnitude of difference between them. ( T-shirt sizes (Small, Medium, Large), Rank in a competition (1st, 2nd, 3rd)

66
New cards

interval

  • Scales with equal intervals between values, but no true zero point.(Temperature scales (Celsius, Fahrenheit), IQ scores)

67
New cards

ratio

Scales with a true zero point, allowing for comparisons of absolute magnitude (Height, Weight, Income, Age)

68
New cards

Why We Should Learn Scales of Measures

Choosing the right scales is crucial as they influence our decisions about which statistical methods to use. Different scales have different properties and applications.

69
New cards

why are descriptive analyses important?

summarize data to provide meaningful insights. They help understand the central tendency, dispersion, and overall distribution of data

70
New cards

mean

the average of a dataset

71
New cards

mode

the most frequently occurring value in a dataset

72
New cards

median

the middle value when the data is ordered

73
New cards

range

The difference between the highest and lowest values in a dataset

74
New cards

variance

The average squared deviation of each value from the mean.

75
New cards

standard deviation

The square root of the variance, indicating the spread of data around the mean.

76
New cards

Why Do We Care About Dispersion?

Understanding dispersion helps us assess the variability and consistency within a dataset. It complements measures of central tendency by providing a fuller picture of data distribution.

77
New cards

Relationship Between Measures of Central Tendency and Dispersion and Scales of Measures:

The choice of descriptive statistics depends on the scale of measurement. For instance, mean and standard deviation are appropriate for interval and ratio scales, while mode is suitable for nominal data.

78
New cards

Frequency

The number of times a particular value occurs in a dataset.

79
New cards

percentage frequency

The frequency expressed as a percentage of the total number of observations

80
New cards

bar chart

  • Displays categorical data with rectangular bars representing different categories (Comparing discrete categories (e.g., favorite colors, types of pets))

81
New cards

histogram

Displays continuous data with adjacent bars representing frequency distribution (Showing the distribution of continuous data (e.g., age, income))

82
New cards

cross tabulation

A method to examine relationships between two categorical variables by creating a contingency table.

83
New cards

Side-by-Side Bar Chart

A bar chart that allows for the comparison of multiple categories side by side.

84
New cards

Segmented Bar Chart

A bar chart that displays stacked segments representing different sub-categories within each bar.

85
New cards

one variable

bar chart, histogram

86
New cards

two variables

cross tabulation, side by side bar chart, segmented bar chart

87
New cards

mode + scales of measure

nominal, ordinal, interval (if meaningful), ratio (if meaningful)

88
New cards

mean + scales of measure

interval, ratio

89
New cards

median + scales of measure

Ordinal (sometimes), interval, ratio

90
New cards

range + scales of measure

ordinal (sometimes), interval, ratio

91
New cards

variance + scales of measure

interval, ratio

92
New cards

standard deviation + scales of measure

interval, ratio

93
New cards

checklist for good data visualization

  • titles, labels, x- and y- axis labels

  • start with zero

  • accurate proportions (pie chart)

  • no unnecessary information or illustrations

94
New cards

when completing both vertical and horizontal integration, which do you complete first?

vertical (append) and then horizontal (merge)

95
New cards

inner join

knowt flashcard image
96
New cards

left join (left outer join)

knowt flashcard image
97
New cards

right join (right outer join)

knowt flashcard image
98
New cards

Full Outer Join

knowt flashcard image
99
New cards

Lefti Anti Join

knowt flashcard image
100
New cards

Right Anti Join

knowt flashcard image