Statistics Clip 1
The Role of Statistics and Data
1. Introduction to Statistics
Definition of Statistics: The art and science of collecting, analyzing, presenting, and interpreting data.
Purpose: Providing information to support decision-making in various fields.
Modern Synonym: Often referred to as Data Science.
2. Application of Statistics in Marketing & Operations Management
Example - Large Online Retailer:
Utilizes extensive databases to manage:
Assortment of products available online.
Customer purchases.
Customer product returns.
Logistics data.
Marketing instruments.
Statistical Analysis Focus:
Drivers of Sales.
Drivers of Product Returns.
3. Current Trends in Data Science
Growth of Data Scientist Positions:
Strong growth observed globally, particularly in Australia.
Statistical Job Postings: A graph shows an increase in Australian data science job postings.
Numbers are reported in a 3-month moving average, reflecting demand:
0 to 1500 postings over years from 2014 to 2019.
Notable Quotations:
HAL VARIAN (Chief Economist at Google): “I keep saying that the sexy job in the next 10 years will be statisticians.”
4. Terminology in Statistics
Data Terminology:
Database: A structured set of data held in a computer.
Data Set: A collection of related data points.
Data Matrix:
Columns: Represent Variables.
Rows: Represent Observations, elements, cases, or subjects.
Cells: Each cell contains a Measurement or data point.
5. Types of Variables
Level of Measurement:
Categorical Data: Data divided into categories.
Metric/Numerical Data: Data represented by numbers.
Importance of Measurement Levels:
Categorical data offers limitations on statistical operations while metric data provides more powerful analytical options.
6. Types of Data Sets
Types of Data Sets:
Cross-sectional Data: All cases measured at one specific time (e.g., customer surveys).
Time-Series Data: Variables measured across time (e.g., stock prices).
Panel Data: Combines elements of both; multiple cases with the same variable measured at multiple time points (e.g., consumer panel data).
Data Source Consideration:
New or Existing Data?: Evaluate whether to collect new data or use existing datasets.
7. Research Data Sources
Primary Data:
Definition: Data collected directly from first-hand experience for specific research projects.
Methods of Collection:
Interviews, surveys, questionnaires, field observations, experiments, action research, case studies.
Secondary Data:
Definition: Data that has already been collected for another purpose, sourced from other researchers.
Sources:
Previous research, mass media, government reports, official statistics, historical data.
8. Comparison of Data Types
Primary Data vs. Secondary Data:
Time-Specificity: Primary data is tailored to researchers' needs, whereas secondary data may not be.
Cost: Primary data is often more expensive, while secondary data is usually low-cost or free.
Control Over Data Quality: Primary data provides high control, while secondary data lacks that level of control.
9. Key Statistical Concepts
Statistics is defined: "Statistics is a way to get information from data".
Key Concepts:
Population: The entire group of items/cases of interest.
Sample: A subset of items/cases drawn from the population for analysis.
10. Statistics in Empirical Cycle Theory
The Empirical Cycle includes these stages:
Hypothesis: A proposed explanation made on the basis of limited evidence.
Observation: Data collection phase.
Empirical Findings: Data analyzed to produce results.
Testing: Determining the validity of hypotheses.
Deduction: Drawing conclusions from the research findings.
Theory Development: Refining or creating theories based on empirical evidence.
11. Statistical Analysis
Descriptive Statistics: Involves organizing, summarizing, and presenting data using:
Graphical techniques.
Numerical techniques.
Inferential Statistics: Involves drawing inferences about characteristics of a population based on analysis of sample data.
Conclusion
Understanding the role of statistics is crucial for decision-making in business and beyond. The insights drawn from both descriptive and inferential statistics enable stakeholders to make informed choices based on data-driven evidence.
Additional Resources
Recommend checking further Knowledge Clips for in-depth analysis on specific statistical topics.