Statistics Clip 1

The Role of Statistics and Data

1. Introduction to Statistics

  • Definition of Statistics: The art and science of collecting, analyzing, presenting, and interpreting data.

  • Purpose: Providing information to support decision-making in various fields.

  • Modern Synonym: Often referred to as Data Science.

2. Application of Statistics in Marketing & Operations Management

  • Example - Large Online Retailer:

    • Utilizes extensive databases to manage:

    • Assortment of products available online.

    • Customer purchases.

    • Customer product returns.

    • Logistics data.

    • Marketing instruments.

    • Statistical Analysis Focus:

    • Drivers of Sales.

    • Drivers of Product Returns.

3. Current Trends in Data Science

  • Growth of Data Scientist Positions:

    • Strong growth observed globally, particularly in Australia.

  • Statistical Job Postings: A graph shows an increase in Australian data science job postings.

    • Numbers are reported in a 3-month moving average, reflecting demand:

    • 0 to 1500 postings over years from 2014 to 2019.

  • Notable Quotations:

    • HAL VARIAN (Chief Economist at Google): “I keep saying that the sexy job in the next 10 years will be statisticians.”

4. Terminology in Statistics

  • Data Terminology:

    • Database: A structured set of data held in a computer.

    • Data Set: A collection of related data points.

    • Data Matrix:

    • Columns: Represent Variables.

    • Rows: Represent Observations, elements, cases, or subjects.

    • Cells: Each cell contains a Measurement or data point.

5. Types of Variables

  • Level of Measurement:

    • Categorical Data: Data divided into categories.

    • Metric/Numerical Data: Data represented by numbers.

  • Importance of Measurement Levels:

    • Categorical data offers limitations on statistical operations while metric data provides more powerful analytical options.

6. Types of Data Sets

  • Types of Data Sets:

    • Cross-sectional Data: All cases measured at one specific time (e.g., customer surveys).

    • Time-Series Data: Variables measured across time (e.g., stock prices).

    • Panel Data: Combines elements of both; multiple cases with the same variable measured at multiple time points (e.g., consumer panel data).

  • Data Source Consideration:

    • New or Existing Data?: Evaluate whether to collect new data or use existing datasets.

7. Research Data Sources

  • Primary Data:

    • Definition: Data collected directly from first-hand experience for specific research projects.

    • Methods of Collection:

    • Interviews, surveys, questionnaires, field observations, experiments, action research, case studies.

  • Secondary Data:

    • Definition: Data that has already been collected for another purpose, sourced from other researchers.

    • Sources:

    • Previous research, mass media, government reports, official statistics, historical data.

8. Comparison of Data Types

  • Primary Data vs. Secondary Data:

    • Time-Specificity: Primary data is tailored to researchers' needs, whereas secondary data may not be.

    • Cost: Primary data is often more expensive, while secondary data is usually low-cost or free.

    • Control Over Data Quality: Primary data provides high control, while secondary data lacks that level of control.

9. Key Statistical Concepts

  • Statistics is defined: "Statistics is a way to get information from data".

  • Key Concepts:

    • Population: The entire group of items/cases of interest.

    • Sample: A subset of items/cases drawn from the population for analysis.

10. Statistics in Empirical Cycle Theory

  • The Empirical Cycle includes these stages:

    • Hypothesis: A proposed explanation made on the basis of limited evidence.

    • Observation: Data collection phase.

    • Empirical Findings: Data analyzed to produce results.

    • Testing: Determining the validity of hypotheses.

    • Deduction: Drawing conclusions from the research findings.

    • Theory Development: Refining or creating theories based on empirical evidence.

11. Statistical Analysis

  • Descriptive Statistics: Involves organizing, summarizing, and presenting data using:

    • Graphical techniques.

    • Numerical techniques.

  • Inferential Statistics: Involves drawing inferences about characteristics of a population based on analysis of sample data.

Conclusion

  • Understanding the role of statistics is crucial for decision-making in business and beyond. The insights drawn from both descriptive and inferential statistics enable stakeholders to make informed choices based on data-driven evidence.

Additional Resources

  • Recommend checking further Knowledge Clips for in-depth analysis on specific statistical topics.