EDA

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/20

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

21 Terms

1
New cards

Exploratory Data Analysis (EDA)

A systematic approach to summarizing and understanding the main characteristics of a dataset using graphical and non-graphical methods.

2
New cards

Descriptive Statistics

Statistics that report facts and summarize raw data, including measures like mean, median, mode, and distribution.

3
New cards

Inferential Statistics

Statistics that involve drawing conclusions and making predictions about a population based on a sample.

4
New cards

Uncertainty in Statistics

The degree of doubt or variability in data that affects the reliability of statistical conclusions.

5
New cards

Structured Data

Data that is organized in a defined format, often in rows and columns, allowing for easy access and analysis.

6
New cards

Unstructured Data

Data that doesn’t have a predefined format or structure, making it complex to process and analyze.

7
New cards

Continuous Data

Numerical data that can take on any value in a given range, such as temperature or time.

8
New cards

Discrete Data

Numerical data that can only take specific integer values, like the number of students.

9
New cards

Categorical Data

Data that represents characteristics and can only take on certain values, such as type of vehicle or gender.

10
New cards

Time Series Data

Records successive measurements of a variable over time, used for analysis of trends.

11
New cards

Mean

The average value of a dataset, calculated by summing all values and dividing by the count of values.

12
New cards

Median

The middle value in a dataset when it is ordered from least to greatest.

13
New cards

Mode

The most frequently occurring value in a dataset.

14
New cards

Outlier

An observation that lies far from the other values in a dataset, potentially indicating variability or error.

15
New cards

Central Tendency

A statistical measure that identifies a single score as representative of an entire distribution.

16
New cards

Statistics

The field of study concerned with collecting, analyzing, interpreting, presenting, and organizing data.

17
New cards

Field of Statistics

The practice and techniques used to analyze data to derive insights and inform decisions.

18
New cards

Inferential Drawings

Conclusions made based on observations in descriptive statistics to make predictions about a larger population.

19
New cards

Bias in Data Interpretation

A tendency to favor a particular perspective, which can lead to misinterpretation of data.

20
New cards

Graphical Methods in EDA

Visual techniques used to represent data characteristics and patterns, such as histograms and scatter plots.

21
New cards

Non-Graphical Methods in EDA

Statistical methods that summarize data numerically, like summary statistics and correlation coefficients.