intro to data science final

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/32

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

33 Terms

1
New cards

Data Literacy

Understanding and interpreting data effectively.

2
New cards

Causality

Establishing a cause-and-effect relationship.

3
New cards

Association

Identifying correlations between variables.

4
New cards

Observational Studies

Research analyzing existing data without manipulation.

5
New cards

Chocolate and Health

Example of association in health studies.

6
New cards

Death Penalty and Murder Rates

Example of potential causal analysis.

7
New cards

Public Data

Freely available datasets for experimentation.

8
New cards

Existing Product Data

User interaction data from current products.

9
New cards

Human-in-the-Loop Systems

Combining automation with human oversight.

10
New cards

Brute Force Collection

Costly data gathering methods for unique datasets.

11
New cards

Purchased Data

Acquired datasets from third-party vendors.

12
New cards

Filtering Impurities

Managing errors in raw data for quality.

13
New cards

Merging Diverse Data Sources

Integrating datasets from different origins.

14
New cards

Data Labeling

Annotating data for machine learning context.

15
New cards

External Services

Platforms for scalable data annotation tasks.

16
New cards

Internal Teams

In-house capabilities for data annotation.

17
New cards

User-Generated Labels

User contributions to data labeling processes.

18
New cards

Annotation Acceleration Tools

Technologies enhancing data annotation efficiency.

19
New cards

Data Science

Field combining statistics

20
New cards

Collaboration in Data Science

Teamwork essential for solving complex data problems.

21
New cards

Skills of Data Scientists

Mix of statistics

22
New cards

5 C's of Data Ethics

Consent

23
New cards

IBM Data Estimate

2.5 quintillion bytes of data generated daily.

24
New cards

Prediction in Data Science

Forecasting events based on data analysis.

25
New cards

Productivity Paradox

Technological shifts may delay visible economic benefits.

26
New cards

Big Data

Large datasets requiring responsible usage for impact.

27
New cards

Human Limitations

Memory and objectivity constraints in data interpretation.

28
New cards

Python Basics

Fundamental functions for data manipulation in Python.

29
New cards

NumPy

Library for numerical operations in Python.

30
New cards

Pandas

Data manipulation library for structured data.

31
New cards

PyPlot

Matplotlib module for creating visualizations.

32
New cards

Seaborn

Statistical data visualization library based on Matplotlib.

33
New cards

Linear Regression

Predicting continuous variables using linear relationships.