Introduction to Data Science

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/22

flashcard set

Earn XP

Description and Tags

These flashcards cover key concepts, definitions, and terms relevant to the field of data science as introduced in the lecture.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

23 Terms

1
New cards

Data Science

An interdisciplinary field that utilizes statistical theories and computer science to analyze data resources to solve problems and predict outcomes.

2
New cards

Structured Data

Data that is organized in a predefined format, easily searchable in databases.

3
New cards

Unstructured Data

Data that is not organized in a predefined manner, such as text, images, and social media posts.

4
New cards

Velocity

The speed at which data is generated and processed.

5
New cards

Variety

The different forms and sources of data, including structured and unstructured formats.

6
New cards

Data Engineering

The field focused on developing and maintaining systems that collect and store large amounts of data.

7
New cards

Data Analytics

The systematic computational analysis of data aimed at drawing conclusions and making decisions.

8
New cards

Business Intelligence (BI)

A set of tools and methodologies that assist organizations in accessing and analyzing historical and real-time data.

9
New cards

Machine Learning

A subset of artificial intelligence that enables systems to learn and improve from experience without being explicitly programmed.

10
New cards

CRISP-DM Model

A process model for data mining that outlines the phases of a data science project: Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation, and Deployment.

11
New cards

Predictive Analytics

Analysis that uses data, statistical algorithms, and machine learning techniques to identify the likelihood of future outcomes.

12
New cards

Descriptive Statistics

A branch of statistics that summarizes and describes the characteristics of a dataset.

13
New cards

Inferential Statistics

A branch of statistics that makes inferences and predictions about a population based on a sample of data.

14
New cards

Feature Selection

The process of identifying and selecting a subset of relevant features for model construction.

15
New cards

Artificial Intelligence (AI)

Field aiming to create systems that can perform tasks that would normally require human intelligence.

16
New cards

Natural Language Processing (NLP)

A field of AI that focuses on the interaction between computers and humans through natural language.

17
New cards

Data Mining

The process of discovering patterns and knowledge from large amounts of data.

18
New cards

Validation

The process of evaluating the performance of a model on a new set of data.

19
New cards

Bias in Data Science

Human biases that can affect the outcomes of data science models and lead to inaccurate predictions.

20
New cards

Causation vs. Correlation

Causation indicates that one event is the result of the occurrence of another event, while correlation indicates a relationship or association between two variables.

21
New cards

Training Set

The portion of the dataset used to train the model.

22
New cards

Testing Set

The portion of the dataset used to evaluate the model's performance.

23
New cards

Key Performance Indicators (KPIs)

Quantifiable measures that gauge the performance of an organization in achieving business objectives.