Introduction to Data Science Flashcards

0.0(0)

Studied by 0 people

View linked note

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/24

Earn XP

Description and Tags

Vocabulary flashcards for key terms and definitions from the Introduction to Data Science course book.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

25 Terms

New cards

Data Science

The combination of business, analytical, and programming skills that are used to extract meaningful insights from raw data.

New cards

Deep Learning

The application of computational networks (with cascading layers of units) to learning tasks.

New cards

Artificial Intelligence

A set of approaches to enable a computer to emulate and thus automate cognitive processes — often based on learning from data.

New cards

Machine Learning

A subset of artificial intelligence where mathematical models are developed to perform given tasks based on provided training examples.

New cards

Data Mining

This is the process of discovering patterns in large datasets.

New cards

Business Intelligence

This is a collection of routines that are used to analyze and deliver the business performance metrics.

New cards

Training Set

The dataset used by the machine learning model that will help it to learn its desired task.

New cards

Testing Set

These data are used to measure the performance of the developed machine learning model.

New cards

Outlier

A data record which is seen as exceptional and outside the distribution of the normal input data.

New cards

Data Cleansing

The process of removing redundant data, handling missing data entries and removing, or at least alleviating, other data quality issues.

New cards

Feature

An observable measure of the data. Other terms such as property, attribute, or characteristic are also used instead of feature.

New cards

Dimensionality Reduction

The process of reducing the dataset into lesser dimensions, ensuring that it conveys similar information.

New cards

Feature Selection

The process of selecting relevant features of the provided dataset.

New cards

Machine Learning

Algorithms or mathematical models that use information extracted from data in order to achieve a desired task or function.

New cards

Supervised Learning

The subset of Machine Learning that is based on labeled data. It can be further distinguished in regression and classification.

New cards

Unsupervised Learning

The subset of Machine Learning that is based on un-labeled data. Typical unsupervised learning tasks are clustering and dimensionality reduction.

New cards

Deep Learning

The application of networks of computational units with cascading layers of information processing used to learn through tasks.

New cards

Decision Model

A model assesses the relationships between the elements of provided data to recommend a possible decision for a given situation.

New cards

Regression

A forecasting technique to estimate the functional dependence between input and output variables.

New cards

Cluster Analysis

A type of unsupervised learning used to partition a set of data records into clusters. Records in a cluster are more similar to each other than to those in other clusters.

New cards

Classification

A machine learning approach to categorize entities into predefined classes.

New cards

Probability

Quantification of how likely it is that a certain event occurs, or the degree of belief in a given proposition.

New cards

Standard Deviation

A measure of how spread out the data values are.

New cards

Type I Error

False positive output, meaning that it was actually negative but has been predicted as positive.

New cards

Type II Error

False negative output, meaning that it was actually positive but has been predicted as negative.