SQL and Scikit Learn

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/21

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

22 Terms

1
New cards

Aggregate functions

Functions that perform a calculation on a set of values and return a single value, like AVG, SUM, MIN, MAX, COUNT.

2
New cards

AVG

An aggregate function that calculates the average of a numeric column.

3
New cards

SUM

An aggregate function that returns the total sum of a numeric column.

4
New cards

MIN

An aggregate function that returns the smallest value in a set.

5
New cards

MAX

An aggregate function that returns the largest value in a set.

6
New cards

COUNT

An aggregate function that returns the number of rows that match a specified criterion.

7
New cards

ROUND function

Used to round a number to a specified number of decimal places.

8
New cards

Negative number with round

Rounding can affect negative numbers similarly to positive numbers, depending on the rule used.

9
New cards

Rounding to nearest thousandth

Rounding a number to three decimal places.

10
New cards

Arithmetic with SQL queries

SQL can perform arithmetic operations within SELECT statements.

11
New cards

Aliasing

Assigning a temporary name to a column or result set for easier referencing.

12
New cards

SELECT (gross - budget) AS profit

An SQL expression that creates an alias 'profit' for the result of 'gross - budget'.

13
New cards

Order of execution in SQL

The sequence in which SQL processes queries: FROM → WHERE → SELECT → LIMIT.

14
New cards

Integer division in SQL

When dividing two integers, SQL will return an integer result, discarding any decimal.

15
New cards

Supervised Learning

A type of machine learning where the model is trained on labeled data.

16
New cards

Feature

An individual measurable property or characteristic used for analysis in supervised learning.

17
New cards

Target

The output variable that the algorithm aims to predict during supervised learning.

18
New cards

Data Wrangling

The process of cleaning and transforming raw data into a usable format.

19
New cards

EDA (Exploratory Data Analysis)

Analyzing data sets to summarize their main characteristics, often using visual methods.

20
New cards

Feature Engineering

The process of using domain knowledge to extract features that make machine learning algorithms work.

21
New cards

Feature Matrix

A two-dimensional array where rows represent samples and columns represent features.

22
New cards

Target Vector

The vector representing the outcomes you want to predict, used in supervised learning.