Cogs 118B: Intro to Machine Learning - Google Docs

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/24

There's no tags or description

Looks like no tags are added yet.

Last updated 5:31 PM on 7/5/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai	Chat

No analytics yet

Send a link to your students to track their progress

25 Terms

New cards

What is the primary objective of Machine Learning as discussed in the context of Cogs 118B?

To study and construct algorithms that can learn from and make predictions on data by building a model from observations.

New cards

How is Supervised Learning characterized in introductory machine learning?

A learning paradigm where the model is trained on a labeled dataset, meaning each training example is paired with an output label.

New cards

Define Unsupervised Learning.

A type of machine learning that looks for previously unknown patterns in a dataset without pre-existing labels.

New cards

What is a 'feature' in a machine learning dataset?

An individual measurable property or characteristic of a phenomenon being observed, often represented as a column in a data matrix.

New cards

What is the role of the 'label' or 'target' variable?

The ground truth or outcome that the machine learning model is tasked with predicting based on the input features.

New cards

Define the Hypothesis Function $h(x)$ .

The mathematical function that a learning algorithm uses to find the mapping from the input features to the output variable.

New cards

What is the goal of a Linear Regression model?

To approximate the relationship between dependent and independent variables using a linear equation, usually to predict continuous values.

New cards

What is the purpose of the Cost Function in machine learning?

To quantify the difference between the model's predictions and the actual target values, typically used to measure model performance.

New cards

Write the formula for the Mean Squared Error (MSE) cost function.

$J(\theta) = \frac{1}{2m} \sum_{i=1}^{m} (h_{\theta}(x^{(i)}) - y^{(i)})^2$

New cards

What is Gradient Descent?

A first-order iterative optimization algorithm for finding the local minimum of a differentiable function by moving proportional to the negative of the gradient.

New cards

In Gradient Descent, what does the parameter $\alpha$ represent?

The learning rate, which determines the magnitude of the steps taken toward the minimum of the cost function.

New cards

What distinguishes Logistic Regression from Linear Regression?

Logistic Regression is used for classification tasks producing discrete outputs, whereas Linear Regression is used for regression tasks producing continuous outputs.

New cards

What is the Sigmoid Function's equation?

$g(z) = \frac{1}{1 + e^{-z}}$

New cards

Define Overfitting.

A phenomenon where a model fits the training data too well, including the noise, resulting in poor generalization to new, unseen data.

New cards

Define Underfitting.

A state where a model is unable to capture the underlying trend of the data, typically due to the model being too simple for the complexity of the data.

New cards

Describe the Bias-Variance tradeoff.

The fundamental trade-off between the error from the model's assumptions (Bias) and the error from sensitivity to training data noise (Variance).

New cards

What is the specific purpose of Regularization?

To prevent overfitting by adding a penalty term for large parameter values to the cost function, encouraging simpler models.

New cards

What is L1 Regularization, also known as Lasso?

A technique that adds a penalty proportional to the sum of the absolute values of the weights: $\lambda \sum_{j=1}^{n} |\theta_j|$ .

New cards

What is L2 Regularization, also known as Ridge?

A technique that adds a penalty proportional to the sum of the squares of the weights: $\lambda \sum_{j=1}^{n} \theta_j^2$ .

New cards

In ML evaluation, what is a Training Set?

A subset of the original data used to train the model and adjust its internal parameters.

New cards

In ML evaluation, what is a Test Set?

A separate portion of the dataset used only at the end of the development process to provide an unbiased evaluation of the final model.

New cards

What is the K-Nearest Neighbors (KNN) algorithm?

A simple classification algorithm that assigns a label to a data point based on the majority label of its $k$ closest points in the feature space.

New cards

Define a Support Vector Machine (SVM).

A classification model that finds the hyperplane which maximizes the separation (margin) between two classes of data.

New cards

What is the formula for Precision in a classification task?

$\text{Precision} = \frac{TP}{TP + FP}$ where $TP$ is True Positives and $FP$ is False Positives.

New cards

What is the formula for Recall (Sensitivity) in a classification task?

$\text{Recall} = \frac{TP}{TP + FN}$ where $TP$ is True Positives and $FN$ is False Negatives.