GB307 Generalized Linear Model

0.0(0)

Studied by 0 people

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/44

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

45 Terms

New cards

What does standard linear regression assume?

the relationship between X’s and Y is linear
the errors are normally distributed

New cards

Standard Linear Regression Equation

New cards

What are Generalized Linear Models (GLMs)?

an extension of linear regression that allow for non-normal response variable distributions and a flexible link between predictors and the mean of the outcome

New cards

What are the two key features that make GLMs different from linear regression?

The response variable Y can have a non-normal distribution.
The mean of Y, E(Y|X), is linked to a linear combination of predictors through a function.

New cards

What does this equation mean in GLMs?

the mean function

New cards

What is the link function in a GLM?

G (⋅) connects the expected value of Y to the linear combination of predictors

New cards

Why use GLMs instead of linear regression?

Because linear regression assumes normal errors and a constant variance. GLMs allow for different distributions (like binomial or Poisson) and more flexible relationships.

New cards

What is the identity link function in GLMs used for?

Used for linear relationships.
Link: β₀+ β₁x₁= E(Y|X)
Mean: E(Y∣X) = β₀+ β₁x₁

New cards

When should you use the log link function in a GLM?

When the mean must be positive, like with count data.
Link: β₀+ β₁x₁= ln(E(Y|X))
Mean: E(Y∣X) = e^β_⁰^{+ β}_¹^x_¹

New cards

What kind of relationships does the power link handle in GLMs?

Used for curved (non-linear) relationships.
Link: β₀+ β₁x₁= E(Y|X)^a
Mean: E(Y|X) = (β₀+ β₁x₁)^1/a

New cards

How are the link and mean functions related in GLMs?

The link function transforms the mean so the model can be expressed as a linear combination of predictors. The mean function is the inverse of the link function.

New cards

When should you use the Normal distribution for P(Y|X)?

Use it when the outcome is bell-shaped, can be positive or negative, and you're modeling averages (e.g., sales, stock changes)

New cards

What kind of data is appropriate for the Gamma distribution?

Use it when the response is always positive and may be skewed, like wait times, durations, or time between events.

New cards

How is the Normal distribution shaped and what can it model?

The Normal distribution is symmetric and bell-shaped, and it can model values that are negative or positive. Great for modeling things like sales or returns

New cards

Why would you choose the Gamma distribution over Normal?

Gamma is used when the data is strictly positive and may be skewed (e.g., time, rates). Normal can’t model skew or enforce positive-only outcomes.

New cards

When should you use the Bernoulli distribution for P(Y|X)?

Use it when the outcome is binary (0 or 1), like yes/no or success/failure questions

New cards

What kind of data does the Bernoulli distribution model?

Binary outcomes — data with only two possible values (0 or 1). Great for modeling the probability of an event happening.

New cards

When should you use the Poisson or Negative Binomial distribution?

Use them for count data:

Poisson: assumes fixed variance
Negative Binomial: handles extra variation (overdispersion)
Examples:
Number of customers per hour
Number of defects per product

New cards

What types of data are modeled with Poisson or Negative Binomial?

Positive integers — like how many times something happens. These models are used when the outcome is a count.

New cards

Normal Distribution

bell shaped, continuous

New cards

Gamme Distribution

positive, continuous, skewed

New cards

Bernoulli Distribution

Binary

New cards

Poisson/Negative Binomial Distribution

positive integers

New cards

What is the goal when fitting a GLM?

To estimate the coefficients β^, and use them to predict the expected value or characteristics of Y through

New cards

How do we estimate the coefficients β^ in a GLM?

We use maximum likelihood estimation — we choose the values of β^ that maximize the probability of observing our data given the model.

New cards

What does maximizing the likelihood mean in GLMs?

It means finding β^ that gives the highest joint probability of all observed Y_ivalues given their predictors X_i:

<p>It means finding β^ that gives the highest joint probability of all observed Y<sub>i </sub>values given their predictors X<sub>i</sub>:</p>

New cards

What is the average likelihood in a GLM?

It is the geometric mean of the individual likelihoods:
It represents the average probability of observing each outcome, mostly used in discrete cases

<ul><li><p>It is the <strong>geometric mean</strong> of the individual likelihoods:</p></li><li><p>It represents the <strong>average probability</strong> of observing each outcome, mostly used in <strong>discrete cases</strong></p></li></ul><p></p>

New cards

Why is the average log-likelihood often used in continuous cases?

Because the average likelihood can become very small, especially with many observations. Taking the log makes the value more interpretable and numerically stable.

New cards

What is the formula for the average log-likelihood?

It’s the mean of the log-probabilities

<p>It’s the <strong>mean of the log-probabilities</strong></p>

New cards

What does the log-likelihood measure in a model?

It measures how well the model explains the data. Higher values mean the model assigns higher probability to the observed outcomes.

New cards

What does the Average Likelihood Ratio (ALR) compare?

It compares how well two models (A and B) explain the data by taking the geometric average of the likelihoods from each model:

<p>It compares how well two models (A and B) explain the data by taking the <strong>geometric average</strong> of the likelihoods from each model:</p>

New cards

What does an Average Likelihood Ratio of 3 mean?

It means that, on average, each observation is 3 times more likely under model A than model B

New cards

Why use a geometric average for likelihood ratios?

Because individual likelihoods are multiplied together (not added), using the geometric mean gives a more balanced comparison across all observations

New cards

What is the Akaike Information Criterion (AIC) used for?

To compare models that may have different numbers of predictors. It adjusts for complexity so models with more variables don’t get an unfair advantage

New cards

What is the formula for AIC?