MARK 5343 Exam 2

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/82

There's no tags or description

Looks like no tags are added yet.

Last updated 4:10 AM on 4/30/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

83 Terms

New cards

Logistic Regression

Makes probabilistic predictions of whether you are a 1 or a 0

New cards

Logistic regression – what is different here regarding the DV? the IVs?

DV: Binary, dummy coded, nonmetric Y that is predicted

IV: Predictors that create a variate called Z

New cards

What are odds ratios?

New cards

What are log odds ratios?

New cards

What is the variate in logistic regression considered to be

ln(P1/P0)

a logit

New cards

What does it mean when Z is positive

Increased P(1) (probability of event 1)

New cards

What does it mean when Z is negative

Increased P(0) (probability of event 0)

New cards

We turn the variate into P(1) and P(0)… how?

e^z over 1 + e^z = P(1)

1 - P(1) = P(0)

Determines strongest alignment to DV

New cards

Null model

Specifies in advance the P(1) and P(0) based on the observed DV proportions. Then, we introduce X material and hopefully improve over the null.

ln(P1/P0) = Z + 0(X)

Model only has the intercept and no predictors

New cards

Parameter estimation to get the best bs is Iterative

New cards

Model produced probability of being “what you are”

New cards

Likelihood function (L)

The joint probability based on the model for the entire sample (Multiplication of actual probabilities)

Maximizes L to find the best b’s

L = 1 if probabilities are perfect

New cards

Log likelihood function (LL)

Sum of the log of each probability

Model fit diagnostic that has meatier numbers than L and should be minimized

LL = Ln(.8) + Ln(.45) + Ln(.9)

Can use to compute pseudo R ²

New cards

-2LL

Makes LL useable in Chi-squared test of significance

Produces x² statistic that follows a chi-squared distribution. The closer this is to 0, the better

We want the proposed model’s to be lower than the null’s

New cards

-2LL difference (Likelihood Ratio Test)

Difference from null model to test whether the proposed model’s estimation is significantly better than the null model

-2(LL_null - LL_proposed)

Can also use to compare two models

New cards

Logistic output pieces that parallel multiple regression output pieces, including if stepwise

Variate: Z vs. Y-hat

Model Fit: L, LL, pseudo R² vs. R²

b’s P-Value: Wald vs. t-statistic

Comparable X’s Impact: EXP(b) vs. Standardized coefficients (beta)

Equation: B vs. Unstandardized B

Variance Explained: pseudo R² vs. R²

Other Model Fit: F test vs. Chi-square test of -2LL difference

New cards

Pseudo R-square (use Nagelkerke)

Model fit test that determines how much variability a model can explain of its outcome (explanatory power)

Want this number to be close to 1

Ex. 64% of the variance in Y is explained by the predictors

New cards

Significance of individual variables in logistic regression

Wald test statistic

New cards

Sign and size of bs and the impact on P(1)

A 1 unit increase in the predictor variable results in b increase/decrease in the Y/log-odds

Can assess directionality of relationship

New cards

Exp(b)

Can compare IVs relative impacts on P(1)

1 = no change in the odds

exp(b) - 1 = % change in odds

New cards

What model fit test is distinct for a categorical DV

Classification Matrix

New cards

Understanding prediction accuracy in the classification matrix

correct prediction freq + correct prediction freq = % accuracy at prediction

look at the diagonal

New cards

What are the two benchmarks for classification accuracy?

Cpro and Cmax

New cards

Cmax

The larger percentage between percent positive and percent negative in a classification matrix

New cards

Cpro

percent positive²+ (percent negative)²

New cards

How to compute Cpro and Cmax criteria for judging classification (1.25 x Cpro, 1.25 x Cmax)

Cmax:

Cpro:

New cards

Cpro - square actual proportions and add together, the chance standard to beat by 25%

New cards

Cmax – assign everyone to largest category, percent accuracy doing that? Beat by 25%

New cards

Logistic Regression validation with a hold-out/split sample

New cards

Two group discriminant Analysis – nature of DV and IVs

New cards

Discriminant analysis big picture idea

Find best bs that make groups as separate / discriminated as possible

New cards

How many discriminant functions in relation to number of groups?

New cards

Read outputs from running in SPSS, direct (simultaneous) or stepwise

New cards

Know differences in coefficient types: unstandardized, standardized, loading/structure

New cards

Which two are most used for variable “importance”?

New cards

Discriminant analysis statistical test of overall model fit

New cards

Classification accuracy (another test of model fit/strength, higher percentages better)

New cards

How do I judge quality of classification accuracy with Cpro and Cmax? (same as logistic)

New cards

Discriminant analysis validation with a hold-out/split sample

New cards

Validation via the “one-at-a-time left out” method (labeled cross-validated in SPSS)

New cards

Three + groups discriminant analysis – nature of DV and IVs

New cards

How many discriminant functions in relation to number of groups 3+?

New cards

Tests of significance for each function

New cards

Types of coefficients for each function

New cards

Outputs for direct or stepwise estimation, SPSS, 2D plots

New cards

What is the potency index?

New cards

Use of potency index for variable “importance” – one number for each IV even if multiple functions

New cards

Classification matrix and accuracy interpretation (know what it looks like for 3+ groups)

New cards

Map with Centroids (green, blue, red, people from slides, pink x boxes are centroids)

New cards

What is conjoint, what does it allow you to do?

New cards

Ratings-based full profile method (previous textbook chapter)

New cards

Defining attributes and levels

New cards

Use of fractional factorial designs to build profiles [the design… the X side predicting ratings]

New cards

Concepts of total utility (rating) and part-worth utilities (bs)

New cards

Addition of other profiles for “validation” - hold out profiles

New cards

Addition of other profiles for simulation (see what would happen in a market with certain offerings)

New cards

Respondent task = ratings task (vs. choice task)

New cards

Effect coding (vs. dummy coding) the design matrix

New cards

Multivariate statistical technique used to estimate part-worths (regression)

New cards

The “nice” property of effects coded estimates (sum to 0)

New cards

Determine fit at individual level, statistics from the chapter (Multiple R, Tau nonparametric)

New cards

Deletion of cases with bad fit (estimation or hold out) and/or illogical part-worth patterns

New cards

Restating part worth utilities (book process – lowest level of each attribute 0)

New cards

Rescaling part worth utilities (book process – each attribute brought 100 to the table)

New cards

Computing attribute importance (High minus low for each attribute, over sum of all hi minus low, easy if low within each attribute was made 0, now just high for each attribute over the sum of all highs)

New cards

Ability to “Segment” people based on individual-level information

New cards

Use of simulation profiles to simulate market share (book uses 2 existing products, 1 new)

New cards

Get total utility for each product configuration in the simulation

New cards

Use values to compute discrete predicted choice (maximum utility rule)

New cards

Or use values to compute probabilistic predicted choice (BTL, or Logit Choice rule)

New cards

Core understanding of leap from ratings to “Choice-Based” (with HB estimation):

New cards

Choice sets

New cards

Choice alternatives

New cards

Probabilistic prediction within each choice set

New cards

HB estimation - borrowing from upper model

New cards

Individual part-worth utilities

New cards

Averaging over every nth draw from last set of draws (e.g., every 10th draw from last 1000 gives 100 values to average)

New cards

Uses of individual-level utility estimates as before

New cards

Very high level understanding of history/development: Ratings Based

New cards

Very high level understanding of history/development: Choice Based – aggregate logit model, one equation for everyone

New cards

Very high level understanding of history/development: Latent class – segments and estimates one logit model per segment (a way to represent respondent heterogeneity)

New cards

Very high level understanding of history/development: CBC-HB… individual level heterogeneity, one model (part worth utilities) per participant

New cards

Relative variable importance: Regression, Logistic, 2 group discriminant, 3 group discriminant, conjoint