Stat 542 exam 2 notes

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/19

There's no tags or description

Looks like no tags are added yet.

Last updated 4:37 AM on 3/27/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

20 Terms

New cards

Wide format

“Spreadsheet style”

Each row is a country, columns represent HIV values over a series of years

Better for viewing

Easier to create variables that compare response values across time periods

New cards

Long format

Each row is a different country-year combination

Each row represents only one distinct observation

Better for data analysis

Easier to add new variables or combine info in multiple tables

New cards

Tidy data

Array of rows and columns

Rows (items/case) are a specific, unique, and similar sort of thing

Column (variables) each have the same sort of value for each row

New cards

pivot_wider

Long → wide

Value_from

name of the variable in the narrow format to be divided up into multiple variables in the resulting wide format.

Names_from

name of the variable in the narrow format that identifies, for each case, which column in the wide format will receive the value

New cards

pivot_longer

wide → long

names_to

defines variables from the wide form that will become the categorical levels in the narrow form.

values_to

the variable that is to hold the values in the variables being gathered – it should reflect what those values actually represent

New cards

list-columns

A column where each cell contains an entire mini dataset instead of a single value.

Each row is a group (i.e. subjects)

A variable of type list

New cards

nest function

Turns a long dataset into one row per group, with the remaining data packed into a tibble in a list-column.

‘unnest’ converts nested list back into numeric or ungrouped data

Example:

Before

subject time score

A 1 10

A 2 12

B 1 8

After

subject data

A <tibble of A’s rows>

B <tibble of B’s rows>

New cards

pull function

Extract a list-column

New cards

map function

Apply a function to each element of a list

Use map with pull to get a specific measured variable

Use the result of that (a list) wtih map to perform calculations on this list

New cards

Native R data file format

.rda or .RData

Write: saveRDS

Read: readRDS

New cards

packages for reading files

readxl: Excel

googlesheets4: google sheets

dybplyr/DBI: relational databases on remote servers

readr: .csv files

rvest: HTML tables

read.csv (base R)

readr::read_csv

New cards

reading HTML files

rvest converts HTML to an R structure, then convert the HTML tables to R data tables

read_html: produces a list containing tables from the webpage

purrr:pluck: extracts any table from the list, can be stores as tibble

New cards

data cleaning: strings and numbers

parse_number: takes a character string and translates into numeric value

parse_character: takes number value/column and converts into character

New cards

Dates

Usually need to convert from character strings to date type

lubridate package

‘Date’ and ‘dttm’ (date-time) values

‘interval’ function values differences in date/time

‘hour’, ‘month’ functions extracts parts of variables that are stores as dates or times

New cards

Factors/strings

Factor: objects containing levels of a categorical variable

Allows custom ordering (fct_relevel)

readr::read_csv reads character strings by defailt, not factors

‘forcats’ package has tools for wrangling factor data

New cards

Vectorized operations

Basically like a for loop

Take vector as input, perform an operation on every element, return vector as output

New cards

map functions

Iteratively apply an R function to each element of a vector

map: collection of outputs stored as a list

map_dbl: numeric vectir

map_lgl, map_int, map_chr: logical, int, character

map_dfr: collects results into data frame

Base R: lapply, tapply

New cards

across function

Apply a function across columns

Used with summarize, mutate

‘where’ and ‘is.numeric’

New cards

iteration over subgroups

‘group_modify’ applies functions to subgroups of a data frame

Define groups with ‘group_by’

New cards

Animation plots

gganimate

transition_time: continuous, specify the name of the variable that is changing with the frames

transition_states: discrete, if the plot is changing over levels of a discrete variable, this will specify the name of this discrete variable

Explore top notes

IGCSE Science - Biology: Nutrition

Updated 1269d ago

0.0(0)

Chapter 8 and 13 Vocabulary

Updated 1249d ago

0.0(0)

Energy Sources

Updated 1318d ago

0.0(0)

Philosophy: Epistemology

Updated 1270d ago

0.0(0)

ap bio unit 3

Updated 493d ago

0.0(0)

deep-learning-with-python-meap-2nd-ed

Updated 393d ago

0.0(0)

Chapter 11: Ecology

Updated 1078d ago

0.0(0)

Iliad Summary

Updated 1199d ago

0.0(0)

IGCSE Science - Biology: Nutrition

Updated 1269d ago

0.0(0)

Chapter 8 and 13 Vocabulary

Updated 1249d ago

0.0(0)

Energy Sources

Updated 1318d ago

0.0(0)

Philosophy: Epistemology

Updated 1270d ago

0.0(0)

ap bio unit 3

Updated 493d ago

0.0(0)

deep-learning-with-python-meap-2nd-ed

Updated 393d ago

0.0(0)

Chapter 11: Ecology

Updated 1078d ago

0.0(0)

Iliad Summary

Updated 1199d ago

0.0(0)

Explore top flashcards

AP Psychology: Unit 6

70Updated 17d ago

0.0(0)

Modern Art

33Updated 178d ago

0.0(0)

A&P 2e - Ch 18: Cardiovascular System: Blood (Review Questions)

22Updated 98d ago

0.0(0)

non fermenting GN

45Updated 1136d ago

0.0(0)

psyc367- neuroscience: test 1

83Updated 28d ago

0.0(0)

Imperialism Quiz

21Updated 1046d ago

0.0(0)

Organelles

26Updated 1250d ago

0.0(0)

[ 2FINAL ] Physics - Electric Field

30Updated 213d ago

0.0(0)

AP Psychology: Unit 6

70Updated 17d ago

0.0(0)

Modern Art

33Updated 178d ago

0.0(0)

A&P 2e - Ch 18: Cardiovascular System: Blood (Review Questions)

22Updated 98d ago

0.0(0)

non fermenting GN

45Updated 1136d ago

0.0(0)

psyc367- neuroscience: test 1

83Updated 28d ago

0.0(0)

Imperialism Quiz

21Updated 1046d ago

0.0(0)

Organelles

26Updated 1250d ago

0.0(0)

[ 2FINAL ] Physics - Electric Field

30Updated 213d ago

0.0(0)