Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/101

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 3:00 PM on 4/1/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

102 Terms

1
New cards

Understanding molecular programs guiding differentiation/dedifferentiation

is a major challenge

2
New cards

Bulk analysis can't resolve two problems simultaneously

discovering cell classes AND tracing development of each class

3
New cards

scRNA-seq destroys cells during profiling

cannot follow the same cell across time

4
New cards

Existing trajectory methods largely designed for stationary processes (e.g., adult stem cell niches)

not true time-courses

5
New cards

Key limitations of prior methods:

Most don't leverage temporal information

Graph-theoretic models impose hard constraints (1D edges, 0D branch points) — can't capture gradual fate divergence

Most don't account for cellular growth and death

6
New cards

Model a differentiating cell population as a time-varying probability distribution

ℙt in gene expression space

7
New cards

Sample this distribution at multiple time points and infer

how it evolves

8
New cards

Key assumption: over short time scales, cells move short distances in expression space

use Optimal Transport (OT) to infer how mass (cells) is redistributed between time points

9
New cards

Historical Origin of OT

Originally developed by Monge (1781) to redistribute earth for fortifications with minimal work

10
New cards

OT implicitly assumes a cell's fate depends only on its current state,

not history (Markov process)

11
New cards

Captures only time-varying components of the distribution

(not applicable to systems in equilibrium, where ℙt is constant)

12
New cards

OT calculates couplings between

consecutive time points

13
New cards

Longer-interval couplings are inferred by

composing adjacent transport maps (chain rule)

14
New cards

Descendant distribution: given a cell set C at time tᵢ, transport it forward

mass distribution over cells at tᵢ₊₁

15
New cards

Ancestor distribution:

"rewind" the coupling backward in time → mass distribution at tᵢ₋₁

16
New cards

Shared ancestry

convergence of ancestor distributions from two different cell sets reveals a common origin

17
New cards

A full trajectory

sequence of descendant or ancestor distributions across all time points

18
New cards

Local model

identify TFs enriched in cells with many vs. few descendants in a target population

19
New cards

Global model

modules of TFs → modules of target genes; predict gene signature expression at later time points from TF expression at earlier ones

20
New cards

Three time lags tested: 6hr, 48hr, 96h

to capture regulation on different timescales

21
New cards

Force-Directed Layout Embedding (FLE)

better captures global structure than t-SNE

22
New cards

Pipeline: 100-dimensional diffusion components → 20 nearest neighbors

ForceAtlas2 algorithm (Gephi)

23
New cards

Initial growth rates estimated from

proliferation and apoptosis gene signatures

24
New cards

Sigmoid function maps expression scores

to birth rates (βMIN = 0.3, βMAX = 1.7)

25
New cards

Doubling times range from

~9.6 hours (fast) to ~55 hours (slow)

26
New cards

Growth rates are then iteratively refined

by the OT computation (growth_iters = 3)

27
New cards

System:

Secondary MEFs from single E13.5 female embryo

Dox-inducible polycistronic OKSM cassette (Oct4/Klf4/Sox2/Myc)

Oct4-IRES-EGFP reporter for successful reprogramming

Phase 1 (Dox): days 0–8

Phase 2: either serum-free 2i medium or continued serum; Dox withdrawn at day 8

Oct4-EGFP+ cells emerge ~day 10

28
New cards

Major cell populations identified (by gene signature):

Pluripotent-like (iPSCs)

Epithelial-like

Trophoblast-like

Neural-like

Stromal-like

MET (mesenchymal-to-epithelial transition) region

29
New cards

Key early divergence: Stromal ancestors diverge from all others as early as day 1.5, sharpening over following days.

All other populations (iPSC, trophoblast, neural) remain indistinguishable until after day 8 (post-Dox withdrawal), when cells undergo MET.

30
New cards

Geodesic interpolation approach:

Given time points t₁ < t₂ < t₃, use OT to predict distribution at t₂ by interpolating between t₁ and t₃

31
New cards

OT performance is roughly as good as

batch-to-batch variation (near-baseline performance)

32
New cards

Quality degrades slightly over longer intervals

(2-day vs. 1-day gaps)

33
New cards

Robustness testing: Results stable across:

Down to 200 cells/batch

Down to 1,000 UMIs/cell

Wide range of βMAX, βMIN, δMAX, δMIN

Entropy regularization ε from 5×10⁻⁵ to 0.5

Unbalanced transport λ from 0.1 to 32

34
New cards

Stromal Region (SR):

Shows ECM rearrangement, senescence, cell cycle inhibitors, secretory phenotype (SASP)

35
New cards

Does NOT simply reflect "MEF reversion"

enriched for neonatal muscle and skin signatures (20–30×)

36
New cards

Peaks in abundance days 10.5–11

then declines due to low proliferation and apoptosis

37
New cards

Genomic aberrations: 2.1% whole-chromosome aneuploidy

3.2% sub-chromosomal (vs. 1.1%/1.2% baseline)

38
New cards

Frequent amplification of region containing Cdkn2a, Cdkn2b, Cdkn2c (cell cycle inhibitors)

Frequent loss of Cdk13 (promotes cycling) and Mapk9 (loss promotes apoptosis)

39
New cards

MET Region:

Increased proliferation, loss of fibroblast identity

40
New cards

OKSM transgene expression explains

~50% of variance in fate ratio by day 2, ~75% by day 5

41
New cards

Shisa8

most differentially expressed gene at day 1.5; expressed in 50% of top-quartile MET cells vs. 5% in bottom quartile; mammalian-specific Shisa family transmembrane adaptor

42
New cards

Fut9

synthesizes SSEA-1 glycoantigen; known MET marker

43
New cards

Id3 association with stromal fate is surprising (forced expression increases reprogramming efficiency)

possibly acts by enhancing paracrine signals from stromal cells to iPSCs

44
New cards

Nfix

represses embryonic expression programs; Nfic/Prrx1: associated with mesenchymal programs

45
New cards

Bottleneck: iPSC trajectory encompasses ~40% of cells at day 8.5, but only ~10% at day 10 (2i) and ~1% at day 11 (serum)

only a small, distinct subset of MET cells can become iPSCs

46
New cards

iPSC progenitors reside along thin "strings" in the FLE;

have not yet acquired full pluripotency signature but are transitioning rapidly

47
New cards

Day 11.5–12.5

Nanog, Zfp42, Dppa4, Esrrb, elevated cell cycle signature

48
New cards

Day 11.5

iPSC-like cells = 12% of cells

49
New cards

Days 15–18

iPSC-like cells = 80–90%

50
New cards

In serum

process delayed and less efficient; 3.5% by day 12.5, peaks at 10–15%

51
New cards

2-cell (2C) stage signature

~1% of iPSCs in both conditions

52
New cards

Wave 1 (days 9–10):

Nanog, Sox2, Mybl2, Elf3, Tgif1, Klf2, Etv5, Cdc5l, Klf4, Esrrb, Spic, Zfp42, Hesx1, Msc

53
New cards

Wave 2 (days 12–14)

Obox6, Sohlh2, Ddit3, Bhlhe40

54
New cards

Obox6 and Sohlh2

not expressed in any other fate trajectory; roles in germ cell maintenance/survival; not previously implicated in pluripotency

55
New cards

X-chromosome reactivation

Xist downregulated

Pluripotency-associated proteins expressed

X-chromosome reactivated

56
New cards

Trajectory 3: Trophoblast-like Cells

Emerge from MET Region after day 8; gain strong epithelial signature by day 9; trophoblast signature by day 10.5

57
New cards

Peak

at day 12.5 (~20% of all cells)

58
New cards

Remarkable diversity of subtypes

previously only some trophoblast genes had been noted but coherent cell types not identified

59
New cards

Trophoblast progenitors (TPs)

found in both 2i and serum

60
New cards

Spongiotrophoblasts (SpTBs)

both conditions

61
New cards

Spiral artery trophoblast giant cells (SpA-TGCs)

serum only

62
New cards

Labyrinthine trophoblasts (LaTBs)

~200 cells in 2i only

63
New cards

Primitive endoderm / XEN-like cells

181 cells from single collection

64
New cards

4.0% whole-chromosome aneuploidy

(vs. 1.1% baseline)

65
New cards

Recurrent sub-chromosomal amplification (8.6% of trophoblasts)

74-gene region on chr. 15 containing Wnt7b, Prr5, and "core trophoblast genes"

66
New cards

Prolactin gene cluster amplification on

chr. 13 in 1% of cells

67
New cards

Trajectory 4: Neural-like Cells (serum only, not 2i)

Form a prominent "spike" in the FLE under serum conditions

Ancestors diverge from trophoblast and iPSC ancestors by day 9

Rapid transition at day 12.5: lose epithelial signatures, gain neural signatures

68
New cards

Radial glial cells (RGCs)

appear first, concurrent with loss of epithelial identity at day 12.5

69
New cards

Three types of radial glial cells

Id3-RGC, Gdf10-RGC, Neurog2-RGC

70
New cards

Mature neurons and glia

emerge day 14 onwards; ancestors concentrated in Gdf10-RGCs at day 13.5

71
New cards

Why neural cells absent in 2i

MEK inhibitor in 2i medium blocks Cntfr signaling required for neural differentiation

72
New cards

Genomic aberrations

Very low — 0.4% sub-chromosomal aberrations

73
New cards

TFs predictive of neural fate:

Early neurogenesis: Rarb, Foxp2, Emx1, Pou3f2, Nr2f1, Myt1l, Neurod4

Late neurogenesis: Scrt2, Nhlh2, Pou2f2

Neural subtype survival: Onecut1, Tal2, Barhl1, Pitx2

Neural tube formation: Msx1, Msx3

74
New cards

Paracrine Signaling Analysis

Collected 415 ligand genes (cytokines, growth factors, hormones from GO) + 2,335 receptor genes

Identified 580 ligand-receptor pairs from curated mouse protein-protein interactions

Interaction score = fraction of cells expressing ligand X in set A × fraction expressing receptor Y in set B

Standardized against 10,000 permutation-based null distributions

75
New cards

Neural cells upregulate Cntfr one day before

neural-like cells appear (day 11.5)

76
New cards

Stromal cells begin expressing Cntfr ligands

(Crlf1, Lif, Clcf1) at day 10.5

77
New cards

In 2i

same ligand-receptor pairs seen but MEK inhibitor blocks downstream Cntfr signaling

78
New cards

Obox6

homeobox gene normally expressed in oocyte, zygote, early embryos, ESCs; unknown function in reprogramming

79
New cards

Expressed in <1% of cells before day 12

94% of Obox6+ cells biased toward MET fate

80
New cards

Test

Dox-inducible lentivirus (Obox6 or Zfp42 as positive control, or empty vector) in secondary MEFs, days 0–8

81
New cards

Result

Both Obox6 and Zfp42 increased reprogramming efficiency ~2-fold in 2i, even more in serum

82
New cards

Confirmed in primary MEFs

(independent experiment)

83
New cards

Gdf9 × Tdgf1

highest paracrine interaction score for iPSC lineage

84
New cards

Tdgf1 known to maintain pluripotency

role in establishing pluripotency not previously reported

85
New cards

Prior attempts to boost reprogramming with

GDF9 at days 0–2 failed

86
New cards

New approach:

recombinant mouse GDF9 added daily starting day 8 (when interaction score begins rising)

87
New cards

Result

GDF9 increased reprogramming efficiency 4–5 fold at highest dose (1 μg/ml) in serum

Confirmed by (i) Oct4-EGFP+ colony counting, (ii) bulk RNA-seq, (iii) scRNA-seq

Dose-dependent, confirmed in multiple independent experiments

88
New cards

GDF9 also increased fraction of neural fate cells

competitive relationship with iPSCs; TGFβ superfamily role in neural lineage specification warrants further study

89
New cards

Monocle2

Day 18 cells give rise to Day 8 cells which give rise to Day 4 cells (completely reversed); cannot distinguish iPSC, neural, and trophoblast fates

90
New cards

URD

Trophoblast lineage specified by day 0.5; neural/iPS arise from stromal cells (biologically implausible); >85% of cells from days 4–8 unassigned

91
New cards

FateID

Fates appear divergent from day 0 (no shared ancestry); trajectories skip time points

92
New cards

AGA

Day 0 cells directly connected to days 14–18 stromal cells; extensive stromal→iPSC transitions inferred

93
New cards

STITCH

iPSCs largely arising from stromal region (ignores differential growth rates)

94
New cards

scDiff/GPfates:

Trajectories to incoherent destinations (mixtures of very different cell types)

95
New cards

Category 2:

Not incorporating time information → temporal inconsistencies

96
New cards

Category 3

Not modeling differential cell growth rates → apoptotic stromal cells incorrectly inferred as iPSC ancestors

97
New cards

Probabilistic, distributional view of trajectories

not deterministic paths but distributions over ancestors/descendants

98
New cards

Handles out-of-equilibrium systems

explicitly designed for systems where ℙt changes over time

99
New cards

Growth-rate modeling is critical

without it, rapidly proliferating iPSCs incorrectly inferred to arise from apoptotic stromal cells

100
New cards

Gradual fate divergence

fates emerge gradually, not at sharp branch points; challenges graph-theoretic trajectory models

Explore top notes

note
The Pearl - Summary Notes
Updated 1161d ago
0.0(0)
note
Finance Class Notes
Updated 1377d ago
0.0(0)
note
SAT Test Taking Guide
Updated 386d ago
0.0(0)
note
Excretion
Updated 1336d ago
0.0(0)
note
science
Updated 1268d ago
0.0(0)
note
The Pearl - Summary Notes
Updated 1161d ago
0.0(0)
note
Finance Class Notes
Updated 1377d ago
0.0(0)
note
SAT Test Taking Guide
Updated 386d ago
0.0(0)
note
Excretion
Updated 1336d ago
0.0(0)
note
science
Updated 1268d ago
0.0(0)

Explore top flashcards

flashcards
Stories Information
102
Updated 1059d ago
0.0(0)
flashcards
BIO 106 Final
49
Updated 852d ago
0.0(0)
flashcards
APUSH Period 3
43
Updated 1198d ago
0.0(0)
flashcards
Biology Review
72
Updated 1167d ago
0.0(0)
flashcards
Georgia Test-Southern History
22
Updated 768d ago
0.0(0)
flashcards
Biologi: växtprov v.49
58
Updated 1213d ago
0.0(0)
flashcards
Stories Information
102
Updated 1059d ago
0.0(0)
flashcards
BIO 106 Final
49
Updated 852d ago
0.0(0)
flashcards
APUSH Period 3
43
Updated 1198d ago
0.0(0)
flashcards
Biology Review
72
Updated 1167d ago
0.0(0)
flashcards
Georgia Test-Southern History
22
Updated 768d ago
0.0(0)
flashcards
Biologi: växtprov v.49
58
Updated 1213d ago
0.0(0)