ChatGPT Influence on Spoken Vocabulary
Key Facts
- ChatGPT reached 100 million users within 2 months of its late 2022 release (fastest-growing consumer app).
- Study analyzed >700000 hours of YouTube & podcast speech to detect vocabulary change.
- Researchers identified “GPT words” (e.g., delve, realm, meticulous) that ChatGPT frequently adds when asked to polish text.
Methodology
- Step 1: Used ChatGPT to edit millions of written pages; extracted repeatedly inserted words ⇒ labeled as GPT words.
- Step 2: Collected speech corpora
- YouTube: 360000 videos
- Podcasts: 771000 episodes
- Samples taken from periods before and after ChatGPT release.
- Step 3: Compared GPT-word frequency to “synthetic controls” (weighted synonyms not common in ChatGPT output).
- Observation window: 18 months post-launch.
Findings
- Sharp surge in GPT-word usage across both scripted and spontaneous speech during the 18-month window.
- Indicates AI-to-human “cultural feedback loop”: patterns learned by LLMs return to influence human language.
Implications & Concerns
- Rapid, large-scale diffusion of AI-generated phrasing may narrow linguistic diversity.
- People imitate sources perceived as knowledgeable; growing reliance on AI could amplify its authority over other cultural influences.
- Future monitoring should expand beyond individual words to sentence structure, idea framing, and broader discourse patterns.
Takeaway
- AI’s effect on spoken language is already measurable within 2.5 years; key question shifts from if to how profound its cultural reshaping will be.