Corpus Linguistics

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/18

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

19 Terms

1
New cards
  • Provide negative evidence

  • Explain why

  • Provide all possible language at a time

Corpus limitations (3)

2
New cards
  • Frequency of words/phrases

  • Differences between spoken and written English

  • Common grammatical structures and collocations

  • Usage of modal verbs, idioms, and formal/informal language

  • Vocabulary size needed for conversation

Questions Corpus Linguistics can answer

3
New cards

Frequency

counts how often a word or phrase appears in a corpus

4
New cards

Concordance

Shows every occurrence of a word/phrase with its surrounding words (context)

5
New cards

Collocation

identifies words that frequently occur together with a target word

6
New cards
  • Comparison

  • Focus on the top most frequent words

  • Calculate a number of other types of phenomena

Approaches in Frequency Methods (3)

7
New cards

Type-token ration

  • measure of the amount of lexical repetition within a text

8
New cards

Lexical richness

expresses the number of unique lexical adjectives in a corpus

9
New cards
  • Mutual Information (MI) Score

  • Dice coefficient

Approaches in Collocation Methods (2)

10
New cards

Mutual Information Score

a method of calculating collocation based on the strength of a relationship beteen two words

11
New cards

Dice coefficient

Generally reveals more frequent lexical collocates

12
New cards

Dispersion

Distribution of keywords in a corpus

13
New cards

low type-token ratio

high lexical repetition

14
New cards

high type-token ratio

wider range of vocabulary

15
New cards

Lexical richness

  • number of hapaxes (words that occur only once in a text) as a percentage of the whole text

  • helpful in building a general profile of a particular text or corpus

16
New cards

Concordance

  • Table of all of the occurrences of a linguistic item in a corpus, presented within their linguistic contexts

  • Allows qualitative analyses to be carried out on corpus data, letting the researcher explore individual cases in detail

17
New cards

Collocation

demonstrating (relatively) exclusive or frequent relationships between words (or other linguistic phenomena)

18
New cards

Mutual Information (MI) Score

Takes into account the relative positions of two words across a whole corpus

  • If they usually occur close together and rarely occur apart then they will receive a high score

  • If they often occur together, but equally often occur apart, then their score will be lower

19
New cards

Dice coefficient

Identifying the collocates around a word gives us an indication about subtle meanings and connotations that a word possesses, which are rarely explained in dictionaries