10. corpus linguistics 2

0.0(0)
studied byStudied by 1 person
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/10

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

11 Terms

1
New cards

collocation

predictable combination of words

2
New cards

construction

in frequency analysis: can be a word, all word forms of a specific lemma, a fixed expression

3
New cards

frequency

how often something occurs, frequency analysis

4
New cards

normalised frequency

when a word becomes a fundamental part of everyday-language

5
New cards

Pmw/pttw

in comparing frequencies, per-million-word, per-ten-thousand-word (formula = number of occurrences/ total number of words x10^x)

6
New cards

qualitative

focus not on how often a feature occurs, data is used only as a basis for identifying and describing aspects of usage in the language to provide real-life examples of particular phenomena

7
New cards

quantitative

we count occurrences and compare frequencies and often use statistics to uncover patterns

8
New cards

regular expression (regex)

a way to validate a string using formal language by looking for string patterns

9
New cards

token (in frequency analysis)

refers to total number of words or constructions

10
New cards

type (in frequency analysis)

refers to the number of different words or constructions

11
New cards