# SLPA 456 Exam 4

An analog signal is _____ and __ _______

1

An analog signal is _____ and __ _______

Continuous and time-varying

2

Speech is an example of a _________ signal

Analog

3

A digital signal is ______.

Discrete

4

3 main parameters of sound

frequency, time, and amplitude

5

3 types of errors that can occur during ADC

Jitter, Quantization noise, and Aliasing

6

Jitter:

deviation in periodicity

• can be a result of irregularities in sampling rate

7

Quantization noise:

deviation in amplitude measures

• can be result of rounding errors in process of quanization

8

Aliasing:

distortion due to misidentification of frequency

• can be result of inappropriate sampling rate

9

Digital Signal Processing (DSP)

Pre-Processing of a digital signal

10

Steps of DSP

Speech Signal

1. Filtering

2. Digitization

3. Frame Selection

4. Windowing

5. Short-term analysis

1. Graphic display or numeric output

11

Elements of filtering

Pre-emphasis, presampling

12

Elements of digitization

time sampling, quantization

13

elements of Frame Selection

Frame length, frame overlap

14

elements of windowing

tapering function

15

elements of short-term analysis

FFT, LPC, Cepstrum

16

elements of Graphic display or numeric output

spectogram, spectrum, other

17

Goal of filtering:

retain wanted parts of the signal while removing parts that do not necessarily provide any information

18

Pre-Sampling:

“anti-aliasing” - applying filters that block frequencies above the Nyquist frequency for that sample

New cards
Aliasing

underrepresentation of the sampling rate because the original signal is underrepresented

New cards
Example of anti-alias filter:

DC Off-set

New cards
Pre-Emphasis:

Equalizes (boosts weaker) energies over a specified range of frequencies so important aspects of signal have sufficient energy to accurately capture within the quantization bits available

New cards
Practical example of filtering in Aud and SLP

Measuring Auditory Brainstem Responses (ABR)

Removes:

• direct current (DC) signals from other electronic equipment

• 60 Hz hum from alternating current (AC) power sources

• background EEG activity, unwanted brain activity

• uses pre-emphasis method called differential amplification

• boosts level of desired evoked potential response while removing the extra noise.

23

Frame Selction/Windowing:

process of selecting which parts of signal to be analyzed

24

Window/Frame:

the portion of the signal selected to perform an analysis on

25

Windowing option examples:

Rectangle

Bartlett

Hanning

Hamming*

Blackman

Gauss

26

How is ABR recording windowed?

Based on a TIME-specific analysis!

27

Types of graphic displays of acoustic data:

Waveform, Spectrum, Spectogram, Profiles or contours

28

Dimensions of a waveform:

Amplitude by time

29

Types of waveforms (temporal analysis)

raw, envelope

30

Dimensions of a spectrum:

Amplitude by frequency

31

Types of spectrums (Spectral Analysis)

Fast Fourier Transform, Linear Prediction Coding, Cepstrum

32

Dimensions of spectogram:

Amplitude by frequency by time

33

Types of spectrograms (speech (complex) analysis

Conventional, countour, waterfall

34

Dimensions of Profiles or contours

Parameter by time

35

Types of profiles/contours:

f0 trace (pitch contour), intensity profile

36

Temporal (time-based) analysis works directly on the ______.

Waveform

37

What information can you analyze from a waveform?

Fundamental frequency

Perturbation Measures

Signal-to-noise ratio

Voice onset time

Vowel duration

Envelope

38

Fundamental Frequency:

frequency at which a system oscillates/resonates freely

39

Signal Processing Strategy used to get fundamental frequency:

Pitch determination algorithm (PDA) or pitch extractor

New cards
Temporal methods used by PDA:

Zero crossing

Peak Picking

Auto correlation (most modern)

41

Zero Crossing:

counts every time a wave passes through the zero line within a second, then divides by two to obtain the fundamental frequency

42

Peak Picking:

Fundamental frequency is derived by identifying wave peaks and counting either the total number of crests or troughs OR total number of peaks in general and dividing by 2

43

Perturbation measures:

3 types we can measure

• jitter

• shimmer

• signal to noise ratio

44

Perturbgation:

a deviation from truly periodic and regular patterns of vibration of the vocal folds

45

Jitter:

variability in the fundamental period of phonation

• reported in an absolute value (ms) or relative value (%)

46

Jitter Percent:

obtained by dividing absolute jitter value by mean fundamental frequency period

47

Shimmer:

variability of amplitude of successive cycles of waveform

• reported in an absolute value (dB) or relative value (shimmer %)

48

Shimmer Percent:

obtained by dividing absolute shimmer value by the mean amplitude of the waveform

49

Signal to Noise Ratio:

Ratio of Periodic energy to aperiodic energy in the voice waveform

50

With NO background noise, SNR = _________

The intensity of the signal

51

When background noise is louder than the signal, SNR = ________

A negative value

52

Voice Onset Time:

duration of the interval between release of a stop consonant and the onset of vocal fold vibration (vowel production)

53

Vowel Duration:

duration of the interval over which the formant pattern (specifically F1 and F2) is stable

New cards
Envelope:

overall profile of waveform

55

Spectral (frequency based) analysis operate directly on a _______

spectrum

56

Commonly used software for spectral analyses:

Audacity

PRAAT

Computerized Speech Lab (CSL)

57

Which spectral analysis software has few spectral analyses options?

Audacity

58

Which spectral analysis software is most widely used acoustic freeware?

PRAAT

59

Which spectral analysis software is professional software?

Computerized Speech Lab

60

Major types of Spectral Analysis:

Fourier Transform: Discrete (DFT) and Fast (FFT),

Linear Predictive Coding (LPC),

Cepstral based analyses,

Mel Frequency Cepstral Coefficients (MFCC)

61

Fourier Transform

Decomposes a waveform to reveal its frequency content to convert a waveform to a power spectrum

New cards
Discrete Fourier Transform

Fourier transform of a finite set of discrete samples from the waveform (determined by sampling rate and windowing)

• transforms data from samples into distinct frequency lines within a power spectrum

63

Fast Fourier Transform

optimized algorithm to calculate DFT

• all speech analyses software packages have an implementation of FFT

64

Linear Predictive Coding

Based on Quazi-periodic nature of speech, by knowing certain parts of the speech signal, other parts can be predicted

65

Cepstrum

A fourier transfer performed on the spectrum

• inverse/transposition of spectrum

66

What is a cepstrum useful in investigating?

Periodicity/ rate of change of a signal

67

Terms associated with Spectrum vs. Cepstrum:

Spectrum: frequency and amplitude → Harmonics → filtering

Cepstrum: Quefrency and amplitude → Rahmonics → liftering

68

2 important features of a cepstrum:

• preserves magnitude information about the signal and discard phase related info

• emphasizes periodic nature of harmonics

69

What do cepstrum algorithms reveal in a signal?

Converting the signal and finding one formant enables algorithms that help find patterns to find the others

70

What do rahmonics show?

correlates to the perceptual “quality” measures of voice

71

Mel Frequency Cepstral Coefficients (MFCC)

represent short-term power within a second

• represents frequency bands as evenly spaced whereas cepstrum represents frequency bands linearly

• more representative of human auditory sensitivity (perception of pitch)

72

Practically, when is mel frequency cepstral coefficients most useful?

in audio compression and speech recognition systems (eg. HA mapping)

73

How to obtain formants:

by using any spectral analysis method

74

Two main characteristics of formants:

• peak in spectrum of a vowel sound or energy bands in spectrogram

• resonance of vocal tract

75

Which formants are typically used to describe most speech sounds?

F1 and F2

76

For vowels, what does F1 describe?

Tongue Height

77

For vowels, what does F2 describe?

tongue position

78

Formant Amplitude:

Relative amplitude of formants in a formant pattern?

79

Formant Space:

aka acoustic working space, acoustic vowel space, vowel triangle

• plot of F1 vs F2

• measures speech intelligibility

• several other measures are derived from formant space.

80

Examples of measures based on (static) formant space:

• vowel space area

• formant centralization ratio

• four vowel articulation index

• Formant centroid

• Vocalic anatomical functional ratio

• long-term formant distribution

81

Measurements based on “dynamic” aspect of formants:

• Formant Transition

• Formant Locus

• Formant Slope

• Locus equation

82

Vowels, glides, and consonants differ in degree of ________.

Constriction

83

Sonorant Consonants

NO pressure build up at constriction

84

Nasal Consonants

lower the velum allowing airflow in nasal cavity

85

Continuant Consonants

do not block airflow in oral cavity

86

Resonators:

specific state of vocal tract that amplifies frequencies near the natural frequency of that system

87

Natural Frequency of a resonator is based on _____.

Length and diameter of the vocal tract

88

Relation of harmonic frequencies to resonating frequency

If close to resonating frequency: will be amplified

If far from resonating frequency: will be dampened

89

Relationship of two formants when they are close in frequency to one another,

They tend to boost each other’s amplitude

90

Formant Bandwidth:

difference (in Hz) between frequencies at +/- 3 dB of the intensity of the center frequency within a formant

91

Which graphic representation can you find formant bandwidth?

on a Spectrum

92

Practical use of formant space measurements:

represents maximum working space of a talker

• representative of maximum performance

93

Vowel Space Area

aka F1-F2 area

• calculated using a specific formula identifying the area of formant space graph

• Used to study variety of speech and voice disorders

94

Long term formant distribution (LTF)

• average formant frequency of a given speaker

• calculated by taking average of all formants across all vowels in recorded sample

• used to study variety of speech and voice disorders

95

Primary use of LTF:

forensic speaker identification and in studying effects of age and sex on speech

96

When is speech dynamic?

when there are changes as a result of consonants embedded along with vowels -- typical running speech

97

Formant transition:

relative shange from a vowel to a consonant

98

What speech sounds are formant transitions specifically associated with?

stop consonants

99

Formant locus:

characteristic value for each place of consonant articulation

** helpful to judge phonemes and speech intelligibility

100

Formant slope:

the change in formant frequency over an interval of formant transition

** helpful in studying speech intelligibility in dysarthric speakers

