SLPA 456 Exam 4

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/163

Earn XP

Description and Tags

Aerospace Engineering

University/Undergrad

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

164 Terms

New cards

An analog signal is _____ __and__ __ _______

Continuous and time-varying

New cards

Speech is an example of a _________ signal

Analog

New cards

A digital signal is ______.

Discrete

New cards

3 main parameters of sound

frequency, time, and amplitude

New cards

3 types of errors that can occur during ADC

Jitter, Quantization noise, and Aliasing

New cards

Jitter:

deviation in periodicity

* can be a result of irregularities in sampling rate

New cards

Quantization noise:

deviation in amplitude measures

* can be result of rounding errors in process of quanization

New cards

Aliasing:

distortion due to misidentification of frequency

* can be result of inappropriate sampling rate

New cards

Digital Signal Processing (DSP)

Pre-Processing of a digital signal

New cards

Steps of DSP

Speech Signal

1. Filtering
2. Digitization
3. Frame Selection
4. Windowing
5. Short-term analysis

1. Graphic display or numeric output

New cards

Elements of filtering

Pre-emphasis, presampling

New cards

Elements of digitization

time sampling, quantization

New cards

elements of Frame Selection

Frame length, frame overlap

New cards

elements of windowing

tapering function

New cards

elements of short-term analysis

FFT, LPC, Cepstrum

New cards

elements of Graphic display or numeric output

spectogram, spectrum, other

New cards

Goal of filtering:

retain wanted parts of the signal while removing parts that do not necessarily provide any information

New cards

Pre-Sampling:

“anti-aliasing” - applying filters that block frequencies above the Nyquist frequency for that sample

New cards

Aliasing

underrepresentation of the sampling rate because the original signal is underrepresented

New cards

Example of anti-alias filter:

DC Off-set

New cards

Pre-Emphasis:

Equalizes (boosts weaker) energies over a specified range of frequencies so important aspects of signal have sufficient energy to accurately capture within the quantization bits available

New cards

Practical example of filtering in Aud and SLP

Measuring Auditory Brainstem Responses (ABR)

Removes:

* direct current (DC) signals from other electronic equipment

* 60 Hz hum from alternating current (AC) power sources
* background EEG activity, unwanted brain activity
* uses pre-emphasis method called differential amplification
* boosts level of desired evoked potential response while removing the extra noise.

New cards

Frame Selction/Windowing:

process of selecting which parts of signal to be analyzed

New cards

Window/Frame:

the portion of the signal selected to perform an analysis on

New cards

Windowing option examples:

Rectangle

Bartlett

Hanning

Hamming\*

Blackman

Gauss

New cards

How is ABR recording windowed?

Based on a TIME-specific analysis!

New cards

Types of graphic displays of acoustic data:

Waveform, Spectrum, Spectogram, Profiles or contours

New cards

Dimensions of a waveform:

Amplitude by time

New cards

Types of waveforms (temporal analysis)

raw, envelope

New cards

Dimensions of a spectrum:

Amplitude by frequency

New cards

Types of spectrums (Spectral Analysis)

Fast Fourier Transform, Linear Prediction Coding, Cepstrum

New cards

Dimensions of spectogram:

Amplitude by frequency by time

New cards

Types of spectrograms (speech (complex) analysis

Conventional, countour, waterfall

New cards

Dimensions of Profiles or contours

Parameter by time

New cards

Types of profiles/contours:

f0 trace (pitch contour), intensity profile

New cards

Temporal (time-based) analysis works directly on the ______.

Waveform

New cards

What information can you analyze from a waveform?

Fundamental frequency

Perturbation Measures

Signal-to-noise ratio

Voice onset time

Vowel duration

Envelope

New cards

Fundamental Frequency:

frequency at which a system oscillates/resonates freely

New cards

Signal Processing Strategy used to get fundamental frequency:

Pitch determination algorithm (PDA) or pitch extractor

New cards

Temporal methods used by PDA:

Zero crossing

Peak Picking

Auto correlation (most modern)

New cards

Zero Crossing:

counts every time a wave passes through the zero line within a second, then divides by two to obtain the fundamental frequency

New cards

Peak Picking:

Fundamental frequency is derived by identifying wave peaks and counting either the total number of crests or troughs OR total number of peaks in general and dividing by 2

New cards

Perturbation measures:

3 types we can measure

* jitter
* shimmer
* signal to noise ratio

New cards

Perturbgation:

a deviation from truly periodic and regular patterns of vibration of the vocal folds

New cards

Jitter:

variability in the fundamental period of phonation

* reported in an absolute value (ms) or relative value (%)

New cards

Jitter Percent:

obtained by dividing absolute jitter value by mean fundamental frequency period

New cards

Shimmer:

variability of amplitude of successive cycles of waveform

* reported in an absolute value (dB) or relative value (shimmer %)

New cards

Shimmer Percent:

obtained by dividing absolute shimmer value by the mean amplitude of the waveform

New cards

Signal to Noise Ratio:

Ratio of Periodic energy to aperiodic energy in the voice waveform

New cards

With NO background noise, SNR = _________

The intensity of the signal

New cards

When background noise is louder than the signal, SNR = ________

A negative value

New cards

Voice Onset Time:

duration of the interval between release of a stop consonant and the onset of vocal fold vibration (vowel production)

New cards

Vowel Duration:

duration of the interval over which the formant pattern (specifically F1 and F2) is stable

* aka vowel steady rate

New cards

Envelope:

overall profile of waveform

New cards

Spectral (frequency based) analysis operate directly on a _______

spectrum

New cards

Commonly used software for spectral analyses:

Audacity

PRAAT

Computerized Speech Lab (CSL)

New cards

Which spectral analysis software has few spectral analyses options?

Audacity

New cards

Which spectral analysis software is most widely used acoustic freeware?

PRAAT

New cards

Which spectral analysis software is professional software?

Computerized Speech Lab

New cards

Major types of Spectral Analysis:

Fourier Transform: Discrete (DFT) and Fast (FFT),

Linear Predictive Coding (LPC),

Cepstral based analyses,

Mel Frequency Cepstral Coefficients (MFCC)

New cards

Fourier Transform

Decomposes a waveform to reveal its frequency content to convert a waveform to a power spectrum

New cards

Discrete Fourier Transform

Fourier transform of a finite set of discrete samples from the waveform (determined by sampling rate and windowing)

* transforms data from samples into distinct frequency lines within a power spectrum

New cards

Fast Fourier Transform

optimized algorithm to calculate DFT

* all speech analyses software packages have an implementation of FFT

New cards

Linear Predictive Coding

Based on Quazi-periodic nature of speech, by knowing certain parts of the speech signal, other parts can be predicted

New cards

Cepstrum

A fourier transfer performed on the spectrum

* inverse/transposition of spectrum

New cards

What is a cepstrum useful in investigating?

Periodicity/ rate of change of a signal

New cards

Terms associated with Spectrum vs. Cepstrum:

Spectrum: frequency and amplitude → Harmonics → filtering

Cepstrum: Quefrency and amplitude → Rahmonics → liftering

New cards

2 important features of a cepstrum:

* preserves magnitude information about the signal and discard phase related info
* emphasizes periodic nature of harmonics

New cards

What do cepstrum algorithms reveal in a signal?

Converting the signal and finding one formant enables algorithms that help find patterns to find the others

New cards

What do rahmonics show?

correlates to the perceptual “quality” measures of voice

New cards

Mel Frequency Cepstral Coefficients (MFCC)

represent short-term power within a second

* represents frequency bands as evenly spaced whereas cepstrum represents frequency bands linearly
* more representative of human auditory sensitivity (perception of pitch)

New cards

Practically, when is mel frequency cepstral coefficients most useful?

in audio compression and speech recognition systems (eg. HA mapping)

New cards

How to obtain formants:

by using any spectral analysis method

New cards

Two main characteristics of formants:

* peak in spectrum of a vowel sound or energy bands in spectrogram
* resonance of vocal tract

New cards

Which formants are typically used to describe most speech sounds?

F1 and F2

New cards

For vowels, what does F1 describe?

Tongue Height

New cards

For vowels, what does F2 describe?

tongue position

New cards

Formant Amplitude:

Relative amplitude of formants in a formant pattern?

New cards

Formant Space:

aka acoustic working space, acoustic vowel space, vowel triangle

* plot of F1 vs F2
* measures speech intelligibility
* several other measures are derived from formant space.

New cards

Examples of measures based on (static) formant space:

* vowel space area
* formant centralization ratio
* four vowel articulation index
* Formant centroid
* Vocalic anatomical functional ratio
* long-term formant distribution

New cards

Measurements based on “dynamic” aspect of formants:

* Formant Transition
* Formant Locus
* Formant Slope
* Locus equation

New cards

Vowels, glides, and consonants differ in degree of ________.

Constriction

New cards

Sonorant Consonants

NO pressure build up at constriction

New cards

Nasal Consonants

lower the velum allowing airflow in nasal cavity

New cards

Continuant Consonants

do not block airflow in oral cavity

New cards

Resonators:

specific state of vocal tract that amplifies frequencies near the natural frequency of that system

New cards

Natural Frequency of a resonator is based on _____.

Length and diameter of the vocal tract

New cards

Relation of harmonic frequencies to resonating frequency

If close to resonating frequency: will be amplified

If far from resonating frequency: will be dampened

New cards

Relationship of two formants when they are close in frequency to one another,

They tend to boost each other’s amplitude

New cards

Formant Bandwidth:

difference (in Hz) between frequencies at +/- 3 dB of the intensity of the center frequency within a formant

\

New cards

Which graphic representation can you find formant bandwidth?

on a Spectrum

New cards

Practical use of formant space measurements:

represents maximum working space of a talker

* representative of maximum performance

New cards

Vowel Space Area

aka F1-F2 area

* calculated using a specific formula identifying the area of formant space graph
* Used to study variety of speech and voice disorders

New cards

Long term formant distribution (LTF)

* average formant frequency of a given speaker
* calculated by taking average of all formants across all vowels in recorded sample
* used to study variety of speech and voice disorders

\

New cards

Primary use of LTF:

forensic speaker identification and in studying effects of age and sex on speech

New cards

When is speech dynamic?

when there are changes as a result of consonants embedded along with vowels -- typical running speech

New cards

Formant transition:

relative shange from a vowel to a consonant

New cards

What speech sounds are formant transitions specifically associated with?

stop consonants

New cards

Formant locus:

characteristic value for each place of consonant articulation

\*\* helpful to judge phonemes and speech intelligibility

100

New cards

Formant slope:

the change in formant frequency over an interval of formant transition

\*\* helpful in studying speech intelligibility in dysarthric speakers