4: Dimensional Analysis by Statistical Analysis

0.0(0)

Studied by 0 people

Knowt Play

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Card Sorting

1/26

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

27 Terms

New cards

Probability Function

Random variable X, f(x_i) = Pr(X = x_i)

New cards

Dense Sampling Set

A sampling space that’s uncountable and continuous.

New cards

Probability Density Function

For every closed interval x_i = [a, b], where for all x, f(x) >= 0 and the integral of f(x) between negative infinity and infinity is 1.

S is a dense sampling space
Powerset(S) = {c_i} and the powerset of all subsets of S
X: S → T is a dense random variable defined over S where T = {x_i} and X(c_i) = x_i

<p>For every closed interval x<sub>i</sub> = [a, b], where for all x, f(x) >= 0 and the integral of f(x) between negative infinity and infinity is 1.</p><ul><li><p>S is a dense sampling space</p></li><li><p><span>Powerset(S) = {c<sub>i</sub>} and the powerset of all subsets of S</span></p></li><li><p><span>X: S → T is a dense random variable defined over S where T = {x<sub>i</sub>} and X(c<sub>i</sub>) = x<sub>i</sub></span></p></li></ul><p></p>

New cards

Probability Density Function Properties

For all x, Pr(X = x) = 0
P(X >= a) = integral between infinity and a of f(x)
P(X <= a) = integral between a and negative infinity of f(x)

New cards

Cumulative Probability Distribution Function

F(x) = P(X <= x)

X is a random variable defined over a sampling space

New cards

Cumulative Probability Distribution Properties

Always increasing but not always monotonic - x₁ < x₂ → F(x₁) <= F(x₂)
lim(x → -inf.) F(x) = 0 and lim(x → inf.) F(x) = 1
Pr(X > x) = 1 - F(x)
x₁ < x₂ → Pr(x₁ < X <= x₂) = F(x₂) - F(x₁)
- Note - F(x) is only continuous on the right-hand side, but not always on the left

New cards

Joint Distribution

A collection of probabilities over a series of random variables’ sampling spaces: Pr(X₁= c_{x1, j}, …, X_n = c_{xn, k})

X are the random variables
n is the number of random variables
Powerset(X_i) = {c_xi}
j and k indicate elements in x

Denoted as f(x, y)

New cards

Bivariate Distribution

Joint distribution where n = 2 (i.e. there are two random variables)

New cards

Contingency Tables

Frequency matrices that express joint frequences for 2 or more categorial variables.

New cards

Mutual exclusion (Disjoint)

Where two sets never contain common outcomes (A ∩ B = {}).
For more than two sets, for each pair of sets denoted by i, j where i ≠ j, A_i ∩ A_j = {}

New cards

Probability of a union of events

New cards

Probability of a union of disjoint events

Just a sum of all individual probabilities.

New cards

Probability of a union of two sets

P(A ∩ B) = P(A) + P(B) - P(A U B)

New cards

Independence of two sets

P(A ∩ B) = P(A) * P(B)

The occurrence of one event does not give us information about another event.

New cards

Statistically Independent Random Variables

The joint probability distribution function is factorisable in the form P(X, Y) = P(X) * P(Y)

New cards

Information contained in the occurrence of an event

I(e_i) = -log_b(P(e_i)) = log_b(1/P(e_i))

New cards

Units of information

Depends on the base used:

Bits if 2
Hartleys if 10
Nats if e

New cards

Entropy

The average amount of information you get from the source - a measure of uncertainty.

Affected by statistical independence
The more common the event is, the less information it carries

New cards

Information (r.e. Entropy)

Resolves uncertainty, tells us more about a random variable.

New cards

Entropy Formula

H(X) = For all i, P(x_i) * I(x_i) = -(for all i, P(x_i) * log_bP(x_i))

New cards

Joint Entropy Formula

H(X, Y) = -(for all x_i in X (for all y_j in Y (P(x_i, y_j) * log_bP(x_i, y_j))))

New cards

Independent Component Analysis

Used to decompose a signal into statistically independent sources (bases).

In the formula Y = XB, ICA solves for both X and B at the same time.
Mathematically solving X ~= X^ = WY, W ~= B^-1
Only produces approximations

New cards

Independent Component Analysis Formula (Long)

Observe J linear mixtures y₁, … y_j comprised of I = J sources x₁, …, x_i
For all j in J, B_{(1, j)}x₁ + B_{(2, j)}x₂ + … + B_{(i, j)}x_i

New cards

Independent Component Analysis Formula

Y = XB = for all i in I (sources), B_{(i, j)}x_i

New cards

Independent Component Analysis Limitations

Always assumes the components are mutually independent, mean-centred and have non-Gaussian distributions
Cannot identify the number of source signals or the proper scaling of them
Ignores sampling order (time dependency) and works over variables rather than signals

New cards

Solving for an estimation - assumption (ICA)

That the sources are mutually exclusive.

New cards

Solving for an estimation (ICA)

Maximising the joint entropy of X^, i.e. g(X^) = g(WY)

g(X^) is the cumulative density function of X^
Independence of sources is obtained by adjusting the mixing of W ~= B^-1
This also maximises the mutual entropy, H(g(X^))
Then use gradient descent to take a small step in the direction of the gradient of H(g(X^))
W_new = W_old + h * gradient(H(g(X^)))
- h is the learning rate