Time Series, Sequential Pattern Mining, and Network Analysis – Vocabulary

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/60

flashcard set

Earn XP

Description and Tags

61 vocabulary flashcards covering fundamental terms from lectures on Time Series, Sequential Pattern Mining, and Network Analysis.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

61 Terms

1
New cards

A sequence of data points ordered chronologically over a period of time.

Time Series

2
New cards

Technique that uses historical values and their patterns to predict future observations.

Time Series Forecasting

3
New cards

Modeling approach where current values depend on previous values (lags) of the same series; represented by parameter p in ARIMA.

Autoregression (AR)

4
New cards

Transformation that subtracts an observation from a prior one to remove trend/seasonality and achieve stationarity; parameter d in ARIMA.

Differencing

5
New cards

Time-series model that uses past forecast errors (lagged residuals); parameter q in ARIMA.

Moving Average (MA)

6
New cards

The long-term upward or downward movement in a series.

Trend (Time Series)

7
New cards

Long-term oscillations around the trend that usually last at least two years.

Cyclical Component

8
New cards

Regular, periodic fluctuations that repeat during the same period each year.

Seasonality

9
New cards

Time-series decomposition where observation equals T + S + C + I; seasonal amplitude independent of level.

Additive Model

10
New cards

Decomposition where observation equals T × S × C × I; seasonal amplitude varies with level.

Multiplicative Model

11
New cards

The unpredictable residual part of a time series after other components are modeled.

Irregular / Noise Component

12
New cards

Series with constant mean, variance, and autocorrelation over time.

Stationary Series

13
New cards

Plotting moving averages or variances to visually assess stationarity.

Rolling Statistics

14
New cards

Statistical test whose null hypothesis states the series is non-stationary (has a unit root).

Dickey-Fuller Test

15
New cards

Extension of the Dickey-Fuller test that includes lagged difference terms to detect a unit root.

Augmented Dickey-Fuller (ADF) Test

16
New cards

Kwiatkowski-Phillips-Schmidt-Shin test where the null hypothesis is trend stationarity.

KPSS Test

17
New cards

Auto-Regressive Integrated Moving Average model combining AR, differencing, and MA terms for forecasting.

ARIMA

18
New cards

Average of the absolute forecast errors.

Mean Absolute Error (MAE)

19
New cards

Average of the squared forecast errors, giving more weight to large errors.

Mean Squared Error (MSE)

20
New cards

Square root of MSE, expressing error in original units.

Root Mean Squared Error (RMSE)

21
New cards

Frequency at which observations in a time series are collected.

Sampling Rate

22
New cards

Data-mining task that discovers frequently occurring ordered events or subsequences in a set of sequences.

Sequential Pattern Mining (SPM)

23
New cards

Ordered list of itemsets or symbols in SPM.

Sequence

24
New cards

Single symbol within a sequence.

Item

25
New cards

Unordered set of distinct items occurring together at one time-stamp.

Itemset

26
New cards

Sequence derived by deleting zero or more items from another sequence without altering order.

Subsequence

27
New cards

Subsequence whose support meets or exceeds the minimum support threshold.

Sequential Pattern

28
New cards

Number of sequences in the database that contain a given sequence.

Support (SPM)

29
New cards

User-defined cut-off that a sequence’s support must reach to be considered frequent.

Minimum Support Threshold (minsup)

30
New cards

Principle stating that if a sequence is not frequent, none of its super-sequences can be frequent.

Apriori Property

31
New cards

Apriori-based algorithm that mines sequential patterns using a horizontal data format.

GSP (Generalized Sequential Pattern)

32
New cards

Apriori-based algorithm that mines sequential patterns using a vertical data format.

SPADE

33
New cards

Pattern-growth algorithm for sequential pattern mining that reduces candidate generation.

FreeSpan

34
New cards

Pattern-growth algorithm that projects databases based on sequence prefixes to find frequent patterns.

PrefixSpan

35
New cards

SPM type applied to data with limited alphabets, such as DNA sequences.

String Mining

36
New cards

SPM type focusing on ordered sets of itemsets, common in marketing and sales.

Itemset Mining (Sequential Context)

37
New cards

Implication X → Y indicating that if items in X occur, they are followed by items in Y.

Sequential Rule

38
New cards

Support of the rule divided by support of X; estimates the conditional probability P(Y|X).

Confidence (Sequential Rule)

39
New cards

Association measure where Lift > 1 indicates Y is more likely after X than under independence.

Lift

40
New cards

Dependence measure; values greater than 1 show strong dependence of X on Y.

Conviction

41
New cards

Association measure ranging from −1 (negative) to +1 (positive); near 0 suggests no association.

Zhang’s Metric

42
New cards

Collection of nodes connected by edges representing relationships.

Network (Graph)

43
New cards

Fundamental unit in a graph representing an entity.

Node (Vertex)

44
New cards

Connection between two nodes; may be directed or undirected.

Edge (Link)

45
New cards

Number of edges incident to a node.

Degree

46
New cards

Overall structural arrangement of connections in a graph.

Network Topology

47
New cards

Topology where every node has the same degree.

Regular Network

48
New cards

Graph whose edges are formed randomly between nodes.

Random Network

49
New cards

Network whose degree distribution follows a power law with a few highly connected hubs.

Scale-free Network

50
New cards

Group of nodes more densely connected internally than with the rest of the network.

Cluster / Community

51
New cards

Algorithms and techniques used to identify clusters within a graph.

Community Detection

52
New cards

Centrality metric equal to the number of connections a node has.

Degree Centrality

53
New cards

Metric counting the number of shortest paths that pass through a node, indicating its bridging role.

Betweenness Centrality

54
New cards

Centrality measure that rewards connections to other high-scoring nodes.

Eigenvector Centrality

55
New cards

Measure of how near a node is to all others based on shortest path lengths.

Closeness Centrality

56
New cards

Heuristic algorithm that maximizes modularity for community detection in large networks.

Louvain Method

57
New cards

Community detection algorithm that iteratively removes edges with highest betweenness centrality.

Girvan-Newman Method

58
New cards

Network growth model producing scale-free graphs via preferential attachment.

Barabási–Albert Model

59
New cards

Process of simulating or describing how networks form and evolve over time.

Network Modeling

60
New cards

Study of how nodes, edges, and network properties change over time.

Network Dynamics

61
New cards

Methodology for visualizing and analyzing relationships among entities in a social context.

Social Network Analysis (SNA)