Blockchain & Machine Learning Exam Notes

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/48

flashcard set

Earn XP

Description and Tags

These flashcards cover key concepts and definitions related to blockchain, Bitcoin, and machine learning based on exam notes.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

49 Terms

1
New cards

What does UTXO stand for in the context of Bitcoin?

Unspent Transaction Outputs.

2
New cards

What is mining in the blockchain context?

The process where miners take unconfirmed transactions from the mempool, form a block, and solve a Proof-of-Work (PoW) puzzle to add it to the blockchain.

3
New cards

What is the average time to mine a Bitcoin block?

10 minutes.

4
New cards

What is the maximum supply of Bitcoin?

21 million BTC.

5
New cards

How does Bitcoin determine consensus?

By accepting the longest valid chain.

6
New cards

What type of system does Bitcoin operate on?

Permissionless system where anyone can join.

7
New cards

What is the current reward for mining a block as per the notes?

3.125 BTC.

8
New cards

What is the mempool?

A temporary storage area where unconfirmed transactions wait before being included in a block.

9
New cards

What is overfitting in machine learning?

When a model learns too much detail or noise from the training data, resulting in poor generalization to new data.

10
New cards

What is one method to prevent overfitting?

Cross-validation.

11
New cards

What distinguishes supervised learning from unsupervised learning?

Supervised learning uses labeled data for predictions; unsupervised learning finds patterns in unlabeled data.

12
New cards

What does precision measure in classification metrics?

The number of true positive predictions out of all positive predictions made.

13
New cards

What is the purpose of K-Fold cross-validation?

To ensure that every data point is used for both training and testing, providing a robust performance estimate.

14
New cards

What is the F1 score?

The harmonic mean of precision and recall.

15
New cards

What is tokenization in text preprocessing?

The process of splitting text into individual words or tokens.

16
New cards

What does TF-IDF stand for?

Term Frequency-Inverse Document Frequency.

17
New cards

What is a Random Forest model?

An ensemble method that combines multiple decision trees to improve classification accuracy.

18
New cards

Why is the aggregation step important in Random Forest?

It combines predictions from multiple trees to determine the final output, reducing overfitting and improving accuracy.

19
New cards

What should you prioritize when false positives are costly?

Precision.

20
New cards

What does AUC-ROC measure in model evaluation?

The area under the ROC curve, assessing how well a model distinguishes between different classes.

21
New cards

What does UTXO stand for in the context of Bitcoin?

Unspent Transaction Outputs.

22
New cards

What is mining in the blockchain context?

The process where miners take unconfirmed transactions from the mempool, form a block, and solve a Proof-of-Work (PoW) puzzle to add it to the blockchain.

23
New cards

What is the average time to mine a Bitcoin block?

10 minutes.

24
New cards

What is the maximum supply of Bitcoin?

21 million BTC.

25
New cards

How does Bitcoin determine consensus?

By accepting the longest valid chain.

26
New cards

What type of system does Bitcoin operate on?

Permissionless system where anyone can join.

27
New cards

What is the current reward for mining a block as per the notes?

3.125 BTC.

28
New cards

What is the mempool?

A temporary storage area where unconfirmed transactions wait before being included in a block.

29
New cards

What is overfitting in machine learning?

When a model learns too much detail or noise from the training data, resulting in poor generalization to new data.

30
New cards

What is one method to prevent overfitting?

Cross-validation.

31
New cards

What distinguishes supervised learning from unsupervised learning?

Supervised learning uses labeled data for predictions; unsupervised learning finds patterns in unlabeled data.

32
New cards

What does precision measure in classification metrics?

The number of true positive predictions out of all positive predictions made.

33
New cards

What is the purpose of K-Fold cross-validation?

To ensure that every data point is used for both training and testing, providing a robust performance estimate.

34
New cards

What is the F1 score?

The harmonic mean of precision and recall.

35
New cards

What is tokenization in text preprocessing?

The process of splitting text into individual words or tokens.

36
New cards

What does TF-IDF stand for?

Term Frequency-Inverse Document Frequency.

37
New cards

What is a Random Forest model?

An ensemble method that combines multiple decision trees to improve classification accuracy.

38
New cards

Why is the aggregation step important in Random Forest?

It combines predictions from multiple trees to determine the final output, reducing overfitting and improving accuracy.

39
New cards

What should you prioritize when false positives are costly?

Precision.

40
New cards

What does AUC-ROC measure in model evaluation?

The area under the ROC curve, assessing how well a model distinguishes between different classes.

41
New cards

What is the purpose of splitting data into training and testing sets?

To evaluate the model's performance on unseen data and prevent overfitting.

42
New cards

What is "training data" used for in machine learning?

The dataset used to train a machine learning model, allowing it to learn patterns and relationships.

43
New cards

What is "testing data" used for in machine learning?

The dataset used to evaluate the performance of a trained machine learning model on unseen data.

44
New cards

What is the primary benefit of cross-validation in machine learning?

Provides a more reliable estimate of a model's performance on unseen data by reducing bias and variance in evaluation.

45
New cards

What is stop word removal in text preprocessing?

The process of removing common words (e.g., "the", "a", "is") from text that do not carry significant meaning for analysis.

46
New cards

What is stemming or lemmatization in text preprocessing?

Techniques used to reduce words to their root form (e.g., "running" to "run") to normalize text and reduce vocabulary size.

47
New cards

How can you reduce model complexity in machine learning?

By techniques such as regularization, feature selection, or using simpler model architectures to prevent overfitting.

48
New cards

What is the main goal of machine learning?

To enable systems to learn from data to identify patterns, make predictions, or improve performance on a specific task without explicit programming.

49
New cards

What is "feature engineering" in machine learning?

The process of creating new input features from existing ones to improve the performance of a machine learning model.