CS6262 Lecture 16 - Machine Learning for Security

0.0(0)

Studied by 0 people

0.0(0)

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/68

There's no tags or description

Looks like no tags are added yet.

Study Analytics

Name	Mastery	Learn	Test	Matching	Spaced

No study sessions yet.

69 Terms

New cards

What is the primary goal of applying machine learning to intrusion detection?

To automatically and quickly identify new attacks and stop bad behavior early

New cards

Which type of analytics models normal network behavior and identifies deviations?

Anomaly detection

New cards

Which analytics approach combines misuse and anomaly detection?

Hybrid detection

New cards

Which type of analytics detects known attacks using signatures?

Misuse detection

New cards

Which type of analytics is capable of detecting zero-day attacks?

Anomaly detection

New cards

What is the main objective of machine learning given training examples?

To learn a function that can predict outputs from inputs

New cards

What occurs during the training phase of machine learning?

The algorithm learns a function by minimizing prediction error using labeled examples

New cards

What happens during the testing phase of machine learning?

The learned function is applied to new, unseen data to predict output values

New cards

Why should data used in machine learning be drawn from real-world applications?

To ensure that training and test data reflect realistic conditions

New cards

How is real-world data typically prepared for machine learning?

It is randomly split into training and test datasets

New cards

What is used as input for learning a predictive function in supervised machine learning?

Feature vectors with associated labels

New cards

What is a feature vector in the context of object recognition?

A numerical representation of an object based on extracted characteristics

New cards

What is the purpose of labeling feature vectors during training?

To provide correct outputs for learning the predictive function

New cards

What determines which features are useful in a machine learning model?

Features depend on the specific application being analyzed

New cards

What is the most important property of a good machine learning model?

The ability to generalize from training data to unseen test data

New cards

What does generalization mean in machine learning?

Correctly predicting outputs for new examples not seen during training

New cards

Which type of machine learning finds patterns or structure in unlabeled data?

Unsupervised learning

New cards

Which type of machine learning uses labeled data to learn a model that maps inputs to outputs?

Supervised learning

New cards

Which type of machine learning uses datasets where only some examples are labeled?

Semi-supervised learning

New cards

What is error rate in machine learning performance metrics?

The fraction of false predictions

New cards

What is accuracy in machine learning performance metrics?

The fraction of correct predictions

New cards

What is precision in machine learning metrics?

The fraction of correct positive predictions among all predicted positives

New cards

What is recall in machine learning metrics?

The fraction of correct positive predictions among all actual positives

New cards

Can precision and recall be used beyond binary classification?

Yes, they can be generalized to multi-class applications

New cards

What type of problems involve training datasets with attributes and class labels?

Classification problems

New cards

What is the goal of a machine learning classification model?

To output a class label based on a set of input attributes or features

New cards

How is a decision tree constructed in the training process?

By repeatedly partitioning data until each partition contains examples from only one class

New cards

What alternative perspective can a decision tree be viewed as?

A set of rules describing the decision logic

New cards

What is the first step in building a decision tree?

Find the best attribute to use as the root

New cards

What metrics are used to determine which attribute best partitions a dataset into subsets?

Entropy and information gain

New cards

What does entropy measure in the context of decision trees?

The purity of examples in a dataset based on class label distribution

New cards

When is entropy at its maximum?

When examples are evenly distributed among different classes

New cards

When is entropy at its minimum?

When all examples in a dataset belong to a single class

New cards

What does high information gain for an attribute indicate?

The attribute produces purer subsets when partitioning the dataset

New cards

What is the purpose of information gain in decision tree construction?

To evaluate how well an attribute separates samples according to their classifications

New cards

What does clustering do with training examples?

Assigns them into different clusters based on a distance measure

New cards

What is commonly used to measure similarity between examples in clustering?

A distance function

New cards

What is typically predetermined before beginning clustering?

The number of clusters

New cards

How is cluster membership determined?

By assigning samples to the cluster with the closest centroid

New cards

When does the clustering process stop?

When clusters converge and membership no longer changes

New cards

For detecting botnet command-and-control using supervised learning, what type of data is needed?

Labeled data with known C&C communication examples

New cards

What makes classifiers effective in machine learning security applications?

Smart features that capture domain knowledge

New cards

How can intrusion detection be viewed in machine learning terms?

As a classification problem distinguishing normal traffic from attack traffic

New cards

What is the goal when applying machine learning to intrusion detection?

To partition mixed traffic into pure-class subsets such as normal or specific attack types

New cards

What type of features are useful when partitioning traffic for intrusion detection?

Features with high information gain

New cards

What does raw network data need to be summarized into for machine learning processing?

Connection records

New cards

What types of attributes are typically included in a connection record?

Timestamp, duration, source IP, destination IP, bytes, service, and flag

New cards

What does the connection flag SF indicate?

That the connection completed both SYN and FIN

New cards

What does the connection flag REJ indicate?

That the connection request was rejected

New cards

What is one method for constructing useful intrusion detection features?

Using temporal and statistical patterns associated with attacks

New cards

What is an example of a temporal/statistical pattern useful for detecting intrusions?

Many S0 connections to the same service or host in a short time period

New cards

What is the first step in the high-level process of building intrusion detection models from network data?

Collect raw audit data such as packets

New cards

After capturing raw data, what is the next step in preparing it for machine learning?

Summarize data into connection records

New cards

What do we search for in connection records to help build intrusion detection features?

Frequent patterns

New cards

How do we identify unique intrusion-related behaviors from discovered patterns?

Compare frequent patterns to determine those associated specifically with intrusions

New cards

What do we construct after identifying unique intrusion-related patterns?

Features used to train classification models

New cards

How is the machine learning feature construction and improvement process described?

Iterative, with each step repeated to improve performance

New cards

What approach is used to discover patterns within the data?

Data mining algorithms

New cards

What is the purpose of association rule mining in this context?

To find associations among features, such as many S0 HTTP connections

New cards

Why may basic association rule and frequent episode algorithms produce useless patterns?

They can include irrelevant attributes not useful for intrusion detection

New cards

What modification is applied to basic pattern-finding algorithms for intrusion detection?

Restricting results to patterns involving essential or reference attributes

New cards

What is an axis attribute in the context of intrusion detection pattern mining?

The most important attribute, such as the service, that must appear in any association

New cards

Why is the axis attribute required in computed associations?

To eliminate patterns that involve only non-essential attributes

New cards

After computing associations involving axis attributes, what is the next step?

Compute sequential patterns involving the associations

New cards

What is the purpose of constructing features from intrusion-specific patterns?

To build classifiers that detect intrusions

New cards

Why is dataset selection challenging in intrusion detection?

Because there is no perfect way to label data and thus no perfect IDS dataset

New cards

What evaluation dataset is used to assess the intrusion detection approach?

The DARPA evaluation dataset

New cards

How many attack types are included in the DARPA dataset used for evaluation?

38 attack types

New cards

What are the four categories of attack types in the DARPA dataset?

Denial-of-Service, probing, remote-to-local, and user-to-root