Overview of the session on Large Language Models and Generative AI.
Instructor: Ivika Jäger, PhD
Date: November 2024
Institution: Mittuniversitetet, Mid Sweden University
Topics to be covered:
Machine learning methods
MIUN guidelines
Generative AI
Prompt design
Introduction to different methods of machine learning.
Characteristics of AI:
Highly adaptable and flexible
Lacks a user manual; requires trial and error
Applications based on user understanding of problems
Definition: Machines learning from data without explicit programming.
Key functions:
Recognizes patterns
Makes decisions
Improves with experience
Essential to modern AI applications.
Various types and applications:
Image Compression
Customer Retention
Discovery and Classification
Types of ML:
Supervised Learning
Unsupervised Learning
Reinforcement Learning
Fields of application:
Market predictions
Fraud detection
Weather forecasting
Uses labeled data to train the model.
Outputs predictions based on learned mapping.
Ideal for classification problems (image recognition, spam detection).
Tools like Teachable Machine for practical model training.
Operates on unlabeled data to discover patterns.
Analyzes data for clustering and anomaly detection.
Suitable for market segmentation and fraud detection.
Steps in K-means clustering:
Initialize centroid positions.
Assign labels to data points.
Update centroid positions.
Repeat until convergence.
Agents learn through interactions and receive rewards/penalties.
Focuses on maximizing reward in complex environments (e.g., autonomous vehicles).
Example use case in a gaming scenario (Trackmania) over days.
Utilizes neural networks with multiple layers.
Effective for large unstructured data and complex problems (e.g., navigation).
Structured vs. unstructured data processing.
Manual feature definition vs. automated learning.
Development of Large Language Models (LLMs) via deep learning.
Discussion on the integration of AI in educational practices.
Importance of 'when' and 'how' to use Generative AI.
Considerations:
Limitations and opportunities in academic settings.
Issues of privacy, security, and ethics.
AI concept recognition among students and its impact on academic integrity.
MIUN resources for students regarding academic integrity and AI use.
Encouragement of responsible AI use while maintaining academic integrity.
Practices include brainstorming, feedback, and tutoring.
Hands-on approach to learning with AI.
Emphasis on transparency and discussing outcomes.
Risks of direct copy-pasting without engagement.
Importance of critical interaction and proper prompting.
Contrast between Generative AI and traditional AI.
Generative AI creates content mimicking human output.
Trained on vast datasets to generate responses based on prompts.
Traditional AI focuses on tasks like classification, while Generative AI synthesizes new data.
Significant developments:
2020: OpenAI GPT-3
2017: Google Brain
Other notable milestones.
Learning patterns before fine-tuning for specific applications.
Techniques:
Next Token Prediction
Masked Language Modeling
Involves Reinforcement Learning with Human Feedback (RLHF).
Incorporates human ratings to understand and improve model performance.
Prioritization of user satisfaction over accuracy.
Risks of anthropomorphism and confirmation bias.
Functions such as text generation, translation, data analysis, and creative assistance.
Recognized limitations include lack of comprehension and ethical concerns.
Analysis of academic articles using OpenAlex concerning specific keywords.
Distinction between LLMs as core technology and AI applications.
Importance of skill in guiding AI outcomes through effective prompt design.
Initially relevant to software developers but valuable to all users.
Process of designing prompts for AI to achieve desired responses.
Specific and clear prompts lead to more insightful AI outputs.
Consistency, accuracy, efficiency resulting from structured prompts.
API automation for repeated utilization of prompts with varying outputs.
Emphasis on clarity, specificity, precision, and iterative improvements.
Defining expectations for AI outputs, specifying required details, and framing prompts positively.
Requesting character identification from a provided text context.
Using delimiters to separate tasks, context, and constraints within the input.
Control the length and depth of responses based on task requirements.
Instructions for summarizing text and specifying length limitations.
Examples of formats for AI outputs and guidelines for when conditions are unmet.
Tasking AI to classify sentiments of social media comments.
Sample output categorizing sentiment from social media comments.
Providing examples to guide AI style and structure.
Generating a catchphrase for a product with given constraints and context.
Importance of revising prompts based on initial outputs for optimal results.
Tasking AI to craft a structured professional email with constraints.
Techniques for guiding AI reasoning and maintaining consistent responses.
Generating creative ideas for user engagement in social media contexts.
Options available for further engagement with AI models.
Links to guides and platforms for effective prompt engineering and AI usage.
Instructions for analyzing sentiment and translations of social media comments.
Tasks involving summarizing, proofreading, and translating emails for clarity.
Extracting key information from passages for deeper understanding.
Measuring understanding and confidence in using LLMs post-lecture.