Study Notes on Multi-Modal Interaction Models and Flipped Interaction Patterns
Overview of Multi-Modal Interaction Models
These models and patterns are not limited to language, enabling interpretation across various information types.
It is crucial to understand that these systems can receive diverse types of information beyond human language.
Flipped Interaction Pattern
The flipped interaction pattern allows the system to ask questions or give commands to users or databases, providing a two-way communication channel.
Key concept: Feedback can be in various modalities, not solely text-based responses.
Example: Diagnosing Internet Connection
Scenario: The user requests help in diagnosing their computer's Internet connection problem step by step.
Initial Prompt: User instructs the system to ask for specific actions and expects images as feedback rather than text.
First Step: Asking for a screenshot of the user's current system to assess the environment properly.
User Action: Takes a screenshot of their laptop and uploads it.
System Response: "Thank you for the screenshot. It looks like you're using a Mac."
Detailed Steps in the Diagnosis Process
WiFi Connection Check
System prompts the user to check the WiFi connection by clicking the icon in the menu bar.
Instructions:
Check if WiFi is turned on.
Confirm connection to a WiFi network.
User Action: Takes a screenshot of their WiFi settings.
System Response: Identifies the WiFi network from the screenshot, confirming the user is connected to "Home 699E."
Network Preferences Verification
Next, the system instructs to open network preferences:
Click on WiFi settings after opening the WiFi dropdown.
Instructions:
Ensure WiFi connection status shows as connected with an assigned IP address.
User Action: Takes a screenshot of the network preferences window.
System Response: Confirms that WiFi has an IP address assigned, indicating the computer is connected.
Checking Internet Connectivity
The system proceeds to check for internet connectivity:
Instructions:
Open the terminal application and type "ping 8.8.8.8" and press enter.
Adaptation: System expertly turns a high-level task into a command that the computer can execute.
Feedback Feedback Mechanism: System continues to provide steps that require user compliance and screenshots for validation.
User Action: Instead of a screenshot, the user copies and pastes output from the ping command.
System Response: Interprets results of the ping command, indicating connectivity to Google's DNS with no packet loss.
Adaptability of Systems
The system continuously adapts based on the information provided at each step, enhancing the interaction experience.
Even with diverse types of input (screenshots vs. text), the system effectively understands and responds accordingly.
Manual Interfacing vs. Future Automation
Current interaction remains manual, requiring human involvement in the process.
Future potential: The expectation is that these systems will evolve to directly interface with machines to fulfill tasks and retrieve information autonomously.
Emphasis on eventual capabilities where automated systems communicate directly with technology rather than relying on user input for every action.