Data Processing and Information
Lesson Objectives
Understand the differences between data and information.
Distinguish between direct and indirect data.
Key Concepts
Difference between Data and Information
Data: Raw facts and figures without context or meaning.
Information: Data that has been processed and given context, thereby gaining meaning.
Examples:
Postal Codes (e.g.,
110053, 641609) are simply data until identified as such in context (like regions in India).Similarly, a number like
5is data until context makes it relevant (e.g., defining it as a prime number).
Data Processing
Data is converted to information through processing.
Binary Digits: Data is stored in computer systems as sequences of 1s and 0s.
Storage Media: Includes hard disks, SSDs, RAM, and more.
Data Processing Operations: Convert source data into usable information using software applications.
Example of processing: Transforming a CSV file in a spreadsheet to create summaries.
Real-Life case: A website creation firm identifying different data points (customer info, Page layout) as information.
Types of Data
Direct Data vs. Indirect Data
Direct Data: Collected for a specific purpose; examples include responses from questionnaires and surveys.
Example: A survey to modify bus routes involves collecting data specifically for that decision.
Indirect Data: Collected for one purpose but used for another; examples include statistical data or census data.
Example: A construction firm purchasing weather data originally collected for forecasting.
Methods of Data Collection
Interviews: Structured (same questions for all) vs. Unstructured (open responses).
Observational Methods: Watching behaviors, counting occurrences (e.g., traffic counts).
Data Logging: Using sensors and software to collect and process data automatically.
Questionnaires: Schedule simple or complex forms for instructing respondents.
Quality of Information
Factors affecting quality include:
Accuracy: Precision and error-free data.
Relevance: If data fulfills the intended purpose and context.
Timeliness: Information should be current.
Completeness: Must encompass all necessary parts of the problem.
Detail Level: Must be neither too detailed (overkill) nor too vague.
Validation and verification are critical to ensuring quality:
Validation: Ensuring reasonableness of data before processing.
Verification: Confirming data entry accuracy.
Data Processing Methods
Batch Processing: Processes data in large groups. Common uses: payroll calculations, credit card transactions, utility billing.
Online Processing: Data processed immediately, as seen in point-of-sale systems; known for real-time customer interaction.
Real-Time Processing: Systems act instantaneously based on immediate input; examples include automated temperature systems in greenhouses or bank transactions.
Advantages and Disadvantages
Batch Processing:
Advantages: Lower operational costs, effective under high load, and works during off-peak hours.
Disadvantages: Delayed output can be less effective for urgent needs.
Online Processing:
Advantages: Immediate responses and interactions.
Disadvantages: More expensive computational resources needed.
Real-Time Processing:
Advantages: Instantaneous responses impacting real-time environments.
Disadvantages: Requires significant computational power and constant monitoring.
Conclusion
Understanding data and its processing is crucial in a digital economy. Direct data collection enables specific insights, while handling and shaping that data into actionable information can significantly improve decision-making and operational efficiency across sectors.