Intro to Analytics Chapter 1

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/108

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

109 Terms

1
New cards

Descriptive Analytics

Focuses on summarizing and describing historical data to provide insights into past trends and patterns.

2
New cards

Diagnostic Analytics

Analyzes past data to identify the root causes of specific outcomes or events, answering the question 'Why did it happen?'

3
New cards

Predictive Analytics

Uses historical data to forecast future outcomes, addressing questions like 'What is likely to happen in the future?'

4
New cards

Prescriptive Analytics

Recommends actions to optimize or improve a situation, answering questions such as 'What should we do?'

5
New cards

Exploratory Analytics

Involves exploring and analyzing data to identify potential trends and insights when there is no clear objective.

6
New cards

Data Collection

The process of gathering large datasets from various sources, including databases and surveys.

7
New cards

Data Cleaning

The process of correcting data errors, removing duplicates, and ensuring dataset accuracy for analysis.

8
New cards

Data Reporting

Creating reports and visualizations that communicate findings to team members and stakeholders clearly.

9
New cards

Predictive Analysis

Utilizes algorithms to predict future trends and outcomes based on historical data.

10
New cards

Data-Driven Decision-Making

Making decisions based on the analysis of data to identify opportunities for improvement.

11
New cards

Continuous Improvement in Data Analytics

Involves professional development and adapting to new data sources, tools, and techniques.

12
New cards

Data Pipeline Development

Designing and developing processes that move data from source systems to data storage systems.

13
New cards

Data Storage Management

Managing data in databases, data lakes, and warehouses, including tasks like data partitioning and indexing.

14
New cards

Data Quality Control

Ensuring that data is accurate, consistent, and free of errors.

15
New cards

Machine Learning Model Training

The process of training machine learning models on data using various algorithms.

16
New cards

Data Integration

Combining data from various sources to create a unified view for analysis.

17
New cards

A/B Testing

A statistical method used to compare two versions of a product or service to determine which performs better.

18
New cards

Data Storytelling

The practice of presenting complex technical concepts and insights in a clear and understandable way.

19
New cards

Collaboration in Data Science

Working with engineers, developers, and stakeholders to design solutions that align with business goals.

20
New cards

Cloud Computing in Data Engineering

Using cloud platforms to deploy and manage data infrastructure.

21
New cards

Data Mining

A business process for exploring large amounts of data to discover meaningful patterns and rules.

22
New cards

Customer Relationship Management (CRM)

A broad topic focusing on managing a company's interactions with current and potential customers to improve business relationships.

23
New cards

Data Warehouse

A system that stores large amounts of historical data from various sources in a consistent format for analysis.

24
New cards

Meaningful Patterns

Significant data trends discovered through data mining that can help improve business operations.

25
New cards

Learning Relationships

The connections formed by businesses understanding and utilizing knowledge about their customers over time.

26
New cards

Predictive Modeling

A statistical technique used to predict future behavior by analyzing patterns in existing data.

27
New cards

Oeno-philes

Wine enthusiasts or experts who have extensive knowledge about wines.

28
New cards

Analytic CRM

The practice of using data mining and analytical techniques to enhance customer relationship management.

29
New cards

Computing Power

The capability of a computer to process data, which has become increasingly affordable and influential in data mining.

30
New cards

The Virtuous Cycle of Data Mining

A cyclical process involving discovering patterns, responding to them, and driving value through actionable insights.

31
New cards

Data Science

The practice of using data to understand and solve real-world problems.

32
New cards

Data Scientist

A professional who analyzes and interprets complex data to help companies make decisions.

33
New cards

Analytics

The process of taking data and putting it in front of the right people to support decision-making.

34
New cards

Machine Learning

A field of artificial intelligence that uses algorithms to allow software to improve its performance on tasks through experience.

35
New cards

Decision Science

The discipline of turning raw data into information that helps make informed business decisions.

36
New cards

Data Cleaning

The process of detecting and correcting (or removing) corrupt or inaccurate records from a dataset.

37
New cards

SQL

A programming language used for managing and querying data in relational databases.

38
New cards

ETL

Extract, Transform, Load - a process that involves extracting data from various sources, transforming it into a format suitable for analysis, and loading it into a database.

39
New cards

Venn Diagram of Data Science

A representation that shows the intersection of math/statistics, domain expertise, and programming skills necessary to succeed in data science.

40
New cards

Business Intelligence Analyst

A professional who uses data analysis tools and software to understand and improve business processes.

41
New cards

Data Science Bootcamp

An intensive, short-term program lasting 8 to 15 weeks that teaches data science skills.

42
New cards

Graduate Degree

An advanced academic program, usually requiring two years, focusing on data science or a related field.

43
New cards

Self-Teaching

The process of learning without a formal instructor, often through books, online courses, and personal projects.

44
New cards

Industry Connections

Relationships between educational programs and businesses that can help with internships and job placements.

45
New cards

Project Work

Hands-on tasks that involve applying data science skills to real-world problems, often included in bootcamp and academic curricula.

46
New cards

Application Process

The series of steps taken to apply to a graduate program, including submitting documents like transcripts and letters of recommendation.

47
New cards

Networking

Building professional relationships that can help in job searches, commonly emphasized in bootcamp programs.

48
New cards

Cost of Education

The financial expense associated with pursuing a degree or bootcamp, which can include tuition and opportunity costs.

49
New cards

Teaching Assistantship

A position often offered in graduate programs that involves assisting professors and may include funding for tuition.

50
New cards

Alumni Network

Groups of former students from a program that can provide support, connections, and job opportunities.

51
New cards

Big Data

Data that is so large, fast, or complex that it's difficult or impossible to process using traditional methods.

52
New cards

3 Vs of Big Data

Volume, Variety, and Velocity - these are the three key characteristics that define Big Data.

53
New cards

Data Scientist

A professional who uses statistical and computational skills to analyze and interpret complex data.

54
New cards

Business Intelligence (BI)

Technologies and strategies used by enterprises for data analysis of business information.

55
New cards

Data Science

An interdisciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge and insights from structured and unstructured data.

56
New cards

Analytic Sandbox

A secure environment for analysts to test and develop models using raw and aggregated data without affecting production systems.

57
New cards

Structured Data

Data that adheres to a predefined data model and is easily searchable, typically organized in rows and columns.

58
New cards

Unstructured Data

Data that does not have a predefined data model or structure, making it more complex to analyze.

59
New cards

Semi-structured Data

Data that does not conform strictly to a data model but contains tags or markers to separate elements.

60
New cards

Machine Learning

A subset of artificial intelligence that focuses on building systems that learn from data and improve their performance over time without being explicitly programmed.

61
New cards

Predictive Analytics

A form of advanced analytics that uses historical data, machine learning, and statistical algorithms to predict future outcomes.

62
New cards

Genomics

The branch of molecular biology concerned with the structure, function, evolution, and mapping of genomes.

63
New cards

Data Deluge

The overwhelming amount of data being generated today from various sources, requiring new strategies for data management.

64
New cards

OLAP

Online Analytical Processing, a category of software technology that enables analysts, managers, and executives to gain insight into data through fast, consistent, interactive access.

65
New cards

Hadoop

An open-source framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.

66
New cards

Clickstream Data

Data generated from a user's interactions on a website, tracking their navigation path and how they interact with different elements.

67
New cards

Data Warehousing

The process of collecting and managing data from various sources to provide meaningful business insights.

68
New cards

Data Mining

The practice of examining large databases to generate new information, often using complex algorithms.

69
New cards

Project sponsor

The executive who authorizes the project and ensures alignment with the company's strategic goals.

70
New cards

Project manager

The individual overseeing day-to-day operations of the project, coordinating with stakeholders and ensuring the project stays on track.

71
New cards

Financial operations

The team responsible for managing the project's budget and finances to ensure it stays within budget.

72
New cards

Database administrator (DBA)

The professional managing the data infrastructure, ensuring secure and efficient data storage and access.

73
New cards

Researchers

Individuals responsible for interpreting data results and making recommendations based on statistical and computational analysis.

74
New cards

Partners

External stakeholders collaborating with the company, such as software vendors or consultants, providing expertise or tools.

75
New cards

End users (customers)

The ultimate beneficiaries of the project, who will benefit from an optimized inventory management system.

76
New cards

Budgeting (in financial operations)

The process of creating and maintaining the project budget, ensuring it reflects the project's needs.

77
New cards

Communication (in project management)

The project manager's responsibility to inform stakeholders about the project's progress, challenges, and successes.

78
New cards

Quality assurance (in project management)

The process of ensuring that the project meets quality standards and that deliverables meet project requirements.

79
New cards

Database security (DBA responsibility)

Implementing measures to protect project data from unauthorized access, theft, or corruption.

80
New cards

Testing and feedback (in end users)

The process in which end users test the project at various stages and provide feedback on usability and performance.

81
New cards

Providing recommendations (by researchers)

The act of researchers giving suggestions to inform project objectives and guide decision-making based on findings.

82
New cards

Collaborating with partners

The need for partners to work together to achieve project objectives effectively.

83
New cards

Providing user documentation (by end users)

The process in which end users review and ensure that project documentation meets their needs.

84
New cards

Providing project support (by project sponsor)

The project sponsor's role in guiding the project team and removing obstacles to stay on track.

85
New cards

Failure Criteria

Guidelines that help a team understand when to stop trying or settle for results during a project.

86
New cards

Key Stakeholders

Individuals or groups that will benefit from the project or be significantly impacted by it.

87
New cards

Success Criteria

Standards defined by stakeholders to judge the success of the project.

88
New cards

Analytics Sponsor

The person who funds the project and provides high-level requirements.

89
New cards

Active Listening

A communication technique that involves fully concentrating, understanding, responding, and remembering what is being said.

90
New cards

Interview Preparation

The process of drafting questions and reviewing them with colleagues before an interview.

91
New cards

Open-ended Questions

Questions designed to encourage detailed responses rather than simple yes or no answers.

92
New cards

Probing Questions

Follow-up questions intended to elicit more detailed information in an interview.

93
New cards

Documenting Findings

The practice of recording what was discussed in an interview for future reference.

94
New cards

Project Scope

The boundaries of what will be covered in a project, including its goals and objectives.

95
New cards

Stakeholder

A person, group, or organization that is actively involved in a project, is affected by its outcome, or can influence its outcome.

96
New cards

Business Stakeholders

Individuals from departments like marketing or product who oversee business decisions and request analyses to make better decisions.

97
New cards

Communication

The process of sharing information and updates with stakeholders regarding project timelines, progress, and findings.

98
New cards

Engineering Stakeholders

Teams responsible for maintaining software and hardware products who rely on data science work to inform technical decisions.

99
New cards

Corporate Leadership

High-level executives such as directors and vice presidents who need data to guide organizational strategies.

100
New cards

Your Manager

The individual who supervises your work, provides guidance, and can act as a stakeholder by influencing project direction.