0% found this document useful (0 votes)
43 views15 pages

Data Analytics 1 - THE History and Concepts of Data Analytics

The document discusses the history and concepts of data analytics. It outlines key events from 1890 to present day, including the invention of the Hollerith Tabulating Machine, the coining of the term "business intelligence," and the development of data mining, MapReduce, and the term "data scientist." Definitions are provided for business intelligence, data mining, data analytics, and big data. Advantages of data analytics include smarter decision making and improved efficiency, customer relationships, and risk management. Challenges of data analytics include strategic alignment, organizational culture, technology, skills, and data issues.

Uploaded by

Pauline Jawi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views15 pages

Data Analytics 1 - THE History and Concepts of Data Analytics

The document discusses the history and concepts of data analytics. It outlines key events from 1890 to present day, including the invention of the Hollerith Tabulating Machine, the coining of the term "business intelligence," and the development of data mining, MapReduce, and the term "data scientist." Definitions are provided for business intelligence, data mining, data analytics, and big data. Advantages of data analytics include smarter decision making and improved efficiency, customer relationships, and risk management. Challenges of data analytics include strategic alignment, organizational culture, technology, skills, and data issues.

Uploaded by

Pauline Jawi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 15

Data Analytics 1 - THE

HISTORY AND CONCEPTS OF


DATA ANALYTICS
“The purpose of computing
is insight, not numbers”
Richard Hamming 1961
GIDEON & THE 300 SOLDIERS?
KEY EVENTS IN THE HISTORY OF DATA
ANALYTICS
• 1890-Herman Hollerith invents the Hollerith Tabulating Machine
which reduced crunching of census data from 10years to 3months!
KEY EVENTS IN THE HISTORY OF DATA
ANALYTICS
• 1962 John Tukey writes a paper title “The Future Of Data Analysis”, where he brought into
question the relationship between statistics and analysis.
• 1970-Edgar F. Codd presents his framework for relational databases.
• 1989-Howard Dresner at Gartner proposes the term “Business Intelligence.”
• 1990s-Data Mining is born following the success of the concept of data warehouses introduced by
William H. Inman.
• 1991-Tim Bernes Lee sets out the specifications for a worldwide, interconnected web of data
accessible to anyone across the world, now the internet.
• 2004-A whitepaper on MapReduce from Google inspires open source software projects like
Apache Hadoop and Apache Cassandra to deal with huge volumes of data through distributed
computing.
• 2008-Jeff Hammerbacher and DJ Patil, then at Facebook and LinkedIn coin the term “data
scientist” to describe their work and it then becomes a buzzword.
• 2013-IBM shows statistics that 90% of the world’s data was created in the preceding 2 years!
DEFINITION OF KEY TERMS
Business intelligence (BI) is an umbrella term that includes the
applications, infrastructure and tools, and best practices that enable
access to and analysis of information to improve and optimize decisions
and performance.(Gartner)
The purpose of Business Intelligence is to support better business
decision making. Essentially, Business Intelligence systems are data-
driven Decision Support Systems (DSS). Business Intelligence is
sometimes used interchangeably with briefing books, report and query
tools and executive information systems.
DEFINITION OF KEY TERMS
• Data Mining: is the computational process to discover patterns in
large datasets stored in relational databases and data warehouses.
• It is an intersection of artificial intelligence, machine learning,
statistics and database systems.
DATA ANALYTICS
DEFINITION OF DATA ANALYTICS
• Data Analytics also known as Predictive Analytics, is all about
automating insights into a dataset through usage of queries and data
aggregation procedures. It can represent various dependencies
between input variables and discover hidden patterns in the dataset
under analysis.
• Data Analytics is the science of examining raw data with the purpose
of finding and drawing conclusions about the information in the data
using methods from statistics and machine learning.
• Data Analytics goes beyond the concept of data mining by analysing
semi-structured and unstructured data from different sources and in
different formats e.g. text mining.
Advantages of Data Analytics
• Smart Decision Making
• Clearer insights into the inner workings of the business
• Improved Efficiency in business processes
• Better responds to market changes
• Better customer relationship management
• Better risk management
• Products and services tailored to customer need
BIG DATA
DEFINITION OF BIG DATA
• Big Data implies huge data volumes that cannot be processed effectively with
traditional applications. Big Data processing begins with raw data that is not
aggregated and it is often impossible to store such data in the memory of a single
computer.
• In other words big data analytics is an extension of traditional data analytics
which was mainly done on structured data e.g. stored in relational databases, to a
more complex analysis on structured, semi-structured and unstructured data.
• Big data is characterized by the famous 3 ‘Vs’-{volume, velocity and variety}
• Big data deals with huge volumes of data, whose rate of growing is very fast and
is from diverse sources and in different formats e.g. texts, tweets, web click
streams, satellite images, sensors, web log data etc.
• Big data analytics is fuelled by improvements in bandwidth and connectivity,
advancement in processing power and the falling price of hard disk.
SOURCES OF BIG DATA
THE SKILL SET FOR DATA
ANALYTICS
CHALLENGES OF DATA ANALYTICS
• Strategic Alignment
• Organizational Culture
• Technology
• Skills
• Data

You might also like