The Next Frontier for Innovation, Competition and Productivity
• ‘Big Data’ is similar to ‘small data’, but
bigger
•…but having data bigger it requires different
approaches:
• Techniques, tools and architecture
•…with an aim to solve new problems
• …or old problems in a better way
Dan Ariely
Professor of psychologie
Duke University – NC - USA
Eric Schmidt
CEO Google
Volume
• Data
quantity
Velocity
• Data
Speed
Variety
• Data
Types
Veracity
• Messiness
Gartner (2001) IBM (2012)
4,4 Zettabytes of data
4,4 Trillion of Gigabytes
in 2013 there is as much data
as known stars in the whole universe
44 Zettabytes of data
44 Trillion of Gigabytes
62 times the number of all
sands in all beaches on earth
1 TB OF INFORMATION
Captured by NYSE
in every trading session
18,9 BILLION
Network connections by 2016
2,5 Network connections
per person on earth
30 BILLION
PIECES OF CONTENT
shared each month on facebook
400 MILLIONTWEETS
Are sent every day by 200 million
active users per month
3,1 TRILLION A YEAR
US$ what poor quality data costs
The US economy
1 OF 3 BUSINESS LEADERS
Don’t trust the information the use
to make decisions
Where have you been
15 days ago at 17:38
Google collects all our numeric
transactions:
Our trajectory …
Our clicks ..
Our choices ..
Our comments …
Our purchases …
Our search queries …
Why Big Data
• Key enablers of appearance and growth of Big Data
are
–Increase of storage capacities
–Increase of processing power
–Availability of data
–Every day we create 2.5 quintillion bytes of data;
90% of the data in the world today has been
created in the last two years alone
Big Data Analytics
• Examining large amount of data
• Appropriate information
• Identification of hidden patterns, unknown correlations
• Competitive advantage
• Better business decisions: strategic and operational
• Effective marketing, customer satisfaction, increased
revenue
Applications for Big Data Analytics
Homeland Security
FinanceSmarter Healthcare
Multi-channel
sales
Telecom
Manufacturing
Traffic Control
Trading Analytics Fraud and Risk
Log Analysis
Search Quality
Retail: Churn, NBO
Healthcare
• 80% of medical data is unstructured and is clinically
relevant
• Data resides in multiple places like individual EMRs,
lab and imaging systems, physician notes, medical
correspondence, claims etc
• Leveraging Big Data
• Build sustainable healthcare systems
• Collaborate to improve care and outcomes
• Increase access to healthcare
Market Size
Source:WikibonTaming Big Data
By 2015 4.4 million IT jobs in Big Data ; 1.9 million is in US itself
Market for Big Data
• Gaining attraction
• Huge market opportunities for IT services (82.9% of
revenues) and analytics firms (17.1 % )
• Current market size is $200 million. By 2015 $1
billion
• The opportunity for service providers lies in offering
services around Big Data implementation and
analytics for global multinationals
NoSQL : non-relational or at least non-SQL database
solutions such as HBase (also a part of the Hadoop
ecosystem), Cassandra, MongoDB, Riak, CouchDB, and
many others.
Hadoop: It is an ecosystem of software packages,
including MapReduce, HDFS, and a whole host of other
software packages
Thank you 
Hassen DHRIF
hassen.dhrif@openvision.tn

Ov big data

  • 1.
    The Next Frontierfor Innovation, Competition and Productivity
  • 2.
    • ‘Big Data’is similar to ‘small data’, but bigger •…but having data bigger it requires different approaches: • Techniques, tools and architecture •…with an aim to solve new problems • …or old problems in a better way
  • 3.
    Dan Ariely Professor ofpsychologie Duke University – NC - USA
  • 4.
  • 6.
    Volume • Data quantity Velocity • Data Speed Variety •Data Types Veracity • Messiness Gartner (2001) IBM (2012)
  • 7.
    4,4 Zettabytes ofdata 4,4 Trillion of Gigabytes in 2013 there is as much data as known stars in the whole universe 44 Zettabytes of data 44 Trillion of Gigabytes 62 times the number of all sands in all beaches on earth
  • 8.
    1 TB OFINFORMATION Captured by NYSE in every trading session 18,9 BILLION Network connections by 2016 2,5 Network connections per person on earth
  • 9.
    30 BILLION PIECES OFCONTENT shared each month on facebook 400 MILLIONTWEETS Are sent every day by 200 million active users per month
  • 10.
    3,1 TRILLION AYEAR US$ what poor quality data costs The US economy 1 OF 3 BUSINESS LEADERS Don’t trust the information the use to make decisions
  • 11.
    Where have youbeen 15 days ago at 17:38
  • 12.
    Google collects allour numeric transactions: Our trajectory … Our clicks .. Our choices .. Our comments … Our purchases … Our search queries …
  • 13.
    Why Big Data •Key enablers of appearance and growth of Big Data are –Increase of storage capacities –Increase of processing power –Availability of data –Every day we create 2.5 quintillion bytes of data; 90% of the data in the world today has been created in the last two years alone
  • 14.
    Big Data Analytics •Examining large amount of data • Appropriate information • Identification of hidden patterns, unknown correlations • Competitive advantage • Better business decisions: strategic and operational • Effective marketing, customer satisfaction, increased revenue
  • 15.
    Applications for BigData Analytics Homeland Security FinanceSmarter Healthcare Multi-channel sales Telecom Manufacturing Traffic Control Trading Analytics Fraud and Risk Log Analysis Search Quality Retail: Churn, NBO
  • 16.
    Healthcare • 80% ofmedical data is unstructured and is clinically relevant • Data resides in multiple places like individual EMRs, lab and imaging systems, physician notes, medical correspondence, claims etc • Leveraging Big Data • Build sustainable healthcare systems • Collaborate to improve care and outcomes • Increase access to healthcare
  • 17.
    Market Size Source:WikibonTaming BigData By 2015 4.4 million IT jobs in Big Data ; 1.9 million is in US itself
  • 18.
    Market for BigData • Gaining attraction • Huge market opportunities for IT services (82.9% of revenues) and analytics firms (17.1 % ) • Current market size is $200 million. By 2015 $1 billion • The opportunity for service providers lies in offering services around Big Data implementation and analytics for global multinationals
  • 20.
    NoSQL : non-relationalor at least non-SQL database solutions such as HBase (also a part of the Hadoop ecosystem), Cassandra, MongoDB, Riak, CouchDB, and many others. Hadoop: It is an ecosystem of software packages, including MapReduce, HDFS, and a whole host of other software packages
  • 22.

Editor's Notes

  • #2 ICP : acco. to IBM
  • #7 Acco.to IBM
  • #8 Acco.to IBM
  • #9 Acco.to IBM
  • #10 Acco.to IBM
  • #11 Acco.to IBM
  • #16 Explain well. Quote practical examples
  • #21 NoSQL : approach to data management and database design that's useful for very large sets of distributed data.   Hadoop: free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment Map Reduce: software framework that allows developers to write programs that process massive amounts of unstructured data in parallel across a distributed cluster of processors or stand-alone computers. Map, a function that parcels out work to different nodes in the distributed cluster. Reduce, another function that collates the work and resolves the results into a single value.
  • #22 No need to explain Mention some company names