SlideShare a Scribd company logo
DATA
DRIVEN
INNOVATION
Rome 2017 | Open Summit
MACHINE LEARNING
REAL LIFE APPLICATIONS
BY EXAMPLES
SPEAKER
MARIO CARTIA
MARIO@BIG-DATA.NINJA
DDI
R O M E| 2017
M A RI O C A RTI A
Can	machines
think?
Computing	 machinery and	intelligence.	
Mind,	59,	433-460	(1950)
Turing A.M.
DDI
R O M E| 2017
M A RI O C A RTI A
1968
2001:	A	Space	Odyssey
“I'm sorry Dave, I'm afraid I can't do that”
DDI
R O M E| 2017
M A RI O C A RTI A
1982
Supercar
DDI
R O M E| 2017
M A RI O C A RTI A
1983
Wargames
DDI
R O M E| 2017
M A RI O C A RTI A
1996
Kasparov	vs.
Deep Blue
DDI
R O M E| 2017
M A RI O C A RTI A
Does Deep Blue use artificial intelligence?
The short answer is "no." Earlier computer designs that tried
to mimic human thinking weren't very good at it. No formula
exists for intuition. So Deep Blue's designers have gone
"back to the future." Deep Blue relies more on computational
power and a simpler search and evaluation function.
The long answer is "no." "Artificial Intelligence" is more
successful in science fiction than it is here on earth, and you
don't have to be Isaac Asimov to know why it's hard to
design a machine to mimic a process we don't understand
very well to begin with.
Source:
https://www.research.ibm.com/deepblue/meet/html/d.3.3a.shtml
DDI
R O M E| 2017
M A RI O C A RTI A
Decision Tree (IF...	THEN)
DDI
R O M E| 2017
M A RI O C A RTI A
“Machine learning is the
subfield of computer science
that gives computers the
ability to learn without being
explicitly programmed.”
Arthur Samuel, 1959
DDI
R O M E| 2017
M A RI O C A RTI A
Spam	Email	Filtering
DDI
R O M E| 2017
M A RI O C A RTI A
Email	Category Tabs
DDI
R O M E| 2017
M A RI O C A RTI A
“If you can't
explain it simply
you don't
understand
it well enough”
DDI
R O M E| 2017
M A RI O C A RTI A
SUPERVISED	LEARNING
Supervised learning is where you
have input variables (x) and an
output variable (y) and you use an
algorithm to learn the mapping
function from the input to the
output
y=f(x)
DDI
R O M E| 2017
M A RI O C A RTI A
SUPERVISED	LEARNING
Classification is a general process
related to categorization, the process
in which ideas and objects are
recognized, differentiated, and
understood
A classification system is an approach
to accomplishing classification
DDI
R O M E| 2017
M A RI O C A RTI A
CLASSIFICATION
In Machine Learning, Naive
Bayes Classifiers are a family of
simple probabilistic classifiers
based on applying Bayes'
theorem with strong (naive)
independence assumptions
between the features
DDI
R O M E| 2017
M A RI O C A RTI A
NAIVE	BAYES	CLASSIFIERS
Naive Bayes has been studied
extensively since the 1950s and
remains a popular (baseline) method for
text categorization, the problem of
judging documents as belonging to one
category or the other (such as spam or
legitimate, sports or politics, etc.) with
word frequencies as the features
DDI
R O M E| 2017
M A RI O C A RTI A
TEXT CATEGORIZATION
SUPERVISED LEARNING
CLASSIFICATION
NAIVE BAYES CLASSIFIER
?
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
Recommendation system
DDI
R O M E| 2017
M A RI O C A RTI A
Recommendation system
DDI
R O M E| 2017
M A RI O C A RTI A
Recommendation system
DDI
R O M E| 2017
M A RI O C A RTI A
Recommendation system
DDI
R O M E| 2017
M A RI O C A RTI A
Recommendation system
DDI
R O M E| 2017
M A RI O C A RTI A
UNSUPERVISED	LEARNING
Unsupervised learning algorithms
are machine learning algorithms
that work without a desired output
label
Essentially, the algorithm attempts
to estimate the underlying structure
of the population of input data
DDI
R O M E| 2017
M A RI O C A RTI A
UNSUPERVISED	LEARNING
Collaborative Filtering is a method of making
automatic predictions (filtering) about the
interests of a user by collecting preferences or
taste information from many users
(collaborating)
In the more general sense, Collaborative
Filtering is the process of filtering for
information or patterns using techniques
involving collaboration among multiple agents,
viewpoints, data sources, etc.
DDI
R O M E| 2017
M A RI O C A RTI A
COLLABORATIVE	FILTERING
Applications of Collaborative Filtering typically
involve very large data sets
As the numbers of users and items grow,
traditional CF algorithms will suffer serious
scalability problems
Large web companies use clusters of
machines to scale recommendations for their
millions of users
DDI
R O M E| 2017
M A RI O C A RTI A
RECOMMENDATION SYSTEM
UNSUPERVISED LEARNING
COLLABORATIVE FILTERING
USER BASED / ITEM BASED / OTHER
?
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
Targeted Advertising
DDI
R O M E| 2017
M A RI O C A RTI A
Targeted Advertising
DDI
R O M E| 2017
M A RI O C A RTI A
UNSUPERVISED	LEARNING
Cluster analysis or Clustering is the
task of grouping a set of objects in
such a way that objects in the same
group (called a cluster) are more
similar (in some sense or another)
to each other than to those in other
groups (clusters)
DDI
R O M E| 2017
M A RI O C A RTI A
CLUSTERING
K-means clustering is a method of vector
quantization, originally from signal
processing, that is popular for cluster
analysis in data mining
K-means clustering aims to partition n
observations into k clusters in which each
observation belongs to the cluster with
the nearest mean
DDI
R O M E| 2017
M A RI O C A RTI A
TARGETED ADVERTISING
UNSUPERVISED LEARNING
CLUSTERING
K-MEANS CLUSTERING
?
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
TYPICAL	ML	WORKFLOW
ü Data and problem definition
ü Data collection
ü Data preprocessing
ü Data analysis and modeling with
unsupervised and supervised
learning
ü Process evaluation
DDI
R O M E| 2017
M A RI O C A RTI A
EVALUATION	METRICS
The root-mean-square deviation
(RMSD) or root-mean-square error
(RMSE) is a frequently used
measure of the differences between
values (sample and population
values) predicted by a model or an
estimator and the values actually
observed
DDI
R O M E| 2017
M A RI O C A RTI A
BEYOND	ML
Deep learning is a branch of
machine learning based on a set of
algorithms that attempt to model
high level abstractions in data
Deep learning is part of a broader
family of machine learning methods
based on learning representations
DDI
R O M E| 2017
M A RI O C A RTI A
BEYOND	ML
One of the promises of Deep Learning
is replacing handcrafted features with
efficient algorithms for unsupervised
or semi-supervised feature learning
and hierarchical feature extraction
Some of the representations are
inspired by advances in neuroscience
DDI
R O M E| 2017
M A RI O C A RTI A
BEYOND	ML
Various Deep Learning architectures
such as deep neural networks have
been applied to fields like computer
vision, automatic speech recognition,
natural language processing, audio
recognition and bioinformatics where
they have been shown to produce state-
of-the-art results on various tasks
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
DDI
R O M E| 2017
M A RI O C A RTI A
ML	&	BIG	DATA
“We don’t have better algorithms.
We just have more data.”
Peter Norvig
Google’s Research Director
DDI
R O M E| 2017
M A RI O C A RTI A
ML	&	BIG	DATA
Apache Hadoop is an open-source
software framework used for distributed
storage and processing of big data sets
using clusters built from commodity
hardware
DDI
R O M E| 2017
M A RI O C A RTI A
ML	&	BIG	DATA
Apache Spark is a fast and general-purpose
cluster computing system
It provides high-level APIs in Scala, Java, Python
and R, and an optimized engine that supports
general execution graphs
It also supports a rich set of higher-level tools
including Spark SQL for SQL and structured data
processing, MLlib for machine learning, GraphX
for graph processing, and Spark Streaming
DDI
R O M E| 2017
M A RI O C A RTI A
ML	&	BIG	DATA
DDI
R O M E| 2017
M A RI O C A RTI A
WHY	TO	USE	SCALA?
Spark Survey 2016
DDI
R O M E| 2017
M A RI O C A RTI A
WHY	TO	USE	SCALA?
Scala is one of the most exciting languages to be created in the 21st
century. It is a multi-paradigm language that fully supports functional,
object-oriented, imperative and concurrent programming. It also has a
strong type system, and from our point of view, strong type is a
convenient form of self-documenting code.
Scala works on the JVM and has access to the riches of the Java
ecosystem, but it is less verbose than Java. As we employ it for ND4J,
its syntax is strikingly similar to Python, a language that many data
scientists are comfortable with. Like Python, Scala makes
programmers happy, but like Java, it is quite fast.
Finally, Apache Spark is written in Scala, and any library that purports
to work on distributed run times should at the very least be able to
interface with Spark
Source: https://deeplearning4j.org/scala
DDI
R O M E| 2017
M A RI O C A RTI A
GRAZIE!
MARIO@BIG-DATA.NINJA

More Related Content

What's hot (20)

PPT
Machine Learning
Rahul Kumar
 
PDF
Artificial Intelligence and the Future of Humanity
Gerd Leonhard
 
PDF
Principles of Artificial Intelligence & Machine Learning
Jerry Lu
 
PPTX
Machine Learning and Real-World Applications
MachinePulse
 
PPTX
Artificial intelligence PPT (AI PPT)
RAONEvv
 
PPTX
Artificial Intelligence
Javaria Chiragh
 
PDF
History of AI, Current Trends, Prospective Trajectories
Giovanni Sileno
 
PPTX
Artificial intelligence
Guduru Lakshmi Kiranmai
 
PPTX
Artificial intelligence- The science of intelligent programs
Derak Davis
 
DOCX
Artificial intelligence-full -report.doc
daksh Talsaniya
 
PDF
Generative AI and Security (1).pptx.pdf
Priyanka Aash
 
PDF
The Danger and Risk of AI
Apac Business Headlines
 
PPTX
Intro/Overview on Machine Learning Presentation
Ankit Gupta
 
PPTX
Machine learning
Rajesh Chittampally
 
PPTX
History of AI
Megha Sharma
 
PPTX
Image recognition
Harika Nalla
 
PPTX
Elements Of Artificial Intelligence
MeghaGambhire
 
PPTX
Artificial Intelligence Course | AI Tutorial For Beginners | Artificial Intel...
Simplilearn
 
PDF
Lecture1 AI1 Introduction to artificial intelligence
Albert Orriols-Puig
 
Machine Learning
Rahul Kumar
 
Artificial Intelligence and the Future of Humanity
Gerd Leonhard
 
Principles of Artificial Intelligence & Machine Learning
Jerry Lu
 
Machine Learning and Real-World Applications
MachinePulse
 
Artificial intelligence PPT (AI PPT)
RAONEvv
 
Artificial Intelligence
Javaria Chiragh
 
History of AI, Current Trends, Prospective Trajectories
Giovanni Sileno
 
Artificial intelligence
Guduru Lakshmi Kiranmai
 
Artificial intelligence- The science of intelligent programs
Derak Davis
 
Artificial intelligence-full -report.doc
daksh Talsaniya
 
Generative AI and Security (1).pptx.pdf
Priyanka Aash
 
The Danger and Risk of AI
Apac Business Headlines
 
Intro/Overview on Machine Learning Presentation
Ankit Gupta
 
Machine learning
Rajesh Chittampally
 
History of AI
Megha Sharma
 
Image recognition
Harika Nalla
 
Elements Of Artificial Intelligence
MeghaGambhire
 
Artificial Intelligence Course | AI Tutorial For Beginners | Artificial Intel...
Simplilearn
 
Lecture1 AI1 Introduction to artificial intelligence
Albert Orriols-Puig
 

Viewers also liked (20)

PDF
Codemotion 2012 - Da web a mobile... senza spargimento di sangue
Mario Cartia
 
PPTX
Malware Detection Using Machine Learning Techniques
ArshadRaja786
 
PPTX
Introduction to Machine Learning
Lior Rokach
 
PPT
CSCI 6505 Machine Learning Project
butest
 
PPTX
Production and Beyond: Deploying and Managing Machine Learning Models
Turi, Inc.
 
PDF
Tech Webinar: Big Data: Introduzione al Machine Learning ed all'Analisi Predi...
Codemotion
 
PPTX
Markov Model for TMR System with Repair
Sujith Jay Nair
 
PDF
Tech Webinar: Come ottimizzare il workflow nello sviluppo di Web App
Codemotion
 
PPTX
Facial recognition
Kartik Raghuvanshi
 
PDF
Metamorphic Malware Analysis and Detection
Grijesh Chauhan
 
PDF
Big Data, a space adventure - Mario Cartia - Codemotion Rome 2015
Codemotion
 
PPTX
Machine Learning for Search at LinkedIn
Viet Ha-Thuc
 
PDF
The Heist Simulation - Participants' Briefing Pack
Disruptive Learning Solutions
 
PPTX
Machine learning prediction of stock markets
Nikola Milosevic
 
PPT
Data mining techniques for malware detection.pptx
Aditya Deshmukh
 
PPTX
Diventare famosi con lo stack ELK - Alfonso Iannotta
Data Driven Innovation
 
PPTX
Empresa.
José Parejo
 
PPTX
Airline passenger profiling based on fuzzy deep machine learning
Ayman Qaddumi
 
PPTX
Jeni Paula Eric
jennihr
 
PDF
Machine Learning in Modern Medicine with Erin LeDell at Stanford Med
Sri Ambati
 
Codemotion 2012 - Da web a mobile... senza spargimento di sangue
Mario Cartia
 
Malware Detection Using Machine Learning Techniques
ArshadRaja786
 
Introduction to Machine Learning
Lior Rokach
 
CSCI 6505 Machine Learning Project
butest
 
Production and Beyond: Deploying and Managing Machine Learning Models
Turi, Inc.
 
Tech Webinar: Big Data: Introduzione al Machine Learning ed all'Analisi Predi...
Codemotion
 
Markov Model for TMR System with Repair
Sujith Jay Nair
 
Tech Webinar: Come ottimizzare il workflow nello sviluppo di Web App
Codemotion
 
Facial recognition
Kartik Raghuvanshi
 
Metamorphic Malware Analysis and Detection
Grijesh Chauhan
 
Big Data, a space adventure - Mario Cartia - Codemotion Rome 2015
Codemotion
 
Machine Learning for Search at LinkedIn
Viet Ha-Thuc
 
The Heist Simulation - Participants' Briefing Pack
Disruptive Learning Solutions
 
Machine learning prediction of stock markets
Nikola Milosevic
 
Data mining techniques for malware detection.pptx
Aditya Deshmukh
 
Diventare famosi con lo stack ELK - Alfonso Iannotta
Data Driven Innovation
 
Empresa.
José Parejo
 
Airline passenger profiling based on fuzzy deep machine learning
Ayman Qaddumi
 
Jeni Paula Eric
jennihr
 
Machine Learning in Modern Medicine with Erin LeDell at Stanford Med
Sri Ambati
 
Ad

Similar to Machine Learning Real Life Applications By Examples (20)

PPTX
Introduction overviewmachinelearning sig Door Lucas Jellema
Getting value from IoT, Integration and Data Analytics
 
PPTX
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
Lucas Jellema
 
PDF
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
Lucas Jellema
 
PDF
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Carol McDonald
 
PPTX
Introduction to Machine Learning - An overview and first step for candidate d...
Lucas Jellema
 
PPTX
Machine Learning - Challenges, Learnings & Opportunities
CodePolitan
 
PPTX
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Mathieu DESPRIEE
 
PPTX
Workshop_Presentation.pptx
RUDRAPRASADSABAR
 
PPTX
Data Science Crash Course
DataWorks Summit
 
PDF
Live Tutorial – Streaming Real-Time Events Using Apache APIs
MapR Technologies
 
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
Daniel Katz
 
PPTX
Machine Learning using Big data
Vaibhav Kurkute
 
PDF
Artificial Intelligence for Business
Nicola Mattina
 
PPTX
Introduction to Big Data/Machine Learning
Lars Marius Garshol
 
PDF
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Carol McDonald
 
PDF
AI meets Big Data
Jan Wiegelmann
 
PDF
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
Dozie Agbo
 
PPTX
Big Data & Machine Learning - TDC2013 Sao Paulo
OCTO Technology
 
PDF
Intro to ai application emeritus uob-final
Luis Fernando Gonzalez Sanchez
 
PDF
Machine Learning
IBA Group
 
Introduction overviewmachinelearning sig Door Lucas Jellema
Getting value from IoT, Integration and Data Analytics
 
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
Lucas Jellema
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
Lucas Jellema
 
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Carol McDonald
 
Introduction to Machine Learning - An overview and first step for candidate d...
Lucas Jellema
 
Machine Learning - Challenges, Learnings & Opportunities
CodePolitan
 
Big Data & Machine Learning - TDC2013 São Paulo - 12/0713
Mathieu DESPRIEE
 
Workshop_Presentation.pptx
RUDRAPRASADSABAR
 
Data Science Crash Course
DataWorks Summit
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
MapR Technologies
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
Daniel Katz
 
Machine Learning using Big data
Vaibhav Kurkute
 
Artificial Intelligence for Business
Nicola Mattina
 
Introduction to Big Data/Machine Learning
Lars Marius Garshol
 
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Carol McDonald
 
AI meets Big Data
Jan Wiegelmann
 
An Elementary Introduction to Artificial Intelligence, Data Science and Machi...
Dozie Agbo
 
Big Data & Machine Learning - TDC2013 Sao Paulo
OCTO Technology
 
Intro to ai application emeritus uob-final
Luis Fernando Gonzalez Sanchez
 
Machine Learning
IBA Group
 
Ad

Recently uploaded (20)

PPTX
Thermal runway and thermal stability.pptx
godow93766
 
PPTX
Element 11. ELECTRICITY safety and hazards
merrandomohandas
 
PPTX
Arduino Based Gas Leakage Detector Project
CircuitDigest
 
PPTX
GitOps_Repo_Structure for begeinner(Scaffolindg)
DanialHabibi2
 
PPTX
美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查
Taqyea
 
PDF
Pressure Measurement training for engineers and Technicians
AIESOLUTIONS
 
PPTX
Hashing Introduction , hash functions and techniques
sailajam21
 
PDF
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
PDF
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
DOCX
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
PPTX
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
PPTX
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
PDF
Book.pdf01_Intro.ppt algorithm for preperation stu used
archu26
 
PDF
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
PPTX
Product Development & DevelopmentLecture02.pptx
zeeshanwazir2
 
PDF
Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGE
DancanNgutuku
 
PPTX
Server Side Web Development Unit 1 of Nodejs.pptx
sneha852132
 
PPTX
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PPTX
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
Thermal runway and thermal stability.pptx
godow93766
 
Element 11. ELECTRICITY safety and hazards
merrandomohandas
 
Arduino Based Gas Leakage Detector Project
CircuitDigest
 
GitOps_Repo_Structure for begeinner(Scaffolindg)
DanialHabibi2
 
美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查
Taqyea
 
Pressure Measurement training for engineers and Technicians
AIESOLUTIONS
 
Hashing Introduction , hash functions and techniques
sailajam21
 
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
The Role of Information Technology in Environmental Protectio....pptx
nallamillisriram
 
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
Book.pdf01_Intro.ppt algorithm for preperation stu used
archu26
 
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
Product Development & DevelopmentLecture02.pptx
zeeshanwazir2
 
Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGE
DancanNgutuku
 
Server Side Web Development Unit 1 of Nodejs.pptx
sneha852132
 
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 

Machine Learning Real Life Applications By Examples

  • 1. DATA DRIVEN INNOVATION Rome 2017 | Open Summit MACHINE LEARNING REAL LIFE APPLICATIONS BY EXAMPLES SPEAKER MARIO CARTIA [email protected]
  • 2. DDI R O M E| 2017 M A RI O C A RTI A Can machines think? Computing machinery and intelligence. Mind, 59, 433-460 (1950) Turing A.M.
  • 3. DDI R O M E| 2017 M A RI O C A RTI A 1968 2001: A Space Odyssey “I'm sorry Dave, I'm afraid I can't do that”
  • 4. DDI R O M E| 2017 M A RI O C A RTI A 1982 Supercar
  • 5. DDI R O M E| 2017 M A RI O C A RTI A 1983 Wargames
  • 6. DDI R O M E| 2017 M A RI O C A RTI A 1996 Kasparov vs. Deep Blue
  • 7. DDI R O M E| 2017 M A RI O C A RTI A Does Deep Blue use artificial intelligence? The short answer is "no." Earlier computer designs that tried to mimic human thinking weren't very good at it. No formula exists for intuition. So Deep Blue's designers have gone "back to the future." Deep Blue relies more on computational power and a simpler search and evaluation function. The long answer is "no." "Artificial Intelligence" is more successful in science fiction than it is here on earth, and you don't have to be Isaac Asimov to know why it's hard to design a machine to mimic a process we don't understand very well to begin with. Source: https://www.research.ibm.com/deepblue/meet/html/d.3.3a.shtml
  • 8. DDI R O M E| 2017 M A RI O C A RTI A Decision Tree (IF... THEN)
  • 9. DDI R O M E| 2017 M A RI O C A RTI A “Machine learning is the subfield of computer science that gives computers the ability to learn without being explicitly programmed.” Arthur Samuel, 1959
  • 10. DDI R O M E| 2017 M A RI O C A RTI A Spam Email Filtering
  • 11. DDI R O M E| 2017 M A RI O C A RTI A Email Category Tabs
  • 12. DDI R O M E| 2017 M A RI O C A RTI A “If you can't explain it simply you don't understand it well enough”
  • 13. DDI R O M E| 2017 M A RI O C A RTI A SUPERVISED LEARNING Supervised learning is where you have input variables (x) and an output variable (y) and you use an algorithm to learn the mapping function from the input to the output y=f(x)
  • 14. DDI R O M E| 2017 M A RI O C A RTI A SUPERVISED LEARNING Classification is a general process related to categorization, the process in which ideas and objects are recognized, differentiated, and understood A classification system is an approach to accomplishing classification
  • 15. DDI R O M E| 2017 M A RI O C A RTI A CLASSIFICATION In Machine Learning, Naive Bayes Classifiers are a family of simple probabilistic classifiers based on applying Bayes' theorem with strong (naive) independence assumptions between the features
  • 16. DDI R O M E| 2017 M A RI O C A RTI A NAIVE BAYES CLASSIFIERS Naive Bayes has been studied extensively since the 1950s and remains a popular (baseline) method for text categorization, the problem of judging documents as belonging to one category or the other (such as spam or legitimate, sports or politics, etc.) with word frequencies as the features
  • 17. DDI R O M E| 2017 M A RI O C A RTI A TEXT CATEGORIZATION SUPERVISED LEARNING CLASSIFICATION NAIVE BAYES CLASSIFIER ?
  • 18. DDI R O M E| 2017 M A RI O C A RTI A
  • 19. DDI R O M E| 2017 M A RI O C A RTI A
  • 20. DDI R O M E| 2017 M A RI O C A RTI A
  • 21. DDI R O M E| 2017 M A RI O C A RTI A
  • 22. DDI R O M E| 2017 M A RI O C A RTI A
  • 23. DDI R O M E| 2017 M A RI O C A RTI A Recommendation system
  • 24. DDI R O M E| 2017 M A RI O C A RTI A Recommendation system
  • 25. DDI R O M E| 2017 M A RI O C A RTI A Recommendation system
  • 26. DDI R O M E| 2017 M A RI O C A RTI A Recommendation system
  • 27. DDI R O M E| 2017 M A RI O C A RTI A Recommendation system
  • 28. DDI R O M E| 2017 M A RI O C A RTI A UNSUPERVISED LEARNING Unsupervised learning algorithms are machine learning algorithms that work without a desired output label Essentially, the algorithm attempts to estimate the underlying structure of the population of input data
  • 29. DDI R O M E| 2017 M A RI O C A RTI A UNSUPERVISED LEARNING Collaborative Filtering is a method of making automatic predictions (filtering) about the interests of a user by collecting preferences or taste information from many users (collaborating) In the more general sense, Collaborative Filtering is the process of filtering for information or patterns using techniques involving collaboration among multiple agents, viewpoints, data sources, etc.
  • 30. DDI R O M E| 2017 M A RI O C A RTI A COLLABORATIVE FILTERING Applications of Collaborative Filtering typically involve very large data sets As the numbers of users and items grow, traditional CF algorithms will suffer serious scalability problems Large web companies use clusters of machines to scale recommendations for their millions of users
  • 31. DDI R O M E| 2017 M A RI O C A RTI A RECOMMENDATION SYSTEM UNSUPERVISED LEARNING COLLABORATIVE FILTERING USER BASED / ITEM BASED / OTHER ?
  • 32. DDI R O M E| 2017 M A RI O C A RTI A
  • 33. DDI R O M E| 2017 M A RI O C A RTI A
  • 34. DDI R O M E| 2017 M A RI O C A RTI A
  • 35. DDI R O M E| 2017 M A RI O C A RTI A
  • 36. DDI R O M E| 2017 M A RI O C A RTI A Targeted Advertising
  • 37. DDI R O M E| 2017 M A RI O C A RTI A Targeted Advertising
  • 38. DDI R O M E| 2017 M A RI O C A RTI A UNSUPERVISED LEARNING Cluster analysis or Clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense or another) to each other than to those in other groups (clusters)
  • 39. DDI R O M E| 2017 M A RI O C A RTI A CLUSTERING K-means clustering is a method of vector quantization, originally from signal processing, that is popular for cluster analysis in data mining K-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean
  • 40. DDI R O M E| 2017 M A RI O C A RTI A TARGETED ADVERTISING UNSUPERVISED LEARNING CLUSTERING K-MEANS CLUSTERING ?
  • 41. DDI R O M E| 2017 M A RI O C A RTI A
  • 42. DDI R O M E| 2017 M A RI O C A RTI A
  • 43. DDI R O M E| 2017 M A RI O C A RTI A TYPICAL ML WORKFLOW ü Data and problem definition ü Data collection ü Data preprocessing ü Data analysis and modeling with unsupervised and supervised learning ü Process evaluation
  • 44. DDI R O M E| 2017 M A RI O C A RTI A EVALUATION METRICS The root-mean-square deviation (RMSD) or root-mean-square error (RMSE) is a frequently used measure of the differences between values (sample and population values) predicted by a model or an estimator and the values actually observed
  • 45. DDI R O M E| 2017 M A RI O C A RTI A BEYOND ML Deep learning is a branch of machine learning based on a set of algorithms that attempt to model high level abstractions in data Deep learning is part of a broader family of machine learning methods based on learning representations
  • 46. DDI R O M E| 2017 M A RI O C A RTI A BEYOND ML One of the promises of Deep Learning is replacing handcrafted features with efficient algorithms for unsupervised or semi-supervised feature learning and hierarchical feature extraction Some of the representations are inspired by advances in neuroscience
  • 47. DDI R O M E| 2017 M A RI O C A RTI A BEYOND ML Various Deep Learning architectures such as deep neural networks have been applied to fields like computer vision, automatic speech recognition, natural language processing, audio recognition and bioinformatics where they have been shown to produce state- of-the-art results on various tasks
  • 48. DDI R O M E| 2017 M A RI O C A RTI A
  • 49. DDI R O M E| 2017 M A RI O C A RTI A
  • 50. DDI R O M E| 2017 M A RI O C A RTI A ML & BIG DATA “We don’t have better algorithms. We just have more data.” Peter Norvig Google’s Research Director
  • 51. DDI R O M E| 2017 M A RI O C A RTI A ML & BIG DATA Apache Hadoop is an open-source software framework used for distributed storage and processing of big data sets using clusters built from commodity hardware
  • 52. DDI R O M E| 2017 M A RI O C A RTI A ML & BIG DATA Apache Spark is a fast and general-purpose cluster computing system It provides high-level APIs in Scala, Java, Python and R, and an optimized engine that supports general execution graphs It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming
  • 53. DDI R O M E| 2017 M A RI O C A RTI A ML & BIG DATA
  • 54. DDI R O M E| 2017 M A RI O C A RTI A WHY TO USE SCALA? Spark Survey 2016
  • 55. DDI R O M E| 2017 M A RI O C A RTI A WHY TO USE SCALA? Scala is one of the most exciting languages to be created in the 21st century. It is a multi-paradigm language that fully supports functional, object-oriented, imperative and concurrent programming. It also has a strong type system, and from our point of view, strong type is a convenient form of self-documenting code. Scala works on the JVM and has access to the riches of the Java ecosystem, but it is less verbose than Java. As we employ it for ND4J, its syntax is strikingly similar to Python, a language that many data scientists are comfortable with. Like Python, Scala makes programmers happy, but like Java, it is quite fast. Finally, Apache Spark is written in Scala, and any library that purports to work on distributed run times should at the very least be able to interface with Spark Source: https://deeplearning4j.org/scala
  • 56. DDI R O M E| 2017 M A RI O C A RTI A GRAZIE! [email protected]