SlideShare a Scribd company logo
When a perfect
algorithm meets real
data
the challenge of getting
insight from data
Alessandra Cagnazzo – Data Scientist
Big Data Oslo
25 June 2019
Data
Image
Recognition
Photos by CERN
Big Data Oslo v 4 | "When a Perfect Algorithm Meets Real Data" -  Alessandra Cagnazzo
Use the superpower
Of
Machine Learning
• It learns formulas partly by itself
• Give insight, indications
• One still has to engineer structures
and formulas but is a less
constrained way.
Threats are behind
the corner
Careful assuming the generality of a machine learning algorit
Deep is not always efficient.
Sometimes it is better to go shallow, at the cost of losing s
details.
1 + 10 + 20 + 3 ≈ 5.83
1 + 10 + 20 + 3 ≈ 8.11
Parallelising, batching, and simplifying
ⅇlog
𝑎
2 𝑒log 2 +
𝑒 𝑎 log
𝑎
𝑏
ⅇlog
𝑎
2
+ 2
𝑎 𝑏
𝑐
+
𝑎 − 𝑒log 𝑎+log 𝑏
𝑐
−1
= 𝑎 + 𝑏 + 𝑐
Big Data Oslo v 4 | "When a Perfect Algorithm Meets Real Data" -  Alessandra Cagnazzo

More Related Content

More from Dataconomy Media (20)

PDF
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
Dataconomy Media
 
PPTX
Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness...
Dataconomy Media
 
PDF
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
Dataconomy Media
 
PPTX
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
Dataconomy Media
 
PDF
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
Dataconomy Media
 
PDF
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
Dataconomy Media
 
PDF
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
Dataconomy Media
 
PDF
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
Dataconomy Media
 
PPTX
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
Dataconomy Media
 
PDF
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
Dataconomy Media
 
PPTX
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
Dataconomy Media
 
PPTX
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Dataconomy Media
 
PPTX
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Dataconomy Media
 
PDF
Big Data Helsinki v 3 | "What you should know about PSD2 APIs?" - Joonas Tomperi
Dataconomy Media
 
PPTX
Big Data Stockholm v 7 | "Federated Machine Learning for Collaborative and Se...
Dataconomy Media
 
PPTX
Big Data Oslo v 4 Sci Code: "Current Industry Projects within AI and the Best...
Dataconomy Media
 
PDF
Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...
Dataconomy Media
 
PDF
Big Data Warsaw v 4 I "Precise Data Integration Thanks to Big Data Analysis &...
Dataconomy Media
 
PPTX
Big Data Warsaw v 4 I "The Role of Hadoop Ecosystem in Advance Analytics" - R...
Dataconomy Media
 
PPTX
Big Data Paris v 9.0 I 'Startups: Lifeguards of the Corporate Data Lake" - Ma...
Dataconomy Media
 
Data Natives Vienna v 7.0 | "The Ingredients of Data Innovation" - Robbert de...
Dataconomy Media
 
Data Natives Cologne v 4.0 | "The Data Lorax: Planting the Seeds of Fairness...
Dataconomy Media
 
Data Natives Cologne v 4.0 | "How People Analytics Can Reveal the Hidden Aspe...
Dataconomy Media
 
Data Natives Amsterdam v 9.0 | "Ten Little Servers: A Story of no Downtime" -...
Dataconomy Media
 
Data Natives Amsterdam v 9.0 | "Point in Time Labeling at Scale" - Timothy Th...
Dataconomy Media
 
Data Natives Hamburg v 6.0 | "Interpersonal behavior: observing Alex to under...
Dataconomy Media
 
Data Natives Hamburg v 6.0 | "About Surfing, Failing & Scaling" - Florian Sch...
Dataconomy Media
 
Data NativesBerlin v 20.0 | "Serving A/B experimentation platform end-to-end"...
Dataconomy Media
 
Data Natives Berlin v 20.0 | "Ten Little Servers: A Story of no Downtime" - A...
Dataconomy Media
 
Big Data Frankfurt meets Thinkport | "The Cloud as a Driver of Innovation" - ...
Dataconomy Media
 
Thinkport meets Frankfurt | "Financial Time Series Analysis using Wavelets" -...
Dataconomy Media
 
Big Data Helsinki v 3 | "Distributed Machine and Deep Learning at Scale with ...
Dataconomy Media
 
Big Data Helsinki v 3 | "Federated Learning and Privacy-preserving AI" - Oguz...
Dataconomy Media
 
Big Data Helsinki v 3 | "What you should know about PSD2 APIs?" - Joonas Tomperi
Dataconomy Media
 
Big Data Stockholm v 7 | "Federated Machine Learning for Collaborative and Se...
Dataconomy Media
 
Big Data Oslo v 4 Sci Code: "Current Industry Projects within AI and the Best...
Dataconomy Media
 
Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...
Dataconomy Media
 
Big Data Warsaw v 4 I "Precise Data Integration Thanks to Big Data Analysis &...
Dataconomy Media
 
Big Data Warsaw v 4 I "The Role of Hadoop Ecosystem in Advance Analytics" - R...
Dataconomy Media
 
Big Data Paris v 9.0 I 'Startups: Lifeguards of the Corporate Data Lake" - Ma...
Dataconomy Media
 

Recently uploaded (20)

PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PPTX
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
Ad

Big Data Oslo v 4 | "When a Perfect Algorithm Meets Real Data" - Alessandra Cagnazzo

  • 1. When a perfect algorithm meets real data the challenge of getting insight from data Alessandra Cagnazzo – Data Scientist Big Data Oslo 25 June 2019
  • 6. Use the superpower Of Machine Learning • It learns formulas partly by itself • Give insight, indications • One still has to engineer structures and formulas but is a less constrained way.
  • 8. Careful assuming the generality of a machine learning algorit Deep is not always efficient. Sometimes it is better to go shallow, at the cost of losing s details.
  • 9. 1 + 10 + 20 + 3 ≈ 5.83 1 + 10 + 20 + 3 ≈ 8.11 Parallelising, batching, and simplifying ⅇlog 𝑎 2 𝑒log 2 + 𝑒 𝑎 log 𝑎 𝑏 ⅇlog 𝑎 2 + 2 𝑎 𝑏 𝑐 + 𝑎 − 𝑒log 𝑎+log 𝑏 𝑐 −1 = 𝑎 + 𝑏 + 𝑐