SlideShare a Scribd company logo
simon@metabase.com

@sbelak
Doing data science with
Clojure
My perspective
• Core developer at Metabase, an open source BI/analytics
tool. 3rd largest BI tool in the world. 21k+ companies use
us daily, including N26, Revolut, Swisscom, Confluent, …

• Built analytics department at GoOpti from the ground up

• Helped 20+ companies become data-driven
Goal: answer 80% of
questions stemming from
data in <20min
The analytics chasm
2 min 20 min project
Ideal. Almost real-time. Can be
done during brainstorming
without disrupting the flow.
:(
Added to roadmapSqueeze in
somewhere
in the day
Doing our job better is
often a matter of speed
and quantity
Doing data science with clojure
Clojure at a glance
• (lisp (running-on :JVM))
• Functional, dynamic, immutable

• Excellent concurrency and state-management primitives

• Unparalleled data manipulation
Affordances of our tools
shape how we approach
problems
Data frames considered
harmful
• Data frame (=table) conflates representation and
abstraction

• Clojure excels in structure manipulation/encoding
Composability is key to
quick iterating
• Curried versions where possible 

• ->> and partial friendly 

• Side benefit: consistent API
“This is possibly Clojure’s most important property: the
syntax expresses the code’s semantic layers. An
experienced reader of Clojure can skip over most of the
code and have a lossless understanding of its high- level
intent.”
— Z. Tellman, Elements of Clojure
Live programming
Catching errors early more
context easier debugging
faster iterating
The power of
sharing runtime
clojure.spec
Queryable data
descriptions
The ecosystem
What about ML?
Farm it out!
(to sklearn)
Mini compilers targeting
a specific library in
another language
huri.plot
• DSL that compiles to ggplot2

• Targets Gorilla REPL

• bar chart, scatter plot, line chart, box & violin plot,
heatmap, histogram
Doing data science with clojure
Takeouts
• Speed-of-answer matters 

• You don’t have to reinvent every wheel in Clojure 

• Clojure is fantastic at structure manipulation, play to its
strengths 

• Blurring the line between environment and work is 

a powerful idea
Questions
P.S. We’re hiring!
github.com/metabase/metabase

github.com/clojupyter/clojupyter

github.com/uncomplicate/neanderthal

github.com/scicloj

github.com/sbelak/huri

More Related Content

What's hot (20)

PDF
Webinar - Patient Readmission Risk
Turi, Inc.
 
PPTX
DataCanvas: Big Data Analytic Flow in Cloud
Lei Fang
 
PDF
Data science team (new version)
Omid Mogharian
 
PPTX
Machine Learning with GraphLab Create
Turi, Inc.
 
PDF
The 3 Insights Defining Modern Analytics
Looker
 
PPTX
Webinar - Fraud Detection - Palombo (20160428)
Turi, Inc.
 
PDF
Data-driven design (UX Antwerp 24/09/19)
Peter Vermaercke
 
PDF
Semantic Web in Enterprise: an agile startup perspective
Jeen Broekstra
 
PDF
Dataiku productive application to production - pap is may 2015
Dataiku
 
PPTX
When and Where to Embed Business Intelligence
Looker
 
PDF
Join 2017_Deep Dive_Integrating Looker with R and Python
Looker
 
PPTX
Krish Swamy + Balaji Gopalakrishnan, Wells Fargo - Building a World Class Dat...
Sri Ambati
 
PPTX
Helping Business Leaders Get Over Their Learning Curve in Advanced Analytics
IDEAS - Int'l Data Engineering and Science Association
 
PPTX
Webinar - Know Your Customer - Arya (20160526)
Turi, Inc.
 
PDF
Let's analyze how world reacts to road traffic by sentiment analysis final
Sajeetharan
 
PPTX
Extreme Excel: How a 35-Year-Old Desktop App Smashed Through the Big Data Bar...
SamanthaBerlant
 
PDF
Data Modeling in Looker
Looker
 
PPTX
Yellowbrick MicroStrategy webcast
Yellowbrick Data
 
PPTX
“How to Succeed with Machine Learning” by Arturs Valujevs from Intrum Global ...
DevClub_lv
 
PPTX
LogiqApps LowCode Norwegian Hackathon Challenge №3
LogiqAppsAS
 
Webinar - Patient Readmission Risk
Turi, Inc.
 
DataCanvas: Big Data Analytic Flow in Cloud
Lei Fang
 
Data science team (new version)
Omid Mogharian
 
Machine Learning with GraphLab Create
Turi, Inc.
 
The 3 Insights Defining Modern Analytics
Looker
 
Webinar - Fraud Detection - Palombo (20160428)
Turi, Inc.
 
Data-driven design (UX Antwerp 24/09/19)
Peter Vermaercke
 
Semantic Web in Enterprise: an agile startup perspective
Jeen Broekstra
 
Dataiku productive application to production - pap is may 2015
Dataiku
 
When and Where to Embed Business Intelligence
Looker
 
Join 2017_Deep Dive_Integrating Looker with R and Python
Looker
 
Krish Swamy + Balaji Gopalakrishnan, Wells Fargo - Building a World Class Dat...
Sri Ambati
 
Helping Business Leaders Get Over Their Learning Curve in Advanced Analytics
IDEAS - Int'l Data Engineering and Science Association
 
Webinar - Know Your Customer - Arya (20160526)
Turi, Inc.
 
Let's analyze how world reacts to road traffic by sentiment analysis final
Sajeetharan
 
Extreme Excel: How a 35-Year-Old Desktop App Smashed Through the Big Data Bar...
SamanthaBerlant
 
Data Modeling in Looker
Looker
 
Yellowbrick MicroStrategy webcast
Yellowbrick Data
 
“How to Succeed with Machine Learning” by Arturs Valujevs from Intrum Global ...
DevClub_lv
 
LogiqApps LowCode Norwegian Hackathon Challenge №3
LogiqAppsAS
 

Similar to Doing data science with clojure (20)

PDF
Store, Extract, Transform, Load, Visualize. Untagged Conference
Ani Lopez
 
PDF
Lendon mc kinney resume 2019 updated
LendonMcKinney
 
PDF
How Celtra Optimizes its Advertising Platform with Databricks
Grega Kespret
 
PPTX
Creating a Culture of Data @ Facebook - TCCEU13
Andy Kriebel
 
PDF
Data Culture Series - Keynote & Panel - 19h May - London
Jonathan Woodward
 
PPTX
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
Looker
 
PDF
Mongo db ops mug pres
David Erickson
 
PPTX
Architecting for Big Data: Trends, Tips, and Deployment Options
Caserta
 
PDF
Continuum Analytics and Python
Travis Oliphant
 
PDF
Data Engineer's Lunch #85: Designing a Modern Data Stack
Anant Corporation
 
PDF
Thinking Outside the Cube: How In-Memory Bolsters Analytics
Inside Analysis
 
PDF
The Right Data Warehouse: Automation Now, Business Value Thereafter
Inside Analysis
 
PPTX
Partner webinar presentation aws pebble_treasure_data
Treasure Data, Inc.
 
PPTX
Moving from BI to AI : For decision makers
zekeLabs Technologies
 
PDF
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
MongoDB
 
PPTX
From Business Intelligence to Big Data - hack/reduce Dec 2014
Adam Ferrari
 
PPTX
Power BI - 2016 - Public
Julian Payne
 
PDF
An overview of modern scalable web development
Tung Nguyen
 
PDF
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Looker
 
PDF
Agile & Data Modeling – How Can They Work Together?
DATAVERSITY
 
Store, Extract, Transform, Load, Visualize. Untagged Conference
Ani Lopez
 
Lendon mc kinney resume 2019 updated
LendonMcKinney
 
How Celtra Optimizes its Advertising Platform with Databricks
Grega Kespret
 
Creating a Culture of Data @ Facebook - TCCEU13
Andy Kriebel
 
Data Culture Series - Keynote & Panel - 19h May - London
Jonathan Woodward
 
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
Looker
 
Mongo db ops mug pres
David Erickson
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Caserta
 
Continuum Analytics and Python
Travis Oliphant
 
Data Engineer's Lunch #85: Designing a Modern Data Stack
Anant Corporation
 
Thinking Outside the Cube: How In-Memory Bolsters Analytics
Inside Analysis
 
The Right Data Warehouse: Automation Now, Business Value Thereafter
Inside Analysis
 
Partner webinar presentation aws pebble_treasure_data
Treasure Data, Inc.
 
Moving from BI to AI : For decision makers
zekeLabs Technologies
 
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
MongoDB
 
From Business Intelligence to Big Data - hack/reduce Dec 2014
Adam Ferrari
 
Power BI - 2016 - Public
Julian Payne
 
An overview of modern scalable web development
Tung Nguyen
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Looker
 
Agile & Data Modeling – How Can They Work Together?
DATAVERSITY
 
Ad

More from Simon Belak (20)

PDF
The subtle art of recommendation
Simon Belak
 
PDF
Metabase Ljubljana Meetup #2
Simon Belak
 
PDF
Metabase lj meetup
Simon Belak
 
PDF
Sketch algorithms
Simon Belak
 
PDF
Transducing for fun and profit
Simon Belak
 
PDF
Your metrics are wrong
Simon Belak
 
PDF
Writing smart contracts the sane way
Simon Belak
 
PDF
Online statistical analysis using transducers and sketch algorithms
Simon Belak
 
PDF
Save the princess
Simon Belak
 
PDF
Data driven going to market strategy
Simon Belak
 
PDF
Spec: a lisp-flavoured type system
Simon Belak
 
PDF
A data layer in clojure
Simon Belak
 
PDF
Odkrivanje segmentov iz podatkov
Simon Belak
 
PDF
Using Onyx in anger
Simon Belak
 
PDF
Spec + onyx
Simon Belak
 
PDF
Dao of lisp
Simon Belak
 
PDF
Predicting the future with goopti
Simon Belak
 
PDF
Living with-spec
Simon Belak
 
PDF
Living with-spec
Simon Belak
 
PDF
Doing data science with Clojure
Simon Belak
 
The subtle art of recommendation
Simon Belak
 
Metabase Ljubljana Meetup #2
Simon Belak
 
Metabase lj meetup
Simon Belak
 
Sketch algorithms
Simon Belak
 
Transducing for fun and profit
Simon Belak
 
Your metrics are wrong
Simon Belak
 
Writing smart contracts the sane way
Simon Belak
 
Online statistical analysis using transducers and sketch algorithms
Simon Belak
 
Save the princess
Simon Belak
 
Data driven going to market strategy
Simon Belak
 
Spec: a lisp-flavoured type system
Simon Belak
 
A data layer in clojure
Simon Belak
 
Odkrivanje segmentov iz podatkov
Simon Belak
 
Using Onyx in anger
Simon Belak
 
Spec + onyx
Simon Belak
 
Dao of lisp
Simon Belak
 
Predicting the future with goopti
Simon Belak
 
Living with-spec
Simon Belak
 
Living with-spec
Simon Belak
 
Doing data science with Clojure
Simon Belak
 
Ad

Recently uploaded (20)

PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
PPTX
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PDF
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PDF
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PDF
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PPT
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
PPTX
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PPTX
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
PDF
Simplifying Document Processing with Docling for AI Applications.pdf
Tamanna
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
Simplifying Document Processing with Docling for AI Applications.pdf
Tamanna
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 

Doing data science with clojure