SlideShare a Scribd company logo
Solving the app market
with only grit, hustle, and a Spark cluster
Johnathan Mercer
VP of Analytics
mercer@apptopia.com
@Apptopia
30 minutes is a lot to ask
3
4
I want to fill your pockets
5
Who are we?
Apptopia is a mobile app
intelligence company
6
7
Company Overview
>$4m raised from major investors like:
~40 employees
~6 years in business
Trusted by > 35,000 publishers
worldwide.
8
Standing “on the shoulders of giants”
Eli Sapir
Co-Founder & CEO
Jonathan Kay
Co-Founder & COO
Serge Balyuk
VP of Engineering
apptopia.com 9
We power mobile intel data for the best
Apptopia empowers mobile
stakeholders
10
apptopia.com 11
There is so much more to mobile than games
12
User demographics and behavior
3%97% 40% 60%
GoToWebinarWealthfront
13
Connecting the dots
14
Competitive intelligence
5M
10M
15M
20M
25M
Worldwide Downloads
15
Investment decisions
2M
4M
6M
8M
10M
12M
Monthly Active Users
16
Advertising optimization
Apptopia is on a mission to
solve the app market
17
has empowered us.
18
19
We combine public and proprietary data
Public Data
Ranks
Ratings
Reviews
Price
Publisher
Descriptions
Screenshots
Versions
Historical
Performance of
>200K Apps
20
Public data is predictive of performance
21
Rank is the most predictive feature
Instagram
22
Rank is also the most complex (i.e., fun)
feature
App
. . .
. . .
Downloads Revenue
23
There are many categories and sub-categories
https://developer.apple.com/app-store/categories/
24
The constellation of ranks is complex and
constantly evolving
25
We build models with data from
connected apps
Connected Apps
All Apps
26
For any app, day, country
How
27
transformed us
28
First generation training and scoring
Rserve
Slow scoring (~2 months)
29
This was not a viable path forward
30
Project Khaleesi
Dev
2 months to 2 days
31
Systematically compare
hundreds of models using
20x more data
32
Spark and other open source
tools have transformed us
33
34
35
Where do we go from here?
The global app market is a
complex dynamical system
36
37
AppSimilarityBundled’nessCompetitiveness
38
Focus as we scale
Here’s 4 things we learned
39
First solve for the human
side of the equation
40
Avoidance should precede
optimization
41
Make big data small data
42
Edges are more important
than nodes
43
Thank You
Solving the app market
with only grit, hustle, and a Spark cluster
Johnathan Mercer
VP of Analytics
mercer@apptopia.com

More Related Content

PDF
Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...
Spark Summit
 
PDF
Locus.sh - Seed Round Pitch Deck
Pranav Divakar
 
PDF
Reflen 2018
Huangmao(Homer) Quan
 
PDF
Data Science: Your Secret Weapon to Closing More Deals
InsideSales.com
 
PDF
500 Demo Day Batch 19: Eventxtra
500 Startups
 
PDF
Optimisator : 8 Ways Analytics Helps Your Business Grow
Optimisator
 
PPTX
State of Salesforce Report 2016 - 2017
Lindsey M Phillips
 
PDF
Front App 10Million Series-A Funding
Pranav Divakar
 
Artificial Intelligence: How Enterprises Can Crush It With Apache Spark: Keyn...
Spark Summit
 
Locus.sh - Seed Round Pitch Deck
Pranav Divakar
 
Data Science: Your Secret Weapon to Closing More Deals
InsideSales.com
 
500 Demo Day Batch 19: Eventxtra
500 Startups
 
Optimisator : 8 Ways Analytics Helps Your Business Grow
Optimisator
 
State of Salesforce Report 2016 - 2017
Lindsey M Phillips
 
Front App 10Million Series-A Funding
Pranav Divakar
 

What's hot (20)

PPTX
500 Miami Launch: Atexto
500 Startups
 
PPTX
Built to Scale — Or Offer at Turing 2015
Turing Fest
 
PPTX
Jean-Paul Edwards, omd: AI, Marketing and Creativity, applications and implic...
ad:tech London, MMS & iMedia
 
PPTX
Gridley iab Summit Discussion
Linda Gridley
 
PDF
SteadyBudget's Seed Funding Pitch Deck
Shape Integrated Software
 
PDF
Numina > 500 Demo Day Batch 20
500 Startups
 
PDF
Rakam: 500 Demo Day Batch 21
500 Startups
 
PPTX
SSS_Web
madhavi K
 
PPTX
InnerTrends - Batch 25 Demo Day
500 Startups
 
PDF
500’s Demo Day Batch 15 >> Beeketing
500 Startups
 
PDF
Mark Edmondson slides
IIHEvents
 
PPTX
The Future of AI, Open Source, and Enterprise SaaS: Where It’s All Going with...
saastr
 
PDF
How to Triple Your Organic Search Using SimilarWeb
SimilarWebEvents
 
PDF
Digitaal werven presentatie wonderkind
Bas Haterd
 
PPTX
The Startup’s Guide to Building a Trusted Brand with OneTrust's Chief Ethics ...
saastr
 
PDF
FriendlyData > 500 Demo Day Batch 20
500 Startups
 
PPTX
Optimizing SaaS Productivity for CEOs, CFOs & CIOs with LeanIX's CEO
saastr
 
PDF
Credii
500 Startups
 
PPTX
See Your Business Take Off with SAP® Leonardo
SAP Customer Experience
 
PDF
graff_diamonds
malcolm maclean
 
500 Miami Launch: Atexto
500 Startups
 
Built to Scale — Or Offer at Turing 2015
Turing Fest
 
Jean-Paul Edwards, omd: AI, Marketing and Creativity, applications and implic...
ad:tech London, MMS & iMedia
 
Gridley iab Summit Discussion
Linda Gridley
 
SteadyBudget's Seed Funding Pitch Deck
Shape Integrated Software
 
Numina > 500 Demo Day Batch 20
500 Startups
 
Rakam: 500 Demo Day Batch 21
500 Startups
 
SSS_Web
madhavi K
 
InnerTrends - Batch 25 Demo Day
500 Startups
 
500’s Demo Day Batch 15 >> Beeketing
500 Startups
 
Mark Edmondson slides
IIHEvents
 
The Future of AI, Open Source, and Enterprise SaaS: Where It’s All Going with...
saastr
 
How to Triple Your Organic Search Using SimilarWeb
SimilarWebEvents
 
Digitaal werven presentatie wonderkind
Bas Haterd
 
The Startup’s Guide to Building a Trusted Brand with OneTrust's Chief Ethics ...
saastr
 
FriendlyData > 500 Demo Day Batch 20
500 Startups
 
Optimizing SaaS Productivity for CEOs, CFOs & CIOs with LeanIX's CEO
saastr
 
Credii
500 Startups
 
See Your Business Take Off with SAP® Leonardo
SAP Customer Experience
 
graff_diamonds
malcolm maclean
 
Ad

Viewers also liked (20)

PDF
Using Apache Spark for Intelligent Services: Keynote at Spark Summit East by ...
Spark Summit
 
PDF
Spark Summit EU talk by Debasish Das and Pramod Narasimha
Spark Summit
 
PDF
Spark Summit EU talk by Emlyn Whittick
Spark Summit
 
PDF
Spark Summit EU talk by Yaroslav Nedashkovsky and Andy Starzhinsky
Spark Summit
 
PDF
Spark Summit EU talk by Debasish Das and Pramod Narasimha
Spark Summit
 
PDF
Spark Summit EU talk by Stephan Kessler
Spark Summit
 
PDF
Spark Summit EU talk by Pat Patterson
Spark Summit
 
PDF
Using Apache Spark for Intelligent Services by Alexis Roos
Spark Summit
 
PDF
Spark Summit EU talk by Ruben Pulido and Behar Veliqi
Spark Summit
 
PDF
Spark Summit EU talk by Tug Grall
Spark Summit
 
PDF
Fighting Cybercrime: A Joint Task Force of Real-Time Data and Human Analytics...
Spark Summit
 
PPTX
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Spark Summit
 
PDF
Horizontally Scalable Relational Databases with Spark: Spark Summit East talk...
Spark Summit
 
PDF
Debugging PySpark: Spark Summit East talk by Holden Karau
Spark Summit
 
PDF
Spark Summit EU talk by Miklos Christine paddling up the stream
Spark Summit
 
PDF
Spark Summit EU talk by Shaun Klopfenstein and Neelesh Shastry
Spark Summit
 
PDF
Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...
Spark Summit
 
PDF
Building a Dataset Search Engine with Spark and Elasticsearch: Spark Summit E...
Spark Summit
 
PDF
Processing Terabyte-Scale Genomics Datasets with ADAM: Spark Summit East talk...
Spark Summit
 
PDF
Apache Spark in Cloud and Hybrid: Why Security and Governance Become More Imp...
Spark Summit
 
Using Apache Spark for Intelligent Services: Keynote at Spark Summit East by ...
Spark Summit
 
Spark Summit EU talk by Debasish Das and Pramod Narasimha
Spark Summit
 
Spark Summit EU talk by Emlyn Whittick
Spark Summit
 
Spark Summit EU talk by Yaroslav Nedashkovsky and Andy Starzhinsky
Spark Summit
 
Spark Summit EU talk by Debasish Das and Pramod Narasimha
Spark Summit
 
Spark Summit EU talk by Stephan Kessler
Spark Summit
 
Spark Summit EU talk by Pat Patterson
Spark Summit
 
Using Apache Spark for Intelligent Services by Alexis Roos
Spark Summit
 
Spark Summit EU talk by Ruben Pulido and Behar Veliqi
Spark Summit
 
Spark Summit EU talk by Tug Grall
Spark Summit
 
Fighting Cybercrime: A Joint Task Force of Real-Time Data and Human Analytics...
Spark Summit
 
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Spark Summit
 
Horizontally Scalable Relational Databases with Spark: Spark Summit East talk...
Spark Summit
 
Debugging PySpark: Spark Summit East talk by Holden Karau
Spark Summit
 
Spark Summit EU talk by Miklos Christine paddling up the stream
Spark Summit
 
Spark Summit EU talk by Shaun Klopfenstein and Neelesh Shastry
Spark Summit
 
Teaching Apache Spark Clusters to Manage Their Workers Elastically: Spark Sum...
Spark Summit
 
Building a Dataset Search Engine with Spark and Elasticsearch: Spark Summit E...
Spark Summit
 
Processing Terabyte-Scale Genomics Datasets with ADAM: Spark Summit East talk...
Spark Summit
 
Apache Spark in Cloud and Hybrid: Why Security and Governance Become More Imp...
Spark Summit
 
Ad

Similar to Spark Summit EU talk by Johnathan Mercer (20)

PDF
Mobile Apps Competitive Analysis Done Right
SafeDK
 
PDF
One-Pager: Store Intelligence
Wes McCabe
 
DOCX
App Store Optimization Tips 101
HarendraSingh Rajput
 
PDF
WebCamp2016:BizDev_Алексей Иваница_Как построить и монетизировать мобильный п...
WebCamp
 
PDF
Reimagine Growth 3 - Session 2 - Planning your ASO strategy from 0 to 100
CleverTap
 
PPTX
Running your App as a Business
Bill Magnuson
 
PDF
#Product management for start ups @NIC, #ProductISB talk
Zishan A. Mohammad
 
PDF
Entering new markets in mobile: how to gather insights and succeed
AppFollow
 
PPT
5 Must Haves for Launching a Successful Mobile Product
Robert Woo
 
PDF
Can You Solve Real World Problems - Insights of Startups
Sridhar Chimalakonda
 
PPT
OpenMIC #7 Talk
Will
 
PPTX
StartupFlux Pitch Draft
VAIBHAV JAIN
 
PPTX
The Mobile Perspective
eugenelin89
 
PPTX
Mobile Healthcare Apps: 7 things to remember to get your app noticed
Scott Hague
 
PPTX
Maximize Your App Downloads Using App Store Optimization
Pushpraj Singh Verma
 
PDF
App Marketing: Discover all of ASO techniques
SlashMobility.com
 
PDF
New Mobile App Development PowerPoint Presentation Slides
SlideTeam
 
PDF
mobile app business plan
ECorp
 
PDF
Mobile app business plan Example
upmetrics.co
 
PDF
New Mobile App Development Powerpoint Presentation Slides
SlideTeam
 
Mobile Apps Competitive Analysis Done Right
SafeDK
 
One-Pager: Store Intelligence
Wes McCabe
 
App Store Optimization Tips 101
HarendraSingh Rajput
 
WebCamp2016:BizDev_Алексей Иваница_Как построить и монетизировать мобильный п...
WebCamp
 
Reimagine Growth 3 - Session 2 - Planning your ASO strategy from 0 to 100
CleverTap
 
Running your App as a Business
Bill Magnuson
 
#Product management for start ups @NIC, #ProductISB talk
Zishan A. Mohammad
 
Entering new markets in mobile: how to gather insights and succeed
AppFollow
 
5 Must Haves for Launching a Successful Mobile Product
Robert Woo
 
Can You Solve Real World Problems - Insights of Startups
Sridhar Chimalakonda
 
OpenMIC #7 Talk
Will
 
StartupFlux Pitch Draft
VAIBHAV JAIN
 
The Mobile Perspective
eugenelin89
 
Mobile Healthcare Apps: 7 things to remember to get your app noticed
Scott Hague
 
Maximize Your App Downloads Using App Store Optimization
Pushpraj Singh Verma
 
App Marketing: Discover all of ASO techniques
SlashMobility.com
 
New Mobile App Development PowerPoint Presentation Slides
SlideTeam
 
mobile app business plan
ECorp
 
Mobile app business plan Example
upmetrics.co
 
New Mobile App Development Powerpoint Presentation Slides
SlideTeam
 

More from Spark Summit (20)

PDF
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
Spark Summit
 
PDF
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
Spark Summit
 
PDF
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
Spark Summit
 
PDF
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Spark Summit
 
PDF
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
Spark Summit
 
PDF
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
Spark Summit
 
PDF
Apache Spark and Tensorflow as a Service with Jim Dowling
Spark Summit
 
PDF
Apache Spark and Tensorflow as a Service with Jim Dowling
Spark Summit
 
PDF
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
Spark Summit
 
PDF
Next CERN Accelerator Logging Service with Jakub Wozniak
Spark Summit
 
PDF
Powering a Startup with Apache Spark with Kevin Kim
Spark Summit
 
PDF
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Spark Summit
 
PDF
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Spark Summit
 
PDF
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
Spark Summit
 
PDF
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spark Summit
 
PDF
Goal Based Data Production with Sim Simeonov
Spark Summit
 
PDF
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Spark Summit
 
PDF
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Spark Summit
 
PDF
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Spark Summit
 
PDF
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
Spark Summit
 
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
Spark Summit
 
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
Spark Summit
 
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
Spark Summit
 
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Spark Summit
 
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
Spark Summit
 
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
Spark Summit
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Spark Summit
 
Apache Spark and Tensorflow as a Service with Jim Dowling
Spark Summit
 
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
Spark Summit
 
Next CERN Accelerator Logging Service with Jakub Wozniak
Spark Summit
 
Powering a Startup with Apache Spark with Kevin Kim
Spark Summit
 
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Spark Summit
 
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Spark Summit
 
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
Spark Summit
 
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spark Summit
 
Goal Based Data Production with Sim Simeonov
Spark Summit
 
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Spark Summit
 
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Spark Summit
 
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Spark Summit
 
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
Spark Summit
 

Recently uploaded (20)

PPTX
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
PPTX
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
PDF
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PDF
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PPTX
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PPTX
Presentation on animal welfare a good topic
kidscream385
 
PDF
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PPTX
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Presentation on animal welfare a good topic
kidscream385
 
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 

Spark Summit EU talk by Johnathan Mercer