SlideShare a Scribd company logo
Delivering near real-time mobility
insights at Swisscom
François Garillot
francois.garillot@swisscom.com
@huitseeker
Agenda
Intro
Smart-Data
Big Data Architecture
Streaming
Data challenges
Introduction : Positioning
Positioning users in a modern
network
no radio-goniometer at scale
cell of attachment has position, beam characteristics
over history, best position ~200m
Positioning at specific locations
handovers at specific cell-to-cell location
phone needs to be active
Positioning with more precision
better positioning with excellent data sources:
3G : GPEH
4G: LTE-CTR
Trajectory
data mining
time series reconstruction
trajectory segmentation
map matching, clustering
mode of transport detection
...
How to create value with
positioning at Swisscom ?
with competitive analytics & data sources,
and by making sure it embodies the right values.
Smart Data
On (not) tracking (any users)
"Swisscom strictly complies with all applicable legislations, in
particular with the telecommunications law and the data
protection initiative."
Jürg Studerus, Swisscom Senior Manager, Corporate Responsibility
Smart Data : Big Data without Big Brother
Privacy preservation is an asset
It makes sense to care as much about your customer as they do about you.
We technically enforce this
answering only synoptic questions, no individual ones,
with data flow control : we neutralize quasi-identifiers at every stage
Swisscom mobile subscribers
source: xavierstuder.com, MD&A reports
Our choices
public good applications: making Switzerland run better,
understanding places, not individuals,
all results presented aggregated, anonymized.
Markets
A first product : City
"It's a dream for civil engineers" -- Alexandre Machu, Urban
systems engineer, Pully
Demo time
Usages
New roads to divert transit traffic out of downtown (informs a 50M$
project)
Parking lot expansion and transformation (informs a 10M$ project)
Electric car charging station deployment
Big Data architecture
In the backend
Spark configuration essentials for enterprise
jobs
spark.executor.memory="not the default 1g"
spark.kryo.registrator="something custom"// and companions
spark.shuffle.service.enabled="true"
spark.dynamicAllocation.enabled="true"
spark.deploy.recoveryMode="ZOOKEEPER"
spark.deploy.recoveryDirectory="/path/to/state"
spark.deploy.zookeeper.url="quorumMachine1:2181, ..."
NOT the only valuable settings, see https://techsuppdiva.github.io
for more
See Also
In the front-end
Scala (1/2)
typeChronoHistory = List[UEupdate] @@ Chronological
typeAnteChronoHistory = List[UEupdate] @@ AnteChronological
implicit classChrono(l: List[UEupdate]){
def asChrono: ChronoHistory = {
chronoCheck(l)
l.asInstanceOf[ChronoHistory]
}
def asAnteChrono: AnteChronoHistory = {
anteChronoCheck(l)
l.asInstanceOf[AnteChronoHistory]
}
}
Scala (2/2)
implicit def reverseChrono(l: ChronoHistory): AnteChronoHistory = l.reve
implicit def reverseAnteChrono(l: AnteChronoHistory): ChronoHistory = l.
Streaming Analytics
Selecting users on a path of Interest
Massive discrepancy between # of users (2-3E6)
and # of interesting users (1.5E3 on test segments)
Filtering interesting time series.
Graph matching
Locality-sensitive hashing short histories
A family H of hashing functions is -sensitive if:(r, cr, , )p1 p2
if then
if then
p–q ≤ r P [h(q) = h(p)] ≥rH p1
p–q ≥ cr P [h(q) = h(p)] ≤rH p2
More :
Locality Sensitive Hashing By Spark, Uber, Spark Summit
A Gentle Introduction to Locality-Sensitive Hashing with Apache Spark,
Scala by The Bay
Computing speeds: Solving graph
constraints
a speed comes from a user well-positioned, twice
plus route knowledge
given a history of cells, where was the user, exactly ?
Solving graph constraints
just a few users left in computation at this stage
so a lot invested in > linear complexity algorithms
Data Challenges
Crucial elements
Quality, reliability of data sources
Automated ground truth checking
sensors
TEMS fleet
What's the ground truth for mode of transport, domicile, etc ?
Colleagues and friends volunteers
In the works
Accuracy improvements
More features (see you Spark Summit EU!)
Streaming for city
Thank you

More Related Content

PDF
Mobility insights at Swisscom - Understanding collective mobility in Switzerland
François Garillot
 
PDF
Static Energy Prediction in Software: A Worst-Case Scenario Approach
GreenLabAtDI
 
PPTX
A First Look at HPC Midlands
Martin Hamilton
 
PDF
Access to Open Earth Observation Data, an Overview and Outlook Raymond Sluit...
CommunicatieSURF
 
PPTX
Taras Lehinevych "Shadows Generation in the Wild"
Fwdays
 
PPTX
Countdown to Zero - Counter Use Cases in Aerospike
Ronen Botzer
 
PPTX
Exploring Modeling - Best Practices with Aerospike Data Types
Ronen Botzer
 
PDF
Practical machine learning: rational approach
Dzianis Pirshtuk
 
Mobility insights at Swisscom - Understanding collective mobility in Switzerland
François Garillot
 
Static Energy Prediction in Software: A Worst-Case Scenario Approach
GreenLabAtDI
 
A First Look at HPC Midlands
Martin Hamilton
 
Access to Open Earth Observation Data, an Overview and Outlook Raymond Sluit...
CommunicatieSURF
 
Taras Lehinevych "Shadows Generation in the Wild"
Fwdays
 
Countdown to Zero - Counter Use Cases in Aerospike
Ronen Botzer
 
Exploring Modeling - Best Practices with Aerospike Data Types
Ronen Botzer
 
Practical machine learning: rational approach
Dzianis Pirshtuk
 

What's hot (16)

PDF
Landset 8 的雲層去除技巧實作
鈵斯 倪
 
PDF
Rachel Leuthold: Shape Optimization for Rigid Airfoils in Multiple-Kite AWE S...
Roland Schmehl
 
PPTX
A Highly Parallel Semi-Dataflow FPGA Architecture for Large-Scale N-Body Simu...
NECST Lab @ Politecnico di Milano
 
PDF
Image Caption Generation: Intro to Distributed Tensorflow and Distributed Sco...
ICTeam S.p.A.
 
PPTX
Exploring Modeling - Doing More with Lists
Ronen Botzer
 
PDF
S1170143 2
s1170143
 
PPTX
MATLAB Based Research Projects List Assistance
Matlab Simulation
 
PDF
Low Energy Task Scheduling based on Work Stealing
LEGATO project
 
PPTX
Composable Energy Modeling for ML-Driven Drone Applications
Demetris Trihinas
 
PDF
Big Data Analytics in R using sparklyr
Nicola Lambiase
 
PDF
cnsm2011_slide
rerngvit yanggratoke
 
PDF
FabSim: Facilitating computational research through automation on large-scale...
Derek Groen
 
PPTX
StreamSight - Query-Driven Descriptive Analytics for IoT and Edge Computing
Demetris Trihinas
 
PDF
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
LEGATO project
 
PDF
Products go Green: Worst-Case Energy Consumption in Software Product Lines
GreenLabAtDI
 
Landset 8 的雲層去除技巧實作
鈵斯 倪
 
Rachel Leuthold: Shape Optimization for Rigid Airfoils in Multiple-Kite AWE S...
Roland Schmehl
 
A Highly Parallel Semi-Dataflow FPGA Architecture for Large-Scale N-Body Simu...
NECST Lab @ Politecnico di Milano
 
Image Caption Generation: Intro to Distributed Tensorflow and Distributed Sco...
ICTeam S.p.A.
 
Exploring Modeling - Doing More with Lists
Ronen Botzer
 
S1170143 2
s1170143
 
MATLAB Based Research Projects List Assistance
Matlab Simulation
 
Low Energy Task Scheduling based on Work Stealing
LEGATO project
 
Composable Energy Modeling for ML-Driven Drone Applications
Demetris Trihinas
 
Big Data Analytics in R using sparklyr
Nicola Lambiase
 
cnsm2011_slide
rerngvit yanggratoke
 
FabSim: Facilitating computational research through automation on large-scale...
Derek Groen
 
StreamSight - Query-Driven Descriptive Analytics for IoT and Edge Computing
Demetris Trihinas
 
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
LEGATO project
 
Products go Green: Worst-Case Energy Consumption in Software Product Lines
GreenLabAtDI
 
Ad

Similar to Delivering near real time mobility insights at swisscom (20)

PDF
Spark Summit EU talk by Francois Garillot and Mohamed Kafsi
Spark Summit
 
PDF
Lessons Learnt from Running Thousands of On-demand Spark Applications
Itai Yaffe
 
PDF
FIWARE Wednesday Webinars - Short Term History within Smart Systems
FIWARE
 
PDF
Getting Started with Apache Spark on Kubernetes
Databricks
 
PDF
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Databricks
 
PDF
Systems Bioinformatics Workshop Keynote
Deepak Singh
 
PDF
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Jason Dai
 
PDF
A Secure and Dynamic Multi Keyword Ranked Search over Encrypted Cloud Data
IRJET Journal
 
PDF
Bending the IoT to your will with JavaScript
All Things Open
 
PPT
Fiware IoT Proposal & Community
TIDChile
 
PPTX
Introduction to FPGA acceleration
Marco77328
 
PDF
Session 7 - Connecting to Legacy Systems, IoT and other Systems | Train the T...
FIWARE
 
PDF
DevDays: Profiling With Java Flight Recorder
Miro Wengner
 
PDF
Project Tungsten: Bringing Spark Closer to Bare Metal
Databricks
 
PDF
Big Data Tools in AWS
Shu-Jeng Hsieh
 
PDF
Relevance trilogy may dream be with you! (dec17)
Woonsan Ko
 
PDF
bakalarska_praca
Severin Simko
 
PDF
ASML_FlightRecorderMeetsJava.pdf
Miro Wengner
 
PDF
eBay Pulsar: Real-time analytics platform
KyoungMo Yang
 
PPTX
DECK36 - Log everything! and Realtime Datastream Analytics with Storm
Mike Lohmann
 
Spark Summit EU talk by Francois Garillot and Mohamed Kafsi
Spark Summit
 
Lessons Learnt from Running Thousands of On-demand Spark Applications
Itai Yaffe
 
FIWARE Wednesday Webinars - Short Term History within Smart Systems
FIWARE
 
Getting Started with Apache Spark on Kubernetes
Databricks
 
Build Large-Scale Data Analytics and AI Pipeline Using RayDP
Databricks
 
Systems Bioinformatics Workshop Keynote
Deepak Singh
 
Automated ML Workflow for Distributed Big Data Using Analytics Zoo (CVPR2020 ...
Jason Dai
 
A Secure and Dynamic Multi Keyword Ranked Search over Encrypted Cloud Data
IRJET Journal
 
Bending the IoT to your will with JavaScript
All Things Open
 
Fiware IoT Proposal & Community
TIDChile
 
Introduction to FPGA acceleration
Marco77328
 
Session 7 - Connecting to Legacy Systems, IoT and other Systems | Train the T...
FIWARE
 
DevDays: Profiling With Java Flight Recorder
Miro Wengner
 
Project Tungsten: Bringing Spark Closer to Bare Metal
Databricks
 
Big Data Tools in AWS
Shu-Jeng Hsieh
 
Relevance trilogy may dream be with you! (dec17)
Woonsan Ko
 
bakalarska_praca
Severin Simko
 
ASML_FlightRecorderMeetsJava.pdf
Miro Wengner
 
eBay Pulsar: Real-time analytics platform
KyoungMo Yang
 
DECK36 - Log everything! and Realtime Datastream Analytics with Storm
Mike Lohmann
 
Ad

More from François Garillot (8)

PDF
Modern multi-proposer consensus implementations
François Garillot
 
PDF
Growing Your Types Without Growing Your Workload
François Garillot
 
PDF
Deep learning on a mixed cluster with deeplearning4j and spark
François Garillot
 
PDF
Spark Streaming : Dealing with State
François Garillot
 
PDF
A Gentle Introduction to Locality Sensitive Hashing with Apache Spark
François Garillot
 
PDF
Ramping up your Devops Fu for Big Data developers
François Garillot
 
PDF
Diving In The Deep End Of The Big Data Pool
François Garillot
 
PDF
Scala Collections : Java 8 on Steroids
François Garillot
 
Modern multi-proposer consensus implementations
François Garillot
 
Growing Your Types Without Growing Your Workload
François Garillot
 
Deep learning on a mixed cluster with deeplearning4j and spark
François Garillot
 
Spark Streaming : Dealing with State
François Garillot
 
A Gentle Introduction to Locality Sensitive Hashing with Apache Spark
François Garillot
 
Ramping up your Devops Fu for Big Data developers
François Garillot
 
Diving In The Deep End Of The Big Data Pool
François Garillot
 
Scala Collections : Java 8 on Steroids
François Garillot
 

Recently uploaded (20)

PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Doc9.....................................
SofiaCollazos
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PDF
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Doc9.....................................
SofiaCollazos
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 

Delivering near real time mobility insights at swisscom

  • 1. Delivering near real-time mobility insights at Swisscom François Garillot [email protected] @huitseeker
  • 4. Positioning users in a modern network no radio-goniometer at scale cell of attachment has position, beam characteristics over history, best position ~200m
  • 5. Positioning at specific locations handovers at specific cell-to-cell location phone needs to be active
  • 6. Positioning with more precision better positioning with excellent data sources: 3G : GPEH 4G: LTE-CTR
  • 7. Trajectory data mining time series reconstruction trajectory segmentation map matching, clustering mode of transport detection ...
  • 8. How to create value with positioning at Swisscom ? with competitive analytics & data sources, and by making sure it embodies the right values.
  • 10. On (not) tracking (any users) "Swisscom strictly complies with all applicable legislations, in particular with the telecommunications law and the data protection initiative." Jürg Studerus, Swisscom Senior Manager, Corporate Responsibility
  • 11. Smart Data : Big Data without Big Brother Privacy preservation is an asset It makes sense to care as much about your customer as they do about you. We technically enforce this answering only synoptic questions, no individual ones, with data flow control : we neutralize quasi-identifiers at every stage
  • 12. Swisscom mobile subscribers source: xavierstuder.com, MD&A reports
  • 13. Our choices public good applications: making Switzerland run better, understanding places, not individuals, all results presented aggregated, anonymized.
  • 15. A first product : City "It's a dream for civil engineers" -- Alexandre Machu, Urban systems engineer, Pully
  • 17. Usages New roads to divert transit traffic out of downtown (informs a 50M$ project) Parking lot expansion and transformation (informs a 10M$ project) Electric car charging station deployment
  • 20. Spark configuration essentials for enterprise jobs spark.executor.memory="not the default 1g" spark.kryo.registrator="something custom"// and companions spark.shuffle.service.enabled="true" spark.dynamicAllocation.enabled="true" spark.deploy.recoveryMode="ZOOKEEPER" spark.deploy.recoveryDirectory="/path/to/state" spark.deploy.zookeeper.url="quorumMachine1:2181, ..." NOT the only valuable settings, see https://techsuppdiva.github.io for more
  • 23. Scala (1/2) typeChronoHistory = List[UEupdate] @@ Chronological typeAnteChronoHistory = List[UEupdate] @@ AnteChronological implicit classChrono(l: List[UEupdate]){ def asChrono: ChronoHistory = { chronoCheck(l) l.asInstanceOf[ChronoHistory] } def asAnteChrono: AnteChronoHistory = { anteChronoCheck(l) l.asInstanceOf[AnteChronoHistory] } }
  • 24. Scala (2/2) implicit def reverseChrono(l: ChronoHistory): AnteChronoHistory = l.reve implicit def reverseAnteChrono(l: AnteChronoHistory): ChronoHistory = l.
  • 26. Selecting users on a path of Interest Massive discrepancy between # of users (2-3E6) and # of interesting users (1.5E3 on test segments) Filtering interesting time series.
  • 28. Locality-sensitive hashing short histories A family H of hashing functions is -sensitive if:(r, cr, , )p1 p2 if then if then p–q ≤ r P [h(q) = h(p)] ≥rH p1 p–q ≥ cr P [h(q) = h(p)] ≤rH p2 More : Locality Sensitive Hashing By Spark, Uber, Spark Summit A Gentle Introduction to Locality-Sensitive Hashing with Apache Spark, Scala by The Bay
  • 29. Computing speeds: Solving graph constraints a speed comes from a user well-positioned, twice plus route knowledge given a history of cells, where was the user, exactly ?
  • 30. Solving graph constraints just a few users left in computation at this stage so a lot invested in > linear complexity algorithms
  • 32. Crucial elements Quality, reliability of data sources Automated ground truth checking sensors TEMS fleet What's the ground truth for mode of transport, domicile, etc ? Colleagues and friends volunteers
  • 33. In the works Accuracy improvements More features (see you Spark Summit EU!) Streaming for city Thank you