SlideShare a Scribd company logo
Telling the Story of People's
Behavior with Smartphone Data
J oerg Blumtritt
@jbenno
1
Mobile
2
Mobile
3
• Two billion people use smartphones (three times more than users of PCs)
• Smartphones are far more than just „mobile computers“: they carry more than twenty sensors or
probes, continuously monitoring our behavior and our environment.
• Mobile is becoming the most important source of human generated data and surpasses social
networks.
• Apart from people using their phones, there are more than five billion mobile devices, connected to
objects, like e.g. cars. These build the Internet of Things.
4
5
• Smartphones carry a phalanx of sensors and track all kind of environmental data.
• Our movements and immediate surroundings are monitored by gyroscope,
accelerometer, luminosity sensor in the camera, microphone etc.
• The location is captured by satellite connection and mobile network.
• Proximity can be trackt via bluetooth or Wifi signal (which just becomes
systematically useable with the iBeacon)
6
Satellites
MEMS
Radio
GPS
NFC
Wifi, Bluetooth
4G
Gyroscope,
Accelerometer
Microphone,
Camera
Temperature,
Air Pressure,
Compass, ...
Supplies Battery
7
Rich Context
Simple Context
Events
Raw Sensor Data
After Pei et.al.: "Human Behavior Cognition Using Smartphone Sensors", Sensors 2013, 13, 1402-1424;
doi:10.3390/s130201402
Our App: explore
• We started our own app 'explore':
• explore tracks all kinds of sensor data on the
smartphone. The data can be collected for
analysis, and it can trigger interactions (like
asking questions or offering suggestions).
• The open beta is available on Google Play
Store; the iOS version should be ready by
J uly 2014.
8 Datarella - Joerg Blumtritt
9
Rich Context
Simple Context
Events
Raw Sensor Data
Data
• Sensor data is generated mostly in forms of
tables, locally stored as SQL databases for
each app. We transfer the data to analyze
it, e.g. visualize geo-location on a map.
10 Datarella - Joerg Blumtritt
• Not all data is telling streightforeward like geolocation.
Gyroscope data e.g. is measured in three dimensions.
• This plot shows typical artefacts: the spikes shooting
out of the clutter in regular intervals. These are
caused by hardware inaccuracies, or also by aliasing
effects.
• The artefacts are unique to each device, like a
fingerprint, and can identify the source of the data.
11
12
Rich Context
Simple Context
Events
Raw Sensor Data
Events
• To see what happens, we have to process the
data. How people move arround is visible through
the gyroscope - you see the turns, changes in
directions ect.
• With gyroscopic data in combination with
acceleration and speed, also the means of
transportation can be revealed: walking has a
distinct signature, driving by car shows more
changes in directions then sitting on a train, etc.
• However: the data is noisy; artefacts emerge from
different brands of the sensors, of glitches in the
operating systems, and also can be caused by
environmental influences.
• Take e.g. the rhytmik spikes in the picture below:
nobody would turn rhythmically and so fast.
• So we have to preprocess the data in the app, to
really see, what happens.
13 Datarella - Joerg Blumtritt
What is behavior?
• The normalized gyroscopic data on the right
shows the movements of a person going
from her desk into the kitchen, fixing a pot of
tea, leaving the kitchen and returning to her
desk.
• Sampling rate was 10s, timeframe is 15min.
• We notice episodes of different behavior:
• turning sharply
• walking
• turning smoothly
• walking again
• entering the kitchen, preparing the pot
• waiting for the water to boil
• standing up, leaving the kitchen
• sitting down again
14
0,0
1,0
2,0
3,0
4,0
5,0
6,0
7,0
8,0
9,0
1 4 7 1013161922252831343740434649525558616467707376798285
Datarella - Joerg Blumtritt
Complex Event Processing
• Simple events, like changing direction,
entering an area of specific geo-
coordinates, or having moved for a
specific time span can be combined to
complex events.
• EPL (event processing language)
offeres a way to listen to the data
stream and detect the occurance of
events.
• EPL looks like SQL, but instead of
tables, the search goes into the data
stream.
• For our app, we define events, boolean-
combine these events in a GUI and
parse the definition in the app via J SON
doc.
• The event processing itself takes place
in the app - no network connection is
needed.
15
SELECT ID AS sensorId
FROM ExampleStream
RETAIN 60 SECONDS
WHERE Observation= '' Outlet"
Datarella - Joerg Blumtritt
16
Rich Context
Simple Context
Events
Raw Sensor Data
Travel
• Studying the means of transportation, they
paths, people choose for their communte or
travel, is a streightforward application of our
data.
• We work e.g. for airports to optimize the
shops they would offer to passengers. Since
many passengers come from other cultures,
it is not an easy task for an airport (or in
general for a shopping mall) to learn the
preferences of potential clients - not
consitent shopping data or market research
is available.
• So, e.g. we incentivize passengers from
China to let us accompany their stay in
Euorpe with our app 'explore'. So we can
understand, what they wanted to buy, if they
succeded and if they would have missed
anything, that an airport could have offered
to them.
17 Datarella - Joerg Blumtritt
18
Rich Context
Simple Context
Events
Raw Sensor Data
Usability
• To avoid the shortcommings of the
other social research apps, we focused
on the user interface, to make it as
"mobile" as possible.
19 Datarella - Joerg Blumtritt
Interactions
• explore can ask questions or offer suggestions, triggered by data.
20 Datarella - Joerg Blumtritt
Quantified Self
• explore offers clear and simple analytics of
the data collected. People can also get their
raw-data for their own purposes.
• We want people to be aware what we (and
other apps) do on the phone. So we do not
only tell in advance, we also show what
sensors are activated and give the opportunity
to opt-out per sensor.
• Since we reflect the results of our tracking as
well as questionnairs and interactions in form
of diagrams and sumaries, we hope, people
will realize what we are doing and can act
self-determined.
• Of couse we respect take-down notices: if
people ask for their data to be deleted, we
follow their request (which btw is also
required by German data protection laws);
this is also a reason for us not to use common
cloud storage and cloud computing platforms,
since we would not have control over the
back-up.
Datarella - Joerg Blumtritt21
22
• Use case: Tracking Chinese
Passengers.
• An international airport wants to learn,
what Chinese passengers buy, what
they consider a pleasent shopping
experience, and what they expect.
• We recruited a panel of Chinese
passengers before they left from
China to Europe, and accompanied
them with our app.
• We learned where they went and
could ask them about their
experiences.
23
• Use case: Driver Timeline
• We built an app that tracks the driving and computes more abstract events from its data,
like „stuck in a traffic jam“.
• The events are displayed in form of a timeline, and can be shared to others.
Battery
• Battery data is both interesting in itself,
and also important to maintain the app
usable.
• Battery consumptions is telling a lot
about the environment of the phone:
temperature, moisture, even air pressure
can be derived using the change in
charge.
24 Datarella - Joerg Blumtritt
Identify whereabouts by
location-specific magnetic field
• Every place has a distinct
signature of the magnetic field
(in strenght like shown on my
own tracking data on the right
as well as in bearing).
• So even if someone decided
to not-track geolocation, we
might still get sufficient
information on their
whereabouts via other
measurements.
• That this is not hypothetical
can be seen on the diagram:
the field's signature of my
home is different from other
places, I stayed during that
week.
25 Datarella - Joerg Blumtritt
26
• We try to help people understand what
their phone does:
• First, in our apps, we provide info on
which probes are active and give the
choice to switch a single probe off to
the user.
• Second, we also feedback
visualizations on the data readings to
the users in our apps (next slide).
• Third, we give users access to their
data via web api.
27
28
Spooky Wifi
Self-Tracking vs. Others-Tracking
• You can't avoid tracking others
involunarily, too. This is a problem.
People might be aware, what they
themselves are doing. But others
might be tracked along without giving
their consent.
• Wifi is a good example of "others-
tracking": all wifi signals within reach
are tracked by the phone. It tells a lot
about other people; not only about the
devices they use.
29 Datarella - Joerg Blumtritt
Postprivacy, and communalization of
private life
• In Neal Stephenson's "Snow Crash", we read about the 'Central
Intelligence Corporation' - a commercialized version of today's
NSA. Mobile health, computational social science, and mass
measurement of environmental influences are obvious and benign
applications of QS for the public good. With quantifying and
making public, what European data protection law defines as "the
most intimate personal data", however do we transform the
current "knowledge-database" character of the Net along with its
communication-networks to something new, something that might
become similar to Stephenson's vision?
• Could this even lead to Teilhard's (resp. McLuhan's) angelization
of humans, not only connected via social media but bodily knit
into the data? Would we rather end up in a rally bucolic global
village with moral control by the panoptic community (and an
inherent abelism that comes with a village life)? In both aspects,
representative aggregates like society as well as the concept of
the individual might be rendered obsolete.
30 Datarella - Joerg Blumtritt
"Privacy"
31 Datarella - Joerg Blumtritt
"Data Protection"
32 Datarella - Joerg Blumtritt
Becoming cyborgs?
• The bodily extension into the data
squere - this is what cyborgism is
really about.
• People like Neal Harbison or Enno
Park are pushing the discussion in
that direction: How do we maintain
posession of our bodies? What ethic
framework has there to be set-up?
How do we avoid technological
extensions becoming "black boxes"
that control us, rather than we do
them?
• So it is worthwhile to follow the
proceedings of the Cyborg e.V that
Enno founded.
33 Datarella - Joerg Blumtritt
Some links
• http://datarella.com/blog
• http://beautifuldata.com my blogs.
• http://twitter.com/jbenno/bigdata a data science related twitter list.
34 Datarella - Joerg Blumtritt
J örg Blumtritt
@jbenno
Datarella GmbH
Oskar-von-Miller-Ring 36
80333 München
089/44 23 69 99
info@datarella.com
Datarella - Joerg Blumtritt35
36
@jbenno

More Related Content

PDF
Mobile Data Analytics
Joerg Blumtritt
 
PDF
Quantified Self and Philosophy
Joerg Blumtritt
 
PPTX
The sixth sense
Himanshu Garg
 
PPTX
The Programmable Internet of Things
Rich Miller
 
PPT
Location Privacy for Mobile Computing, Cylab Talk on Feb 2011
Jason Hong
 
PDF
10 reasons why now is the perfect time to get started with the mobile web
Tijs Vrolix
 
PDF
Evgeny Morozov
soren-knudsen.dk
 
PPTX
Privacy, Ethics, and Big (Smartphone) Data, at Mobisys 2014
Jason Hong
 
Mobile Data Analytics
Joerg Blumtritt
 
Quantified Self and Philosophy
Joerg Blumtritt
 
The sixth sense
Himanshu Garg
 
The Programmable Internet of Things
Rich Miller
 
Location Privacy for Mobile Computing, Cylab Talk on Feb 2011
Jason Hong
 
10 reasons why now is the perfect time to get started with the mobile web
Tijs Vrolix
 
Evgeny Morozov
soren-knudsen.dk
 
Privacy, Ethics, and Big (Smartphone) Data, at Mobisys 2014
Jason Hong
 

What's hot (14)

PPTX
Into the next dimension
Ed Charbeneau
 
PDF
Freedom and Privacy in the Mobile World
Jessie Helfrich
 
PDF
Off the Page Into the Wild: Designing For the Internet of Things
frog
 
PPT
Wearable Computing - Part I: What is Wearable Computing?
Daniel Roggen
 
PPTX
Current And Future Trends in Media and Information - Media and Information Li...
Mark Jhon Oxillo
 
PPTX
Transforming instagram data into location intelligence
suresh sood
 
PPTX
Bigdataforesight
suresh sood
 
PDF
Privacy, Emerging Technology, and Information Professionals
Centre for Advanced Management Education
 
PDF
IoT is Something to Figure Out
Peter Hoddie
 
PDF
Digital Britain
- Irv -
 
PDF
Interactivity Beyond the Screen
Ericsson
 
PDF
WUD2013_Quantified_Self_Talk_TBB1
Tina Bejian-Binnion
 
PDF
Augmented reality
Abdelrahman Ali
 
PDF
Thought: The Future of Mobile and Embedded Application Input
TechWell
 
Into the next dimension
Ed Charbeneau
 
Freedom and Privacy in the Mobile World
Jessie Helfrich
 
Off the Page Into the Wild: Designing For the Internet of Things
frog
 
Wearable Computing - Part I: What is Wearable Computing?
Daniel Roggen
 
Current And Future Trends in Media and Information - Media and Information Li...
Mark Jhon Oxillo
 
Transforming instagram data into location intelligence
suresh sood
 
Bigdataforesight
suresh sood
 
Privacy, Emerging Technology, and Information Professionals
Centre for Advanced Management Education
 
IoT is Something to Figure Out
Peter Hoddie
 
Digital Britain
- Irv -
 
Interactivity Beyond the Screen
Ericsson
 
WUD2013_Quantified_Self_Talk_TBB1
Tina Bejian-Binnion
 
Augmented reality
Abdelrahman Ali
 
Thought: The Future of Mobile and Embedded Application Input
TechWell
 
Ad

Viewers also liked (7)

PDF
Open Foresight
Joerg Blumtritt
 
PDF
Credit crisis in a nutshell
Andrey Belyaev
 
PDF
Posthuman advertising
Joerg Blumtritt
 
PDF
Future mobility blumtritt_43pr
Joerg Blumtritt
 
PDF
Zwei jahrebigdata
Joerg Blumtritt
 
PPT
Miyawaki 03.2009
Андрей Ситко
 
PPTX
Ошибки входа. Или как заходить в интернет правильно.
Андрей Ситко
 
Open Foresight
Joerg Blumtritt
 
Credit crisis in a nutshell
Andrey Belyaev
 
Posthuman advertising
Joerg Blumtritt
 
Future mobility blumtritt_43pr
Joerg Blumtritt
 
Zwei jahrebigdata
Joerg Blumtritt
 
Miyawaki 03.2009
Андрей Ситко
 
Ошибки входа. Или как заходить в интернет правильно.
Андрей Ситко
 
Ad

Similar to Behavioral Analytics with Smartphone Data. Talk at Strata + Hadoop World 2014, Barcelona (20)

PDF
Contextual apps for Tizen
Shashwat Pradhan
 
PDF
The ethics of urban big data and smart cities
robkitchin
 
PPT
Intelligent Data Processing for the Internet of Things
PayamBarnaghi
 
PDF
PerCol 2012 - Presentation
Ville Antila
 
PPTX
Tizen apps with Context Awareness and Machine Learning
Shashwat Pradhan
 
PPTX
Paulo Canas Rodrigues - The role of Statistics in the Internet of Things - ...
Mindtrek
 
PPTX
Data Science Innovations : Democratisation of Data and Data Science
suresh sood
 
PPTX
SC4 Hangout 1: Big data europe transport webinar Philippe Crist
BigData_Europe
 
PPTX
Unstructured data to structured meaning for nyu itp camp - 6-22-12 ms
Marshall Sponder
 
PDF
The data we want
Elena Simperl
 
PPTX
How We Will Fail in Privacy and Ethics for the Emerging Internet of Things
Jason Hong
 
PDF
Mobile user experience conference 2009 - The rise of the mobile context
Florent Stroppa
 
PPTX
BUTLER project presentation
butler-iot
 
PPTX
Privacy, Ethics, and Big (Smartphone) Data, Keynote talk at ICISSP 2016
Jason Hong
 
PPTX
Analytic innovation transforming instagram data into predicitive analytics wi...
suresh sood
 
PPTX
Itri icl 0116_distribute
Fuming Shih
 
PPTX
Analyzing the Privacy of Smartphone Apps, for CMU Cylab Talk on April 2013
Jason Hong
 
PDF
Behaviometrics: Behavior Modeling from Heterogeneous Sensory Time-Series
Jiang Zhu
 
PDF
Dino pedreschi keynote ieee cist 2014 BIG DATA ANALYTICS & SOCIAL MINING
ieee-cist
 
PPTX
Privacy for Mobile Sensing Systems
Jason Hong
 
Contextual apps for Tizen
Shashwat Pradhan
 
The ethics of urban big data and smart cities
robkitchin
 
Intelligent Data Processing for the Internet of Things
PayamBarnaghi
 
PerCol 2012 - Presentation
Ville Antila
 
Tizen apps with Context Awareness and Machine Learning
Shashwat Pradhan
 
Paulo Canas Rodrigues - The role of Statistics in the Internet of Things - ...
Mindtrek
 
Data Science Innovations : Democratisation of Data and Data Science
suresh sood
 
SC4 Hangout 1: Big data europe transport webinar Philippe Crist
BigData_Europe
 
Unstructured data to structured meaning for nyu itp camp - 6-22-12 ms
Marshall Sponder
 
The data we want
Elena Simperl
 
How We Will Fail in Privacy and Ethics for the Emerging Internet of Things
Jason Hong
 
Mobile user experience conference 2009 - The rise of the mobile context
Florent Stroppa
 
BUTLER project presentation
butler-iot
 
Privacy, Ethics, and Big (Smartphone) Data, Keynote talk at ICISSP 2016
Jason Hong
 
Analytic innovation transforming instagram data into predicitive analytics wi...
suresh sood
 
Itri icl 0116_distribute
Fuming Shih
 
Analyzing the Privacy of Smartphone Apps, for CMU Cylab Talk on April 2013
Jason Hong
 
Behaviometrics: Behavior Modeling from Heterogeneous Sensory Time-Series
Jiang Zhu
 
Dino pedreschi keynote ieee cist 2014 BIG DATA ANALYTICS & SOCIAL MINING
ieee-cist
 
Privacy for Mobile Sensing Systems
Jason Hong
 

More from Joerg Blumtritt (11)

PDF
Algorithm Ethics
Joerg Blumtritt
 
PDF
BYOD - Bring your own data. Wearable Tech und Shared Data für die Medizin
Joerg Blumtritt
 
PDF
Big Data, Augmented Ubiquity, Quantified Self
Joerg Blumtritt
 
PDF
Streetfighting datascience
Joerg Blumtritt
 
PDF
Hacking the meme_code_ext
Joerg Blumtritt
 
PDF
Einfurung in media
Joerg Blumtritt
 
PDF
Einfuhrung datascience
Joerg Blumtritt
 
PDF
Algorithm ethics
Joerg Blumtritt
 
PDF
Quantified self
Joerg Blumtritt
 
PDF
Vom kriege
Joerg Blumtritt
 
PDF
Grüne Trends: Der Markt der Bio-Lebensmittel.
Joerg Blumtritt
 
Algorithm Ethics
Joerg Blumtritt
 
BYOD - Bring your own data. Wearable Tech und Shared Data für die Medizin
Joerg Blumtritt
 
Big Data, Augmented Ubiquity, Quantified Self
Joerg Blumtritt
 
Streetfighting datascience
Joerg Blumtritt
 
Hacking the meme_code_ext
Joerg Blumtritt
 
Einfurung in media
Joerg Blumtritt
 
Einfuhrung datascience
Joerg Blumtritt
 
Algorithm ethics
Joerg Blumtritt
 
Quantified self
Joerg Blumtritt
 
Vom kriege
Joerg Blumtritt
 
Grüne Trends: Der Markt der Bio-Lebensmittel.
Joerg Blumtritt
 

Recently uploaded (20)

PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PDF
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PDF
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
short term internship project on Data visualization
JMJCollegeComputerde
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 

Behavioral Analytics with Smartphone Data. Talk at Strata + Hadoop World 2014, Barcelona

  • 1. Telling the Story of People's Behavior with Smartphone Data J oerg Blumtritt @jbenno 1
  • 3. Mobile 3 • Two billion people use smartphones (three times more than users of PCs) • Smartphones are far more than just „mobile computers“: they carry more than twenty sensors or probes, continuously monitoring our behavior and our environment. • Mobile is becoming the most important source of human generated data and surpasses social networks. • Apart from people using their phones, there are more than five billion mobile devices, connected to objects, like e.g. cars. These build the Internet of Things.
  • 4. 4
  • 5. 5 • Smartphones carry a phalanx of sensors and track all kind of environmental data. • Our movements and immediate surroundings are monitored by gyroscope, accelerometer, luminosity sensor in the camera, microphone etc. • The location is captured by satellite connection and mobile network. • Proximity can be trackt via bluetooth or Wifi signal (which just becomes systematically useable with the iBeacon)
  • 7. 7 Rich Context Simple Context Events Raw Sensor Data After Pei et.al.: "Human Behavior Cognition Using Smartphone Sensors", Sensors 2013, 13, 1402-1424; doi:10.3390/s130201402
  • 8. Our App: explore • We started our own app 'explore': • explore tracks all kinds of sensor data on the smartphone. The data can be collected for analysis, and it can trigger interactions (like asking questions or offering suggestions). • The open beta is available on Google Play Store; the iOS version should be ready by J uly 2014. 8 Datarella - Joerg Blumtritt
  • 10. Data • Sensor data is generated mostly in forms of tables, locally stored as SQL databases for each app. We transfer the data to analyze it, e.g. visualize geo-location on a map. 10 Datarella - Joerg Blumtritt
  • 11. • Not all data is telling streightforeward like geolocation. Gyroscope data e.g. is measured in three dimensions. • This plot shows typical artefacts: the spikes shooting out of the clutter in regular intervals. These are caused by hardware inaccuracies, or also by aliasing effects. • The artefacts are unique to each device, like a fingerprint, and can identify the source of the data. 11
  • 13. Events • To see what happens, we have to process the data. How people move arround is visible through the gyroscope - you see the turns, changes in directions ect. • With gyroscopic data in combination with acceleration and speed, also the means of transportation can be revealed: walking has a distinct signature, driving by car shows more changes in directions then sitting on a train, etc. • However: the data is noisy; artefacts emerge from different brands of the sensors, of glitches in the operating systems, and also can be caused by environmental influences. • Take e.g. the rhytmik spikes in the picture below: nobody would turn rhythmically and so fast. • So we have to preprocess the data in the app, to really see, what happens. 13 Datarella - Joerg Blumtritt
  • 14. What is behavior? • The normalized gyroscopic data on the right shows the movements of a person going from her desk into the kitchen, fixing a pot of tea, leaving the kitchen and returning to her desk. • Sampling rate was 10s, timeframe is 15min. • We notice episodes of different behavior: • turning sharply • walking • turning smoothly • walking again • entering the kitchen, preparing the pot • waiting for the water to boil • standing up, leaving the kitchen • sitting down again 14 0,0 1,0 2,0 3,0 4,0 5,0 6,0 7,0 8,0 9,0 1 4 7 1013161922252831343740434649525558616467707376798285 Datarella - Joerg Blumtritt
  • 15. Complex Event Processing • Simple events, like changing direction, entering an area of specific geo- coordinates, or having moved for a specific time span can be combined to complex events. • EPL (event processing language) offeres a way to listen to the data stream and detect the occurance of events. • EPL looks like SQL, but instead of tables, the search goes into the data stream. • For our app, we define events, boolean- combine these events in a GUI and parse the definition in the app via J SON doc. • The event processing itself takes place in the app - no network connection is needed. 15 SELECT ID AS sensorId FROM ExampleStream RETAIN 60 SECONDS WHERE Observation= '' Outlet" Datarella - Joerg Blumtritt
  • 17. Travel • Studying the means of transportation, they paths, people choose for their communte or travel, is a streightforward application of our data. • We work e.g. for airports to optimize the shops they would offer to passengers. Since many passengers come from other cultures, it is not an easy task for an airport (or in general for a shopping mall) to learn the preferences of potential clients - not consitent shopping data or market research is available. • So, e.g. we incentivize passengers from China to let us accompany their stay in Euorpe with our app 'explore'. So we can understand, what they wanted to buy, if they succeded and if they would have missed anything, that an airport could have offered to them. 17 Datarella - Joerg Blumtritt
  • 19. Usability • To avoid the shortcommings of the other social research apps, we focused on the user interface, to make it as "mobile" as possible. 19 Datarella - Joerg Blumtritt
  • 20. Interactions • explore can ask questions or offer suggestions, triggered by data. 20 Datarella - Joerg Blumtritt
  • 21. Quantified Self • explore offers clear and simple analytics of the data collected. People can also get their raw-data for their own purposes. • We want people to be aware what we (and other apps) do on the phone. So we do not only tell in advance, we also show what sensors are activated and give the opportunity to opt-out per sensor. • Since we reflect the results of our tracking as well as questionnairs and interactions in form of diagrams and sumaries, we hope, people will realize what we are doing and can act self-determined. • Of couse we respect take-down notices: if people ask for their data to be deleted, we follow their request (which btw is also required by German data protection laws); this is also a reason for us not to use common cloud storage and cloud computing platforms, since we would not have control over the back-up. Datarella - Joerg Blumtritt21
  • 22. 22 • Use case: Tracking Chinese Passengers. • An international airport wants to learn, what Chinese passengers buy, what they consider a pleasent shopping experience, and what they expect. • We recruited a panel of Chinese passengers before they left from China to Europe, and accompanied them with our app. • We learned where they went and could ask them about their experiences.
  • 23. 23 • Use case: Driver Timeline • We built an app that tracks the driving and computes more abstract events from its data, like „stuck in a traffic jam“. • The events are displayed in form of a timeline, and can be shared to others.
  • 24. Battery • Battery data is both interesting in itself, and also important to maintain the app usable. • Battery consumptions is telling a lot about the environment of the phone: temperature, moisture, even air pressure can be derived using the change in charge. 24 Datarella - Joerg Blumtritt
  • 25. Identify whereabouts by location-specific magnetic field • Every place has a distinct signature of the magnetic field (in strenght like shown on my own tracking data on the right as well as in bearing). • So even if someone decided to not-track geolocation, we might still get sufficient information on their whereabouts via other measurements. • That this is not hypothetical can be seen on the diagram: the field's signature of my home is different from other places, I stayed during that week. 25 Datarella - Joerg Blumtritt
  • 26. 26 • We try to help people understand what their phone does: • First, in our apps, we provide info on which probes are active and give the choice to switch a single probe off to the user. • Second, we also feedback visualizations on the data readings to the users in our apps (next slide). • Third, we give users access to their data via web api.
  • 27. 27
  • 28. 28
  • 29. Spooky Wifi Self-Tracking vs. Others-Tracking • You can't avoid tracking others involunarily, too. This is a problem. People might be aware, what they themselves are doing. But others might be tracked along without giving their consent. • Wifi is a good example of "others- tracking": all wifi signals within reach are tracked by the phone. It tells a lot about other people; not only about the devices they use. 29 Datarella - Joerg Blumtritt
  • 30. Postprivacy, and communalization of private life • In Neal Stephenson's "Snow Crash", we read about the 'Central Intelligence Corporation' - a commercialized version of today's NSA. Mobile health, computational social science, and mass measurement of environmental influences are obvious and benign applications of QS for the public good. With quantifying and making public, what European data protection law defines as "the most intimate personal data", however do we transform the current "knowledge-database" character of the Net along with its communication-networks to something new, something that might become similar to Stephenson's vision? • Could this even lead to Teilhard's (resp. McLuhan's) angelization of humans, not only connected via social media but bodily knit into the data? Would we rather end up in a rally bucolic global village with moral control by the panoptic community (and an inherent abelism that comes with a village life)? In both aspects, representative aggregates like society as well as the concept of the individual might be rendered obsolete. 30 Datarella - Joerg Blumtritt
  • 31. "Privacy" 31 Datarella - Joerg Blumtritt
  • 32. "Data Protection" 32 Datarella - Joerg Blumtritt
  • 33. Becoming cyborgs? • The bodily extension into the data squere - this is what cyborgism is really about. • People like Neal Harbison or Enno Park are pushing the discussion in that direction: How do we maintain posession of our bodies? What ethic framework has there to be set-up? How do we avoid technological extensions becoming "black boxes" that control us, rather than we do them? • So it is worthwhile to follow the proceedings of the Cyborg e.V that Enno founded. 33 Datarella - Joerg Blumtritt
  • 34. Some links • http://datarella.com/blog • http://beautifuldata.com my blogs. • http://twitter.com/jbenno/bigdata a data science related twitter list. 34 Datarella - Joerg Blumtritt
  • 35. J örg Blumtritt @jbenno Datarella GmbH Oskar-von-Miller-Ring 36 80333 München 089/44 23 69 99 [email protected] Datarella - Joerg Blumtritt35