SlideShare a Scribd company logo
dataviz on
                   bordeaux open data
                     + elasticsearch
                         + d3js
                      + angular.js + google refine

mardi 19 mars 13
goal
                   • full stack dataviz
                   • front js centric with angular.js and sass
                   • d3.js -> best lib for dataviz (a bit complex)
                   • elasticsearch scalable search engine access
                     form js ajax client
                   • grab open data and reformat with google
                     refine


mardi 19 mars 13
philosophy

                   • smarter and smarter browsers will grab a
                     huge part of the stack
                   • scalable NoSQL solution talk REST: super
                     easy access
                   • you always can enrich, refine, store, model
                     data from anysource: yes you can !



mardi 19 mars 13
go open data !
                   • go to http://opendata.bordeaux.fr/
                   • we focus ou capc contemporary museum
                     of bordeaux
                   • http://opendata.bordeaux.fr/content/
                     collections-du-capc-musee-dart-
                     contemporain
                   • curl https://
                     bdxconfigogdi.blob.core.windows.net/
                     converteddata/capc.csv -o capc.csv
mardi 19 mars 13
google refine
                   • data many times isn’t perfect: humans
                     input, unsuitable model...
                   • with google refine you can tidy data, enrich
                     with web services call and so on...
                   • i think you should never think you don’t
                     have the good data for the job
                   • be smart and be creativ: you have
                     everything you need, all the time !
mardi 19 mars 13
import data in
                            elasticsearch
                   • bulk api import for es is handy
                   • http://www.elasticsearch.org/guide/
                     reference/api/bulk.html
                   • we use templating in google refine to
                     export data to the correct format
                   • { "index" : { "_index" : "musees", "_type" :
                     "capc"} } [snipp]


mardi 19 mars 13
elasticsearch

                   • scalable search engine
                   • adding more power == adding more nodes
                   • sharding
                   • replication
                   • fault tolerant

mardi 19 mars 13
elasticsearch
                   • store unstructured document (json) in
                     indexes (NoSQL way)
                   • talk REST (api)
                   • advanced query langage
                   • multiple analyzers (tokens, languages, etc...)
                   • blazing fast !
                   • no alternativ solutions (to my advice) (and
                     kimshy advice too ;) )
mardi 19 mars 13
yeoman
                   • perfect tool for the new browser focus
                     stack coming soon
                   • yeoman init angular
                   • yeoman init angular:route   capc
                   • yeoman install d3
                   • yeoman install jquery
                   • yeoman server -> yeah !!!
mardi 19 mars 13
angular.js

                   • mvc framework for browser (js)
                   • by google
                   • rely better on html doc
                   • cleaner controller handling than backbone
                     (to my advice)



mardi 19 mars 13
d3.js

                   • best javascript lib for dataviz (to my advice)
                   • slow learning curve
                   • based on svg browser markup
                   • data and dom oriented
                   • generic (go low level with svg easily)

mardi 19 mars 13
dataviz
                   • make appears informations from pure data
                   • you should explore, analyse and be creativ
                     to grab the most value possible from the
                     data: go as far as you can
                   • tables (excell way) -> forgive
                   • visualization will reveal your data: data is
                     made to be know to take decision, manage,
                     understand

mardi 19 mars 13
open data

                   • today, wide range of datas come to the
                     public domain
                   • again, without computing and vizualisation,
                     data has a poor value
                   • open data without data scientists and data
                     visualization has no futur



mardi 19 mars 13
futur
                   • with low bandwith, storage and computing
                      cost, you can grab open data from a lot of
                      sources
                   • you can put them in big data store and
                      make analysis, relations between, with open
                      source technologies
                   • you can then share this to the word with
                      data visualization on your website, blogs...

mardi 19 mars 13
                   • amazing isn’t it ?? !!!!!
questions ? infos ?

                   • https://github.com/mathieue
                   • http://www.mathieu-elie.net/
                   • https://twitter.com/mathieuel
                   • https://www.facebook.com/mathieuelietech

mardi 19 mars 13

More Related Content

PDF
elasticsearch basics workshop
PDF
Data Visualization on the Tech Side
PPT
Data vizualisation: d3.js + sinatra + elasticsearch
PPTX
PDF
Appli légère avec d3.js, sinatra, elasticsearch et capucine
PDF
ElasticSearch - index server used as a document database
ODP
Cool bonsai cool - an introduction to ElasticSearch
PDF
Elasticsearch in 15 minutes
elasticsearch basics workshop
Data Visualization on the Tech Side
Data vizualisation: d3.js + sinatra + elasticsearch
Appli légère avec d3.js, sinatra, elasticsearch et capucine
ElasticSearch - index server used as a document database
Cool bonsai cool - an introduction to ElasticSearch
Elasticsearch in 15 minutes

What's hot (20)

PDF
On Again; Off Again - Benjamin Young - ebookcraft 2017
PDF
CouchDB Open Source Bridge
PDF
Mongo db php_shaken_not_stirred_joomlafrappe
PDF
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
KEY
NoSQL: Why, When, and How
PDF
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
PDF
Introduction to Elasticsearch
PDF
Elasticsearch Introduction at BigData meetup
PPTX
Drupal 6 to 7 migration
PDF
Apache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
PPTX
Scrapy.for.dummies
PDF
Simple search with elastic search
PDF
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
PDF
NoSQL - An introduction to CouchDB
PDF
Web History 101, or How the Future is Unwritten
PDF
Elasticsearch: You know, for search! and more!
PPTX
An Introduction to Elastic Search.
PDF
Managing Your Content with Elasticsearch
PPSX
What I learnt: Elastic search & Kibana : introduction, installtion & configur...
PDF
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
On Again; Off Again - Benjamin Young - ebookcraft 2017
CouchDB Open Source Bridge
Mongo db php_shaken_not_stirred_joomlafrappe
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
NoSQL: Why, When, and How
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Introduction to Elasticsearch
Elasticsearch Introduction at BigData meetup
Drupal 6 to 7 migration
Apache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
Scrapy.for.dummies
Simple search with elastic search
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
NoSQL - An introduction to CouchDB
Web History 101, or How the Future is Unwritten
Elasticsearch: You know, for search! and more!
An Introduction to Elastic Search.
Managing Your Content with Elasticsearch
What I learnt: Elastic search & Kibana : introduction, installtion & configur...
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
Ad

Similar to dataviz on d3.js + elasticsearch (20)

PPTX
Big Data Analysis : Deciphering the haystack
PDF
Data collection & communication
PDF
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
PPTX
Mechanical curator - Technical notes
PPT
From Developer to Data Scientist
PPTX
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich
PDF
Wsrest 2013
KEY
Lean & agile with MongoDB
PDF
neurisa_11_09_rosenthal
PDF
neurisa_11_09_rosenthal
PDF
Pre-Aggregated Analytics And Social Feeds Using MongoDB
PDF
Sql saturday el salvador 2016 - Me, A Data Scientist?
PDF
Data Science at Scale - The DevOps Approach
PPTX
Big data 101
PPTX
Real time data viz with Spark Streaming, Kafka and D3.js
KEY
Trending with Purpose
PPTX
Graph Databases
PDF
Couchbase
PDF
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
PDF
NoSQL-Overview
Big Data Analysis : Deciphering the haystack
Data collection & communication
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Mechanical curator - Technical notes
From Developer to Data Scientist
How to use NoSQL in Enterprise Java Applications - NoSQL Roadshow Zurich
Wsrest 2013
Lean & agile with MongoDB
neurisa_11_09_rosenthal
neurisa_11_09_rosenthal
Pre-Aggregated Analytics And Social Feeds Using MongoDB
Sql saturday el salvador 2016 - Me, A Data Scientist?
Data Science at Scale - The DevOps Approach
Big data 101
Real time data viz with Spark Streaming, Kafka and D3.js
Trending with Purpose
Graph Databases
Couchbase
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
NoSQL-Overview
Ad

dataviz on d3.js + elasticsearch

  • 1. dataviz on bordeaux open data + elasticsearch + d3js + angular.js + google refine mardi 19 mars 13
  • 2. goal • full stack dataviz • front js centric with angular.js and sass • d3.js -> best lib for dataviz (a bit complex) • elasticsearch scalable search engine access form js ajax client • grab open data and reformat with google refine mardi 19 mars 13
  • 3. philosophy • smarter and smarter browsers will grab a huge part of the stack • scalable NoSQL solution talk REST: super easy access • you always can enrich, refine, store, model data from anysource: yes you can ! mardi 19 mars 13
  • 4. go open data ! • go to http://opendata.bordeaux.fr/ • we focus ou capc contemporary museum of bordeaux • http://opendata.bordeaux.fr/content/ collections-du-capc-musee-dart- contemporain • curl https:// bdxconfigogdi.blob.core.windows.net/ converteddata/capc.csv -o capc.csv mardi 19 mars 13
  • 5. google refine • data many times isn’t perfect: humans input, unsuitable model... • with google refine you can tidy data, enrich with web services call and so on... • i think you should never think you don’t have the good data for the job • be smart and be creativ: you have everything you need, all the time ! mardi 19 mars 13
  • 6. import data in elasticsearch • bulk api import for es is handy • http://www.elasticsearch.org/guide/ reference/api/bulk.html • we use templating in google refine to export data to the correct format • { "index" : { "_index" : "musees", "_type" : "capc"} } [snipp] mardi 19 mars 13
  • 7. elasticsearch • scalable search engine • adding more power == adding more nodes • sharding • replication • fault tolerant mardi 19 mars 13
  • 8. elasticsearch • store unstructured document (json) in indexes (NoSQL way) • talk REST (api) • advanced query langage • multiple analyzers (tokens, languages, etc...) • blazing fast ! • no alternativ solutions (to my advice) (and kimshy advice too ;) ) mardi 19 mars 13
  • 9. yeoman • perfect tool for the new browser focus stack coming soon • yeoman init angular • yeoman init angular:route capc • yeoman install d3 • yeoman install jquery • yeoman server -> yeah !!! mardi 19 mars 13
  • 10. angular.js • mvc framework for browser (js) • by google • rely better on html doc • cleaner controller handling than backbone (to my advice) mardi 19 mars 13
  • 11. d3.js • best javascript lib for dataviz (to my advice) • slow learning curve • based on svg browser markup • data and dom oriented • generic (go low level with svg easily) mardi 19 mars 13
  • 12. dataviz • make appears informations from pure data • you should explore, analyse and be creativ to grab the most value possible from the data: go as far as you can • tables (excell way) -> forgive • visualization will reveal your data: data is made to be know to take decision, manage, understand mardi 19 mars 13
  • 13. open data • today, wide range of datas come to the public domain • again, without computing and vizualisation, data has a poor value • open data without data scientists and data visualization has no futur mardi 19 mars 13
  • 14. futur • with low bandwith, storage and computing cost, you can grab open data from a lot of sources • you can put them in big data store and make analysis, relations between, with open source technologies • you can then share this to the word with data visualization on your website, blogs... mardi 19 mars 13 • amazing isn’t it ?? !!!!!
  • 15. questions ? infos ? • https://github.com/mathieue • http://www.mathieu-elie.net/ • https://twitter.com/mathieuel • https://www.facebook.com/mathieuelietech mardi 19 mars 13