SlideShare a Scribd company logo
dataviz on
                   bordeaux open data
                     + elasticsearch
                         + d3js
                      + angular.js + google refine

mardi 19 mars 13
goal
                   • full stack dataviz
                   • front js centric with angular.js and sass
                   • d3.js -> best lib for dataviz (a bit complex)
                   • elasticsearch scalable search engine access
                     form js ajax client
                   • grab open data and reformat with google
                     refine


mardi 19 mars 13
philosophy

                   • smarter and smarter browsers will grab a
                     huge part of the stack
                   • scalable NoSQL solution talk REST: super
                     easy access
                   • you always can enrich, refine, store, model
                     data from anysource: yes you can !



mardi 19 mars 13
go open data !
                   • go to http://opendata.bordeaux.fr/
                   • we focus ou capc contemporary museum
                     of bordeaux
                   • http://opendata.bordeaux.fr/content/
                     collections-du-capc-musee-dart-
                     contemporain
                   • curl https://
                     bdxconfigogdi.blob.core.windows.net/
                     converteddata/capc.csv -o capc.csv
mardi 19 mars 13
google refine
                   • data many times isn’t perfect: humans
                     input, unsuitable model...
                   • with google refine you can tidy data, enrich
                     with web services call and so on...
                   • i think you should never think you don’t
                     have the good data for the job
                   • be smart and be creativ: you have
                     everything you need, all the time !
mardi 19 mars 13
import data in
                            elasticsearch
                   • bulk api import for es is handy
                   • http://www.elasticsearch.org/guide/
                     reference/api/bulk.html
                   • we use templating in google refine to
                     export data to the correct format
                   • { "index" : { "_index" : "musees", "_type" :
                     "capc"} } [snipp]


mardi 19 mars 13
elasticsearch

                   • scalable search engine
                   • adding more power == adding more nodes
                   • sharding
                   • replication
                   • fault tolerant

mardi 19 mars 13
elasticsearch
                   • store unstructured document (json) in
                     indexes (NoSQL way)
                   • talk REST (api)
                   • advanced query langage
                   • multiple analyzers (tokens, languages, etc...)
                   • blazing fast !
                   • no alternativ solutions (to my advice) (and
                     kimshy advice too ;) )
mardi 19 mars 13
yeoman
                   • perfect tool for the new browser focus
                     stack coming soon
                   • yeoman init angular
                   • yeoman init angular:route   capc
                   • yeoman install d3
                   • yeoman install jquery
                   • yeoman server -> yeah !!!
mardi 19 mars 13
angular.js

                   • mvc framework for browser (js)
                   • by google
                   • rely better on html doc
                   • cleaner controller handling than backbone
                     (to my advice)



mardi 19 mars 13
d3.js

                   • best javascript lib for dataviz (to my advice)
                   • slow learning curve
                   • based on svg browser markup
                   • data and dom oriented
                   • generic (go low level with svg easily)

mardi 19 mars 13
dataviz
                   • make appears informations from pure data
                   • you should explore, analyse and be creativ
                     to grab the most value possible from the
                     data: go as far as you can
                   • tables (excell way) -> forgive
                   • visualization will reveal your data: data is
                     made to be know to take decision, manage,
                     understand

mardi 19 mars 13
open data

                   • today, wide range of datas come to the
                     public domain
                   • again, without computing and vizualisation,
                     data has a poor value
                   • open data without data scientists and data
                     visualization has no futur



mardi 19 mars 13
futur
                   • with low bandwith, storage and computing
                      cost, you can grab open data from a lot of
                      sources
                   • you can put them in big data store and
                      make analysis, relations between, with open
                      source technologies
                   • you can then share this to the word with
                      data visualization on your website, blogs...

mardi 19 mars 13
                   • amazing isn’t it ?? !!!!!
questions ? infos ?

                   • https://github.com/mathieue
                   • http://www.mathieu-elie.net/
                   • https://twitter.com/mathieuel
                   • https://www.facebook.com/mathieuelietech

mardi 19 mars 13

More Related Content

What's hot (20)

PDF
On Again; Off Again - Benjamin Young - ebookcraft 2017
BookNet Canada
 
PDF
CouchDB Open Source Bridge
Chris Anderson
 
PDF
Mongo db php_shaken_not_stirred_joomlafrappe
Spyros Passas
 
PDF
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Sammy Fung
 
KEY
NoSQL: Why, When, and How
BigBlueHat
 
PDF
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
guest5b1607
 
PDF
Introduction to Elasticsearch
Jason Austin
 
PDF
Elasticsearch Introduction at BigData meetup
Eric Rodriguez (Hiring in Lex)
 
PPTX
Drupal 6 to 7 migration
Adelle Frank
 
PDF
Apache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
Myles Braithwaite
 
PPTX
Scrapy.for.dummies
Chandler Huang
 
PDF
Simple search with elastic search
markstory
 
PDF
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
BookNet Canada
 
PDF
NoSQL - An introduction to CouchDB
Jonathan Weiss
 
PDF
Web History 101, or How the Future is Unwritten
BookNet Canada
 
PDF
Elasticsearch: You know, for search! and more!
Philips Kokoh Prasetyo
 
PPTX
An Introduction to Elastic Search.
Jurriaan Persyn
 
PDF
Managing Your Content with Elasticsearch
Samantha Quiñones
 
PPSX
What I learnt: Elastic search & Kibana : introduction, installtion & configur...
Rahul K Chauhan
 
PDF
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
Elżbieta Bednarek
 
On Again; Off Again - Benjamin Young - ebookcraft 2017
BookNet Canada
 
CouchDB Open Source Bridge
Chris Anderson
 
Mongo db php_shaken_not_stirred_joomlafrappe
Spyros Passas
 
Web scraping 1 2-3 with python + scrapy (Summer BarCampHK 2012 version)
Sammy Fung
 
NoSQL: Why, When, and How
BigBlueHat
 
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
guest5b1607
 
Introduction to Elasticsearch
Jason Austin
 
Elasticsearch Introduction at BigData meetup
Eric Rodriguez (Hiring in Lex)
 
Drupal 6 to 7 migration
Adelle Frank
 
Apache CouchDB Presentation @ Sept. 2104 GTALUG Meeting
Myles Braithwaite
 
Scrapy.for.dummies
Chandler Huang
 
Simple search with elastic search
markstory
 
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
BookNet Canada
 
NoSQL - An introduction to CouchDB
Jonathan Weiss
 
Web History 101, or How the Future is Unwritten
BookNet Canada
 
Elasticsearch: You know, for search! and more!
Philips Kokoh Prasetyo
 
An Introduction to Elastic Search.
Jurriaan Persyn
 
Managing Your Content with Elasticsearch
Samantha Quiñones
 
What I learnt: Elastic search & Kibana : introduction, installtion & configur...
Rahul K Chauhan
 
FIFA fails, Guy Kawasaki and real estate in SF - find out about all three by ...
Elżbieta Bednarek
 

Similar to dataviz on d3.js + elasticsearch (20)

PPTX
Big Data: Beyond the "Bigness" and the Technology (webcast)
Apigee | Google Cloud
 
PDF
EDF2012: The Web of Data and its Five Stars
Richard Cyganiak
 
PDF
Lightweight Collection and Storage of Software Repository Data with DataRover
Christoph Matthies
 
PDF
Python for Data Science
Harri Hämäläinen
 
ODP
OpenRefine - Data Science Training for Librarians
tfmorris
 
PDF
Sharing data on the web (2013)
3 Round Stones
 
PPTX
Big and Small Web Data
Marieke Guy
 
KEY
Big data and APIs for PHP developers - SXSW 2011
Eli White
 
PDF
MongoDB and Web Scrapping with the Gyes Platform
MongoDB
 
PPT
Tutorial
Atner Yegorov
 
PDF
Web Queries: From a Web of Data to a Semantic Web
Tim Furche
 
PPTX
Big data 101
Lars Marius Garshol
 
PDF
Open Data and APIs - DataWeave
DataWeave
 
PDF
Introduction to the FP7 CODE project @ BDBC
Florian Stegmaier
 
PDF
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 
POT
From Data to Visualization: Emerging Tools for Research / Jan Johansson
PVC.ASIST
 
PDF
Intro to-technologies-Green-City-Hackathon-Athens
Stoitsis Giannis
 
PPTX
Crawlable Spatial Data - #Geo4Web research topic #3
Dimitri van Hees
 
PDF
Digital archiving 3.0
Christophe Guéret
 
PDF
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
Gigaom
 
Big Data: Beyond the "Bigness" and the Technology (webcast)
Apigee | Google Cloud
 
EDF2012: The Web of Data and its Five Stars
Richard Cyganiak
 
Lightweight Collection and Storage of Software Repository Data with DataRover
Christoph Matthies
 
Python for Data Science
Harri Hämäläinen
 
OpenRefine - Data Science Training for Librarians
tfmorris
 
Sharing data on the web (2013)
3 Round Stones
 
Big and Small Web Data
Marieke Guy
 
Big data and APIs for PHP developers - SXSW 2011
Eli White
 
MongoDB and Web Scrapping with the Gyes Platform
MongoDB
 
Tutorial
Atner Yegorov
 
Web Queries: From a Web of Data to a Semantic Web
Tim Furche
 
Big data 101
Lars Marius Garshol
 
Open Data and APIs - DataWeave
DataWeave
 
Introduction to the FP7 CODE project @ BDBC
Florian Stegmaier
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 
From Data to Visualization: Emerging Tools for Research / Jan Johansson
PVC.ASIST
 
Intro to-technologies-Green-City-Hackathon-Athens
Stoitsis Giannis
 
Crawlable Spatial Data - #Geo4Web research topic #3
Dimitri van Hees
 
Digital archiving 3.0
Christophe Guéret
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
Gigaom
 
Ad

dataviz on d3.js + elasticsearch

  • 1. dataviz on bordeaux open data + elasticsearch + d3js + angular.js + google refine mardi 19 mars 13
  • 2. goal • full stack dataviz • front js centric with angular.js and sass • d3.js -> best lib for dataviz (a bit complex) • elasticsearch scalable search engine access form js ajax client • grab open data and reformat with google refine mardi 19 mars 13
  • 3. philosophy • smarter and smarter browsers will grab a huge part of the stack • scalable NoSQL solution talk REST: super easy access • you always can enrich, refine, store, model data from anysource: yes you can ! mardi 19 mars 13
  • 4. go open data ! • go to http://opendata.bordeaux.fr/ • we focus ou capc contemporary museum of bordeaux • http://opendata.bordeaux.fr/content/ collections-du-capc-musee-dart- contemporain • curl https:// bdxconfigogdi.blob.core.windows.net/ converteddata/capc.csv -o capc.csv mardi 19 mars 13
  • 5. google refine • data many times isn’t perfect: humans input, unsuitable model... • with google refine you can tidy data, enrich with web services call and so on... • i think you should never think you don’t have the good data for the job • be smart and be creativ: you have everything you need, all the time ! mardi 19 mars 13
  • 6. import data in elasticsearch • bulk api import for es is handy • http://www.elasticsearch.org/guide/ reference/api/bulk.html • we use templating in google refine to export data to the correct format • { "index" : { "_index" : "musees", "_type" : "capc"} } [snipp] mardi 19 mars 13
  • 7. elasticsearch • scalable search engine • adding more power == adding more nodes • sharding • replication • fault tolerant mardi 19 mars 13
  • 8. elasticsearch • store unstructured document (json) in indexes (NoSQL way) • talk REST (api) • advanced query langage • multiple analyzers (tokens, languages, etc...) • blazing fast ! • no alternativ solutions (to my advice) (and kimshy advice too ;) ) mardi 19 mars 13
  • 9. yeoman • perfect tool for the new browser focus stack coming soon • yeoman init angular • yeoman init angular:route capc • yeoman install d3 • yeoman install jquery • yeoman server -> yeah !!! mardi 19 mars 13
  • 10. angular.js • mvc framework for browser (js) • by google • rely better on html doc • cleaner controller handling than backbone (to my advice) mardi 19 mars 13
  • 11. d3.js • best javascript lib for dataviz (to my advice) • slow learning curve • based on svg browser markup • data and dom oriented • generic (go low level with svg easily) mardi 19 mars 13
  • 12. dataviz • make appears informations from pure data • you should explore, analyse and be creativ to grab the most value possible from the data: go as far as you can • tables (excell way) -> forgive • visualization will reveal your data: data is made to be know to take decision, manage, understand mardi 19 mars 13
  • 13. open data • today, wide range of datas come to the public domain • again, without computing and vizualisation, data has a poor value • open data without data scientists and data visualization has no futur mardi 19 mars 13
  • 14. futur • with low bandwith, storage and computing cost, you can grab open data from a lot of sources • you can put them in big data store and make analysis, relations between, with open source technologies • you can then share this to the word with data visualization on your website, blogs... mardi 19 mars 13 • amazing isn’t it ?? !!!!!
  • 15. questions ? infos ? • https://github.com/mathieue • http://www.mathieu-elie.net/ • https://twitter.com/mathieuel • https://www.facebook.com/mathieuelietech mardi 19 mars 13