SlideShare a Scribd company logo
Great Minds Search Alike™By Jean-Louis Quéguiner
Senior Big data Developer at Wajam
Agile Project Management
in a Big Data context
Great Minds Search Alike™
Wajam is a leader in social search and social
advertising technology.
Wajam gives you recommendations from
friends on Google, Bing, eBay, Amazon,
Wikipedia, TripAdvisor and more.
What is Wajam?
© Wajam 2015
1. Context
2. Architecture
3. Proposed process
CONTENT
© Wajam 2015
● 350 GB of logs every day
● 7-10 Millions active users
● 20 Millions of ads every day
● More than 200 Specific events
● Around 1 to 2 events deployed every week
CONTEXT
© Wajam 2015
ARCHITECTURE
© Wajam 2015
PROPOSED PROCESS
Backlog Sync Dev Test Eval
Size
Adapt
storage
Backward
Compatibilty
Merge &
Reload
Done
Sync :
1- Define with BI their needed output
2- Define with Dev our needed input
Eval Size:
1- Will it fit in queue?
2- Will it fit in HDFS
Adapt Storage :
1- file structure
2- adapt MySQL columns / index/ tables
Backward Compatibilty:
If output structure changed &
If output(t) is used as input for t+1
then adaptation is needed
© Wajam 2015
PROPOSED MARKER
NAME +
PRIORITY
TASK NAME +
DESCRIPTION
ADD MOUSE ON
HOVER AD EVENT
ALEX 1
© Wajam 2015
PROPOSED PROCESS
Backlog Sync Dev Test Eval
Size
Adapt
storage
Backward
Compatibilty
Merge &
Reload
Done
TODO 1
TODO 2
TODO 3
A 1 A 2
A 3
J 1J 2
J 3
© Wajam 2015
CONTINUOUS IMPROVEMENT & CONCLUSIONS
● Re-evaluate priorities every morning
● Re-communicate priorities every morning
● Custom Steps
● Parallel task tracking
● Help focus
● Easy to track
● More detailed and flexible than classic scrum approach
THANK YOU
© Wajam 2015

More Related Content

PDF
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Cloudera, Inc.
 
PDF
Introduction to Big Data
Mohammed Guller
 
PDF
Data Vault 2.0 Demystified: East Coast Tour
WhereScape
 
PDF
State of Big Data Adoption
Qubole
 
PDF
Building a Digital Bank
DataStax
 
PDF
Hadoop is dead - long live Hadoop | BiDaTA 2013 Genoa
larsgeorge
 
PDF
Top 5 Considerations for a Big Data Solution
DataStax
 
PDF
Introduction to big data and apache spark
Mohammed Guller
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Cloudera, Inc.
 
Introduction to Big Data
Mohammed Guller
 
Data Vault 2.0 Demystified: East Coast Tour
WhereScape
 
State of Big Data Adoption
Qubole
 
Building a Digital Bank
DataStax
 
Hadoop is dead - long live Hadoop | BiDaTA 2013 Genoa
larsgeorge
 
Top 5 Considerations for a Big Data Solution
DataStax
 
Introduction to big data and apache spark
Mohammed Guller
 

What's hot (20)

PDF
Southwest Power Pool big data case study
Seeling Cheung
 
PPTX
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
DataStax
 
PPTX
How to Realize an Additional 270% ROI on Snowflake
AtScale
 
PDF
Building the Modern Data Hub: Beyond the Traditional Enterprise Data Warehouse
Formant
 
PDF
Introduction to Hadoop Administration
Ramesh Pabba - seeking new projects
 
PDF
Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
PDF
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
Big Data Spain
 
PDF
Modernizing Data Management Through Metadata
MANTA
 
PPTX
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
DataStax
 
PPTX
Getting Big Value from Big Data
DataStax
 
PPTX
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
DataWorks Summit
 
PPTX
A brief history of data warehousing
Rob Winters
 
PDF
A beginners guide to Cloudera Hadoop
David Yahalom
 
PDF
How Google Does Big Data - DevNexus 2014
James Chittenden
 
PDF
DataOps - Lean principles and lean practices
Lars Albertsson
 
PDF
Accelerating workloads and bursting data with Google Dataproc & Alluxio
Alluxio, Inc.
 
PPTX
Webinar: Transforming Customer Experience Through an Always-On Data Platform
DataStax
 
PDF
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
kcmallu
 
PDF
Apache Cassandra: NoSQL in the enterprise
jbellis
 
PDF
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Inside Analysis
 
Southwest Power Pool big data case study
Seeling Cheung
 
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
DataStax
 
How to Realize an Additional 270% ROI on Snowflake
AtScale
 
Building the Modern Data Hub: Beyond the Traditional Enterprise Data Warehouse
Formant
 
Introduction to Hadoop Administration
Ramesh Pabba - seeking new projects
 
Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
Migration and Coexistence between Relational and NoSQL Databases by Manuel H...
Big Data Spain
 
Modernizing Data Management Through Metadata
MANTA
 
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
DataStax
 
Getting Big Value from Big Data
DataStax
 
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
DataWorks Summit
 
A brief history of data warehousing
Rob Winters
 
A beginners guide to Cloudera Hadoop
David Yahalom
 
How Google Does Big Data - DevNexus 2014
James Chittenden
 
DataOps - Lean principles and lean practices
Lars Albertsson
 
Accelerating workloads and bursting data with Google Dataproc & Alluxio
Alluxio, Inc.
 
Webinar: Transforming Customer Experience Through an Always-On Data Platform
DataStax
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
kcmallu
 
Apache Cassandra: NoSQL in the enterprise
jbellis
 
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Inside Analysis
 
Ad

Viewers also liked (20)

PPTX
Project management for Big Data projects
Sandeep Kumar, PMP®
 
PDF
Yarn optimization (Real life use case)
Jean-Louis Quéguiner
 
PPTX
Project management for Big Data projects
Sandeep Kumar, PMP®
 
PDF
BIG DATA WORKBOOK OCT 2015
Fiona Lew
 
PDF
SymEx 2015 - Agile Process for Big Data Analytic
PMI Indonesia Chapter
 
PDF
Meetup BigData et Machine Learning
José Corral Gallego
 
PDF
Présentation Meetup #2
Julien Cartigny
 
PPTX
Slide share cr_meetup
Line Khachtban
 
PDF
Presentation meetup ml bd
antoine vastel
 
PPTX
Introduction project managemen
Mostafa Elgamala
 
PPTX
Lan yogi-nipa’s organizational analysis
zerosugar
 
PDF
Scaling Data Science at Airbnb
Work-Bench
 
PDF
PM in Digital Age
Lean In Consulting
 
PDF
Data Day Texas 2017: Scaling Data Science at Stitch Fix
Stefan Krawczyk
 
PPTX
PMI-ACP - Agile Framework
Wafi Mohtaseb
 
PDF
Big data Hadoop Analytic and Data warehouse comparison guide
Danairat Thanabodithammachari
 
PDF
Big data project management
IMC Institute
 
PDF
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Thoughtworks
 
PDF
How to use Innovative Architectures for Digital Enterprises
Capgemini
 
PDF
Digital Transformation, Enterprise Architecture, Big Data by Danairat
Danairat Thanabodithammachari
 
Project management for Big Data projects
Sandeep Kumar, PMP®
 
Yarn optimization (Real life use case)
Jean-Louis Quéguiner
 
Project management for Big Data projects
Sandeep Kumar, PMP®
 
BIG DATA WORKBOOK OCT 2015
Fiona Lew
 
SymEx 2015 - Agile Process for Big Data Analytic
PMI Indonesia Chapter
 
Meetup BigData et Machine Learning
José Corral Gallego
 
Présentation Meetup #2
Julien Cartigny
 
Slide share cr_meetup
Line Khachtban
 
Presentation meetup ml bd
antoine vastel
 
Introduction project managemen
Mostafa Elgamala
 
Lan yogi-nipa’s organizational analysis
zerosugar
 
Scaling Data Science at Airbnb
Work-Bench
 
PM in Digital Age
Lean In Consulting
 
Data Day Texas 2017: Scaling Data Science at Stitch Fix
Stefan Krawczyk
 
PMI-ACP - Agile Framework
Wafi Mohtaseb
 
Big data Hadoop Analytic and Data warehouse comparison guide
Danairat Thanabodithammachari
 
Big data project management
IMC Institute
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Thoughtworks
 
How to use Innovative Architectures for Digital Enterprises
Capgemini
 
Digital Transformation, Enterprise Architecture, Big Data by Danairat
Danairat Thanabodithammachari
 
Ad

Similar to BDM - project management in big data context.pptx (20)

PDF
An Introduction To Palomino
Laine Campbell
 
PPTX
Cloud-enabled Development: Putting the Agile into the Infrastructure
brian.white
 
PDF
Business Intelligence Best Practice Summit: BI Quo Vadis
Managility
 
DOC
SP1740_Vivek Kumar_Speridian
vivek kumar
 
PDF
Get Ready for SAP BusinessObjects BI 2025 with 360Suite
Wiiisdom
 
PDF
Big Data Ready Enterprise
DataWorks Summit/Hadoop Summit
 
PDF
Is Your Organization Ready for Data Vault?
WhereScape
 
PPTX
Accelerate Develoment with VIrtual Data
Kyle Hailey
 
PDF
Workshop on Google Cloud Data Platform
GoDataDriven
 
PPTX
Testing for performance
Eglė Baltrimaitė
 
PDF
Enabling BIM / GIS integrations with Other Systems with FME
Safe Software
 
PDF
Pentaho Roadmap 2011
Datalytics
 
PDF
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
Márton Kodok
 
PDF
Using ClickHouse for Experimentation
Gleb Kanterov
 
PDF
e-IT exec lunch - "It's all about data" - 25 May '16
Devin Deen
 
PDF
Accelerate your Upgrades and Migrations
Wiiisdom
 
PPTX
How city of Chicago saved 200k$ upgrading to BI4.2 using 360Suite
Sebastien Goiffon
 
PDF
Connecta Event: Big Query och dataanalys med Google Cloud Platform
ConnectaDigital
 
PDF
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQuery
Márton Kodok
 
PDF
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Visual_BI
 
An Introduction To Palomino
Laine Campbell
 
Cloud-enabled Development: Putting the Agile into the Infrastructure
brian.white
 
Business Intelligence Best Practice Summit: BI Quo Vadis
Managility
 
SP1740_Vivek Kumar_Speridian
vivek kumar
 
Get Ready for SAP BusinessObjects BI 2025 with 360Suite
Wiiisdom
 
Big Data Ready Enterprise
DataWorks Summit/Hadoop Summit
 
Is Your Organization Ready for Data Vault?
WhereScape
 
Accelerate Develoment with VIrtual Data
Kyle Hailey
 
Workshop on Google Cloud Data Platform
GoDataDriven
 
Testing for performance
Eglė Baltrimaitė
 
Enabling BIM / GIS integrations with Other Systems with FME
Safe Software
 
Pentaho Roadmap 2011
Datalytics
 
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
Márton Kodok
 
Using ClickHouse for Experimentation
Gleb Kanterov
 
e-IT exec lunch - "It's all about data" - 25 May '16
Devin Deen
 
Accelerate your Upgrades and Migrations
Wiiisdom
 
How city of Chicago saved 200k$ upgrading to BI4.2 using 360Suite
Sebastien Goiffon
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
ConnectaDigital
 
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQuery
Márton Kodok
 
Snowflake: The most cost-effective agile and scalable data warehouse ever!
Visual_BI
 

Recently uploaded (20)

PDF
SXSW Panel Picker: Placemaking: Culture is the new cost of living
GabrielCohen28
 
PDF
Developing Accessible and Usable Security Heuristics
Daniela Napoli
 
PPTX
Selecting relevant value chain/s for Impactful Development Policies
Francois Stepman
 
PPTX
How do Company Analysis Short Term and Long Term Investment.pptx
auntorkhastagirpujan
 
PPTX
Raksha Bandhan Celebrations PPT festival
sowmyabapuram
 
PPTX
2025-07-27 Abraham 09 (shared slides).pptx
Dale Wells
 
PPTX
Design Tips to Help Non-Visual Visitors Stay Safe Online
Daniela Napoli
 
PPTX
Remote Healthcare Technology Use Cases and the Contextual Integrity of Olde...
Daniela Napoli
 
PPT
strucure of protein geomics for new .ppt
RakeshKumar508211
 
PPTX
Intellectual Property Rights in India.pptx
SurbhitShukla2
 
PDF
Thu Dinh - CIE-RESEARCH-METHODS-SLIDES-sample-extract.pptx.pdf
dinhminhthu1405
 
PDF
Green Natural Green House Presentation (2).pdf
SaeedOsman6
 
PPTX
DARKWEB Deepweb what to do or not ?.pptx
prembasnet12
 
PPTX
PHILIPPINE LITERATURE DURING SPANISH ERA
AllizaJoyMendigoria
 
PPTX
Public Speakingbjdsbkjfdkjdasnlkdasnlknadslnbsjknsakjscbnkjbncs.pptx
ranazunairriaz1
 
DOCX
Policies & Procedures of Internal Audit Department of Shelter Holding LLC.docx
AlamGir100
 
PPTX
milgram study as level psychology core study (social approach)
dinhminhthu1405
 
PPTX
Rotary_Fundraising_Overview_Updated_new video .pptx
allangraemeduncan
 
PPTX
Influencing Factors of Business Environment of Vegetables Selling Business
auntorkhastagirpujan
 
PDF
Chapter-52-Relationship-between-countries-at-different-levels-of-development-...
dinhminhthu1405
 
SXSW Panel Picker: Placemaking: Culture is the new cost of living
GabrielCohen28
 
Developing Accessible and Usable Security Heuristics
Daniela Napoli
 
Selecting relevant value chain/s for Impactful Development Policies
Francois Stepman
 
How do Company Analysis Short Term and Long Term Investment.pptx
auntorkhastagirpujan
 
Raksha Bandhan Celebrations PPT festival
sowmyabapuram
 
2025-07-27 Abraham 09 (shared slides).pptx
Dale Wells
 
Design Tips to Help Non-Visual Visitors Stay Safe Online
Daniela Napoli
 
Remote Healthcare Technology Use Cases and the Contextual Integrity of Olde...
Daniela Napoli
 
strucure of protein geomics for new .ppt
RakeshKumar508211
 
Intellectual Property Rights in India.pptx
SurbhitShukla2
 
Thu Dinh - CIE-RESEARCH-METHODS-SLIDES-sample-extract.pptx.pdf
dinhminhthu1405
 
Green Natural Green House Presentation (2).pdf
SaeedOsman6
 
DARKWEB Deepweb what to do or not ?.pptx
prembasnet12
 
PHILIPPINE LITERATURE DURING SPANISH ERA
AllizaJoyMendigoria
 
Public Speakingbjdsbkjfdkjdasnlkdasnlknadslnbsjknsakjscbnkjbncs.pptx
ranazunairriaz1
 
Policies & Procedures of Internal Audit Department of Shelter Holding LLC.docx
AlamGir100
 
milgram study as level psychology core study (social approach)
dinhminhthu1405
 
Rotary_Fundraising_Overview_Updated_new video .pptx
allangraemeduncan
 
Influencing Factors of Business Environment of Vegetables Selling Business
auntorkhastagirpujan
 
Chapter-52-Relationship-between-countries-at-different-levels-of-development-...
dinhminhthu1405
 

BDM - project management in big data context.pptx

  • 1. Great Minds Search Alike™By Jean-Louis Quéguiner Senior Big data Developer at Wajam Agile Project Management in a Big Data context
  • 2. Great Minds Search Alike™ Wajam is a leader in social search and social advertising technology. Wajam gives you recommendations from friends on Google, Bing, eBay, Amazon, Wikipedia, TripAdvisor and more. What is Wajam?
  • 3. © Wajam 2015 1. Context 2. Architecture 3. Proposed process CONTENT
  • 4. © Wajam 2015 ● 350 GB of logs every day ● 7-10 Millions active users ● 20 Millions of ads every day ● More than 200 Specific events ● Around 1 to 2 events deployed every week CONTEXT
  • 6. © Wajam 2015 PROPOSED PROCESS Backlog Sync Dev Test Eval Size Adapt storage Backward Compatibilty Merge & Reload Done Sync : 1- Define with BI their needed output 2- Define with Dev our needed input Eval Size: 1- Will it fit in queue? 2- Will it fit in HDFS Adapt Storage : 1- file structure 2- adapt MySQL columns / index/ tables Backward Compatibilty: If output structure changed & If output(t) is used as input for t+1 then adaptation is needed
  • 7. © Wajam 2015 PROPOSED MARKER NAME + PRIORITY TASK NAME + DESCRIPTION ADD MOUSE ON HOVER AD EVENT ALEX 1
  • 8. © Wajam 2015 PROPOSED PROCESS Backlog Sync Dev Test Eval Size Adapt storage Backward Compatibilty Merge & Reload Done TODO 1 TODO 2 TODO 3 A 1 A 2 A 3 J 1J 2 J 3
  • 9. © Wajam 2015 CONTINUOUS IMPROVEMENT & CONCLUSIONS ● Re-evaluate priorities every morning ● Re-communicate priorities every morning ● Custom Steps ● Parallel task tracking ● Help focus ● Easy to track ● More detailed and flexible than classic scrum approach