SlideShare a Scribd company logo
• More than 17 years experiences in IT industry with
theoretical physics background. He started as
scientific programmer at University of Indonesia’s
semiconductor lab then later worked as software
engineer and architect in various software companies.
He joined SRIN in 2014 to lead development of
various mobile apps and middleware platforms, as
well as to conduct research projects on predictive data
analytic using deep machine learning technologies.
Prior to SRIN, he spent 10 years at Microsoft
Indonesia as Director of Developer Ecosystem (DX)
division.
insert photo
SymEx 2015 - Agile Process for Big Data Analytic
 Context of “Big Data” Science
 Scope of Data Analytic
 Project Management Complexities
 Team Structure and R&R
 Agile Principles and Process Model
 Common Execution Issues
 Q&A
Volume
Exceeds physical limits of vertical scalability
Velocity
Decision window small compared to data
change rate
Variety
Many different formats makes integration
expensive
Variability
Many options or variable interpretations
confound analysis
By 2015, organizations that build a modern information
management system will outperform their peers financially
by 20 percent.
– – Gartner, Mark Beyer
“Information Management in the 21st Century”
 Data, Data, .. Everywhere
 New Data Sources
 Larger Data Volumes
 New Data Management Technologies
 Hadoop + Spark + Tool Ecosystem
 New Era of Data Analytic
 Descriptive, Predictive & Prescriptive
 Data-Driven Organization
10x
increase every
five years
85%from
new data types
Volume
Velocity
Variety
SymEx 2015 - Agile Process for Big Data Analytic
2013
1940 1945 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
Cloud Data Storage is Unlimited
Quincy, WA Chicago, IL San Antonio, TX Dublin, Ireland Generation 4 DCs
2015
1940 1945 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
SymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data Analytic
Generic Tasks
1. Define Analytic
Requirement
2. Setup
Infrastructure
3. Collect Data
4. Data Modeling
5. Data Processing
6. Model Deployment
7. Monitoring
8. Evaluation
9. Etc….
Complexity
Value
Descriptive
Analytics
Diagnostic
Analytics
Predictive
Analytics
Prescriptive
Analytics
What happened?
Why did it happen?
What will happen?
How can we
make it happen?
SymEx 2015 - Agile Process for Big Data Analytic
Vision
Analytics
Recommenda-
tion engines
Advertising
analysis
Weather
forecasting for
business
planning
Social network
analysis
Legal
discovery and
document
archiving
Pricing
analysis
Fraud
detection
Churn
analysis
Equipment
monitoring
Location-
based tracking
and services
Personalized
Insurance
Advance computation
based on machine
learning & predictive
analytics are core
capabilities that are
needed throughout
future business
SymEx 2015 - Agile Process for Big Data Analytic
Pull-based
Batch Loads
Enterprise
Data Models
Complex ETL
Logic
Poorly
Suited to
Non-Relational Data
Emergent design is difficult
SymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data Analytic
Much More than Technologies
people process
New Roles:
1. Data Engineer
2. Data Scientists
SymEx 2015 - Agile Process for Big Data Analytic
CRISP-DM - Cross Industry Standard
Process for Data Mining.
 Framework for Guidance
 Process Model
 Non-proprietary
 Experience Base
 Application/Industry neutral
 Tool neutral
 Focus on business issues
 As well as technical analysis
SymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data Analytic
Business
Understanding
Data
Understanding
Data
Preparation
Modeling DeploymentEvaluation
Format
Data
Integrate
Data
Construct
Data
Clean
Data
Select
Data
Determine
Business
Objectives
Review
Project
Produce
Final
Report
Plan Monitoring
&
Maintenance
Plan
Deployment
Determine
Next Steps
Review
Process
Evaluate
Results
Assess
Model
Build
Model
Generate
Test Design
Select
Modeling
Technique
Assess
Situation
Explore
Data
Describe
Data
Collect
Initial
Data
Determine
Data Mining
Goals
Verify
Data
Quality
Produce
Project Plan
Common Issues
 Learning curve for data science & data engineer.
 We can’t design insights, we discover it through exploring
 Low data quality .. Less insights from the data.
 The result is not good enough.
Key Strategies
 Extra dedicated time to learn before project sprints (Eq. MOOC).
 Add capabilities to explore data, iterate and publish intermediate results.
 Improve data quality based on feedbacks.
 Build-Measure-Release iteration.
SymEx 2015 - Agile Process for Big Data Analytic

More Related Content

PPTX
Innovating in Big Data
imec
 
PPTX
Guide to big data analytics
Gahya Pandian
 
PDF
Data analyst vs data scientist
AbhaySharma786746
 
PDF
How I Learned to Stop Worrying and Love Linked Data
Domino Data Lab
 
PPTX
IoT 2018
Richard Marshall
 
PPTX
Augment the Human
Richard Marshall
 
PDF
From Science to Data: Following a principled path to Data Science
Institute of Contemporary Sciences
 
PDF
Latest Trends in Computer Science
Techsparks
 
Innovating in Big Data
imec
 
Guide to big data analytics
Gahya Pandian
 
Data analyst vs data scientist
AbhaySharma786746
 
How I Learned to Stop Worrying and Love Linked Data
Domino Data Lab
 
Augment the Human
Richard Marshall
 
From Science to Data: Following a principled path to Data Science
Institute of Contemporary Sciences
 
Latest Trends in Computer Science
Techsparks
 

What's hot (20)

PPTX
IoT and Big Data
Musa Kalimullah
 
PPTX
Big data Introduction
Musa Kalimullah
 
PDF
Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...
Memoori
 
PDF
Building up a Data Science Team from Scratch
Institute of Contemporary Sciences
 
PDF
5 Questions To Ask Before Getting Started With Data Annotation
Innodata, Inc
 
PPTX
Charles Slicer-Watkinson - Subject Matter Expert, Seerene
Global Business Intelligence
 
PPTX
Big Data
ipower softwares
 
PPTX
EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven
EPR1
 
PPTX
Nonprofits + Data: Pathway to Innovation
Tim Sarrantonio
 
PPTX
The Role of Artificial Intelligence in Corporate Innovation
Dickson Lukose
 
PPTX
5 Key Areas in the Construction Industry, where Big Data Solutions Play a Piv...
SPEC INDIA
 
PDF
Big Data
Seminar Links
 
PPTX
Data Scientist:The Sexiest Job of 21st Century
Rishabh Singh
 
PPTX
Big Data, Data Visualization, Machine Learning & Artificial Intelligence by...
VIVEK PHALKE
 
PDF
Using Data Riches A tale of two projects - Ajay Vinze
Institute of Contemporary Sciences
 
PPTX
"Social innovation with (big) data" - Maurice Fransen, Analytics Lead Public ...
Dataconomy Media
 
PPTX
Evolution of big data technology
Market Analyzer
 
PDF
ADV Slides: The World in 2045 – What Has Artificial Intelligence Created?
DATAVERSITY
 
DOCX
Semantic Computing will make the Internet of Things 2
Bob Connell
 
PPTX
Customer Driven Products
Karl Seiler
 
IoT and Big Data
Musa Kalimullah
 
Big data Introduction
Musa Kalimullah
 
Simplifying Building Automation: Leveraging Semantic Tagging with a New Breed...
Memoori
 
Building up a Data Science Team from Scratch
Institute of Contemporary Sciences
 
5 Questions To Ask Before Getting Started With Data Annotation
Innodata, Inc
 
Charles Slicer-Watkinson - Subject Matter Expert, Seerene
Global Business Intelligence
 
EPR Annual Conference 2020 Workshop 1 - Simon Uytterhoeven
EPR1
 
Nonprofits + Data: Pathway to Innovation
Tim Sarrantonio
 
The Role of Artificial Intelligence in Corporate Innovation
Dickson Lukose
 
5 Key Areas in the Construction Industry, where Big Data Solutions Play a Piv...
SPEC INDIA
 
Big Data
Seminar Links
 
Data Scientist:The Sexiest Job of 21st Century
Rishabh Singh
 
Big Data, Data Visualization, Machine Learning & Artificial Intelligence by...
VIVEK PHALKE
 
Using Data Riches A tale of two projects - Ajay Vinze
Institute of Contemporary Sciences
 
"Social innovation with (big) data" - Maurice Fransen, Analytics Lead Public ...
Dataconomy Media
 
Evolution of big data technology
Market Analyzer
 
ADV Slides: The World in 2045 – What Has Artificial Intelligence Created?
DATAVERSITY
 
Semantic Computing will make the Internet of Things 2
Bob Connell
 
Customer Driven Products
Karl Seiler
 
Ad

Similar to SymEx 2015 - Agile Process for Big Data Analytic (20)

PPTX
Big data unit 2
RojaT4
 
PPTX
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad
 
PDF
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
Krishna Sankar
 
PPTX
Big data ppt
Deepika ParthaSarathy
 
PPTX
Big data
Mani Gandan
 
PPTX
Big data
Prince Barai
 
PPTX
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
windu19
 
PPTX
Chapter 1 Introduction to Data Science (Computing)
jayashirymorgan
 
PPTX
Big Data_Big Data_Big Data-Big Data_Big Data
Harish Khodke
 
PPTX
000 introduction to big data analytics 2021
Dendej Sawarnkatat
 
PPTX
Mtech First_Year Data Analytics in Industry with power bI
SachinDhavane
 
PPTX
Big Data Analytics
Ghulam Imaduddin
 
PDF
Bigdataanalytics
Haroon Karim
 
PDF
Big data Analytics
ShivanandaVSeeri
 
PDF
Emcien overview v6 01282013
WCJones6348
 
PDF
Introduction to Data Science - Fundamentals
jayashirymorgan
 
PPTX
Big Data & Business Analytics: Understanding the Marketspace
Bala Iyer
 
PPTX
Trends in data analytics
Ramakrishnan Venkataramanan
 
PPTX
Turning information chaos into reliable data: Tools and techniques to interpr...
Career Communications Group
 
PPTX
Big Data Analytics
Global Business Solutions SME
 
Big data unit 2
RojaT4
 
Usama Fayyad talk in South Africa: From BigData to Data Science
Usama Fayyad
 
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
Krishna Sankar
 
Big data ppt
Deepika ParthaSarathy
 
Big data
Mani Gandan
 
Big data
Prince Barai
 
Explorasi Data untuk Peluang Bisnis dan Pengembangan Karir.pptx
windu19
 
Chapter 1 Introduction to Data Science (Computing)
jayashirymorgan
 
Big Data_Big Data_Big Data-Big Data_Big Data
Harish Khodke
 
000 introduction to big data analytics 2021
Dendej Sawarnkatat
 
Mtech First_Year Data Analytics in Industry with power bI
SachinDhavane
 
Big Data Analytics
Ghulam Imaduddin
 
Bigdataanalytics
Haroon Karim
 
Big data Analytics
ShivanandaVSeeri
 
Emcien overview v6 01282013
WCJones6348
 
Introduction to Data Science - Fundamentals
jayashirymorgan
 
Big Data & Business Analytics: Understanding the Marketspace
Bala Iyer
 
Trends in data analytics
Ramakrishnan Venkataramanan
 
Turning information chaos into reliable data: Tools and techniques to interpr...
Career Communications Group
 
Big Data Analytics
Global Business Solutions SME
 
Ad

More from PMI Indonesia Chapter (8)

PDF
SymEx 2015 - Troubled Project Recovery, The Story of Firefighter & Hero
PMI Indonesia Chapter
 
PDF
SymEx 2015 - Business Transformation, Change Management And Organization Rest...
PMI Indonesia Chapter
 
PDF
SymEx 2015 - Delivering Transformation in Infrastructure Portfolio/Business t...
PMI Indonesia Chapter
 
PDF
SymEx 2015 - Faster Projects, High Performance and Team Harmony with Critical...
PMI Indonesia Chapter
 
PDF
SymEx 2015 - Turning Risks Into Results, A Wider Perspective to Understand P...
PMI Indonesia Chapter
 
PDF
SymEx 2015 - How to Make Your Major IT Projects Fly with the Help of IT Gove...
PMI Indonesia Chapter
 
PDF
SymEx 2015 - Global Trends in Project Management
PMI Indonesia Chapter
 
PDF
PMI Indonesia Chapter Profile
PMI Indonesia Chapter
 
SymEx 2015 - Troubled Project Recovery, The Story of Firefighter & Hero
PMI Indonesia Chapter
 
SymEx 2015 - Business Transformation, Change Management And Organization Rest...
PMI Indonesia Chapter
 
SymEx 2015 - Delivering Transformation in Infrastructure Portfolio/Business t...
PMI Indonesia Chapter
 
SymEx 2015 - Faster Projects, High Performance and Team Harmony with Critical...
PMI Indonesia Chapter
 
SymEx 2015 - Turning Risks Into Results, A Wider Perspective to Understand P...
PMI Indonesia Chapter
 
SymEx 2015 - How to Make Your Major IT Projects Fly with the Help of IT Gove...
PMI Indonesia Chapter
 
SymEx 2015 - Global Trends in Project Management
PMI Indonesia Chapter
 
PMI Indonesia Chapter Profile
PMI Indonesia Chapter
 

Recently uploaded (20)

PDF
Agile Chennai 18-19 July 2025 | Workshop - Leadership in an Uncertain World: ...
AgileNetwork
 
PDF
Branding Potentials of Keyword Search Ads The Effects of Ad Rankings on Bran...
hritikamishra2k
 
PPTX
Itc market and how ITC shift form cigarette market to all other market like w...
sanu1902singh
 
PDF
Intro to Org Topologies by Rowan Bunning.pdf
Rowan Bunning
 
PPTX
Agile Chennai 18-19 July 2025 | Leading with Integrity in the Age of AI – A C...
AgileNetwork
 
PPTX
Empowering Women Achieving Dreams Setting and Reaching Your Personal Profess...
Muhammad Musawar Ali
 
PPTX
Multicolor leadership kepemimpinan untuk organisasi
GusTri5
 
PDF
Geopolitical Uncertainties, Dynamic Capabilities, and Technology Management
David Teece
 
PPTX
Sardar Vallabhbhai Patel ironman of india.pptx
pruthvi07899
 
PPTX
english presenation on professional writing and its types.pptx
WajahatAli434864
 
PPTX
Leadership Meaning and Styles- Autocratic, Paternalis--
PoojaShetty805509
 
PDF
2019_10 The changing world of the Law Firm CFO
tanbir16
 
PDF
250628-Challenges of Field Offices in Pharmacovigilance-CQS.pdf
Obaid Ali / Roohi B. Obaid
 
PPTX
Project Management with Knowledge Areas and AI
Usman Zafar Malik
 
PPTX
MFJDJSJSNXJCJJDJSNSKSDJNJCJSKSJAJSJDJKDKSJS
MaryanneRoseElder
 
PDF
250628-Training of Field Offices-CQS.pdf
Obaid Ali / Roohi B. Obaid
 
PDF
Asia’s Healthcare Power Players - The Visionary CEOs Reshaping Medicine for 4...
Gorman Bain Capital
 
PDF
SpatzAI is a self-managed micro-conflict toolkit that helps teams resolve one...
Desmond Sherlock
 
PDF
250621-Medical Review in Pharmacovigilance-CQS.pdf
Obaid Ali / Roohi B. Obaid
 
PDF
Asia’s Health Titans - Meet the Hospital CEOs Revolutionizing Care Across the...
Gorman Bain Capital
 
Agile Chennai 18-19 July 2025 | Workshop - Leadership in an Uncertain World: ...
AgileNetwork
 
Branding Potentials of Keyword Search Ads The Effects of Ad Rankings on Bran...
hritikamishra2k
 
Itc market and how ITC shift form cigarette market to all other market like w...
sanu1902singh
 
Intro to Org Topologies by Rowan Bunning.pdf
Rowan Bunning
 
Agile Chennai 18-19 July 2025 | Leading with Integrity in the Age of AI – A C...
AgileNetwork
 
Empowering Women Achieving Dreams Setting and Reaching Your Personal Profess...
Muhammad Musawar Ali
 
Multicolor leadership kepemimpinan untuk organisasi
GusTri5
 
Geopolitical Uncertainties, Dynamic Capabilities, and Technology Management
David Teece
 
Sardar Vallabhbhai Patel ironman of india.pptx
pruthvi07899
 
english presenation on professional writing and its types.pptx
WajahatAli434864
 
Leadership Meaning and Styles- Autocratic, Paternalis--
PoojaShetty805509
 
2019_10 The changing world of the Law Firm CFO
tanbir16
 
250628-Challenges of Field Offices in Pharmacovigilance-CQS.pdf
Obaid Ali / Roohi B. Obaid
 
Project Management with Knowledge Areas and AI
Usman Zafar Malik
 
MFJDJSJSNXJCJJDJSNSKSDJNJCJSKSJAJSJDJKDKSJS
MaryanneRoseElder
 
250628-Training of Field Offices-CQS.pdf
Obaid Ali / Roohi B. Obaid
 
Asia’s Healthcare Power Players - The Visionary CEOs Reshaping Medicine for 4...
Gorman Bain Capital
 
SpatzAI is a self-managed micro-conflict toolkit that helps teams resolve one...
Desmond Sherlock
 
250621-Medical Review in Pharmacovigilance-CQS.pdf
Obaid Ali / Roohi B. Obaid
 
Asia’s Health Titans - Meet the Hospital CEOs Revolutionizing Care Across the...
Gorman Bain Capital
 

SymEx 2015 - Agile Process for Big Data Analytic

  • 1. • More than 17 years experiences in IT industry with theoretical physics background. He started as scientific programmer at University of Indonesia’s semiconductor lab then later worked as software engineer and architect in various software companies. He joined SRIN in 2014 to lead development of various mobile apps and middleware platforms, as well as to conduct research projects on predictive data analytic using deep machine learning technologies. Prior to SRIN, he spent 10 years at Microsoft Indonesia as Director of Developer Ecosystem (DX) division. insert photo
  • 3.  Context of “Big Data” Science  Scope of Data Analytic  Project Management Complexities  Team Structure and R&R  Agile Principles and Process Model  Common Execution Issues  Q&A
  • 4. Volume Exceeds physical limits of vertical scalability Velocity Decision window small compared to data change rate Variety Many different formats makes integration expensive Variability Many options or variable interpretations confound analysis
  • 5. By 2015, organizations that build a modern information management system will outperform their peers financially by 20 percent. – – Gartner, Mark Beyer “Information Management in the 21st Century”  Data, Data, .. Everywhere  New Data Sources  Larger Data Volumes  New Data Management Technologies  Hadoop + Spark + Tool Ecosystem  New Era of Data Analytic  Descriptive, Predictive & Prescriptive  Data-Driven Organization 10x increase every five years 85%from new data types Volume Velocity Variety
  • 7. 2013 1940 1945 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 Cloud Data Storage is Unlimited Quincy, WA Chicago, IL San Antonio, TX Dublin, Ireland Generation 4 DCs
  • 8. 2015 1940 1945 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
  • 12. Generic Tasks 1. Define Analytic Requirement 2. Setup Infrastructure 3. Collect Data 4. Data Modeling 5. Data Processing 6. Model Deployment 7. Monitoring 8. Evaluation 9. Etc…. Complexity Value Descriptive Analytics Diagnostic Analytics Predictive Analytics Prescriptive Analytics What happened? Why did it happen? What will happen? How can we make it happen?
  • 14. Vision Analytics Recommenda- tion engines Advertising analysis Weather forecasting for business planning Social network analysis Legal discovery and document archiving Pricing analysis Fraud detection Churn analysis Equipment monitoring Location- based tracking and services Personalized Insurance Advance computation based on machine learning & predictive analytics are core capabilities that are needed throughout future business
  • 16. Pull-based Batch Loads Enterprise Data Models Complex ETL Logic Poorly Suited to Non-Relational Data Emergent design is difficult
  • 19. Much More than Technologies people process
  • 20. New Roles: 1. Data Engineer 2. Data Scientists
  • 22. CRISP-DM - Cross Industry Standard Process for Data Mining.  Framework for Guidance  Process Model  Non-proprietary  Experience Base  Application/Industry neutral  Tool neutral  Focus on business issues  As well as technical analysis
  • 26. Business Understanding Data Understanding Data Preparation Modeling DeploymentEvaluation Format Data Integrate Data Construct Data Clean Data Select Data Determine Business Objectives Review Project Produce Final Report Plan Monitoring & Maintenance Plan Deployment Determine Next Steps Review Process Evaluate Results Assess Model Build Model Generate Test Design Select Modeling Technique Assess Situation Explore Data Describe Data Collect Initial Data Determine Data Mining Goals Verify Data Quality Produce Project Plan
  • 27. Common Issues  Learning curve for data science & data engineer.  We can’t design insights, we discover it through exploring  Low data quality .. Less insights from the data.  The result is not good enough. Key Strategies  Extra dedicated time to learn before project sprints (Eq. MOOC).  Add capabilities to explore data, iterate and publish intermediate results.  Improve data quality based on feedbacks.  Build-Measure-Release iteration.