SlideShare a Scribd company logo
Bridging the Gap Between Data Science
& Engineering:
Building High-Performing Teams
How do I hire a data scientist?
Software Engineer Data Engineer Data Scientist Applied Scientist Research Scientist
Continuum of Skills
Software Engineer Data Engineer Data Scientist Applied Scientist Research Scientist
Continuum of Skills
Math &
Stats
Computer
Science
Domain
Expertise
Machine
Learning
Software
Engineering Research
Unicorn
Data Science
Many companies try to find all of these skills in a
single person.
Which leads to job requirements like this…
• MSc/PhD in Computer Science, Electrical Engineering, Math or Statistics
• At least 5 years of experience in solving real-world practical problems using Machine Learning
• At least 5 years of experience on mining and modeling large-scale data (hundreds of terabytes)
• Extensive in-depth knowledge of Data Mining, Machine Learning, Algorithms
• Knowledge of at least one high-level programming language (C++, Java)
• Knowledge of at least one scripting language (Perl, Python, Ruby)
• Knowledge of SQL and experience with large relational databases
• Knowledge of at least one ML toolset (R, Weka, KNIME, Octave, Mahout, scikit-learn)
• Strong ability to formalize and provide practical solutions to research problems
• Strong communication skills and ability to work independently to get an idea from inception to
implementation.
• Knowledge of the state of the art in at least one of Bayesian Optimization, Recommendation
Systems, Social Network Analysis, Information Retrieval
• At least 5 years of experience with storing, sampling, querying large-scale data (hundreds of
terabytes) and experimentation frameworks
• At least 5 years of experience with Hadoop, Spark, Mahout or Giraph
Data Science Unicorn
These people do exist, but they are often already
well-compensated, and only want to work on
interesting problems.
What can you do?
Build a team instead.
Bridging the Gap Between Data Science & Engineer: Building High-Performance Teams
Broad-range generalist
Deepexpertise
Look for T-shaped people
Machine Learning,
Statistics, Domain Knowledge
Softw
are
Engineering
Business
Acum
en
Distributed
Com
puting
Com
m
unication
Look for T-shaped people
• Compose teams of individuals who
have overlapping skill-sets and
deep expertise in one area
(machine learning, statistics,
engineering, business, etc.)
• The overlap allows them to speak
the same language and work
collaboratively on solving problems
How do I structure my data science team within
my organization?
Data Science Team Structures
CentralizedEmbeddedHub & Spoke
Centralized
Data Scientists sit on a team that
acts as internal consultants, fielding
and answering questions from
multiple teams within the
organization, defining tools for the
organization, and acting as highly
powered consultants.
Embedded
• Data Scientists are almost wholly
embedded within one particular team
and focus on solving problems for that
team.
• Teams are assigned to one particular
product or function within the company
and define and answer questions for
that product or function.
Hub & Spoke
• The data science team sits
together physically and works
collaboratively to solve problems.
• However, each data scientist (or
a combination of them) gets
deployed to work on problems
within the organization.
• Tends to apply to companies
who have a lot of users.
Data Science Team Structure
CentralizedEmbeddedHub & Spoke
> >
How do I get my data scientists to work with
engineering?
Data Science
Python R
modeling & prototyping production
Software Engineering
Java/C++ RoR/Javascript
Data Science Software Engineering
Python R Java/C++ RoR/Javascript
modeling & prototyping production
Data scientists learn
to write prototypes
in production
languages
Engineers learn the
basics of data
science so they can
understand how
the models work
Goal is to have both teams speak
the same language and engender
trust through communication
Data Science Data Engineering
Common Core
Data Science
Curriculum
Data Engineering
Curriculum
Data Science Data Engineering
Projects
Data Science Engineering
Initial Planning
Data Science Engineering
Data Science Engineering
Production
• Don’t look for unicorns, build collaborative
teams of T-shaped people
• Pay attention to how your data science team is
structured within your organization
• Get your data science and engineering teams to
speak the same language, allowing them to build
trust and work collaboratively
Summary
We believe an opportunity belongs 

to anyone with aptitude and ambition.
29Galvanize 2015
NODES ON THE NETWORK
COLORADO (BOULDER, DENVER, FORT COLLINS)
SEATTLE, WA
SAN FRANCISCO, CA
AUSTIN, TX (OPENING Q1 2016)
Programs: Full Stack Immersive, Data Science Immersive,
Entrepreneurship
Programs: Full Stack Immersive, Data Science Immersive,
Entrepreneurship
Programs: Full Stack Immersive, Data Science Immersive, Data
Engineering Immersive, Masters of Science in Data Science,
Entrepreneurship
Programs: Full Stack Immersive, Data Science Immersive,
Entrepreneurship
[Explanation Text]
30Galvanize 2015
PLACEMENT STATS
FULL STACK IMMERSIVE DATA SCIENCE IMMERSIVE
$43K $77KPre-program Salary
Average Starting Salary
97% Placement
Rate*
*Galvanize is a founder member of NESTA (New Economy Skills Training Association), a trade organization founded to regulate the new “bootcamp” market.
This place rate is more rigorous than that requested by state licensure agencies. The placement rate is calculated 6 months after graduation.
$72K $114KPre-program Salary
94%Placement
Rate*
Average Starting Salary
31Galvanize 2015
5 PROGRAMS
• Full Stack Immersive
• Data Science Immersive
• Data Engineering Immersive
Project over 500 Student Member Graduates in 2015
Currently over 1500 Members
• Master of Science in Data Science 

(University of New Haven)
• Startup Membership
32Galvanize 2015
FULL STACK IMMERSIVE
• 97% Placement Rate 

within 6 months
• $77K Average Starting Salary
• 6 Month Program
33Galvanize 2015
FULL STACK IMMERSIVE
34Galvanize 2015
DATA SCIENCE IMMERSIVE
• 94% Placement Rate 

within 6 months
• $114K Average Starting Salary
• 3 Month Program
35Galvanize 2015
DATA SCIENCE IMMERSIVE
Week 1 - Exploratory Data Analysis and Software Engineering Best Practices
Week 2 - Statistical Inference, Bayesian Methods, A/B Testing, Multi-Armed Bandit
Week 3 - Regression, Regularization, Gradient Descent
Week 4 - Supervised Machine Learning: Classification, Validation, Ensemble Methods
Week 5 - Clustering, Topic Modeling (NMF, LDA), NLP
Week 6 - Network Analysis, Matrix Factorization, and Time Series
Week 7 - Hadoop, Hive, and MapReduce
Week 8 - Data Visualization with D3.js, Data Products, and Fraud Detection Case Study
Weeks 9-10 - Capstone Projects
Week 12 - Onsite Interviews
36Galvanize 2015
DATA SCIENCE IMMERSIVE
37Galvanize 2015
DATA ENGINEERING IMMERSIVE
• Launched Oct. 2015
• Built in partnership with Nvent and
Concurrent
• 3 Month Program
THANK YOU
RYAN ORBAN | EVP OF PRODUCT & STRATEGY
ryan.orban@galvanize.com
@ryanorban
www.galvanize.com

More Related Content

What's hot (20)

PDF
10x THINKING: innovation mindset from google
Annova Studio
 
PDF
10 Productivity Hacks Backed By Science
When I Work
 
PDF
Top Tips For Working Smarter
InterQuest Group
 
PDF
Habits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit Summit
Habit Summit
 
PDF
Visualising Data with Code
Ri Liu
 
PPT
Digital Transformation: What it is and how to get there
Econsultancy
 
PPTX
Discover The Top 10 Types Of Colleagues Around You
Ankur Tandon
 
PPTX
Top 10 Learnings Growing to (Almost) $10 Million ARR: Leo's presentation at S...
Buffer
 
PDF
DesignOps Handbook Condensed
Peter Weibrecht
 
PPTX
The Architect's Clue Bucket
Ruth Malan
 
PPTX
Company Portfolio Example by i-pointing ltd.
i-pointing ltd.
 
PDF
17 Things Powerful People Say
GetSmarter
 
PDF
Apply Design Thinking (Design Thinking Action Lab - Stanford University)
Esfandiar Khaleghi
 
PDF
Building the Metaverse
Jon Radoff
 
PDF
TEDx Manchester: AI & The Future of Work
Volker Hirsch
 
PDF
10 Steps great leaders take when things go wrong
GetSmarter
 
PDF
Montreal Girl Geeks: Building the Modern Web
Rachel Andrew
 
PDF
ChatGPT OpenAI Primer for Business
Dion Hinchcliffe
 
PDF
The Platform Manifesto - 16 principles for digital transformation
Sangeet Paul Choudary
 
PPTX
DesignOps and the design of efficient teams: the metrics and the processes th...
Patrizia Bertini
 
10x THINKING: innovation mindset from google
Annova Studio
 
10 Productivity Hacks Backed By Science
When I Work
 
Top Tips For Working Smarter
InterQuest Group
 
Habits at Work - Merci Victoria Grace, Growth, Slack - 2016 Habit Summit
Habit Summit
 
Visualising Data with Code
Ri Liu
 
Digital Transformation: What it is and how to get there
Econsultancy
 
Discover The Top 10 Types Of Colleagues Around You
Ankur Tandon
 
Top 10 Learnings Growing to (Almost) $10 Million ARR: Leo's presentation at S...
Buffer
 
DesignOps Handbook Condensed
Peter Weibrecht
 
The Architect's Clue Bucket
Ruth Malan
 
Company Portfolio Example by i-pointing ltd.
i-pointing ltd.
 
17 Things Powerful People Say
GetSmarter
 
Apply Design Thinking (Design Thinking Action Lab - Stanford University)
Esfandiar Khaleghi
 
Building the Metaverse
Jon Radoff
 
TEDx Manchester: AI & The Future of Work
Volker Hirsch
 
10 Steps great leaders take when things go wrong
GetSmarter
 
Montreal Girl Geeks: Building the Modern Web
Rachel Andrew
 
ChatGPT OpenAI Primer for Business
Dion Hinchcliffe
 
The Platform Manifesto - 16 principles for digital transformation
Sangeet Paul Choudary
 
DesignOps and the design of efficient teams: the metrics and the processes th...
Patrizia Bertini
 

Similar to Bridging the Gap Between Data Science & Engineer: Building High-Performance Teams (20)

PDF
Building successful data science teams
Venkatesh Umaashankar
 
PDF
How to become a data scientist
Manjunath Sindagi
 
PDF
Untitled document.pdf
MuhammadTahiriqbal13
 
PPTX
Data science as a professional career
David Rostcheck
 
PDF
Data science - An Introduction
Ravishankar Rajagopalan
 
PDF
Introduction to Data Science.pdf
University of Sindh
 
PPTX
introductiontodatascience-230122140841-b90a0856 (1).pptx
urvashipundir04
 
PPTX
Introduction to Data Science.pptx
Vrishit Saraswat
 
PDF
Data science a practitioner's perspective
Amir Ziai
 
PPTX
Introduction to Data Science.pptx
Dr.Shweta
 
PDF
Decoding Data Science
Matt Fornito
 
PDF
Enabling Your Data Science Team with Modern Data Engineering
James Densmore
 
PPTX
Data science presentation - Management career institute
PoojaPatidar11
 
PDF
Data Science Training and Placement
AkhilGGM
 
PDF
Guide for a Data Scientist
Rohit Dubey
 
PPTX
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 
PDF
Data science presentation
MSDEVMTL
 
PDF
Data science training in hyd pdf converted (1)
SayyedYusufali
 
PDF
Data science training in hydpdf converted (1)
SayyedYusufali
 
PDF
Data science training in hyd ppt converted (1)
SayyedYusufali
 
Building successful data science teams
Venkatesh Umaashankar
 
How to become a data scientist
Manjunath Sindagi
 
Untitled document.pdf
MuhammadTahiriqbal13
 
Data science as a professional career
David Rostcheck
 
Data science - An Introduction
Ravishankar Rajagopalan
 
Introduction to Data Science.pdf
University of Sindh
 
introductiontodatascience-230122140841-b90a0856 (1).pptx
urvashipundir04
 
Introduction to Data Science.pptx
Vrishit Saraswat
 
Data science a practitioner's perspective
Amir Ziai
 
Introduction to Data Science.pptx
Dr.Shweta
 
Decoding Data Science
Matt Fornito
 
Enabling Your Data Science Team with Modern Data Engineering
James Densmore
 
Data science presentation - Management career institute
PoojaPatidar11
 
Data Science Training and Placement
AkhilGGM
 
Guide for a Data Scientist
Rohit Dubey
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 
Data science presentation
MSDEVMTL
 
Data science training in hyd pdf converted (1)
SayyedYusufali
 
Data science training in hydpdf converted (1)
SayyedYusufali
 
Data science training in hyd ppt converted (1)
SayyedYusufali
 
Ad

Recently uploaded (20)

PPTX
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
PPT
Lecture 2-1.ppt at a higher learning institution such as the university of Za...
rachealhantukumane52
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PPT
01 presentation finyyyal معهد معايره.ppt
eltohamym057
 
PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PPTX
The _Operations_on_Functions_Addition subtruction Multiplication and Division...
mdregaspi24
 
PPTX
加拿大尼亚加拉学院毕业证书{Niagara在读证明信Niagara成绩单修改}复刻
Taqyea
 
PPT
Data base management system Transactions.ppt
gandhamcharan2006
 
PDF
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
PDF
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
PDF
Incident Response and Digital Forensics Certificate
VICTOR MAESTRE RAMIREZ
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PDF
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
PPTX
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PDF
Context Engineering vs. Prompt Engineering, A Comprehensive Guide.pdf
Tamanna
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PDF
2_Management_of_patients_with_Reproductive_System_Disorders.pdf
motbayhonewunetu
 
PDF
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
Lecture 2-1.ppt at a higher learning institution such as the university of Za...
rachealhantukumane52
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
01 presentation finyyyal معهد معايره.ppt
eltohamym057
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
The _Operations_on_Functions_Addition subtruction Multiplication and Division...
mdregaspi24
 
加拿大尼亚加拉学院毕业证书{Niagara在读证明信Niagara成绩单修改}复刻
Taqyea
 
Data base management system Transactions.ppt
gandhamcharan2006
 
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
Incident Response and Digital Forensics Certificate
VICTOR MAESTRE RAMIREZ
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
Context Engineering vs. Prompt Engineering, A Comprehensive Guide.pdf
Tamanna
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
2_Management_of_patients_with_Reproductive_System_Disorders.pdf
motbayhonewunetu
 
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
Ad

Bridging the Gap Between Data Science & Engineer: Building High-Performance Teams

  • 1. Bridging the Gap Between Data Science & Engineering: Building High-Performing Teams
  • 2. How do I hire a data scientist?
  • 3. Software Engineer Data Engineer Data Scientist Applied Scientist Research Scientist Continuum of Skills
  • 4. Software Engineer Data Engineer Data Scientist Applied Scientist Research Scientist Continuum of Skills
  • 6. Many companies try to find all of these skills in a single person.
  • 7. Which leads to job requirements like this… • MSc/PhD in Computer Science, Electrical Engineering, Math or Statistics • At least 5 years of experience in solving real-world practical problems using Machine Learning • At least 5 years of experience on mining and modeling large-scale data (hundreds of terabytes) • Extensive in-depth knowledge of Data Mining, Machine Learning, Algorithms • Knowledge of at least one high-level programming language (C++, Java) • Knowledge of at least one scripting language (Perl, Python, Ruby) • Knowledge of SQL and experience with large relational databases • Knowledge of at least one ML toolset (R, Weka, KNIME, Octave, Mahout, scikit-learn) • Strong ability to formalize and provide practical solutions to research problems • Strong communication skills and ability to work independently to get an idea from inception to implementation. • Knowledge of the state of the art in at least one of Bayesian Optimization, Recommendation Systems, Social Network Analysis, Information Retrieval • At least 5 years of experience with storing, sampling, querying large-scale data (hundreds of terabytes) and experimentation frameworks • At least 5 years of experience with Hadoop, Spark, Mahout or Giraph
  • 9. These people do exist, but they are often already well-compensated, and only want to work on interesting problems.
  • 10. What can you do? Build a team instead.
  • 13. Machine Learning, Statistics, Domain Knowledge Softw are Engineering Business Acum en Distributed Com puting Com m unication Look for T-shaped people
  • 14. • Compose teams of individuals who have overlapping skill-sets and deep expertise in one area (machine learning, statistics, engineering, business, etc.) • The overlap allows them to speak the same language and work collaboratively on solving problems
  • 15. How do I structure my data science team within my organization?
  • 16. Data Science Team Structures CentralizedEmbeddedHub & Spoke
  • 17. Centralized Data Scientists sit on a team that acts as internal consultants, fielding and answering questions from multiple teams within the organization, defining tools for the organization, and acting as highly powered consultants.
  • 18. Embedded • Data Scientists are almost wholly embedded within one particular team and focus on solving problems for that team. • Teams are assigned to one particular product or function within the company and define and answer questions for that product or function.
  • 19. Hub & Spoke • The data science team sits together physically and works collaboratively to solve problems. • However, each data scientist (or a combination of them) gets deployed to work on problems within the organization. • Tends to apply to companies who have a lot of users.
  • 20. Data Science Team Structure CentralizedEmbeddedHub & Spoke > >
  • 21. How do I get my data scientists to work with engineering?
  • 22. Data Science Python R modeling & prototyping production Software Engineering Java/C++ RoR/Javascript
  • 23. Data Science Software Engineering Python R Java/C++ RoR/Javascript modeling & prototyping production
  • 24. Data scientists learn to write prototypes in production languages Engineers learn the basics of data science so they can understand how the models work Goal is to have both teams speak the same language and engender trust through communication
  • 25. Data Science Data Engineering Common Core Data Science Curriculum Data Engineering Curriculum Data Science Data Engineering Projects
  • 26. Data Science Engineering Initial Planning Data Science Engineering Data Science Engineering Production
  • 27. • Don’t look for unicorns, build collaborative teams of T-shaped people • Pay attention to how your data science team is structured within your organization • Get your data science and engineering teams to speak the same language, allowing them to build trust and work collaboratively Summary
  • 28. We believe an opportunity belongs 
 to anyone with aptitude and ambition.
  • 29. 29Galvanize 2015 NODES ON THE NETWORK COLORADO (BOULDER, DENVER, FORT COLLINS) SEATTLE, WA SAN FRANCISCO, CA AUSTIN, TX (OPENING Q1 2016) Programs: Full Stack Immersive, Data Science Immersive, Entrepreneurship Programs: Full Stack Immersive, Data Science Immersive, Entrepreneurship Programs: Full Stack Immersive, Data Science Immersive, Data Engineering Immersive, Masters of Science in Data Science, Entrepreneurship Programs: Full Stack Immersive, Data Science Immersive, Entrepreneurship [Explanation Text]
  • 30. 30Galvanize 2015 PLACEMENT STATS FULL STACK IMMERSIVE DATA SCIENCE IMMERSIVE $43K $77KPre-program Salary Average Starting Salary 97% Placement Rate* *Galvanize is a founder member of NESTA (New Economy Skills Training Association), a trade organization founded to regulate the new “bootcamp” market. This place rate is more rigorous than that requested by state licensure agencies. The placement rate is calculated 6 months after graduation. $72K $114KPre-program Salary 94%Placement Rate* Average Starting Salary
  • 31. 31Galvanize 2015 5 PROGRAMS • Full Stack Immersive • Data Science Immersive • Data Engineering Immersive Project over 500 Student Member Graduates in 2015 Currently over 1500 Members • Master of Science in Data Science 
 (University of New Haven) • Startup Membership
  • 32. 32Galvanize 2015 FULL STACK IMMERSIVE • 97% Placement Rate 
 within 6 months • $77K Average Starting Salary • 6 Month Program
  • 34. 34Galvanize 2015 DATA SCIENCE IMMERSIVE • 94% Placement Rate 
 within 6 months • $114K Average Starting Salary • 3 Month Program
  • 35. 35Galvanize 2015 DATA SCIENCE IMMERSIVE Week 1 - Exploratory Data Analysis and Software Engineering Best Practices Week 2 - Statistical Inference, Bayesian Methods, A/B Testing, Multi-Armed Bandit Week 3 - Regression, Regularization, Gradient Descent Week 4 - Supervised Machine Learning: Classification, Validation, Ensemble Methods Week 5 - Clustering, Topic Modeling (NMF, LDA), NLP Week 6 - Network Analysis, Matrix Factorization, and Time Series Week 7 - Hadoop, Hive, and MapReduce Week 8 - Data Visualization with D3.js, Data Products, and Fraud Detection Case Study Weeks 9-10 - Capstone Projects Week 12 - Onsite Interviews
  • 37. 37Galvanize 2015 DATA ENGINEERING IMMERSIVE • Launched Oct. 2015 • Built in partnership with Nvent and Concurrent • 3 Month Program
  • 38. THANK YOU RYAN ORBAN | EVP OF PRODUCT & STRATEGY [email protected] @ryanorban www.galvanize.com