SlideShare a Scribd company logo
2
Most read
4
Most read
KDD 2022, W26 Half Day Workshop – Tuesday August 16
3rd IADSS Workshop on Data Science Standards – Hiring, Assessing and Upskilling Data Science Talent
IADSS link https://www.iadss.org/
Workshop link https://www.iadss.org/kdd2022
One Hiring Process – Quiz for Learning and Reasoning
Greg Makowski
Head of Data Science Solutions
greg.Makowski@jci.com
www.LinkedIn.com/in/GregMakowski
Building Technologies & Solutions
2
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Context
 My background
• I have been deploying data mining and data science models since 1992
• I have been hiring and building DS teams since 2010
 Assumptions driving requirements and design
• You can have candidates describe their past models deployed and development
process. That is not hard.
– However, that may not be a close match for upcoming work
• The types of future algorithms and architectures will likely be constantly changing,
between the project requirements and what is developing as State Of The Art (SOTA)
• Therefore, it would be good to get a sense of how a candidate would do on future,
upcoming projects and new algorithms
• There is a large variety of new things to learn, of new algorithms or tweaks at a
conference – but you shouldn’t expect the candidate to know all
•  Expect that constant learning is needed in the future
•  interview for the candidates learning process
3
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Context – other parts of the interview process
• Job description – looking for a person that has ”deployed 3+ models”, like a ”medical residency”. 3-12 yrs
• Other parts of the interview process already cover common basics
– basic coding or software development skills
– algorithm skills on algorithms they know well or have deployed
• Treat the interview process as a “dating game” to get a good match for the candidate as well as the
employer
– What the company has to offer, existing enterprise software, tools, past projects
– Career path in Data Science
– “A day in the life of”, a “week in the life of” the candidate in the role
– Personal responsibility, ownership, how the they can contribute to the employers OKR’s (Objectives and Key
Results)
– Intellectual variety – vertical markets, customer personas, algorithm families
– Opportunities to intellectually grow, learn and adapt
• In final round, have candidate present on past project for 45 min, which includes Q&A
– Focus on their DS “value add”, avoid anything confidential to past client or co
– Like an internal DS presentation or to DS contacts at a client site, to make it easier
– Can be a reused conference or meetup presentation
– A chance for all the DS staff in a compatible time zone to understand the past project and ask questions
Exciting Mission
• Save the world
• Save people
• Save machines
4
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Design of one part of the DS phone interview process
 ”Teach” for 15 minutes, going over content in slides
• Let the candidate ask questions, it is fine to ask to go back and forth in the slides
• Give a problem context, a client role in a vertical domain, their values and KPIs
• Pick something that goes deep into an algorithm, not at the “surface level” of a Python
function call, but on how the algorithm interacts with the data
• Every few slides, ask if they understand or have questions
• Describe in terms of different modalities (math, geometry, process)
 “ask reasoning questions” that may take 15-30 min
5
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Design of one part of the DS phone interview process
 “ask reasoning questions” that may take 15-30 min
• One question is more algorithmic or geometrical
• One question is more about “fiduciary responsibility to the values and KPIs of the
client”, for the DS to think as a consultant or in the business deployment terms
• Tell the candidate they can ask to go back and forth over any of the training slides
• They can ask questions of the client contact. Then roleplay as a client (not knowing
DS).
• Track the time it takes per question, as one metric to compare candidates. It is mostly
used for comparisons of large differences
• For any ”binary” answers, the candidate has to explain “why”. Ask probing questions
• Ask the candidate to declare a “final answer state”. Don’t stop them if you hear
something close. They should be confident and describe their reasoning
– When deploying in the future, they have to be confident to know their “final answer state”
6
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Design of one part of the “on-site” interview process
 Go over a short 2 week POC project in slides (something I know well and
worked on). Provide background slides w text and diagrams for 15 min
• Client requirements
• Data, it’s noise, what it represents in terms of a physical process or human decisions,
sample tables of different structures
• Initial EDA that you could take for granted that most people would do
• Offer to “role play” or be a “holodeck simulator” to answer questions
– Of the client, who knows objectives and data at a business process level
– Of the data, for a specified EDA, program or model, get a high level answer
– A fellow DS on the team, who doesn’t know the problem – may trigger questions
– Can go back and forth to any of the background slides
• Questions for ~30 min:
– How to structure the data to fit some algorithm family (very involved for this project)
– How to train the models, model development process (less time, maybe 10 min)
– Once you have an algorithm result, so what? What business process do you change to get
value or a business KPI metric? How to estimate the value? (not obvious for this project)
7
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
When designing questions, consider:
 Pick an algorithm or algorithm feature that is intentionally obscure
• Want to see a candidate learning and adapting experience, how they ask questions
• Want to avoid 1 in 10 candidates who already know the answer – then you can’t
compare how they “would learn X the first time”
 In the age of remote interviews, a person can web search easily
• Check common web searches in advance, what they return and how that could be
used. Consider how you may tweak your question or setup
• Maybe don’t use an algorithm or feature name, just call it “alg X”
8
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Questions, discussions, alternate approaches, advice?
 Since 2010, candidates hired this way have been good at
• adapting to new projects and requirements
• No retention surprises or mis-hires

More Related Content

Similar to A Successful Hiring Process for Data Scientists (20)

PDF
Ace ux hiring with applied design thinking
Malini Rao
 
PPTX
Cracking The Technical Interview Uw
careercup
 
DOC
55 interview questions
svidhyaghantasala
 
PPTX
Cracking the coding interview columbia - march 23 2011
careercup
 
PPTX
Röck Yoür Technical Interview - V3
David McCarter
 
PPTX
Lessons after working as a data scientist for 1 year
Yao Yao
 
PDF
Karat at CMU
Anthony P. Rotoli
 
PPTX
Interview Skills (Presented for the Penbrooke Community Association)
Jeremy Dutton
 
PDF
92 electrical interview questions and answers
ElectricalCareer66
 
PPTX
Marc perillo
MarcPerillo1
 
PDF
Lessons Learned from Hiring and Retaining Data Practitioners
Tereza Iofciu
 
PPT
Milestones achievedrd catiller
Robert D Catiller
 
PPTX
Interviewing for Results - Eaton Corporation
Women in Electrical and Computer Engineering at Georgia Tech
 
DOCX
Trudell_Resume_2015
Bill Trudell
 
PDF
System Design Interview - from both sides of the table.pdf
Dejan Vukmirovic
 
PPTX
Brit_HUG_CurrrentStateof JDE
Brittany Raysin
 
PPTX
Brit_HUG_CurrrentStateof JDE
Brittany Raysin
 
PPTX
Gregory d sleeper profile
Greg Sleeper
 
PPTX
Case study analysis - Muffler magic
←ครђเรђ Batra
 
PDF
Milestones Achieved Rd Catiller
Robert D Catiller
 
Ace ux hiring with applied design thinking
Malini Rao
 
Cracking The Technical Interview Uw
careercup
 
55 interview questions
svidhyaghantasala
 
Cracking the coding interview columbia - march 23 2011
careercup
 
Röck Yoür Technical Interview - V3
David McCarter
 
Lessons after working as a data scientist for 1 year
Yao Yao
 
Karat at CMU
Anthony P. Rotoli
 
Interview Skills (Presented for the Penbrooke Community Association)
Jeremy Dutton
 
92 electrical interview questions and answers
ElectricalCareer66
 
Marc perillo
MarcPerillo1
 
Lessons Learned from Hiring and Retaining Data Practitioners
Tereza Iofciu
 
Milestones achievedrd catiller
Robert D Catiller
 
Interviewing for Results - Eaton Corporation
Women in Electrical and Computer Engineering at Georgia Tech
 
Trudell_Resume_2015
Bill Trudell
 
System Design Interview - from both sides of the table.pdf
Dejan Vukmirovic
 
Brit_HUG_CurrrentStateof JDE
Brittany Raysin
 
Brit_HUG_CurrrentStateof JDE
Brittany Raysin
 
Gregory d sleeper profile
Greg Sleeper
 
Case study analysis - Muffler magic
←ครђเรђ Batra
 
Milestones Achieved Rd Catiller
Robert D Catiller
 

More from Greg Makowski (18)

PPTX
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
PPTX
Understanding Hallucinations in LLMs - 2023 09 29.pptx
Greg Makowski
 
PPTX
Future of AI - 2023 07 25.pptx
Greg Makowski
 
PDF
Kdd 2019: Standardizing Data Science to Help Hiring
Greg Makowski
 
PPTX
Tales from an ip worker in consulting and software
Greg Makowski
 
PPTX
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Greg Makowski
 
PDF
Production model lifecycle management 2016 09
Greg Makowski
 
PDF
Using Deep Learning to do Real-Time Scoring in Practical Applications
Greg Makowski
 
PDF
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Greg Makowski
 
PDF
SFbayACM ACM Data Science Camp 2015 10 24
Greg Makowski
 
PDF
How to Create 80% of a Big Data Pilot Project
Greg Makowski
 
PDF
Powering Real­time Decision Engines in Finance and Healthcare using Open Sour...
Greg Makowski
 
PDF
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
Greg Makowski
 
PDF
Heuristic design of experiments w meta gradient search
Greg Makowski
 
PDF
Three case studies deploying cluster analysis
Greg Makowski
 
PDF
Linked In Slides 2009 02 24 B
Greg Makowski
 
PDF
The 360º Leader (Section 2 of 6)
Greg Makowski
 
PPT
The 360º Leader (Section 1 of 6)
Greg Makowski
 
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
Understanding Hallucinations in LLMs - 2023 09 29.pptx
Greg Makowski
 
Future of AI - 2023 07 25.pptx
Greg Makowski
 
Kdd 2019: Standardizing Data Science to Help Hiring
Greg Makowski
 
Tales from an ip worker in consulting and software
Greg Makowski
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Greg Makowski
 
Production model lifecycle management 2016 09
Greg Makowski
 
Using Deep Learning to do Real-Time Scoring in Practical Applications
Greg Makowski
 
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Greg Makowski
 
SFbayACM ACM Data Science Camp 2015 10 24
Greg Makowski
 
How to Create 80% of a Big Data Pilot Project
Greg Makowski
 
Powering Real­time Decision Engines in Finance and Healthcare using Open Sour...
Greg Makowski
 
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
Greg Makowski
 
Heuristic design of experiments w meta gradient search
Greg Makowski
 
Three case studies deploying cluster analysis
Greg Makowski
 
Linked In Slides 2009 02 24 B
Greg Makowski
 
The 360º Leader (Section 2 of 6)
Greg Makowski
 
The 360º Leader (Section 1 of 6)
Greg Makowski
 
Ad

Recently uploaded (20)

PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PDF
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
PPTX
7 Easy Ways to Improve Clarity in Your BI Reports
sophiegracewriter
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPT
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
PDF
apidays Munich 2025 - Developer Portals, API Catalogs, and Marketplaces, Miri...
apidays
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PDF
Top Civil Engineer Canada Services111111
nengineeringfirms
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PPTX
Customer Segmentation: Seeing the Trees and the Forest Simultaneously
Sione Palu
 
PDF
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PPTX
Solution+Architecture+Review+-+Sample.pptx
manuvratsingh1
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
7 Easy Ways to Improve Clarity in Your BI Reports
sophiegracewriter
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
apidays Munich 2025 - Developer Portals, API Catalogs, and Marketplaces, Miri...
apidays
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Top Civil Engineer Canada Services111111
nengineeringfirms
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
Customer Segmentation: Seeing the Trees and the Forest Simultaneously
Sione Palu
 
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
Solution+Architecture+Review+-+Sample.pptx
manuvratsingh1
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
short term internship project on Data visualization
JMJCollegeComputerde
 
Ad

A Successful Hiring Process for Data Scientists

  • 1. KDD 2022, W26 Half Day Workshop – Tuesday August 16 3rd IADSS Workshop on Data Science Standards – Hiring, Assessing and Upskilling Data Science Talent IADSS link https://www.iadss.org/ Workshop link https://www.iadss.org/kdd2022 One Hiring Process – Quiz for Learning and Reasoning Greg Makowski Head of Data Science Solutions [email protected] www.LinkedIn.com/in/GregMakowski Building Technologies & Solutions
  • 2. 2 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Context  My background • I have been deploying data mining and data science models since 1992 • I have been hiring and building DS teams since 2010  Assumptions driving requirements and design • You can have candidates describe their past models deployed and development process. That is not hard. – However, that may not be a close match for upcoming work • The types of future algorithms and architectures will likely be constantly changing, between the project requirements and what is developing as State Of The Art (SOTA) • Therefore, it would be good to get a sense of how a candidate would do on future, upcoming projects and new algorithms • There is a large variety of new things to learn, of new algorithms or tweaks at a conference – but you shouldn’t expect the candidate to know all •  Expect that constant learning is needed in the future •  interview for the candidates learning process
  • 3. 3 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Context – other parts of the interview process • Job description – looking for a person that has ”deployed 3+ models”, like a ”medical residency”. 3-12 yrs • Other parts of the interview process already cover common basics – basic coding or software development skills – algorithm skills on algorithms they know well or have deployed • Treat the interview process as a “dating game” to get a good match for the candidate as well as the employer – What the company has to offer, existing enterprise software, tools, past projects – Career path in Data Science – “A day in the life of”, a “week in the life of” the candidate in the role – Personal responsibility, ownership, how the they can contribute to the employers OKR’s (Objectives and Key Results) – Intellectual variety – vertical markets, customer personas, algorithm families – Opportunities to intellectually grow, learn and adapt • In final round, have candidate present on past project for 45 min, which includes Q&A – Focus on their DS “value add”, avoid anything confidential to past client or co – Like an internal DS presentation or to DS contacts at a client site, to make it easier – Can be a reused conference or meetup presentation – A chance for all the DS staff in a compatible time zone to understand the past project and ask questions Exciting Mission • Save the world • Save people • Save machines
  • 4. 4 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Design of one part of the DS phone interview process  ”Teach” for 15 minutes, going over content in slides • Let the candidate ask questions, it is fine to ask to go back and forth in the slides • Give a problem context, a client role in a vertical domain, their values and KPIs • Pick something that goes deep into an algorithm, not at the “surface level” of a Python function call, but on how the algorithm interacts with the data • Every few slides, ask if they understand or have questions • Describe in terms of different modalities (math, geometry, process)  “ask reasoning questions” that may take 15-30 min
  • 5. 5 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Design of one part of the DS phone interview process  “ask reasoning questions” that may take 15-30 min • One question is more algorithmic or geometrical • One question is more about “fiduciary responsibility to the values and KPIs of the client”, for the DS to think as a consultant or in the business deployment terms • Tell the candidate they can ask to go back and forth over any of the training slides • They can ask questions of the client contact. Then roleplay as a client (not knowing DS). • Track the time it takes per question, as one metric to compare candidates. It is mostly used for comparisons of large differences • For any ”binary” answers, the candidate has to explain “why”. Ask probing questions • Ask the candidate to declare a “final answer state”. Don’t stop them if you hear something close. They should be confident and describe their reasoning – When deploying in the future, they have to be confident to know their “final answer state”
  • 6. 6 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Design of one part of the “on-site” interview process  Go over a short 2 week POC project in slides (something I know well and worked on). Provide background slides w text and diagrams for 15 min • Client requirements • Data, it’s noise, what it represents in terms of a physical process or human decisions, sample tables of different structures • Initial EDA that you could take for granted that most people would do • Offer to “role play” or be a “holodeck simulator” to answer questions – Of the client, who knows objectives and data at a business process level – Of the data, for a specified EDA, program or model, get a high level answer – A fellow DS on the team, who doesn’t know the problem – may trigger questions – Can go back and forth to any of the background slides • Questions for ~30 min: – How to structure the data to fit some algorithm family (very involved for this project) – How to train the models, model development process (less time, maybe 10 min) – Once you have an algorithm result, so what? What business process do you change to get value or a business KPI metric? How to estimate the value? (not obvious for this project)
  • 7. 7 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential When designing questions, consider:  Pick an algorithm or algorithm feature that is intentionally obscure • Want to see a candidate learning and adapting experience, how they ask questions • Want to avoid 1 in 10 candidates who already know the answer – then you can’t compare how they “would learn X the first time”  In the age of remote interviews, a person can web search easily • Check common web searches in advance, what they return and how that could be used. Consider how you may tweak your question or setup • Maybe don’t use an algorithm or feature name, just call it “alg X”
  • 8. 8 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Questions, discussions, alternate approaches, advice?  Since 2010, candidates hired this way have been good at • adapting to new projects and requirements • No retention surprises or mis-hires