SlideShare a Scribd company logo
DATA MODELING FOR
SOFTWARE ENGINEERS
G E T T I N G Y O U R E N G I N E E R I N G L E A D E R S T O S M I L E A G A I N
G E T T I N G Y O U R D A T A L E A D E R S T O S M I L E A G A I N
G E T T I N G Y O U R A R C H I T E C T S T O S M I L E A G A I N
G E T T I N G Y O U R P R O D U C T L E A D E R S T O S M I L E A G A I N
G E T T I N G Y O U R O R G A N I Z A T I O N T O S M I L E A G A I N
© 2025 Scott C Sosna scott-sosna
https://www.ibm.com/think/topics/data-modeling
Data Architect
Product
Software Engineer
Leadership
Data Governance
Legal
Database Admin
Data Modelers
Data Modelling For Software Engineers (Full).key.pdf
7
What Possible
Could Go Wrong?
Understanding

Completeness

Quality

Usability

No time to de
fi
ne end-to-end
data vision, get out of my way
and let me start coding!
S O F T W A R E E N G I N E E R X
“
9
10
DATA IS FOREVER
…but your software is not…
11
12
Architect
Enterprise, Solution, Application, Integration, Data
Who Am I?
Life-long technologist with professional experience
across multiple business domains, tech stacks, and
responsibilities.
Engineer
Most recent position returned me to hands-on
software engineering after long hiatus.
Speaker/Writer/Mentor
Enjoy sharing experiences, insights, expertise, war
stories of my career, both good and bad.
Traveller
Too many places on my bucket list!
C o n t a c t I n f o
me@scottsosna.dev
LinkedIn: /in/scott-sosna
DZone: /authors/scsosna
13
14 TODAY’S GOALS
Becoming Data First
Driven top-down not bottom-up, requires senior leaders’
buy-in, replaces existing processes, changes culture.
De
fi
ning Career Path
Impacted by business goals, role diversity, personal
goals, no simple answer for any one person.
Data Modeling Tutorial
Industry, technology, non-functional requirements, culture
impact organizational approach.
Data Modeling 101
Design/implementation decisions impact your solution’s
overall viability … and your organization’s success.
Decouple, Please!
How outsiders consume your data may not match its
internal representation …. and that’s not necessarily bad.
Wrap Up
Other considerations, final thoughts, Q&A.
15
16
Definitions How is data used in your solutions?
17
CATEGORIZING DATA
Some obvious, some not
Messaging
APIs
Persisted Hardware Generated
18
HOW IS DATA USED?
Application
Data maintained and shared in your solution supports day-to-day business activity.
Reporting
Aggregated data provides view of state of business and identifies new opportunities.
Integration
You and outside providers exchange data for mutually beneficial reasons.
Intellectual Property
Your data differentiates you from competitors and is your company’s raison d’état.
Data is like garbage, you’d
better know what you are
going to do with it before you
collect it.
M A R K T W A I N ( M A Y B E ? )
“
20
Data Modeling 101
Data Modeling screams waterfall: data
models always preceded code. Not today.
Today, code changes to implemented data
structures in whatever form is modeling
data: persisted, published, exchanged,
cached, measured.
21
Need To Know
Key points to realize before you start….
Never Means Maybe
Certainty is fleeting when business requirements change.
All Changes Impactful
No change inconsequential, everything visible to everyone.
What Code Comments?
No IDE to open, no inline comments to review.
Formal Documentation
Few created, fewer reviewed, none accurate.
22
CONSISTENCY
Naming
Data naming more important than code naming: name visible to everyone.
Data Type
Choose a data type and stick with it. Epoch or structured date/time for timestamps? String, number,
native for booleans? Decimal points on a percent? Enums to represent closed set of values?
Structure
Consistency reduces cognitive complexity, increases productivity, reduces bugs.
Validation
Validate once, validate always: database constraints, data type limitations, common code libraries.
23
The good thing about
standards is that there are so
many to choose from.
A N D R E W T A N E N B A U M
“
25
Standards
Rarely need to create your own, so don’t!
International
Industry
Corporate
Business
De facto
26
Inspiration
Don’t recreate the wheel if you don’t need to.
https://github.com/Microsoft/CDM/blob/master/docs/CDMPoster_a3.pdf
27
Don’t …
Overuse Optionality
Identify By Business Keys
Codify ALL or NONE
Inject Compliance Problems
Store Localized Data
Don’t Repeat Yourself
When in doubt, talk it out.
S C O T T S O S N A
“
29
30
Decouple, Please!
Consumers whom know - or think they know - your data often implement in ways that
make it difficult to evolve your data later.
Keep Data Behind Closed Doors
The ability to evolve your data
implementation is inversely
proportional to outsiders’ knowledge of
current state.
S C O T T S O S N A
“
32
KEEP YOUR SECRETS
Again, some obvious, some not.
Structure
Identity Validation
Flow
33
34
INTERNAL
EXTERNAL
BALANCING ACT
Are internal and external consumers handled differently?

35
Wrap Up Other considerations, final thoughts, Q&A.
https://www.ibm.com/think/topics/data-modeling
https://www.erwin.com/solutions/data-modeling/benefits-of-data-modeling.aspx
38
FINAL THOUGHTS
Understand your organization
Business domain, target audience, core competencies, processes, definition of success
Know Your Platform
Leverage capabilities, improve maintainability, reduce costs.
Accept Problem Statement, Not Solution
You own the tech domain, someone’s proposed solution will become your future problem.
Anticipate Future But Implement Selectively
Listen to functionality discussions carefully to anticipate and (perhaps) future-proof your data design.
Data Modelling For Software Engineers (Full).key.pdf

More Related Content

PDF
Data Modelling For Software Engineers (Poland).pdf
Scott Sosna
 
PDF
Data Modelling For Software Engineers (Devoxx GR 2025).pdf
Scott Sosna
 
PDF
Demystifying ML/AI
Matthew Reynolds
 
PDF
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 
PDF
The coding portion of Data Science
Institute of Contemporary Sciences
 
PDF
Analytics-Enabled Experiences: The New Secret Weapon
Databricks
 
PDF
From Data to Decisions_ A Complete Guide for New-Age Data Scientists.pdf
khushnuma khan
 
PPTX
Why do most machine learning projects never make it to production
Cameron Vetter
 
Data Modelling For Software Engineers (Poland).pdf
Scott Sosna
 
Data Modelling For Software Engineers (Devoxx GR 2025).pdf
Scott Sosna
 
Demystifying ML/AI
Matthew Reynolds
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 
The coding portion of Data Science
Institute of Contemporary Sciences
 
Analytics-Enabled Experiences: The New Secret Weapon
Databricks
 
From Data to Decisions_ A Complete Guide for New-Age Data Scientists.pdf
khushnuma khan
 
Why do most machine learning projects never make it to production
Cameron Vetter
 

Similar to Data Modelling For Software Engineers (Full).key.pdf (20)

PPTX
Open Web Technologies and You - Durham College Student Integration Presentation
darryl_lehmann
 
PDF
10 Things Competencies
jothisekaran
 
PDF
Building digital product masters to prevail in the age of accelerations parts...
Jeffrey Stewart
 
PDF
Putting data science in your business a first utility feedback
Peculium Crypto
 
PPTX
MeasureCamp Belgrade 2025 - Yasen Lilov - Past - Present - Prompt
Yasen Lilov
 
PDF
Data and data scientists are not equal to money david hoyle
Institute of Contemporary Sciences
 
PDF
Forget the A to Z of why it projects fail, here’s the S to L of successful!
Stoneseed Ltd
 
PDF
Winning the right to deploy AI: Dedication to craft, designing the right expe...
JoshuaM27
 
PDF
Technophile CTOs of the Year 2022.pdf
InsightsSuccess4
 
PDF
Agile & Data Modeling – How Can They Work Together?
DATAVERSITY
 
PDF
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
NadinaLisbon1
 
PDF
Top Takeaways from Validate 2019
ObservePoint
 
PDF
Harnessing Data Growth
Embarcadero Technologies
 
PDF
Harnessing Data Growth
Michael Findling
 
PDF
Lead AI incubations as a Product manager
Debapriya Basu
 
PPTX
Future of IT preso
Lorna Garey
 
PDF
201308 Deloitte Tech Trends 2013 - Elements of Post Digital.pdf
Francisco Calzado
 
PDF
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT
 
PDF
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
Joe Lamantia
 
PDF
How Semantics Simplifies Data Sharing at an Enterprise Level
Denodo
 
Open Web Technologies and You - Durham College Student Integration Presentation
darryl_lehmann
 
10 Things Competencies
jothisekaran
 
Building digital product masters to prevail in the age of accelerations parts...
Jeffrey Stewart
 
Putting data science in your business a first utility feedback
Peculium Crypto
 
MeasureCamp Belgrade 2025 - Yasen Lilov - Past - Present - Prompt
Yasen Lilov
 
Data and data scientists are not equal to money david hoyle
Institute of Contemporary Sciences
 
Forget the A to Z of why it projects fail, here’s the S to L of successful!
Stoneseed Ltd
 
Winning the right to deploy AI: Dedication to craft, designing the right expe...
JoshuaM27
 
Technophile CTOs of the Year 2022.pdf
InsightsSuccess4
 
Agile & Data Modeling – How Can They Work Together?
DATAVERSITY
 
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
NadinaLisbon1
 
Top Takeaways from Validate 2019
ObservePoint
 
Harnessing Data Growth
Embarcadero Technologies
 
Harnessing Data Growth
Michael Findling
 
Lead AI incubations as a Product manager
Debapriya Basu
 
Future of IT preso
Lorna Garey
 
201308 Deloitte Tech Trends 2013 - Elements of Post Digital.pdf
Francisco Calzado
 
UX STRAT USA Presentation: Joe Lamantia, Bottomline Technologies
UX STRAT
 
UX STRAT 2018 | Flying Blind On a Rocket Cycle: Pioneering Experience Centere...
Joe Lamantia
 
How Semantics Simplifies Data Sharing at an Enterprise Level
Denodo
 
Ad

Recently uploaded (20)

PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PDF
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PDF
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
The Future of Artificial Intelligence (AI)
Mukul
 
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Ad

Data Modelling For Software Engineers (Full).key.pdf

  • 1. DATA MODELING FOR SOFTWARE ENGINEERS G E T T I N G Y O U R E N G I N E E R I N G L E A D E R S T O S M I L E A G A I N G E T T I N G Y O U R D A T A L E A D E R S T O S M I L E A G A I N G E T T I N G Y O U R A R C H I T E C T S T O S M I L E A G A I N G E T T I N G Y O U R P R O D U C T L E A D E R S T O S M I L E A G A I N G E T T I N G Y O U R O R G A N I Z A T I O N T O S M I L E A G A I N © 2025 Scott C Sosna scott-sosna
  • 3. Data Architect Product Software Engineer Leadership Data Governance Legal Database Admin Data Modelers
  • 5. 7 What Possible Could Go Wrong? Understanding  Completeness  Quality  Usability 
  • 6. No time to de fi ne end-to-end data vision, get out of my way and let me start coding! S O F T W A R E E N G I N E E R X “
  • 7. 9
  • 8. 10 DATA IS FOREVER …but your software is not…
  • 9. 11
  • 10. 12 Architect Enterprise, Solution, Application, Integration, Data Who Am I? Life-long technologist with professional experience across multiple business domains, tech stacks, and responsibilities. Engineer Most recent position returned me to hands-on software engineering after long hiatus. Speaker/Writer/Mentor Enjoy sharing experiences, insights, expertise, war stories of my career, both good and bad. Traveller Too many places on my bucket list! C o n t a c t I n f o [email protected] LinkedIn: /in/scott-sosna DZone: /authors/scsosna
  • 11. 13
  • 12. 14 TODAY’S GOALS Becoming Data First Driven top-down not bottom-up, requires senior leaders’ buy-in, replaces existing processes, changes culture. De fi ning Career Path Impacted by business goals, role diversity, personal goals, no simple answer for any one person. Data Modeling Tutorial Industry, technology, non-functional requirements, culture impact organizational approach. Data Modeling 101 Design/implementation decisions impact your solution’s overall viability … and your organization’s success. Decouple, Please! How outsiders consume your data may not match its internal representation …. and that’s not necessarily bad. Wrap Up Other considerations, final thoughts, Q&A.
  • 13. 15
  • 14. 16 Definitions How is data used in your solutions?
  • 15. 17 CATEGORIZING DATA Some obvious, some not Messaging APIs Persisted Hardware Generated
  • 16. 18 HOW IS DATA USED? Application Data maintained and shared in your solution supports day-to-day business activity. Reporting Aggregated data provides view of state of business and identifies new opportunities. Integration You and outside providers exchange data for mutually beneficial reasons. Intellectual Property Your data differentiates you from competitors and is your company’s raison d’état.
  • 17. Data is like garbage, you’d better know what you are going to do with it before you collect it. M A R K T W A I N ( M A Y B E ? ) “
  • 18. 20 Data Modeling 101 Data Modeling screams waterfall: data models always preceded code. Not today. Today, code changes to implemented data structures in whatever form is modeling data: persisted, published, exchanged, cached, measured.
  • 19. 21 Need To Know Key points to realize before you start…. Never Means Maybe Certainty is fleeting when business requirements change. All Changes Impactful No change inconsequential, everything visible to everyone. What Code Comments? No IDE to open, no inline comments to review. Formal Documentation Few created, fewer reviewed, none accurate.
  • 20. 22 CONSISTENCY Naming Data naming more important than code naming: name visible to everyone. Data Type Choose a data type and stick with it. Epoch or structured date/time for timestamps? String, number, native for booleans? Decimal points on a percent? Enums to represent closed set of values? Structure Consistency reduces cognitive complexity, increases productivity, reduces bugs. Validation Validate once, validate always: database constraints, data type limitations, common code libraries.
  • 21. 23
  • 22. The good thing about standards is that there are so many to choose from. A N D R E W T A N E N B A U M “
  • 23. 25 Standards Rarely need to create your own, so don’t! International Industry Corporate Business De facto
  • 24. 26 Inspiration Don’t recreate the wheel if you don’t need to. https://github.com/Microsoft/CDM/blob/master/docs/CDMPoster_a3.pdf
  • 25. 27 Don’t … Overuse Optionality Identify By Business Keys Codify ALL or NONE Inject Compliance Problems Store Localized Data Don’t Repeat Yourself
  • 26. When in doubt, talk it out. S C O T T S O S N A “
  • 27. 29
  • 28. 30 Decouple, Please! Consumers whom know - or think they know - your data often implement in ways that make it difficult to evolve your data later. Keep Data Behind Closed Doors
  • 29. The ability to evolve your data implementation is inversely proportional to outsiders’ knowledge of current state. S C O T T S O S N A “
  • 30. 32 KEEP YOUR SECRETS Again, some obvious, some not. Structure Identity Validation Flow
  • 31. 33
  • 32. 34 INTERNAL EXTERNAL BALANCING ACT Are internal and external consumers handled differently? 
  • 33. 35 Wrap Up Other considerations, final thoughts, Q&A.
  • 36. 38 FINAL THOUGHTS Understand your organization Business domain, target audience, core competencies, processes, definition of success Know Your Platform Leverage capabilities, improve maintainability, reduce costs. Accept Problem Statement, Not Solution You own the tech domain, someone’s proposed solution will become your future problem. Anticipate Future But Implement Selectively Listen to functionality discussions carefully to anticipate and (perhaps) future-proof your data design.