SlideShare a Scribd company logo
3
Most read
4
Most read
10
Most read
Hello…
Welcome
To the Talk on
Data Science Applications and Use cases
Agenda…
• What is Data Science?
• Big Data Challenges
• Data Science vs Software Engineering
• Data Science Applications & Use cases
• Conclusion
What is Data Science?
Data Science is the science which uses computer science, statistics and machine
learning, visualization and human-computer interactions to collect, clean, integrate,
analyze, visualize, interact with data to create data products.
“Using data to make better decisions, optimize processes and improve products
and services.”
“What distinguishes data science itself from the tools and techniques
is the central goal of deploying effective decision-making models to a
production environment. “
– John Mount & Nina Zumel, Practical Data Science with R
Big Data Challenges
• Dealing with Data Growth
• Generating insights in a timely manner
• Integrating disparate data sources
• Validating Data
• Securing Bigdata
• Organizational resistance
Data science applications and usecases
‘Data science’ is “Data-Driven Decision” making, to help the business to
make good choices, whereas software engineering is the methodology
for software product development without any confusions about the
requirements.
Data Science vs Software Engineering
Data Science Competence Groups - Research
Data Science Competence includes 5
areas/groups
• Data Analytics
• Data Science Engineering
• Domain Expertise
• Data Management
• Scientific Methods (or Business Process
Management)
Scientific Methods
• Design Experiment
• Collect Data
• Analyse Data
• Identify Patterns
• Hypothesise Explanation
• Test Hypothesis
Business Operations
• Operations Strategy
• Plan
• Design & Deploy
• Monitor & Control
• Improve & Re-design
Data Science Competence includes 5
areas/groups
• Data Analytics
• Data Science Engineering
• Domain Expertise
• Data Management
• Scientific Methods (or Business Process
Management)
Scientific Methods
• Design Experiment
• Collect Data
• Analyse Data
• Identify Patterns
• Hypothesise Explanation
• Test Hypothesis
Business Process
Operations/Stages
• Design
• Model/Plan
• Deploy & Execute
• Monitor & Control
• Optimise & Re-design
Data Science Competences Groups – Business
Design
Modelling
Execution
Monitoring
Optimisation
RESEARCH
DATA
ANALYTICS
ALGORITHMSANALYTIC
SYSTEMS
ENGINEERING
COMPETENCES
DOMAIN
EXPERTISE DATA
SCIENCE
Data
Management
Scientific
Methods
Business Process
Management
Data Science Applications & Use cases
• RECOMMENDER SYSTEMS
• CREDIT SCORING
• DYNAMIC PRICING
• CUSTOMER CHURN
• FRAUD DETECTION
RECOMMENDER SYSTEMS
WHAT IS A RECOMMENDER SYSTEM?
A model that filters information to present users with a curated subset
of options they’re likely to find appealing
HOW DOES IT WORK?
Generally via a collaborative approach (considering user’s previous
behavior) or content-based approach (based on discrete assigned
characteristics)
WHAT IS A REAL USE CASE?
Tendril uses recommendation models to match eligible customers with
new or existing energy products
CREDIT SCORING
WHAT IS CREDIT SCORING?
A model that determines an applicant’s creditworthiness for a mortgage,
loan or credit card
HOW DOES IT WORK?
A set of decision management rules evaluates how likely an applicant is to
repay debts
WHAT IS A REAL USE CASE?
Ferratum Bank uses machine learning models to reach prospective
customers that may have been overlooked by traditional banking
institutions
DYNAMIC PRICING
WHAT IS DYNAMIC PRICING?
Modeling price as a function of supply, demand, competitor pricing and
exogenous factors
HOW DOES IT WORK?
Generalized linear models and classification trees are popular
techniques for estimating the “right” price to maximize expected
revenue.
WHAT IS A REAL USE CASE?
Turo uses dynamic pricing models to suggest prices to the people who
list and rent out cars
CUSTOMER CHURN
WHAT IS CUSTOMER CHURN?
Predicting which customers are going to abandon a product or service
HOW DOES IT WORK?
Data scientists may consider using support vector machines, random
forest or k-nearest-neighbors algorithms
WHAT IS A REAL USE CASE?
EAB combines data from transcripts, standardized test scores,
demographics and more to identify students at risk of not graduating.
FRAUD DETECTION
WHAT IS FRAUD DETECTION?
Detecting and preventing fraudulent financial transactions from being
processed
HOW DOES IT WORK?
Fraud detection is a binary classification problem: “is this transaction
legitimate or not?”
WHAT IS A REAL USE CASE?
Via SMS Group uses a combination of complex data lookups and
decision algorithms written in R and implemented in PHP to assess
whether a loan applicant is fraudulent
Works Cited
• https://www.yhat.com/whitepapers/data-science-in-practice
• http://wikibon.org/blog/role-of-the-data-scientist/
• https://www.cyfronet.krakow.pl/cgw16/presentations/S8_02_present
ation-Edison-CGW-26-10-2016.pdf
Thank You
Sreenatha Reddy K R
krsreenatha@gmail.com
https://in.linkedin.com/in/sreenathaa

More Related Content

What's hot (20)

PDF
Data Visualization in Data Science
Maloy Manna, PMP®
 
PDF
Introduction to data analytics
SSaudia
 
PDF
Introduction to data science
Tharushi Ruwandika
 
DOCX
Big data lecture notes
Mohit Saini
 
PPTX
Data Science
Amit Singh
 
PPTX
Introduction to Data Mining
DataminingTools Inc
 
PPTX
1. Data Analytics-introduction
krishna singh
 
PDF
Data science presentation
MSDEVMTL
 
PPTX
Data cleansing
kunaljain1701
 
PPTX
Data science life cycle
Manoj Mishra
 
PDF
Data visualization in Python
Marc Garcia
 
PPTX
Data science
SwapnilDahake2
 
PPTX
Decision Trees
Student
 
PPTX
Data Science Training | Data Science For Beginners | Data Science With Python...
Simplilearn
 
PPTX
Introduction to ML (Machine Learning)
SwatiTripathi44
 
PPT
Clustering
M Rizwan Aqeel
 
PPTX
03. Data Exploration.pptx
Sarojkumari55
 
PPTX
Data Science
Prakhyath Rai
 
PDF
Exploring the Data science Process
Vishal Patel
 
PPTX
Statistics for data science
zekeLabs Technologies
 
Data Visualization in Data Science
Maloy Manna, PMP®
 
Introduction to data analytics
SSaudia
 
Introduction to data science
Tharushi Ruwandika
 
Big data lecture notes
Mohit Saini
 
Data Science
Amit Singh
 
Introduction to Data Mining
DataminingTools Inc
 
1. Data Analytics-introduction
krishna singh
 
Data science presentation
MSDEVMTL
 
Data cleansing
kunaljain1701
 
Data science life cycle
Manoj Mishra
 
Data visualization in Python
Marc Garcia
 
Data science
SwapnilDahake2
 
Decision Trees
Student
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Simplilearn
 
Introduction to ML (Machine Learning)
SwatiTripathi44
 
Clustering
M Rizwan Aqeel
 
03. Data Exploration.pptx
Sarojkumari55
 
Data Science
Prakhyath Rai
 
Exploring the Data science Process
Vishal Patel
 
Statistics for data science
zekeLabs Technologies
 

Similar to Data science applications and usecases (20)

PPTX
Business Analytics Unit III: Developing analytical talent
Rani Channamma University, Sangolli Rayanna First Grade Constituent College, Belagavi
 
PPTX
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
RATISHKUMAR32
 
PDF
Data Science: Unlocking Insights and Transforming Industries
Institute
 
PPTX
Big Data Courses In Mumbai
faizrashid1995
 
PPTX
Data Science Training in Chandigarh h
asmeerana605
 
PPTX
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 
PPTX
The Power of Data Science by DICS INNOVATIVE.pptx
gs5545791
 
PDF
DataScience_introduction.pdf
SouravBiswas747273
 
PPTX
Data Science PPT _basics of data science.pptx
KuldeepSinghBrar3
 
PDF
Dr. Chadd Winterburg’s Impact on Modern Analytics
chaddwinterburg
 
PPTX
Data science in business Administration Nagarajan.pptx
NagarajanG35
 
PDF
Top 10 areas of expertise in data science
GlobalTechCouncil
 
PPTX
Intoduction to Data Science By Sulav Acharya
achsulav100
 
PDF
Building successful data science teams
Venkatesh Umaashankar
 
PPTX
intro to data science Clustering and visualization of data science subfields ...
jybufgofasfbkpoovh
 
PPTX
Unit 1-FDS. .pptx
kavalishiva33
 
PPTX
Data science and business analytics
Inbavalli Valli
 
PPTX
Impact of Data Science
kumari36
 
PDF
Guide for a Data Scientist
Rohit Dubey
 
PDF
Introduction to data science.pdf-Definition,types and application of Data Sci...
DrSumathyV
 
Business Analytics Unit III: Developing analytical talent
Rani Channamma University, Sangolli Rayanna First Grade Constituent College, Belagavi
 
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
RATISHKUMAR32
 
Data Science: Unlocking Insights and Transforming Industries
Institute
 
Big Data Courses In Mumbai
faizrashid1995
 
Data Science Training in Chandigarh h
asmeerana605
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 
The Power of Data Science by DICS INNOVATIVE.pptx
gs5545791
 
DataScience_introduction.pdf
SouravBiswas747273
 
Data Science PPT _basics of data science.pptx
KuldeepSinghBrar3
 
Dr. Chadd Winterburg’s Impact on Modern Analytics
chaddwinterburg
 
Data science in business Administration Nagarajan.pptx
NagarajanG35
 
Top 10 areas of expertise in data science
GlobalTechCouncil
 
Intoduction to Data Science By Sulav Acharya
achsulav100
 
Building successful data science teams
Venkatesh Umaashankar
 
intro to data science Clustering and visualization of data science subfields ...
jybufgofasfbkpoovh
 
Unit 1-FDS. .pptx
kavalishiva33
 
Data science and business analytics
Inbavalli Valli
 
Impact of Data Science
kumari36
 
Guide for a Data Scientist
Rohit Dubey
 
Introduction to data science.pdf-Definition,types and application of Data Sci...
DrSumathyV
 
Ad

More from Sreenatha Reddy K R (10)

PPT
Linux security firewall and SELinux
Sreenatha Reddy K R
 
PPT
Mail server setup
Sreenatha Reddy K R
 
PPT
Linux System Administration - Web Server and squid setup
Sreenatha Reddy K R
 
PPTX
Linux System Administration - NFS Server
Sreenatha Reddy K R
 
PPTX
Linux System Administration - DNS
Sreenatha Reddy K R
 
PPTX
DHCP and NIS
Sreenatha Reddy K R
 
PPT
Linux commands and file structure
Sreenatha Reddy K R
 
PPTX
Linux booting process - Linux System Administration
Sreenatha Reddy K R
 
PPTX
Introduction to tcp ip linux networking
Sreenatha Reddy K R
 
PPTX
Access control list acl - permissions in linux
Sreenatha Reddy K R
 
Linux security firewall and SELinux
Sreenatha Reddy K R
 
Mail server setup
Sreenatha Reddy K R
 
Linux System Administration - Web Server and squid setup
Sreenatha Reddy K R
 
Linux System Administration - NFS Server
Sreenatha Reddy K R
 
Linux System Administration - DNS
Sreenatha Reddy K R
 
DHCP and NIS
Sreenatha Reddy K R
 
Linux commands and file structure
Sreenatha Reddy K R
 
Linux booting process - Linux System Administration
Sreenatha Reddy K R
 
Introduction to tcp ip linux networking
Sreenatha Reddy K R
 
Access control list acl - permissions in linux
Sreenatha Reddy K R
 
Ad

Recently uploaded (20)

PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PPTX
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PPT
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PPTX
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
PDF
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PDF
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PDF
Driving Employee Engagement in a Hybrid World.pdf
Mia scott
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
Driving Employee Engagement in a Hybrid World.pdf
Mia scott
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 

Data science applications and usecases

  • 1. Hello… Welcome To the Talk on Data Science Applications and Use cases
  • 2. Agenda… • What is Data Science? • Big Data Challenges • Data Science vs Software Engineering • Data Science Applications & Use cases • Conclusion
  • 3. What is Data Science? Data Science is the science which uses computer science, statistics and machine learning, visualization and human-computer interactions to collect, clean, integrate, analyze, visualize, interact with data to create data products. “Using data to make better decisions, optimize processes and improve products and services.” “What distinguishes data science itself from the tools and techniques is the central goal of deploying effective decision-making models to a production environment. “ – John Mount & Nina Zumel, Practical Data Science with R
  • 4. Big Data Challenges • Dealing with Data Growth • Generating insights in a timely manner • Integrating disparate data sources • Validating Data • Securing Bigdata • Organizational resistance
  • 6. ‘Data science’ is “Data-Driven Decision” making, to help the business to make good choices, whereas software engineering is the methodology for software product development without any confusions about the requirements. Data Science vs Software Engineering
  • 7. Data Science Competence Groups - Research Data Science Competence includes 5 areas/groups • Data Analytics • Data Science Engineering • Domain Expertise • Data Management • Scientific Methods (or Business Process Management) Scientific Methods • Design Experiment • Collect Data • Analyse Data • Identify Patterns • Hypothesise Explanation • Test Hypothesis Business Operations • Operations Strategy • Plan • Design & Deploy • Monitor & Control • Improve & Re-design
  • 8. Data Science Competence includes 5 areas/groups • Data Analytics • Data Science Engineering • Domain Expertise • Data Management • Scientific Methods (or Business Process Management) Scientific Methods • Design Experiment • Collect Data • Analyse Data • Identify Patterns • Hypothesise Explanation • Test Hypothesis Business Process Operations/Stages • Design • Model/Plan • Deploy & Execute • Monitor & Control • Optimise & Re-design Data Science Competences Groups – Business Design Modelling Execution Monitoring Optimisation RESEARCH DATA ANALYTICS ALGORITHMSANALYTIC SYSTEMS ENGINEERING COMPETENCES DOMAIN EXPERTISE DATA SCIENCE Data Management Scientific Methods Business Process Management
  • 9. Data Science Applications & Use cases • RECOMMENDER SYSTEMS • CREDIT SCORING • DYNAMIC PRICING • CUSTOMER CHURN • FRAUD DETECTION
  • 10. RECOMMENDER SYSTEMS WHAT IS A RECOMMENDER SYSTEM? A model that filters information to present users with a curated subset of options they’re likely to find appealing HOW DOES IT WORK? Generally via a collaborative approach (considering user’s previous behavior) or content-based approach (based on discrete assigned characteristics) WHAT IS A REAL USE CASE? Tendril uses recommendation models to match eligible customers with new or existing energy products
  • 11. CREDIT SCORING WHAT IS CREDIT SCORING? A model that determines an applicant’s creditworthiness for a mortgage, loan or credit card HOW DOES IT WORK? A set of decision management rules evaluates how likely an applicant is to repay debts WHAT IS A REAL USE CASE? Ferratum Bank uses machine learning models to reach prospective customers that may have been overlooked by traditional banking institutions
  • 12. DYNAMIC PRICING WHAT IS DYNAMIC PRICING? Modeling price as a function of supply, demand, competitor pricing and exogenous factors HOW DOES IT WORK? Generalized linear models and classification trees are popular techniques for estimating the “right” price to maximize expected revenue. WHAT IS A REAL USE CASE? Turo uses dynamic pricing models to suggest prices to the people who list and rent out cars
  • 13. CUSTOMER CHURN WHAT IS CUSTOMER CHURN? Predicting which customers are going to abandon a product or service HOW DOES IT WORK? Data scientists may consider using support vector machines, random forest or k-nearest-neighbors algorithms WHAT IS A REAL USE CASE? EAB combines data from transcripts, standardized test scores, demographics and more to identify students at risk of not graduating.
  • 14. FRAUD DETECTION WHAT IS FRAUD DETECTION? Detecting and preventing fraudulent financial transactions from being processed HOW DOES IT WORK? Fraud detection is a binary classification problem: “is this transaction legitimate or not?” WHAT IS A REAL USE CASE? Via SMS Group uses a combination of complex data lookups and decision algorithms written in R and implemented in PHP to assess whether a loan applicant is fraudulent
  • 15. Works Cited • https://www.yhat.com/whitepapers/data-science-in-practice • http://wikibon.org/blog/role-of-the-data-scientist/ • https://www.cyfronet.krakow.pl/cgw16/presentations/S8_02_present ation-Edison-CGW-26-10-2016.pdf
  • 16. Thank You Sreenatha Reddy K R [email protected] https://in.linkedin.com/in/sreenathaa

Editor's Notes

  • #14: Churn rate describes the rate at which customers abandon a product or service. Understanding customers’ likelihood to churn is particularly important for subscription-based models, everything ranging from traditional cable or gym memberships to recently popularized monthly subscription boxes.