SlideShare a Scribd company logo
DATASCIENCE
IntroductiontoDataScience
Presented by Prof.Priyanka Jadhav
Jens Martensson
1.1WhatisDataScience,importanceofdatascience,
1.2BigdataanddataScience,thecurrentScenario,
1.3IndustryPerspectiveTypesofData:Structuredvs.
UnstructuredData,
1.4Quantitativevs.CategoricalData,
1.5BigDatavs.LittleData,Datascienceprocess
1.6RoleofDataScientist
2
Jens Martensson
What is Data Science
The study of data to extract meaningful insights
for business.
Data Science is a multidisciplinary field that
uses scientific methods, processes, algorithms,
and systems to extract knowledge and insights
from structured and unstructured data. It
combines aspects of mathematics, statistics,
computer science, and domain knowledge to
interpret data for decision-making and problem-
solving.
3
Jens Martensson
Importance of Data Science
Data science is important because it combines tools, methods, and
technology to generate meaning from data
1 2
3 4
5 6
4
Jens Martensson
Comparison
5
1.Structured data –
Structured data is data whose elements are addressable for effective analysis. It has been organized into a
formatted repository that is typically a database. It concerns all data which can be stored in database SQL in
a table with rows and columns. They have relational keys and can easily be mapped into pre-designed
fields. Today, those data are most processed in the development and simplest way to manage
information. Example: Relational data.
2.Semi-Structured data –
Semi-structured data is information that does not reside in a relational database but that has some
organizational properties that make it easier to analyze. With some processes, you can store them in the
relation database (it could be very hard for some kind of semi-structured data), but Semi-structured exist to
ease space. Example: XML data.
3.Unstructured data –
Unstructured data is a data which is not organized in a predefined manner or does not have a predefined
data model, thus it is not a good fit for a mainstream relational database. So for Unstructured data, there are
alternative platforms for storing and managing, it is increasingly prevalent in IT systems and is used by
organizations in a variety of business intelligence and analytics applications. Example: Word, PDF, Text,
Media logs.
Jens Martensson 6
Jens Martensson 7
Big Data :
The definition of big data is data that contains greater variety, arriving in increasing
volumes and with more velocity.
Big data is a term that describes large, hard-to-manage volumes of data – both
structured and unstructured – that inundate businesses on a day-to-day basis. But it's
not just the type or amount of data that's important, it's what organisations do with the
data that matters.
Big Data Examples to Know
Transportation: assist in GPS navigation, traffic and weather alerts
Big data examples:
Tracking consumer behavior and shopping habits to deliver hyper-personalized retail
product recommendations tailored to individual customers. Monitoring payment
patterns and analyzing them against historical customer activity to detect fraud in real
time.
Jens Martensson 8
Jens Martensson 9
Jens Martensson 10
Why is big data needed in current scenario?
Large data sets are meant to be comprehensive and encompass as much
information as the organization needs to make better decisions. Big data
insights let business leaders quickly make data-driven decisions that
impact their organizations. Better customer and market insights.
Quantitativevariablesareanyvariableswherethedatarepresent
amounts(e.g.height,weight,orage).
Categoricalvariablesareanyvariableswherethedatarepresent
groups.Thisincludesrankings(e.g.finishingplacesinarace),
classifications(e.g.brandsofcereal),andbinaryoutcomes(e.g.coin
flips).
Youneedtoknowwhattypeofvariablesyouareworkingwithto
choosetherightstatisticaltestforyourdataandinterpret
yourresults.
Jens Martensson 12
Jens Martensson 13
Jens Martensson 14
Jens Martensson 15
Role of Data Scientist :
A data scientist is a tech professional
that collects, analyzes, and interprets vast
amounts of data using analytical, statistical,
and programming skills. They are responsible
for mining valuable information from various
sources and transforming it into actionable
insights that can drive business growth.
Thank
You

More Related Content

PPTX
1.Introduction to Blockchain Technology.pptx
Modern College Shivajinagar, Pune-5
 
PPTX
3_Blockchain's_SHA_algorithm_Immutable ledger_Distributed p2p Network.pptx
Modern College Shivajinagar, Pune-5
 
PPTX
Types, Components Architecture of blockchain.pptx
Modern College Shivajinagar, Pune-5
 
PPTX
Chapter 4 layers & monetory policy part1.pptx
Modern College Shivajinagar, Pune-5
 
PPTX
Introduction to Ethereum,accounts, smart contract.pptx
Modern College Shivajinagar, Pune-5
 
PPTX
Basic introduction in blockchain, smart contracts, permissioned ledgers
Koen Vingerhoets
 
PPTX
Chapter 4 UTXo transaction fees part 2.pptx
Modern College Shivajinagar, Pune-5
 
PPTX
3_How mining works _Byzantine_Fault_ToleranceBFT.pptx
Modern College Shivajinagar, Pune-5
 
1.Introduction to Blockchain Technology.pptx
Modern College Shivajinagar, Pune-5
 
3_Blockchain's_SHA_algorithm_Immutable ledger_Distributed p2p Network.pptx
Modern College Shivajinagar, Pune-5
 
Types, Components Architecture of blockchain.pptx
Modern College Shivajinagar, Pune-5
 
Chapter 4 layers & monetory policy part1.pptx
Modern College Shivajinagar, Pune-5
 
Introduction to Ethereum,accounts, smart contract.pptx
Modern College Shivajinagar, Pune-5
 
Basic introduction in blockchain, smart contracts, permissioned ledgers
Koen Vingerhoets
 
Chapter 4 UTXo transaction fees part 2.pptx
Modern College Shivajinagar, Pune-5
 
3_How mining works _Byzantine_Fault_ToleranceBFT.pptx
Modern College Shivajinagar, Pune-5
 

What's hot (20)

PPTX
Overview-of-Blockchain-Technology-and-Ethereum.pptx
xhamm1994
 
PPTX
Blockchain 101 presentation by fstream.io
Baiju Devani
 
PDF
Introduction to Blockchain
Malak Abu Hammad
 
PDF
Blockchain and Decentralization
Priyab Satoshi
 
PDF
Blockchain Presentation
Zied GUESMI
 
PDF
Blockchain
Pawan Ghewande
 
PDF
What is a blockchain?
Kevin Koo
 
PPTX
BLOCKCHAIN, DIGITAL WALLET And CRYPTOCURRENCY
sanidulsattar
 
PPTX
Blockchain Consensus Protocols
Melanie Swan
 
PPTX
Blockchain
Amit Kumar
 
PPTX
Bitcoin data mining
malathieswaran29
 
PDF
How does blockchain work
Shishir Aryal
 
PPTX
Blockchain and Cryptocurrencies
nimeshQ
 
PPTX
An introduction to block chain technology
yaminisindhurabandar
 
PPTX
Smart contract
Akhmad Daniel Sembiring
 
PPTX
Blockchain
Wael Othmani
 
PPTX
Types of Blockchains
Vikram Khanna
 
PPTX
IoT and Blockchain Challenges and Risks
Ahmed Banafa
 
PPTX
Blockchain 101
Jithin Babu
 
PPTX
Cơ bản về blockchain, bitcoin và ethereum
Long Le
 
Overview-of-Blockchain-Technology-and-Ethereum.pptx
xhamm1994
 
Blockchain 101 presentation by fstream.io
Baiju Devani
 
Introduction to Blockchain
Malak Abu Hammad
 
Blockchain and Decentralization
Priyab Satoshi
 
Blockchain Presentation
Zied GUESMI
 
Blockchain
Pawan Ghewande
 
What is a blockchain?
Kevin Koo
 
BLOCKCHAIN, DIGITAL WALLET And CRYPTOCURRENCY
sanidulsattar
 
Blockchain Consensus Protocols
Melanie Swan
 
Blockchain
Amit Kumar
 
Bitcoin data mining
malathieswaran29
 
How does blockchain work
Shishir Aryal
 
Blockchain and Cryptocurrencies
nimeshQ
 
An introduction to block chain technology
yaminisindhurabandar
 
Smart contract
Akhmad Daniel Sembiring
 
Blockchain
Wael Othmani
 
Types of Blockchains
Vikram Khanna
 
IoT and Blockchain Challenges and Risks
Ahmed Banafa
 
Blockchain 101
Jithin Babu
 
Cơ bản về blockchain, bitcoin và ethereum
Long Le
 
Ad

Similar to Unit 1 Introduction to DATA SCIENCE .pptx (20)

PDF
Untitled document.pdf
MuhammadTahiriqbal13
 
PPTX
Data_Analytics for m tech min iit bhu.pptx
ShaktikantGiri1
 
PPTX
The Power of Data Science by DICS INNOVATIVE.pptx
gs5545791
 
PDF
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
IJET - International Journal of Engineering and Techniques
 
PPTX
introduction to data science
Johnson Ubah
 
PDF
Unveiling the Power of Data Analytics Transforming Insights into Action.pdf
Kajal Digital
 
PDF
Odgers Berndtson and Unico Big Data White Paper
Robertson Executive Search
 
PDF
Introduction to Data Science
ANOOP V S
 
DOCX
What is data science artical
kavyapandala
 
PPTX
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 
PDF
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
ijdpsjournal
 
PDF
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
ijdpsjournal
 
PDF
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...
ijdpsjournal
 
PDF
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
IJSCAI Journal
 
PDF
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
PDF
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
ijscai
 
PDF
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
PDF
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
PDF
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
PDF
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
ijscai
 
Untitled document.pdf
MuhammadTahiriqbal13
 
Data_Analytics for m tech min iit bhu.pptx
ShaktikantGiri1
 
The Power of Data Science by DICS INNOVATIVE.pptx
gs5545791
 
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
IJET - International Journal of Engineering and Techniques
 
introduction to data science
Johnson Ubah
 
Unveiling the Power of Data Analytics Transforming Insights into Action.pdf
Kajal Digital
 
Odgers Berndtson and Unico Big Data White Paper
Robertson Executive Search
 
Introduction to Data Science
ANOOP V S
 
What is data science artical
kavyapandala
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
ijdpsjournal
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
ijdpsjournal
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...
ijdpsjournal
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
IJSCAI Journal
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
ijscai
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
gerogepatton
 
BIG DATA ANALYTICS: CHALLENGES AND APPLICATIONS FOR TEXT, AUDIO, VIDEO, AND S...
ijscai
 
Ad

More from Priyanka Jadhav (7)

PPTX
Unit4.pptx Data Access File System Data
Priyanka Jadhav
 
PPTX
Unit2.pptx Statistical Interference and Exploratory Data Analysis
Priyanka Jadhav
 
PPTX
Unit2_4.pptx Object Oriented Concepts .
Priyanka Jadhav
 
PPTX
Unit2_3.pptx Chapter 2 Introduction to C#
Priyanka Jadhav
 
PPTX
Unit2_2.pptx Chapter 2 Introduction to C#
Priyanka Jadhav
 
PPTX
Unit2_1.pptx Introduction to C# Language features
Priyanka Jadhav
 
PPTX
Unit 1:DOT NET Framework CLR(Common Language Runtime )
Priyanka Jadhav
 
Unit4.pptx Data Access File System Data
Priyanka Jadhav
 
Unit2.pptx Statistical Interference and Exploratory Data Analysis
Priyanka Jadhav
 
Unit2_4.pptx Object Oriented Concepts .
Priyanka Jadhav
 
Unit2_3.pptx Chapter 2 Introduction to C#
Priyanka Jadhav
 
Unit2_2.pptx Chapter 2 Introduction to C#
Priyanka Jadhav
 
Unit2_1.pptx Introduction to C# Language features
Priyanka Jadhav
 
Unit 1:DOT NET Framework CLR(Common Language Runtime )
Priyanka Jadhav
 

Recently uploaded (20)

PPTX
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PPTX
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
PDF
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PDF
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PPTX
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PPT
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
PPTX
Presentation on animal welfare a good topic
kidscream385
 
PDF
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
short term internship project on Data visualization
JMJCollegeComputerde
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
Presentation on animal welfare a good topic
kidscream385
 
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 

Unit 1 Introduction to DATA SCIENCE .pptx