SlideShare a Scribd company logo
BIG DATA
ANALYTICS
DR. OWAIS BHAT
DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf
DATA-SCIENCE APPLICATIONS
Data science is a rapidly growing field that is being used in a wide variety of
applications.
• Fraud detection: Data science can be used to identify and prevent fraudulent
transactions. For example, banks use data science to identify suspicious
activity in credit card transactions.
• Customer segmentation: Data science can be used to segment customers into
groups based on their characteristics. This information can be used to target
customers with specific marketing messages.
• Product recommendations: Data science can be used to recommend products
to customers based on their past purchases or browsing history. This
information can be used to increase sales and improve customer satisfaction.
• Risk assessment: Data science can be used to assess the risk of an event
happening. For example, insurance companies use data science to assess the
risk of a customer filing a claim.
• Personalized medicine: Data science can be used to personalize medical
treatment for patients. For example, doctors can use data science to identify
the best treatment for a patient based on their individual characteristics.
RECENT TRENDS IN DATA COLLECTION & ANALYSIS
• The rise of real-time data collection and analysis.
• The increasing use of artificial intelligence (AI) and machine learning (ML) for
data analysis.
• The growing popularity of cloud-based data collection and analysis tools.
• The increasing focus on data privacy and security
Some of the most popular technologies for data visualization in data science:
• Matplotlib: Matplotlib is a Python library for creating static, animated, and
interactive visualizations. It is a popular choice for data scientists because it is
easy to use and versatile.
• Seaborn: Seaborn is a Python library that builds on Matplotlib to provide a
high-level interface for creating attractive and informative visualizations. It is a
good choice for data scientists who want to create visualizations that are both
visually appealing and easy to understand.
Plotly: Plotly is a Python library for creating interactive visualizations that can
be embedded in web pages or documents. It is a good choice for data scientists
who want to create visualizations that can be shared and explored online.
Tableau: Tableau is a commercial data visualization software that is known for
its ease of use and interactive capabilities. It is a good choice for businesses
and organizations that need to create data-driven visualizations for non-
technical audiences.
Qlik Sense: Qlik Sense is another commercial data visualization software that is
known for its speed and scalability. It is a good choice for businesses that need
to process large amounts of data quickly and create interactive visualizations.
DATA-SCIENCE – R LANGUAGE
R is a powerful programming language for data science, and it can be used to
develop a variety of applications. Some of the most common application
development methods in data science using R include:
Shiny:
• Shiny is an R package that facilitates the creation of interactive web
applications directly from R scripts.
• It allows data scientists to build dynamic dashboards, visualizations, and data-
driven web interfaces without extensive web development knowledge.
• Shiny apps can be hosted online or deployed on local servers.
R Markdown:
• R Markdown is a versatile tool for creating reproducible reports, documents,
and presentations that integrate R code, visualizations, and narrative text.
• It enables data scientists to weave code, output, and text into a single
document, making it easy to share insights and analysis.
Plumber:
• Plumber is an R package for building APIs (Application Programming
Interfaces) using R code.
• Data scientists can create RESTful APIs to expose R models, functions, or data
processing pipelines for integration with other applications.
RStudio Connect:
• RStudio Connect is a platform that allows you to publish and share Shiny apps,
R Markdown documents, and Plumber APIs securely within your organization.
• It simplifies the deployment and management of R-based applications.
R Packages:
• R allows you to develop custom R packages that encapsulate functions, data,
and documentation for specific data science tasks.
• Packages can be shared and reused across projects, enhancing code
modularity and reusability.
R with SQL Databases:
• R can be integrated with SQL databases using packages like RMySQL or DBI,
enabling data retrieval, analysis, and visualization directly from databases.
• This is useful for building data-driven applications that fetch and process data
from relational databases.
DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf
THANK YOU
owais@iust.ac.in

More Related Content

Similar to DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf (20)

PPTX
DA DS traning.pptx. Data Science is marking its graph on a high note by expan...
sureshchandran711
 
PPSX
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Tomasz Bednarz
 
PPTX
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
Elvis Muyanja
 
PPTX
Data Analytic s (Unit -1).pRESENTATION .PPT
GayathriShiva4
 
PPTX
Unit-I_Big data life cycle.pptx, sources of Big Data
RajendraKankrale1
 
PPTX
BDA UNIT 1big data – web analytics – big data applications– big data technolo...
BalachandarJ5
 
PPTX
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy
 
PPTX
BIG DATA ANALYTICS USING R
Umair Shafique
 
PDF
Data Analytics with Python: A Comprehensive Approach - CETPA Infotech
Cetpa Infotech Pvt Ltd
 
PPTX
Data science using r multisoft systems
Multisoft Systems
 
PPTX
Abhishek Training PPT.pptx
KashishKashish22
 
PDF
Agile data science
Joel Horwitz
 
PDF
Python para Manual de Ciência de Dados
Rafael Oliveira Bitcoin
 
PDF
How to Identify, Train or Become a Data Scientist
Inside Analysis
 
PPTX
2019 DSA 105 Introduction to Data Science Week 4
Ferdin Joe John Joseph PhD
 
PPTX
Introduction to Data Visualization, Importance and types
grsssyw24
 
PDF
Big Data Analytics M1.pdf big data analytics
nithishlkumar9194
 
PDF
Big Data & DS Analytics for PAARL
Philippine Association of Academic/Research Librarians
 
PPTX
Choosing a Data Visualization Tool for Data Scientists_Final
Heather Choi
 
DA DS traning.pptx. Data Science is marking its graph on a high note by expan...
sureshchandran711
 
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Tomasz Bednarz
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
Elvis Muyanja
 
Data Analytic s (Unit -1).pRESENTATION .PPT
GayathriShiva4
 
Unit-I_Big data life cycle.pptx, sources of Big Data
RajendraKankrale1
 
BDA UNIT 1big data – web analytics – big data applications– big data technolo...
BalachandarJ5
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy
 
BIG DATA ANALYTICS USING R
Umair Shafique
 
Data Analytics with Python: A Comprehensive Approach - CETPA Infotech
Cetpa Infotech Pvt Ltd
 
Data science using r multisoft systems
Multisoft Systems
 
Abhishek Training PPT.pptx
KashishKashish22
 
Agile data science
Joel Horwitz
 
Python para Manual de Ciência de Dados
Rafael Oliveira Bitcoin
 
How to Identify, Train or Become a Data Scientist
Inside Analysis
 
2019 DSA 105 Introduction to Data Science Week 4
Ferdin Joe John Joseph PhD
 
Introduction to Data Visualization, Importance and types
grsssyw24
 
Big Data Analytics M1.pdf big data analytics
nithishlkumar9194
 
Choosing a Data Visualization Tool for Data Scientists_Final
Heather Choi
 

More from coreyanderson7866 (6)

PPTX
2-The unified process in computer Science.pptx
coreyanderson7866
 
PDF
Cloud and Grid Computing PPT computer science.pdf
coreyanderson7866
 
PPTX
Learn various loops and Iterations in Python
coreyanderson7866
 
PPTX
Presentation_ON _HTML Irfan Rashid .pptx
coreyanderson7866
 
PPTX
Contingency arguments for Existence Of Allah.edd.pptx
coreyanderson7866
 
PDF
Microbiology report card test smims .pdf
coreyanderson7866
 
2-The unified process in computer Science.pptx
coreyanderson7866
 
Cloud and Grid Computing PPT computer science.pdf
coreyanderson7866
 
Learn various loops and Iterations in Python
coreyanderson7866
 
Presentation_ON _HTML Irfan Rashid .pptx
coreyanderson7866
 
Contingency arguments for Existence Of Allah.edd.pptx
coreyanderson7866
 
Microbiology report card test smims .pdf
coreyanderson7866
 
Ad

Recently uploaded (20)

PPTX
How to Set Maximum Difference Odoo 18 POS
Celine George
 
PDF
0725.WHITEPAPER-UNIQUEWAYSOFPROTOTYPINGANDUXNOW.pdf
Thomas GIRARD, MA, CDP
 
PPTX
Quarter1-English3-W4-Identifying Elements of the Story
FLORRACHELSANTOS
 
PPT
Talk on Critical Theory, Part One, Philosophy of Social Sciences
Soraj Hongladarom
 
PPTX
2025 Winter SWAYAM NPTEL & A Student.pptx
Utsav Yagnik
 
PDF
The Constitution Review Committee (CRC) has released an updated schedule for ...
nservice241
 
PPTX
Unit 2 COMMERCIAL BANKING, Corporate banking.pptx
AnubalaSuresh1
 
PPTX
Cultivation practice of Litchi in Nepal.pptx
UmeshTimilsina1
 
PDF
Dimensions of Societal Planning in Commonism
StefanMz
 
PPTX
Growth and development and milestones, factors
BHUVANESHWARI BADIGER
 
PPTX
BANDHA (BANDAGES) PPT.pptx ayurveda shalya tantra
rakhan78619
 
PPTX
How to Manage Large Scrollbar in Odoo 18 POS
Celine George
 
PPTX
ASRB NET 2023 PREVIOUS YEAR QUESTION PAPER GENETICS AND PLANT BREEDING BY SAT...
Krashi Coaching
 
PPTX
HYDROCEPHALUS: NURSING MANAGEMENT .pptx
PRADEEP ABOTHU
 
PPTX
How to Create a PDF Report in Odoo 18 - Odoo Slides
Celine George
 
PDF
LAW OF CONTRACT (5 YEAR LLB & UNITARY LLB )- MODULE - 1.& 2 - LEARN THROUGH P...
APARNA T SHAIL KUMAR
 
PDF
CEREBRAL PALSY: NURSING MANAGEMENT .pdf
PRADEEP ABOTHU
 
PPSX
HEALTH ASSESSMENT (Community Health Nursing) - GNM 1st Year
Priyanshu Anand
 
PPTX
A PPT on Alfred Lord Tennyson's Ulysses.
Beena E S
 
PPTX
How to Convert an Opportunity into a Quotation in Odoo 18 CRM
Celine George
 
How to Set Maximum Difference Odoo 18 POS
Celine George
 
0725.WHITEPAPER-UNIQUEWAYSOFPROTOTYPINGANDUXNOW.pdf
Thomas GIRARD, MA, CDP
 
Quarter1-English3-W4-Identifying Elements of the Story
FLORRACHELSANTOS
 
Talk on Critical Theory, Part One, Philosophy of Social Sciences
Soraj Hongladarom
 
2025 Winter SWAYAM NPTEL & A Student.pptx
Utsav Yagnik
 
The Constitution Review Committee (CRC) has released an updated schedule for ...
nservice241
 
Unit 2 COMMERCIAL BANKING, Corporate banking.pptx
AnubalaSuresh1
 
Cultivation practice of Litchi in Nepal.pptx
UmeshTimilsina1
 
Dimensions of Societal Planning in Commonism
StefanMz
 
Growth and development and milestones, factors
BHUVANESHWARI BADIGER
 
BANDHA (BANDAGES) PPT.pptx ayurveda shalya tantra
rakhan78619
 
How to Manage Large Scrollbar in Odoo 18 POS
Celine George
 
ASRB NET 2023 PREVIOUS YEAR QUESTION PAPER GENETICS AND PLANT BREEDING BY SAT...
Krashi Coaching
 
HYDROCEPHALUS: NURSING MANAGEMENT .pptx
PRADEEP ABOTHU
 
How to Create a PDF Report in Odoo 18 - Odoo Slides
Celine George
 
LAW OF CONTRACT (5 YEAR LLB & UNITARY LLB )- MODULE - 1.& 2 - LEARN THROUGH P...
APARNA T SHAIL KUMAR
 
CEREBRAL PALSY: NURSING MANAGEMENT .pdf
PRADEEP ABOTHU
 
HEALTH ASSESSMENT (Community Health Nursing) - GNM 1st Year
Priyanshu Anand
 
A PPT on Alfred Lord Tennyson's Ulysses.
Beena E S
 
How to Convert an Opportunity into a Quotation in Odoo 18 CRM
Celine George
 
Ad

DS-Visualization-Unit-4 COMPUTER SCIENCE.pdf

  • 3. DATA-SCIENCE APPLICATIONS Data science is a rapidly growing field that is being used in a wide variety of applications. • Fraud detection: Data science can be used to identify and prevent fraudulent transactions. For example, banks use data science to identify suspicious activity in credit card transactions.
  • 4. • Customer segmentation: Data science can be used to segment customers into groups based on their characteristics. This information can be used to target customers with specific marketing messages. • Product recommendations: Data science can be used to recommend products to customers based on their past purchases or browsing history. This information can be used to increase sales and improve customer satisfaction. • Risk assessment: Data science can be used to assess the risk of an event happening. For example, insurance companies use data science to assess the risk of a customer filing a claim.
  • 5. • Personalized medicine: Data science can be used to personalize medical treatment for patients. For example, doctors can use data science to identify the best treatment for a patient based on their individual characteristics.
  • 6. RECENT TRENDS IN DATA COLLECTION & ANALYSIS • The rise of real-time data collection and analysis. • The increasing use of artificial intelligence (AI) and machine learning (ML) for data analysis. • The growing popularity of cloud-based data collection and analysis tools. • The increasing focus on data privacy and security
  • 7. Some of the most popular technologies for data visualization in data science: • Matplotlib: Matplotlib is a Python library for creating static, animated, and interactive visualizations. It is a popular choice for data scientists because it is easy to use and versatile. • Seaborn: Seaborn is a Python library that builds on Matplotlib to provide a high-level interface for creating attractive and informative visualizations. It is a good choice for data scientists who want to create visualizations that are both visually appealing and easy to understand.
  • 8. Plotly: Plotly is a Python library for creating interactive visualizations that can be embedded in web pages or documents. It is a good choice for data scientists who want to create visualizations that can be shared and explored online. Tableau: Tableau is a commercial data visualization software that is known for its ease of use and interactive capabilities. It is a good choice for businesses and organizations that need to create data-driven visualizations for non- technical audiences. Qlik Sense: Qlik Sense is another commercial data visualization software that is known for its speed and scalability. It is a good choice for businesses that need to process large amounts of data quickly and create interactive visualizations.
  • 9. DATA-SCIENCE – R LANGUAGE R is a powerful programming language for data science, and it can be used to develop a variety of applications. Some of the most common application development methods in data science using R include: Shiny: • Shiny is an R package that facilitates the creation of interactive web applications directly from R scripts. • It allows data scientists to build dynamic dashboards, visualizations, and data- driven web interfaces without extensive web development knowledge. • Shiny apps can be hosted online or deployed on local servers.
  • 10. R Markdown: • R Markdown is a versatile tool for creating reproducible reports, documents, and presentations that integrate R code, visualizations, and narrative text. • It enables data scientists to weave code, output, and text into a single document, making it easy to share insights and analysis.
  • 11. Plumber: • Plumber is an R package for building APIs (Application Programming Interfaces) using R code. • Data scientists can create RESTful APIs to expose R models, functions, or data processing pipelines for integration with other applications. RStudio Connect: • RStudio Connect is a platform that allows you to publish and share Shiny apps, R Markdown documents, and Plumber APIs securely within your organization. • It simplifies the deployment and management of R-based applications.
  • 12. R Packages: • R allows you to develop custom R packages that encapsulate functions, data, and documentation for specific data science tasks. • Packages can be shared and reused across projects, enhancing code modularity and reusability. R with SQL Databases: • R can be integrated with SQL databases using packages like RMySQL or DBI, enabling data retrieval, analysis, and visualization directly from databases. • This is useful for building data-driven applications that fetch and process data from relational databases.