SlideShare a Scribd company logo
8
Most read
9
Most read
11
Most read
ChatGPT for Data
Science Projects
https://vitalflux.com
5/6/2023 https://vitalflux.com 1
Topics
Setting up ChatGPT for Data Analysis
Data Exploration and Analysis with
ChatGPT
Building Predictive Models with ChatGPT
Model Evaluation and Selection with
ChatGPT
5/6/2023 https://vitalflux.com 2
Setting Up ChatGPT for Data
Analysis
5/6/2023 https://vitalflux.com 3
Setting Up
ChatGPT for
Data
Analysis
Execute the following prompt for ChatGPT to get set
up for data analysis
Be an expert data scientists. Help me extract
insights from the data.
Here is the X no. of records as dataset.
18.0 8. 307.0 130.0 3504. 12.0 70. 1. "chevrolet
chevelle malibu" 15.0 8. 350.0 165.0 3693. 11.5
70. 1. "buick skylark 320“
Have you understood the dataset and related
information?
5/6/2023 https://vitalflux.com 4
Data Exploration and Analysis
with ChatGPT
5/6/2023 https://vitalflux.com 5
Data
Exploration
and Analysis
with
ChatGPT
• Give me top 3 insights from the dataset
• What are the most common values for each
attribute?
• Are there any trends or patterns in the data?
Identify Insights
• Identify any outliers in the dataset and decide
on a strategy for handling them, such as
removing them or replacing them with a more
reasonable value.
• How many outliers are there in the data?
• What is the range of values for each attribute?
Find Outliers
5/6/2023 https://vitalflux.com 6
Data
Exploration
and Analysis
with
ChatGPT
• What are the correlations between the attributes in
the dataset?
• Which attributes are most strongly correlated with
the target variable?
• Are there any correlations between the categorical
variables?
• Write Python code for visualizing correlations
Identify Correlations
• What is the distribution of values for each attribute?
• Are the distributions skewed or symmetric?
• Are there any outliers in the distribution?
• Write Python code for visualizing distributions
Discover Distributions
5/6/2023 https://vitalflux.com 7
Data
Exploration
and Analysis
with
ChatGPT
• What hypothesis do you think can be
tested from the data given earlier?
• Write Python code for performing
hypothesis test
Hypothesis Testing
• Write Python code that can help
visualize the relationships existing in
the dataset?
•Data Visualization
5/6/2023 https://vitalflux.com 8
Data
Exploration
and Analysis
with
ChatGPT
• Which attributes are most
relevant to the target variable?
• Can we create new features by
combining existing attributes?
• Which features are redundant or
irrelevant and can be removed?
• Write Python code for extracting
most relevant features
Extract Features
5/6/2023 https://vitalflux.com 9
Building Predictive Models
with ChatGPT
5/6/2023 https://vitalflux.com 10
Building Predictive Models with ChatGPT
Can I build a predictive model using this data? What can I predict?
What is the distribution of the target variable?
Are there any outliers in the target variable?
Identify
Target
Variables
Which attributes are most relevant to the target variable?
Can we create new features by combining existing attributes?
Which features are redundant or irrelevant and can be removed?
Select
Predictors
5/6/2023 https://vitalflux.com 11
Building Predictive Models with ChatGPT
Which algorithm is most appropriate for the problem and the data?
•Should we use a linear regression, decision tree, or neural network model?
•What are the pros and cons of each algorithm?
Choose
Algorithm
Create Python code for training the model using {algorithm}
Create Python code for training the model using {algorithm} while also performing
hyperparameter tuning
This prompt can be used repeatedly for different algorithms.
Train Model
5/6/2023 https://vitalflux.com 12
Model Evaluation & Selection
with ChatGPT
5/6/2023 https://vitalflux.com 13
Model Evaluation and Selection with
ChatGPT
Evaluate Model Performance
• How can I evaluate the performance of the model trained using
{algorithm}?
• What are the metrics for evaluating models trained using {algorithm}?
• Rewrite the python code while including evaluation metrics and
printing them
Hyperparameter tuning
• What are different hyperparameters which can be tuned for the model
trained using {algorithm}?
• How can I fine-tune the hyperparameters of the model trained using
{algorithm} in order to improve performance?
• Rewrite the model training python code with hyperparameters tuning
5/6/2023 https://vitalflux.com 14
Model Evaluation and Selection with
ChatGPT
Model Selection
• How can I compare the
performance of different
models trained using
{algorithm1}, {algorithm2}?
• What are the advantages and
disadvantages of each model?
5/6/2023 https://vitalflux.com 15
Thank You
https://vitalflux.com
5/6/2023 https://vitalflux.com 16

More Related Content

What's hot (20)

PDF
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
Andre Muscat
 
PDF
Exploring Opportunities in the Generative AI Value Chain.pdf
Dung Hoang
 
PDF
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
VINCI Digital - Industrial IoT (IIoT) Strategic Advisory
 
PDF
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu
 
PDF
Unleashing the Power of OpenAI GPT-3 in FME Data Integration Workflows
Safe Software
 
PPTX
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Anant Corporation
 
PDF
An Introduction to Generative AI
Cori Faklaris
 
PDF
Leveraging Generative AI & Best practices
DianaGray10
 
PPTX
Data Mesh using Microsoft Fabric
Nathan Bijnens
 
PDF
Databricks Overview for MLOps
Databricks
 
PPTX
Generative AI Risks & Concerns
Ajitesh Kumar
 
PDF
BI Consultancy - Data, Analytics and Strategy
Shivam Dhawan
 
PDF
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ssuser4edc93
 
PDF
Generative AI
All Things Open
 
PDF
Machine Learning Pipelines
jeykottalam
 
PPTX
Power BI Overview
James Serra
 
PDF
Introducing Databricks Delta
Databricks
 
PDF
Let's talk about GPT: A crash course in Generative AI for researchers
Steven Van Vaerenbergh
 
PDF
Understanding GenAI/LLM and What is Google Offering - Felix Goh
NUS-ISS
 
PDF
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Naoki (Neo) SATO
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
Andre Muscat
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Dung Hoang
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
VINCI Digital - Industrial IoT (IIoT) Strategic Advisory
 
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu
 
Unleashing the Power of OpenAI GPT-3 in FME Data Integration Workflows
Safe Software
 
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Anant Corporation
 
An Introduction to Generative AI
Cori Faklaris
 
Leveraging Generative AI & Best practices
DianaGray10
 
Data Mesh using Microsoft Fabric
Nathan Bijnens
 
Databricks Overview for MLOps
Databricks
 
Generative AI Risks & Concerns
Ajitesh Kumar
 
BI Consultancy - Data, Analytics and Strategy
Shivam Dhawan
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ssuser4edc93
 
Generative AI
All Things Open
 
Machine Learning Pipelines
jeykottalam
 
Power BI Overview
James Serra
 
Introducing Databricks Delta
Databricks
 
Let's talk about GPT: A crash course in Generative AI for researchers
Steven Van Vaerenbergh
 
Understanding GenAI/LLM and What is Google Offering - Felix Goh
NUS-ISS
 
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Naoki (Neo) SATO
 

Similar to ChatGPT for Data Science Projects (20)

PDF
btNOG 10: Preparing for IPv6 implementation using AI
APNIC
 
PDF
Automatic machine learning (AutoML) 101
QuantUniversity
 
PPTX
Integrating Azure Machine Learning and Predictive Analytics with SharePoint O...
Bhakthi Liyanage
 
PDF
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
iSEO AI
 
PDF
900 keynote abbott
Rising Media, Inc.
 
PDF
Data Science Meets DevOps: GitOps with OpenShift (1).pdf
HemaVeeradhi1
 
PPTX
Python for Machine Learning_ A Comprehensive Overview.pptx
KuldeepSinghBrar3
 
PPTX
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
Alok Singh
 
PDF
[Analyst Research Slides] Build vs. Buy: Finding the Best Path to Network Aut...
Enterprise Management Associates
 
PDF
Ai in finance
QuantUniversity
 
PPTX
From Data Science to MLOps
Carl W. Handlin
 
PPTX
Understanding ChatGPT and Its Implications.pptx
Hexaview Technologies Inc.
 
PPTX
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
ITCamp
 
PDF
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
DataBench
 
PDF
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Impetus Technologies
 
PDF
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
DataKitchen
 
PDF
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Robert Grossman
 
PDF
The path to success with Graph Database and Graph Data Science
Neo4j
 
PDF
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...
NETWAYS
 
PDF
Supercharge your data analytics with BigQuery
Márton Kodok
 
btNOG 10: Preparing for IPv6 implementation using AI
APNIC
 
Automatic machine learning (AutoML) 101
QuantUniversity
 
Integrating Azure Machine Learning and Predictive Analytics with SharePoint O...
Bhakthi Liyanage
 
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
iSEO AI
 
900 keynote abbott
Rising Media, Inc.
 
Data Science Meets DevOps: GitOps with OpenShift (1).pdf
HemaVeeradhi1
 
Python for Machine Learning_ A Comprehensive Overview.pptx
KuldeepSinghBrar3
 
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
Alok Singh
 
[Analyst Research Slides] Build vs. Buy: Finding the Best Path to Network Aut...
Enterprise Management Associates
 
Ai in finance
QuantUniversity
 
From Data Science to MLOps
Carl W. Handlin
 
Understanding ChatGPT and Its Implications.pptx
Hexaview Technologies Inc.
 
ITCamp 2019 - Andy Cross - Machine Learning with ML.NET and Azure Data Lake
ITCamp
 
DataBench Toolbox Demo, Ivan Martinez, Tomas Pariente Lobo, BDV Meet-Up Riga,...
DataBench
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Impetus Technologies
 
Open Data Science Conference Big Data Infrastructure – Introduction to Hadoop...
DataKitchen
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Robert Grossman
 
The path to success with Graph Database and Graph Data Science
Neo4j
 
OSMC 2023 | Experiments with OpenSearch and AI by Jochen Kressin & Leanne La...
NETWAYS
 
Supercharge your data analytics with BigQuery
Márton Kodok
 
Ad

More from Ajitesh Kumar (6)

PPTX
GPT-3 Models Overview
Ajitesh Kumar
 
PPTX
Machine Learning Terminologies
Ajitesh Kumar
 
PPTX
Mastering Analytical Thinking: A Comprehensive Guide to Problem-Solving and D...
Ajitesh Kumar
 
PPTX
A Beginner's Guide to Large Language Models
Ajitesh Kumar
 
PPTX
How to Identify Analytics Use Cases
Ajitesh Kumar
 
PPTX
What is first principles thinking
Ajitesh Kumar
 
GPT-3 Models Overview
Ajitesh Kumar
 
Machine Learning Terminologies
Ajitesh Kumar
 
Mastering Analytical Thinking: A Comprehensive Guide to Problem-Solving and D...
Ajitesh Kumar
 
A Beginner's Guide to Large Language Models
Ajitesh Kumar
 
How to Identify Analytics Use Cases
Ajitesh Kumar
 
What is first principles thinking
Ajitesh Kumar
 
Ad

Recently uploaded (20)

PDF
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
PPTX
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
PDF
Optimizing Large Language Models with vLLM and Related Tools.pdf
Tamanna36
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PDF
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
PDF
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
PDF
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PDF
apidays Singapore 2025 - From API Intelligence to API Governance by Harsha Ch...
apidays
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PPTX
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PDF
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
PDF
1750162332_Snapshot-of-Indias-oil-Gas-data-May-2025.pdf
sandeep718278
 
PDF
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
Optimizing Large Language Models with vLLM and Related Tools.pdf
Tamanna36
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
apidays Singapore 2025 - From API Intelligence to API Governance by Harsha Ch...
apidays
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
1750162332_Snapshot-of-Indias-oil-Gas-data-May-2025.pdf
sandeep718278
 
Business implication of Artificial Intelligence.pdf
VishalChugh12
 

ChatGPT for Data Science Projects

  • 1. ChatGPT for Data Science Projects https://vitalflux.com 5/6/2023 https://vitalflux.com 1
  • 2. Topics Setting up ChatGPT for Data Analysis Data Exploration and Analysis with ChatGPT Building Predictive Models with ChatGPT Model Evaluation and Selection with ChatGPT 5/6/2023 https://vitalflux.com 2
  • 3. Setting Up ChatGPT for Data Analysis 5/6/2023 https://vitalflux.com 3
  • 4. Setting Up ChatGPT for Data Analysis Execute the following prompt for ChatGPT to get set up for data analysis Be an expert data scientists. Help me extract insights from the data. Here is the X no. of records as dataset. 18.0 8. 307.0 130.0 3504. 12.0 70. 1. "chevrolet chevelle malibu" 15.0 8. 350.0 165.0 3693. 11.5 70. 1. "buick skylark 320“ Have you understood the dataset and related information? 5/6/2023 https://vitalflux.com 4
  • 5. Data Exploration and Analysis with ChatGPT 5/6/2023 https://vitalflux.com 5
  • 6. Data Exploration and Analysis with ChatGPT • Give me top 3 insights from the dataset • What are the most common values for each attribute? • Are there any trends or patterns in the data? Identify Insights • Identify any outliers in the dataset and decide on a strategy for handling them, such as removing them or replacing them with a more reasonable value. • How many outliers are there in the data? • What is the range of values for each attribute? Find Outliers 5/6/2023 https://vitalflux.com 6
  • 7. Data Exploration and Analysis with ChatGPT • What are the correlations between the attributes in the dataset? • Which attributes are most strongly correlated with the target variable? • Are there any correlations between the categorical variables? • Write Python code for visualizing correlations Identify Correlations • What is the distribution of values for each attribute? • Are the distributions skewed or symmetric? • Are there any outliers in the distribution? • Write Python code for visualizing distributions Discover Distributions 5/6/2023 https://vitalflux.com 7
  • 8. Data Exploration and Analysis with ChatGPT • What hypothesis do you think can be tested from the data given earlier? • Write Python code for performing hypothesis test Hypothesis Testing • Write Python code that can help visualize the relationships existing in the dataset? •Data Visualization 5/6/2023 https://vitalflux.com 8
  • 9. Data Exploration and Analysis with ChatGPT • Which attributes are most relevant to the target variable? • Can we create new features by combining existing attributes? • Which features are redundant or irrelevant and can be removed? • Write Python code for extracting most relevant features Extract Features 5/6/2023 https://vitalflux.com 9
  • 10. Building Predictive Models with ChatGPT 5/6/2023 https://vitalflux.com 10
  • 11. Building Predictive Models with ChatGPT Can I build a predictive model using this data? What can I predict? What is the distribution of the target variable? Are there any outliers in the target variable? Identify Target Variables Which attributes are most relevant to the target variable? Can we create new features by combining existing attributes? Which features are redundant or irrelevant and can be removed? Select Predictors 5/6/2023 https://vitalflux.com 11
  • 12. Building Predictive Models with ChatGPT Which algorithm is most appropriate for the problem and the data? •Should we use a linear regression, decision tree, or neural network model? •What are the pros and cons of each algorithm? Choose Algorithm Create Python code for training the model using {algorithm} Create Python code for training the model using {algorithm} while also performing hyperparameter tuning This prompt can be used repeatedly for different algorithms. Train Model 5/6/2023 https://vitalflux.com 12
  • 13. Model Evaluation & Selection with ChatGPT 5/6/2023 https://vitalflux.com 13
  • 14. Model Evaluation and Selection with ChatGPT Evaluate Model Performance • How can I evaluate the performance of the model trained using {algorithm}? • What are the metrics for evaluating models trained using {algorithm}? • Rewrite the python code while including evaluation metrics and printing them Hyperparameter tuning • What are different hyperparameters which can be tuned for the model trained using {algorithm}? • How can I fine-tune the hyperparameters of the model trained using {algorithm} in order to improve performance? • Rewrite the model training python code with hyperparameters tuning 5/6/2023 https://vitalflux.com 14
  • 15. Model Evaluation and Selection with ChatGPT Model Selection • How can I compare the performance of different models trained using {algorithm1}, {algorithm2}? • What are the advantages and disadvantages of each model? 5/6/2023 https://vitalflux.com 15