SlideShare a Scribd company logo
PYTHON IN DATA SCIENCE WORK
RICK BAHAGUE, DATA SCIENTIST

RBAHAGUEJR@GMAIL.COM
Our Agenda
What is Data Science?
Introduction to Python
Python Tools for Data Science
A bit of Python for Big Data Processing
Questions
Data Science
Source: Python Data Analytics
Data Scientist asks relevant
real world questions
Source: http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
And hopefully,
discovers
actionable
recommendations
from data
TOOLS
WHAT IS
PYTHON?
“THE NAME PYTHON COMES
FROM THE SURREAL BRITISH
COMEDY GROUP MONTY PYTHON,
NOT FROM THE SNAKE. PYTHON
PROGRAMMERS ARE
AFFECTIONATELY CALLED
PYTHONISTAS, AND BOTH MONTY
PYTHON AND SERPENTINE
REFERENCES USUALLY PEPPER
PYTHON TUTORIALS AND
DOCUMENTATION.”

Automate the Boring Stuff with Python
import antigravity
Installing Python
https://www.continuum.io/downloads
Launching Anaconda Python
Distribution
When is data ready and
prepared for analysis ?
Image source: http://blog.kaggle.com/2016/07/21/approaching-almost-any-machine-learning-problem-abhishek-thakur/
Github: https://github.com/RickBahague/dspop
Sample Data Set:
Github: https://github.com/veekun/
pokedex
Pandas: Python Data Analysis
Library
Import pandas library
Reading/Writing Data
Series
DataFrame
Selecting Internal Elements
Assigning Values to Elements
Pandas: Python Data Analysis
Library
Evaluating Values (unique, isin, value_counts,
NaN)
Filtering Values
Transpose
Operations between DataFrame and Series
Statistics Functions, Correlation/Covariance
Scikit-learn & ML Basics
... learning from experience either
with or without supervision of
humans
Mastering Machine Learning with scikit-learn
ML Flow
Image source: http://blog.kaggle.com/2016/07/21/approaching-almost-any-machine-learning-problem-abhishek-thakur/
Machine Learning with
Scikit-learn
Source: http://scikit-learn.org/stable/
A bit of Big Data Processing
Source: Python Data Analytics
Creative Commons License
Python in Data Science Work by Rick
Bahague is licensed under a Creative Commons
Attribution-NonCommercial-ShareAlike 4.0
International License.
Based on a work at https://medium.com/
@rbahaguejr.
Permissions beyond the scope of this license
may be available at https://medium.com/
@rbahaguejr.

More Related Content

What's hot (20)

PPTX
Programming for Everybody in Python
Charles Severance
 
PDF
Data Science Popup Austin: Applied Machine Learning for IOT
Domino Data Lab
 
PDF
Text analytics for Google Spreadsheets using Text Mining add-on
SpazioDati
 
PDF
Data Science : Make Smarter Business Decisions
Edureka!
 
PDF
Webinar : Introduction to R Programming and Machine Learning
Edureka!
 
PDF
Using hadoop for big data
Data Science Thailand
 
PDF
Data Curation @ SpazioDati - NEXA Lunch Seminar
SpazioDati
 
PPTX
Data Science using Python
ShapeMySkills Pvt Ltd
 
PPTX
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Dataconomy Media
 
PDF
Increasing the Impact of Visualization Research
Krist Wongsuphasawat
 
PDF
Data Science Popup Austin: Back to The Future for Data and Analytics
Domino Data Lab
 
PPTX
(Big) Data (Science) Skills
Oscar Corcho
 
PDF
Informal presentation about RES
Christophe Guéret
 
PPTX
Application of Clustering in Data Science using Real-life Examples
Edureka!
 
PDF
Best Python Libraries For Data Science & Machine Learning | Edureka
Edureka!
 
PDF
Information Visualization for Knowledge Discovery: An Introduction
Krist Wongsuphasawat
 
PDF
Sentiment Analysis In Retail Domain
Edureka!
 
PPTX
Linked Statistical Data: does it actually pay off?
Oscar Corcho
 
PDF
Data Science Provenance: From Drug Discovery to Fake Fans
Jameel Syed
 
PDF
ISWC 2014 - Dandelion: from raw data to dataGEMs for developers
SpazioDati
 
Programming for Everybody in Python
Charles Severance
 
Data Science Popup Austin: Applied Machine Learning for IOT
Domino Data Lab
 
Text analytics for Google Spreadsheets using Text Mining add-on
SpazioDati
 
Data Science : Make Smarter Business Decisions
Edureka!
 
Webinar : Introduction to R Programming and Machine Learning
Edureka!
 
Using hadoop for big data
Data Science Thailand
 
Data Curation @ SpazioDati - NEXA Lunch Seminar
SpazioDati
 
Data Science using Python
ShapeMySkills Pvt Ltd
 
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Dataconomy Media
 
Increasing the Impact of Visualization Research
Krist Wongsuphasawat
 
Data Science Popup Austin: Back to The Future for Data and Analytics
Domino Data Lab
 
(Big) Data (Science) Skills
Oscar Corcho
 
Informal presentation about RES
Christophe Guéret
 
Application of Clustering in Data Science using Real-life Examples
Edureka!
 
Best Python Libraries For Data Science & Machine Learning | Edureka
Edureka!
 
Information Visualization for Knowledge Discovery: An Introduction
Krist Wongsuphasawat
 
Sentiment Analysis In Retail Domain
Edureka!
 
Linked Statistical Data: does it actually pay off?
Oscar Corcho
 
Data Science Provenance: From Drug Discovery to Fake Fans
Jameel Syed
 
ISWC 2014 - Dandelion: from raw data to dataGEMs for developers
SpazioDati
 

Similar to Python in Data Science Work (20)

PDF
How You Can Use Open Source Materials to Learn Python & Data Science - EuroPy...
Kamila Stępniowska
 
PPTX
Data science presentation - Management career institute
PoojaPatidar11
 
PPTX
Python for Big Data Analytics
Edureka!
 
PPTX
Python PPT
Edureka!
 
PDF
Data Analysis Python For Environmental Science Hayden Van Der Post
unidosmungwe
 
PDF
-python-for-data-science-20240911071905Ss8z.pdf
abhishekprasadabhima
 
PDF
Python on Science ? Yes, We can.
Marcel Caraciolo
 
PDF
Data_Science_Generating_Value_From_Data_Course_Slides_red.pdf
OlgaAngelikiKyriakou
 
PDF
Python Projects For Beginners | Python Projects Examples | Python Tutorial | ...
Edureka!
 
PDF
Analyzing social media with Python and other tools (1/4)
Department of Communication Science, University of Amsterdam
 
PPTX
Python for Data Science Professionals.pptx
chethanhk10
 
PPTX
Datascience
Umang Sharma
 
PDF
Python For Natural Resource Extraction A Comprehensive Programming Guide For ...
unidosmungwe
 
PDF
Why should I learn python
grinu
 
PDF
Python for Data Science: A Comprehensive Guide
Uncodemy
 
PDF
A Simple Introduction To Python Stephen Lynch
pavranqaba
 
PPTX
Python Certification Course In Bangalore
sushmitasharan1
 
PDF
Why Learn Python for Data Science Tutorial | IABAC
IABAC
 
PDF
Programming for data science in python
UmmeSalmaM1
 
PDF
_Python for Data Science.pdf
khushnuma khan
 
How You Can Use Open Source Materials to Learn Python & Data Science - EuroPy...
Kamila Stępniowska
 
Data science presentation - Management career institute
PoojaPatidar11
 
Python for Big Data Analytics
Edureka!
 
Python PPT
Edureka!
 
Data Analysis Python For Environmental Science Hayden Van Der Post
unidosmungwe
 
-python-for-data-science-20240911071905Ss8z.pdf
abhishekprasadabhima
 
Python on Science ? Yes, We can.
Marcel Caraciolo
 
Data_Science_Generating_Value_From_Data_Course_Slides_red.pdf
OlgaAngelikiKyriakou
 
Python Projects For Beginners | Python Projects Examples | Python Tutorial | ...
Edureka!
 
Analyzing social media with Python and other tools (1/4)
Department of Communication Science, University of Amsterdam
 
Python for Data Science Professionals.pptx
chethanhk10
 
Datascience
Umang Sharma
 
Python For Natural Resource Extraction A Comprehensive Programming Guide For ...
unidosmungwe
 
Why should I learn python
grinu
 
Python for Data Science: A Comprehensive Guide
Uncodemy
 
A Simple Introduction To Python Stephen Lynch
pavranqaba
 
Python Certification Course In Bangalore
sushmitasharan1
 
Why Learn Python for Data Science Tutorial | IABAC
IABAC
 
Programming for data science in python
UmmeSalmaM1
 
_Python for Data Science.pdf
khushnuma khan
 
Ad

More from Rick. Bahague (7)

PDF
Ansible for the Impatient Devops
Rick. Bahague
 
PDF
Panopoly + kalatheme: Site buildout na mainit at mabilis
Rick. Bahague
 
PDF
The Beauty of Drupal 8 (Drupal Camp Manila 2014)
Rick. Bahague
 
PDF
Drupal campmanila 2012 (Responsive Web in Drupal with Omega Theme)
Rick. Bahague
 
PDF
Legal info management in the Era of Advanced Technology
Rick. Bahague
 
PDF
Trends In Physics Teaching
Rick. Bahague
 
PDF
Trends In Chemistry
Rick. Bahague
 
Ansible for the Impatient Devops
Rick. Bahague
 
Panopoly + kalatheme: Site buildout na mainit at mabilis
Rick. Bahague
 
The Beauty of Drupal 8 (Drupal Camp Manila 2014)
Rick. Bahague
 
Drupal campmanila 2012 (Responsive Web in Drupal with Omega Theme)
Rick. Bahague
 
Legal info management in the Era of Advanced Technology
Rick. Bahague
 
Trends In Physics Teaching
Rick. Bahague
 
Trends In Chemistry
Rick. Bahague
 
Ad

Recently uploaded (20)

PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PDF
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
PPTX
The _Operations_on_Functions_Addition subtruction Multiplication and Division...
mdregaspi24
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PDF
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PPTX
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
PPTX
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
The _Operations_on_Functions_Addition subtruction Multiplication and Division...
mdregaspi24
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
Choosing the Right Database for Indexing.pdf
Tamanna
 
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 

Python in Data Science Work