SlideShare a Scribd company logo
Python for ML
MOST BASIC LIBRARIES
1
Libraries And ML Scope
ML
Data Gathering
Data Cleaning
Exploring DataBuilding Model
Visualization
2
Data Gathering
Beautiful Soup
• Is a Python library for pulling data
out of HTML and XML files. It works
with your favorite parser to provide
idiomatic ways of navigating,
searching, and modifying the parse
tree. It commonly saves
programmers hours or days of work.
Requests
• Is the de facto standard for making
HTTP requests in Python. It abstracts
the complexities of making requests
behind a beautiful, simple API so that
you can focus on interacting with
services and consuming data in your
application.
Pandas
• Is an open source, BSD-licensed
library providing high-performance,
easy-to-use data structures and data
analysis tools for
the Python programming language.
3
Data Cleaning 4
NumPy
• Is the fundamental package for scientific computing with
Python. It contains among other things:
• a powerful N-dimensional array object
• sophisticated (broadcasting) functions
• tools for integrating C/C++ and Fortran code
• useful linear algebra, Fourier transform, and random
number capabilities
Pandas
• Is an open source, BSD-licensed library providing high-
performance, easy-to-use data structures and data analysis
tools for the Python programming language.
Exploring Data 5
Seaborn
• is a Python data visualization library
based on matplotlib. It provides a
high-level interface for drawing
attractive and informative statistical
graphics.
Matplotlib.pyplot
• is a collection of command style
functions that make matplotlib work
like MATLAB. Each pyplot function
makes some change to a figure: e.g.,
creates a figure, creates a plotting
area in a figure, plots some lines in a
plotting area, decorates the plot with
labels, etc.
Pandas
• Is an open source, BSD-licensed
library providing high-performance,
easy-to-use data structures and data
analysis tools for
the Python programming language.
Building Model 6
SciKit-learn
• Is an open source machine learning library that
that supports supervised and unsupervised
learning. It also provides various tools for
model fitting, data preprocessing, model
selection and evaluation, and many other
utilities.
Statsmodels
• Is a Python module that provides classes and
functions for the estimation of many different
statistical models, as well as for conducting
statistical tests, and statistical data exploration.
An extensive list of result statistics are
available for each estimator.
Visualization 7
Seaborn
• is a Python data
visualization library based
on matplotlib. It provides a
high-level interface for
drawing attractive and
informative statistical
graphics.
Matplotlib.pyplot
• is a collection of command
style functions that make
matplotlib work like
MATLAB.
Each pyplot function
makes some change to a
figure: e.g.,
Plotly
• is a web-based toolkit to
form data visualizations.
Plotly can also be accessed
from a Python Notebook
and has a great API.
Geoplotlib
• Is a toolbox for creating
maps and plotting
geographical data. You
can use it to create a
variety of map-types, like
choropleths, heatmaps,
and dot density maps.

More Related Content

What's hot (20)

PDF
Red hat infrastructure for analytics
Kyle Bader
 
PDF
Data Analysis and Statistics in Python using pandas and statsmodels
Wes McKinney
 
PDF
Apache Arrow: Cross-language Development Platform for In-memory Data
Wes McKinney
 
PDF
The Bitter Lesson of ML Pipelines
Jim Dowling
 
PPTX
Session 03 acquiring data
bodaceacat
 
PDF
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
Wes McKinney
 
PDF
Apache Arrow Workshop at VLDB 2019 / BOSS Session
Wes McKinney
 
PPT
HyperGraphDb
borislav
 
PPTX
END-TO-END MACHINE LEARNING STACK
Jan Wiegelmann
 
PPT
Graph Analytics for big data
Sigmoid
 
PPT
SubSift web services and workflows for profiling and comparing scientists and...
Simon Price
 
PDF
Machine Learning with Spark MLlib
Todd McGrath
 
PPT
Rapid software evolution
borislav
 
PDF
Employing Graph Databases as a Standardization Model towards Addressing Heter...
Dippy Aggarwal
 
KEY
Large Scale Data Analysis Tools
boorad
 
PPTX
Dropbox Talk at Netflix ML Platform Meetup Spe 2019
Faisal Siddiqi
 
PDF
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Joshua Shinavier
 
PPTX
Analyzing Data With Python
Sarah Guido
 
PPTX
Анатолий Старостин (ABBYY) "ABBYY InfoExtractor: технология разработки предме...
AINL Conferences
 
PDF
HypergraphDB
Jan Drozen
 
Red hat infrastructure for analytics
Kyle Bader
 
Data Analysis and Statistics in Python using pandas and statsmodels
Wes McKinney
 
Apache Arrow: Cross-language Development Platform for In-memory Data
Wes McKinney
 
The Bitter Lesson of ML Pipelines
Jim Dowling
 
Session 03 acquiring data
bodaceacat
 
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
Wes McKinney
 
Apache Arrow Workshop at VLDB 2019 / BOSS Session
Wes McKinney
 
HyperGraphDb
borislav
 
END-TO-END MACHINE LEARNING STACK
Jan Wiegelmann
 
Graph Analytics for big data
Sigmoid
 
SubSift web services and workflows for profiling and comparing scientists and...
Simon Price
 
Machine Learning with Spark MLlib
Todd McGrath
 
Rapid software evolution
borislav
 
Employing Graph Databases as a Standardization Model towards Addressing Heter...
Dippy Aggarwal
 
Large Scale Data Analysis Tools
boorad
 
Dropbox Talk at Netflix ML Platform Meetup Spe 2019
Faisal Siddiqi
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Joshua Shinavier
 
Analyzing Data With Python
Sarah Guido
 
Анатолий Старостин (ABBYY) "ABBYY InfoExtractor: технология разработки предме...
AINL Conferences
 
HypergraphDB
Jan Drozen
 

Similar to Python for ML (20)

PDF
Essential Python Libraries Every Developer Should Know - CETPA Infotech
Cetpa Infotech Pvt Ltd
 
PDF
machineleatrfgkjbvcgfhjkl;kjhgfdsfghjdxfcgvhjklk
kapishverma2005
 
PPTX
python data science libray seaborn.pptx
y18771929
 
PPTX
python libray for data analytics seaborn[1].pptx
y18771929
 
PPTX
Data analysis using python in Jupyter notebook.pptx
ssuserc26f8f
 
PPTX
Abhishek Training PPT.pptx
KashishKashish22
 
PDF
5 Best Python Libraries For Data Analysis
Dhyan Chandra Pandey
 
PDF
Python Libraries for Data Science - A Must-Know List.pdf
TCCI Computer Coaching
 
PDF
Python for Data Science: A Comprehensive Guide
priyanka rajput
 
PPTX
Basic of python for data analysis
Pramod Toraskar
 
PDF
Python Científico
Márcio Ramos
 
PDF
Python For Data Analysis Unlocking Insightsguide Brian P
panchhijar4n
 
PDF
Advance Programming Slides lect.pptx.pdf
mohsinfareed780
 
PPTX
overview of python programming language.pptx
dmsidharth
 
PPTX
Data science with python and related concepts
ShivaKoushik2
 
PPTX
1.pptx why python for AI in engineering field
SwapnilAshtekar3
 
PPTX
Top 10 Data analytics tools to look for in 2021
Mobcoder
 
PPTX
Adarsh_Masekar(2GP19CS003).pptx
hkabir55
 
PDF
Using_python_webdevolopment_datascience.pdf
Sudipta Bhattacharya
 
PDF
Python Libraries Unveiled_ Empowering Data Science Explorations - Uncodemy.pdf
Ahana Sharma
 
Essential Python Libraries Every Developer Should Know - CETPA Infotech
Cetpa Infotech Pvt Ltd
 
machineleatrfgkjbvcgfhjkl;kjhgfdsfghjdxfcgvhjklk
kapishverma2005
 
python data science libray seaborn.pptx
y18771929
 
python libray for data analytics seaborn[1].pptx
y18771929
 
Data analysis using python in Jupyter notebook.pptx
ssuserc26f8f
 
Abhishek Training PPT.pptx
KashishKashish22
 
5 Best Python Libraries For Data Analysis
Dhyan Chandra Pandey
 
Python Libraries for Data Science - A Must-Know List.pdf
TCCI Computer Coaching
 
Python for Data Science: A Comprehensive Guide
priyanka rajput
 
Basic of python for data analysis
Pramod Toraskar
 
Python Científico
Márcio Ramos
 
Python For Data Analysis Unlocking Insightsguide Brian P
panchhijar4n
 
Advance Programming Slides lect.pptx.pdf
mohsinfareed780
 
overview of python programming language.pptx
dmsidharth
 
Data science with python and related concepts
ShivaKoushik2
 
1.pptx why python for AI in engineering field
SwapnilAshtekar3
 
Top 10 Data analytics tools to look for in 2021
Mobcoder
 
Adarsh_Masekar(2GP19CS003).pptx
hkabir55
 
Using_python_webdevolopment_datascience.pdf
Sudipta Bhattacharya
 
Python Libraries Unveiled_ Empowering Data Science Explorations - Uncodemy.pdf
Ahana Sharma
 
Ad

Recently uploaded (20)

PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PDF
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
PPTX
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PDF
Simplifying Document Processing with Docling for AI Applications.pdf
Tamanna
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PDF
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
Simplifying Document Processing with Docling for AI Applications.pdf
Tamanna
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
Choosing the Right Database for Indexing.pdf
Tamanna
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
Ad

Python for ML

  • 1. Python for ML MOST BASIC LIBRARIES 1
  • 2. Libraries And ML Scope ML Data Gathering Data Cleaning Exploring DataBuilding Model Visualization 2
  • 3. Data Gathering Beautiful Soup • Is a Python library for pulling data out of HTML and XML files. It works with your favorite parser to provide idiomatic ways of navigating, searching, and modifying the parse tree. It commonly saves programmers hours or days of work. Requests • Is the de facto standard for making HTTP requests in Python. It abstracts the complexities of making requests behind a beautiful, simple API so that you can focus on interacting with services and consuming data in your application. Pandas • Is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language. 3
  • 4. Data Cleaning 4 NumPy • Is the fundamental package for scientific computing with Python. It contains among other things: • a powerful N-dimensional array object • sophisticated (broadcasting) functions • tools for integrating C/C++ and Fortran code • useful linear algebra, Fourier transform, and random number capabilities Pandas • Is an open source, BSD-licensed library providing high- performance, easy-to-use data structures and data analysis tools for the Python programming language.
  • 5. Exploring Data 5 Seaborn • is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics. Matplotlib.pyplot • is a collection of command style functions that make matplotlib work like MATLAB. Each pyplot function makes some change to a figure: e.g., creates a figure, creates a plotting area in a figure, plots some lines in a plotting area, decorates the plot with labels, etc. Pandas • Is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
  • 6. Building Model 6 SciKit-learn • Is an open source machine learning library that that supports supervised and unsupervised learning. It also provides various tools for model fitting, data preprocessing, model selection and evaluation, and many other utilities. Statsmodels • Is a Python module that provides classes and functions for the estimation of many different statistical models, as well as for conducting statistical tests, and statistical data exploration. An extensive list of result statistics are available for each estimator.
  • 7. Visualization 7 Seaborn • is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics. Matplotlib.pyplot • is a collection of command style functions that make matplotlib work like MATLAB. Each pyplot function makes some change to a figure: e.g., Plotly • is a web-based toolkit to form data visualizations. Plotly can also be accessed from a Python Notebook and has a great API. Geoplotlib • Is a toolbox for creating maps and plotting geographical data. You can use it to create a variety of map-types, like choropleths, heatmaps, and dot density maps.