SlideShare a Scribd company logo
Cleap up your data  (Techniques and tips)
What is Data Cleaning ?
The process of amending or removing data in a
Data Set/ Data Base that is incorrect, incomplete,
improperly formatted, or duplicated.
Before Data Cleaning :
1. Have a Backup to play freely with your Dataset.
2. Understand your Dataset.
3. Specify your Problems.
4. Vision before tools.
5. Set your Priorities.
Main Problems :
1. Duplicates.
- Places.
- Removing.
Main Problems :
2. Missing Data :
- Number.
- Places.
- Filling : Choose the right Decision First.
Main Problems :
3. Spelling Errors :
How to check and fix them ?
Main Problems :
4. Text to Columns :
- How to divide one cell into two ?
Main Problems :
5. Columns to Text :
How to combine two cells in one ?
Main Problems :
6. Part of the Cell to Columns :
- Mid Function.
- Flash Fill.
Main Problems :
7. Replacing Shortcuts / Unification of Variables :
- Why ?
- How ?
Main Problems :
8. Lower / Upper / Proper Case :
- Lower Function.
- Upper Function.
- Proper Function.
- Flash Fill.
Main Problems :
9. Some Data In the Wrong Place.
Main Problems :
10. Day / Month / Year :
- Day Function.
- Month Function.
- Year Function.
- Text Function.
Main Problems :
11. Invisible Spaces :
- Why ?
- How ?
Main Problems :
12. Format of the Cell :
- Is Number Function.
- Is Text Function.
Main Problems :
13. Unwanted Punctuation or Duplicated Parts of
Text :
- Right Function.
- Left Function.
- Replace Function.
- Substitute Function.
- Multi Substitute Function.
My Contact :
 Mobile : 01017008287
01155523721
 Mail : ahmed.essam.mohamed1@gmail.com
 Linkedin : www.linkedin.com/in/ahmed-essam-mohamed/
Cleap up your data  (Techniques and tips)

More Related Content

PPT
Entourage Repair
smith bush
 
PPT
Mac Mail Recovery - Get Back Lost Emails in Macintosh
Henrylapor
 
ODP
Creating relationships with tables
Jhen Articona
 
ODP
ppt on open office.org
Deepansh Goel
 
PPTX
Relationships within the relational database
Janecatalla
 
PPTX
File maker for yap
ericwilliammarshall
 
PPT
Unit 1.3 Introduction to Programming (Part 2)
Intan Jameel
 
PPTX
Spss vs excel
calltutors
 
Entourage Repair
smith bush
 
Mac Mail Recovery - Get Back Lost Emails in Macintosh
Henrylapor
 
Creating relationships with tables
Jhen Articona
 
ppt on open office.org
Deepansh Goel
 
Relationships within the relational database
Janecatalla
 
File maker for yap
ericwilliammarshall
 
Unit 1.3 Introduction to Programming (Part 2)
Intan Jameel
 
Spss vs excel
calltutors
 

Similar to Cleap up your data (Techniques and tips) (20)

PDF
The Magic of Excel – Fromatting Like a Pro
Alliance To Save Energy
 
PDF
Quicktip excel
Sachin Singh
 
PPTX
Etl - Extract Transform Load
ABDUL KHALIQ
 
PPTX
Lecture 4-Prepare data-Clean, transform, and load data in Power BI.pptx
edieali1
 
PPTX
Excel and Pivot Tables.pptx
aryanthakur424401
 
PPTX
Exchange Database: Data loss and Recovery Methods
Ben Tyson
 
PPTX
chapter5_Q&A-.pptx for childers of clas 8
SantanuJana46
 
PPTX
February 2017 HUG: Slow, Stuck, or Runaway Apps? Learn How to Quickly Fix Pro...
Yahoo Developer Network
 
PDF
Tao-of-Excel.pdf
ChandraSarkar13
 
PPTX
OF92-kTAEeiVcg6irwbSYA_3869af6044c011e8af65c7571aacc328_Week-1-Tao-of-Excel-r...
Umar Karimi
 
PDF
Sage Intelligence 40 Microsoft Excel Tips and Tricks
BurCom Consulting Ltd.
 
PPT
Dwh lecture-07-denormalization
Sulman Ahmed
 
PDF
MS ACCESS Tutorials
Duj Law
 
PPT
Ms access tutorial
minga48
 
PPTX
Phpmyadmin administer mysql
Mohd yasin Karim
 
PPTX
machine learning for BCA students .pptx
thrishathanushree230
 
PPT
Intro to Data warehousing Lecture 04
AnwarrChaudary
 
PPTX
17.INTRODUCTION TO SCHEMA REFINEMENT.pptx
AshokRachapalli1
 
PDF
Making Machine Learning Work in Practice - StampedeCon 2014
StampedeCon
 
The Magic of Excel – Fromatting Like a Pro
Alliance To Save Energy
 
Quicktip excel
Sachin Singh
 
Etl - Extract Transform Load
ABDUL KHALIQ
 
Lecture 4-Prepare data-Clean, transform, and load data in Power BI.pptx
edieali1
 
Excel and Pivot Tables.pptx
aryanthakur424401
 
Exchange Database: Data loss and Recovery Methods
Ben Tyson
 
chapter5_Q&A-.pptx for childers of clas 8
SantanuJana46
 
February 2017 HUG: Slow, Stuck, or Runaway Apps? Learn How to Quickly Fix Pro...
Yahoo Developer Network
 
Tao-of-Excel.pdf
ChandraSarkar13
 
OF92-kTAEeiVcg6irwbSYA_3869af6044c011e8af65c7571aacc328_Week-1-Tao-of-Excel-r...
Umar Karimi
 
Sage Intelligence 40 Microsoft Excel Tips and Tricks
BurCom Consulting Ltd.
 
Dwh lecture-07-denormalization
Sulman Ahmed
 
MS ACCESS Tutorials
Duj Law
 
Ms access tutorial
minga48
 
Phpmyadmin administer mysql
Mohd yasin Karim
 
machine learning for BCA students .pptx
thrishathanushree230
 
Intro to Data warehousing Lecture 04
AnwarrChaudary
 
17.INTRODUCTION TO SCHEMA REFINEMENT.pptx
AshokRachapalli1
 
Making Machine Learning Work in Practice - StampedeCon 2014
StampedeCon
 
Ad

More from Ahmed Essam (6)

PPTX
Introduction to data analysis using excel
Ahmed Essam
 
PPTX
Data cleaning using Excel
Ahmed Essam
 
PPTX
Overview on excel 2013 - 6th April 2017
Ahmed Essam
 
DOCX
فرصة للحصول على منحة بمدرسة استونيا للدبلوماسية
Ahmed Essam
 
DOCX
الجهات المهتمة بالتعليم - أسبوع العمل العالمى للتعليم
Ahmed Essam
 
DOCX
جهات مهتمة بتمويل مشروعات مواجهة العنف القائم على النوع الإجتماعى
Ahmed Essam
 
Introduction to data analysis using excel
Ahmed Essam
 
Data cleaning using Excel
Ahmed Essam
 
Overview on excel 2013 - 6th April 2017
Ahmed Essam
 
فرصة للحصول على منحة بمدرسة استونيا للدبلوماسية
Ahmed Essam
 
الجهات المهتمة بالتعليم - أسبوع العمل العالمى للتعليم
Ahmed Essam
 
جهات مهتمة بتمويل مشروعات مواجهة العنف القائم على النوع الإجتماعى
Ahmed Essam
 
Ad

Recently uploaded (20)

PPTX
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PPTX
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PDF
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
PPTX
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PPTX
Presentation on animal welfare a good topic
kidscream385
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PDF
Fundamentals and Techniques of Biophysics and Molecular Biology (Pranav Kumar...
RohitKumar868624
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
Presentation on animal welfare a good topic
kidscream385
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
Fundamentals and Techniques of Biophysics and Molecular Biology (Pranav Kumar...
RohitKumar868624
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 

Cleap up your data (Techniques and tips)

  • 2. What is Data Cleaning ? The process of amending or removing data in a Data Set/ Data Base that is incorrect, incomplete, improperly formatted, or duplicated.
  • 3. Before Data Cleaning : 1. Have a Backup to play freely with your Dataset. 2. Understand your Dataset. 3. Specify your Problems. 4. Vision before tools. 5. Set your Priorities.
  • 4. Main Problems : 1. Duplicates. - Places. - Removing.
  • 5. Main Problems : 2. Missing Data : - Number. - Places. - Filling : Choose the right Decision First.
  • 6. Main Problems : 3. Spelling Errors : How to check and fix them ?
  • 7. Main Problems : 4. Text to Columns : - How to divide one cell into two ?
  • 8. Main Problems : 5. Columns to Text : How to combine two cells in one ?
  • 9. Main Problems : 6. Part of the Cell to Columns : - Mid Function. - Flash Fill.
  • 10. Main Problems : 7. Replacing Shortcuts / Unification of Variables : - Why ? - How ?
  • 11. Main Problems : 8. Lower / Upper / Proper Case : - Lower Function. - Upper Function. - Proper Function. - Flash Fill.
  • 12. Main Problems : 9. Some Data In the Wrong Place.
  • 13. Main Problems : 10. Day / Month / Year : - Day Function. - Month Function. - Year Function. - Text Function.
  • 14. Main Problems : 11. Invisible Spaces : - Why ? - How ?
  • 15. Main Problems : 12. Format of the Cell : - Is Number Function. - Is Text Function.
  • 16. Main Problems : 13. Unwanted Punctuation or Duplicated Parts of Text : - Right Function. - Left Function. - Replace Function. - Substitute Function. - Multi Substitute Function.
  • 17. My Contact :  Mobile : 01017008287 01155523721  Mail : [email protected]  Linkedin : www.linkedin.com/in/ahmed-essam-mohamed/