Cleap up your data  (Techniques and tips)
What is Data Cleaning ?
The process of amending or removing data in a
Data Set/ Data Base that is incorrect, incomplete,
improperly formatted, or duplicated.
Before Data Cleaning :
1. Have a Backup to play freely with your Dataset.
2. Understand your Dataset.
3. Specify your Problems.
4. Vision before tools.
5. Set your Priorities.
Main Problems :
1. Duplicates.
- Places.
- Removing.
Main Problems :
2. Missing Data :
- Number.
- Places.
- Filling : Choose the right Decision First.
Main Problems :
3. Spelling Errors :
How to check and fix them ?
Main Problems :
4. Text to Columns :
- How to divide one cell into two ?
Main Problems :
5. Columns to Text :
How to combine two cells in one ?
Main Problems :
6. Part of the Cell to Columns :
- Mid Function.
- Flash Fill.
Main Problems :
7. Replacing Shortcuts / Unification of Variables :
- Why ?
- How ?
Main Problems :
8. Lower / Upper / Proper Case :
- Lower Function.
- Upper Function.
- Proper Function.
- Flash Fill.
Main Problems :
9. Some Data In the Wrong Place.
Main Problems :
10. Day / Month / Year :
- Day Function.
- Month Function.
- Year Function.
- Text Function.
Main Problems :
11. Invisible Spaces :
- Why ?
- How ?
Main Problems :
12. Format of the Cell :
- Is Number Function.
- Is Text Function.
Main Problems :
13. Unwanted Punctuation or Duplicated Parts of
Text :
- Right Function.
- Left Function.
- Replace Function.
- Substitute Function.
- Multi Substitute Function.
My Contact :
 Mobile : 01017008287
01155523721
 Mail : ahmed.essam.mohamed1@gmail.com
 Linkedin : www.linkedin.com/in/ahmed-essam-mohamed/
Cleap up your data  (Techniques and tips)

More Related Content

PPT
Entourage Repair
PPT
Mac Mail Recovery - Get Back Lost Emails in Macintosh
ODP
Creating relationships with tables
ODP
ppt on open office.org
PPTX
Relationships within the relational database
PPTX
File maker for yap
PPT
Unit 1.3 Introduction to Programming (Part 2)
PPTX
Spss vs excel
Entourage Repair
Mac Mail Recovery - Get Back Lost Emails in Macintosh
Creating relationships with tables
ppt on open office.org
Relationships within the relational database
File maker for yap
Unit 1.3 Introduction to Programming (Part 2)
Spss vs excel

Similar to Cleap up your data (Techniques and tips) (20)

PDF
The Magic of Excel – Fromatting Like a Pro
PDF
Quicktip excel
PPTX
Etl - Extract Transform Load
PPTX
Lecture 4-Prepare data-Clean, transform, and load data in Power BI.pptx
PPTX
Excel and Pivot Tables.pptx
PPTX
Exchange Database: Data loss and Recovery Methods
PPTX
chapter5_Q&A-.pptx for childers of clas 8
PPTX
February 2017 HUG: Slow, Stuck, or Runaway Apps? Learn How to Quickly Fix Pro...
PDF
Tao-of-Excel.pdf
PPTX
OF92-kTAEeiVcg6irwbSYA_3869af6044c011e8af65c7571aacc328_Week-1-Tao-of-Excel-r...
PDF
Sage Intelligence 40 Microsoft Excel Tips and Tricks
PPT
Dwh lecture-07-denormalization
PDF
MS ACCESS Tutorials
PPT
Ms access tutorial
PPTX
Phpmyadmin administer mysql
PPTX
machine learning for BCA students .pptx
PPT
Intro to Data warehousing Lecture 04
PPTX
17.INTRODUCTION TO SCHEMA REFINEMENT.pptx
PDF
Making Machine Learning Work in Practice - StampedeCon 2014
The Magic of Excel – Fromatting Like a Pro
Quicktip excel
Etl - Extract Transform Load
Lecture 4-Prepare data-Clean, transform, and load data in Power BI.pptx
Excel and Pivot Tables.pptx
Exchange Database: Data loss and Recovery Methods
chapter5_Q&A-.pptx for childers of clas 8
February 2017 HUG: Slow, Stuck, or Runaway Apps? Learn How to Quickly Fix Pro...
Tao-of-Excel.pdf
OF92-kTAEeiVcg6irwbSYA_3869af6044c011e8af65c7571aacc328_Week-1-Tao-of-Excel-r...
Sage Intelligence 40 Microsoft Excel Tips and Tricks
Dwh lecture-07-denormalization
MS ACCESS Tutorials
Ms access tutorial
Phpmyadmin administer mysql
machine learning for BCA students .pptx
Intro to Data warehousing Lecture 04
17.INTRODUCTION TO SCHEMA REFINEMENT.pptx
Making Machine Learning Work in Practice - StampedeCon 2014
Ad

More from Ahmed Essam (6)

PPTX
Introduction to data analysis using excel
PPTX
Data cleaning using Excel
PPTX
Overview on excel 2013 - 6th April 2017
DOCX
فرصة للحصول على منحة بمدرسة استونيا للدبلوماسية
DOCX
الجهات المهتمة بالتعليم - أسبوع العمل العالمى للتعليم
DOCX
جهات مهتمة بتمويل مشروعات مواجهة العنف القائم على النوع الإجتماعى
Introduction to data analysis using excel
Data cleaning using Excel
Overview on excel 2013 - 6th April 2017
فرصة للحصول على منحة بمدرسة استونيا للدبلوماسية
الجهات المهتمة بالتعليم - أسبوع العمل العالمى للتعليم
جهات مهتمة بتمويل مشروعات مواجهة العنف القائم على النوع الإجتماعى
Ad

Recently uploaded (20)

PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
PDF
Navigating the Thai Supplements Landscape.pdf
DOCX
Factor Analysis Word Document Presentation
PDF
A biomechanical Functional analysis of the masitary muscles in man
PDF
An essential collection of rules designed to help businesses manage and reduc...
PPT
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
PPT
statistic analysis for study - data collection
PPTX
recommendation Project PPT with details attached
PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
PPTX
MBA JAPAN: 2025 the University of Waseda
PPTX
Tapan_20220802057_Researchinternship_final_stage.pptx
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PPTX
CYBER SECURITY the Next Warefare Tactics
PPTX
Machine Learning and working of machine Learning
PDF
Global Data and Analytics Market Outlook Report
PPTX
Business_Capability_Map_Collection__pptx
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
1 hour to get there before the game is done so you don’t need a car seat for ...
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
Navigating the Thai Supplements Landscape.pdf
Factor Analysis Word Document Presentation
A biomechanical Functional analysis of the masitary muscles in man
An essential collection of rules designed to help businesses manage and reduc...
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
statistic analysis for study - data collection
recommendation Project PPT with details attached
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
MBA JAPAN: 2025 the University of Waseda
Tapan_20220802057_Researchinternship_final_stage.pptx
retention in jsjsksksksnbsndjddjdnFPD.pptx
CYBER SECURITY the Next Warefare Tactics
Machine Learning and working of machine Learning
Global Data and Analytics Market Outlook Report
Business_Capability_Map_Collection__pptx
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf

Cleap up your data (Techniques and tips)

  • 2. What is Data Cleaning ? The process of amending or removing data in a Data Set/ Data Base that is incorrect, incomplete, improperly formatted, or duplicated.
  • 3. Before Data Cleaning : 1. Have a Backup to play freely with your Dataset. 2. Understand your Dataset. 3. Specify your Problems. 4. Vision before tools. 5. Set your Priorities.
  • 4. Main Problems : 1. Duplicates. - Places. - Removing.
  • 5. Main Problems : 2. Missing Data : - Number. - Places. - Filling : Choose the right Decision First.
  • 6. Main Problems : 3. Spelling Errors : How to check and fix them ?
  • 7. Main Problems : 4. Text to Columns : - How to divide one cell into two ?
  • 8. Main Problems : 5. Columns to Text : How to combine two cells in one ?
  • 9. Main Problems : 6. Part of the Cell to Columns : - Mid Function. - Flash Fill.
  • 10. Main Problems : 7. Replacing Shortcuts / Unification of Variables : - Why ? - How ?
  • 11. Main Problems : 8. Lower / Upper / Proper Case : - Lower Function. - Upper Function. - Proper Function. - Flash Fill.
  • 12. Main Problems : 9. Some Data In the Wrong Place.
  • 13. Main Problems : 10. Day / Month / Year : - Day Function. - Month Function. - Year Function. - Text Function.
  • 14. Main Problems : 11. Invisible Spaces : - Why ? - How ?
  • 15. Main Problems : 12. Format of the Cell : - Is Number Function. - Is Text Function.
  • 16. Main Problems : 13. Unwanted Punctuation or Duplicated Parts of Text : - Right Function. - Left Function. - Replace Function. - Substitute Function. - Multi Substitute Function.
  • 17. My Contact :  Mobile : 01017008287 01155523721  Mail : [email protected]  Linkedin : www.linkedin.com/in/ahmed-essam-mohamed/