SlideShare a Scribd company logo
3
Most read
6
Most read
8
Most read
How I Turned to the Dark Side. Formats of Data Transfer
What file types are there? The 4/5 most popular are: CSV, TSV XML JSON YAML
CSV / TSV Comma or Tab Separated Values Easy to dump into a spreadsheet Parsable using a library that uses SQL Little space CSV is not very human readable, and if large amounts TSV can get confusing
 
 
YAML YAML Aint Markup Language Very Human Readable Data structures Very useful for config and fixture files Easy for machines to read Whitespace dependent, so can produce very long files. Forced to be a particular structure
 
XML eXtensible Mark up Language If well-formed, can be read by a number of libraries Very common Whitespace independent Layout of data very much up to the individual - but needs documenting!
 
 
 
JSON JavaScript Object Notation Forced to be a particular structure Potentially Dangerous in JavaScript if just evalled Little memory space Data structure is obvious to a human reader if spaced out, although whitespace independent
 
 
What’s the best? Pros CSV/TSV good for sending to Spreadsheets and Databases YAML is great when it needs to be human modifiable, such as fixture data/config files XML is very versatile in how to markup data JSON is very compact and easily parsed into objects
What’s the best? Cons CSV/TSV can be difficult to work with in apps, as no variable names necessarily associated. YAML can be very long files, and needs to adhere to the whitespacing
What’s the best? Cons XML can be confusing if not well documented, and can be longwinded to obtain the information JSON can be less human readable if you are aiming for reducing bandwith by stripping whitespace
What is my Choice? Depends on the application, but: I want the data to be both Human and Machine readable I want the format to be well defined I want it to be convenient to parse I want it to be supported long term
What is my choice? Was XML, Fast becoming JSON Easy to parse Follows rigid structure If laid out it can be Easily eyeballed for the data Easily hand-modified

More Related Content

What's hot (20)

PPTX
Visual Data Mining
Doctoral Student, NCU
 
PDF
Platform as a Service (PaaS) - A cloud service for Developers
Ravindra Dastikop
 
PPTX
Service level agreement in cloud computing an overview
Dr Neelesh Jain
 
PPTX
Zero-day Vulnerabilities
alihassaah1994
 
PPTX
3C Java Graphics Drawing
Java Programming
 
PPTX
Melissa Virus
Minh Nguyen
 
DOCX
Avamar Run Book - 5-14-2015_v3
Bill Oliver
 
PPT
Ecg analysis in the cloud
gaurav jain
 
PPTX
Bastion Host : Amazon Web Services
Akhilesh Joshi
 
PDF
Cloud computing
Parth Maisheri
 
PPTX
Service Oriented Architecture
Luqman Shareef
 
PPT
Network Trends
Arun ACE
 
PPTX
Computer networks and it's applications
varun arora
 
PPT
Benefits of Cloud Computing
KNOWARTH - Software Development Company
 
PPTX
Zimbra mail server
Md. Hasnat Shoheb
 
PPTX
Image captioning
Muhammad Zbeedat
 
PDF
Wireless Personal Area Networks (WPAN): Lowrate amd High Rate
Don Norwood
 
PPTX
Client server model
Gd Goenka University
 
PPTX
introduction to NOSQL Database
nehabsairam
 
PPTX
Cloud computing
albert1234321
 
Visual Data Mining
Doctoral Student, NCU
 
Platform as a Service (PaaS) - A cloud service for Developers
Ravindra Dastikop
 
Service level agreement in cloud computing an overview
Dr Neelesh Jain
 
Zero-day Vulnerabilities
alihassaah1994
 
3C Java Graphics Drawing
Java Programming
 
Melissa Virus
Minh Nguyen
 
Avamar Run Book - 5-14-2015_v3
Bill Oliver
 
Ecg analysis in the cloud
gaurav jain
 
Bastion Host : Amazon Web Services
Akhilesh Joshi
 
Cloud computing
Parth Maisheri
 
Service Oriented Architecture
Luqman Shareef
 
Network Trends
Arun ACE
 
Computer networks and it's applications
varun arora
 
Benefits of Cloud Computing
KNOWARTH - Software Development Company
 
Zimbra mail server
Md. Hasnat Shoheb
 
Image captioning
Muhammad Zbeedat
 
Wireless Personal Area Networks (WPAN): Lowrate amd High Rate
Don Norwood
 
Client server model
Gd Goenka University
 
introduction to NOSQL Database
nehabsairam
 
Cloud computing
albert1234321
 

Viewers also liked (8)

ODP
Pluggable Pipelines
setitesuk
 
ODP
Agile analysis development
setitesuk
 
PPT
Test Presentation
setitesuk
 
PPT
20120301 prezentacja slc niezbędnik właściciela sklepu internetowego
marcinblaszyk
 
ODP
pipeline_structure_overview
setitesuk
 
ODP
Pomodoro lightning talk
setitesuk
 
PDF
Hype vs. Reality: The AI Explainer
Luminary Labs
 
PDF
Study: The Future of VR, AR and Self-Driving Cars
LinkedIn
 
Pluggable Pipelines
setitesuk
 
Agile analysis development
setitesuk
 
Test Presentation
setitesuk
 
20120301 prezentacja slc niezbędnik właściciela sklepu internetowego
marcinblaszyk
 
pipeline_structure_overview
setitesuk
 
Pomodoro lightning talk
setitesuk
 
Hype vs. Reality: The AI Explainer
Luminary Labs
 
Study: The Future of VR, AR and Self-Driving Cars
LinkedIn
 
Ad

Similar to Data Formats (20)

PPS
eXtensible Markup Language
Aditya Raj
 
PPTX
xml and xhtml.pptx
ssusere16bd9
 
PDF
Presentation
IrinaDovgyalo
 
PPT
Xml iet 2015
kiransurariya
 
PPT
XML Technologies
juancpinzone
 
PPT
XML Technologies
juancpinzone
 
ODP
Web based application of Live Scoreboard using XML.
Uttam Kumar
 
PDF
Markup For Dummies (Russ Ward)
STC-Philadelphia Metro Chapter
 
PDF
Schaum s Outline of XML 1st Edition Ed Tittel
lineleporcs
 
PDF
Introduction to xml
soumya
 
PPT
Xml
Vanndy Sun
 
PDF
Download full ebook of Schaums Outline Of Xml 1st Ed Tittel instant download pdf
estefaannike
 
PDF
Dipping Your Toes: Azure Data Lake for DBAs
Bob Pusateri
 
PPTX
Making the semantic web work
Paul Houle
 
PDF
Hadoop compression strata conference
nkabra
 
PPTX
XML | Computer Science
Transweb Global Inc
 
PPT
01 Xml Begin
Dennis Pipper
 
PPTX
CSV vs Xisx compare for computer engineering
jhuth981
 
eXtensible Markup Language
Aditya Raj
 
xml and xhtml.pptx
ssusere16bd9
 
Presentation
IrinaDovgyalo
 
Xml iet 2015
kiransurariya
 
XML Technologies
juancpinzone
 
XML Technologies
juancpinzone
 
Web based application of Live Scoreboard using XML.
Uttam Kumar
 
Markup For Dummies (Russ Ward)
STC-Philadelphia Metro Chapter
 
Schaum s Outline of XML 1st Edition Ed Tittel
lineleporcs
 
Introduction to xml
soumya
 
Download full ebook of Schaums Outline Of Xml 1st Ed Tittel instant download pdf
estefaannike
 
Dipping Your Toes: Azure Data Lake for DBAs
Bob Pusateri
 
Making the semantic web work
Paul Houle
 
Hadoop compression strata conference
nkabra
 
XML | Computer Science
Transweb Global Inc
 
01 Xml Begin
Dennis Pipper
 
CSV vs Xisx compare for computer engineering
jhuth981
 
Ad

Recently uploaded (20)

PPTX
Extensions Framework (XaaS) - Enabling Orchestrate Anything
ShapeBlue
 
PPTX
Machine Learning Benefits Across Industries
SynapseIndia
 
PPTX
python advanced data structure dictionary with examples python advanced data ...
sprasanna11
 
PPTX
Building and Operating a Private Cloud with CloudStack and LINBIT CloudStack ...
ShapeBlue
 
PDF
Julia Furst Morgado The Lazy Guide to Kubernetes with EKS Auto Mode + Karpenter
AWS Chicago
 
PDF
GITLAB-CICD_For_Professionals_KodeKloud.pdf
deepaktyagi0048
 
PDF
Apache CloudStack 201: Let's Design & Build an IaaS Cloud
ShapeBlue
 
PPTX
UI5Con 2025 - Beyond UI5 Controls with the Rise of Web Components
Wouter Lemaire
 
PDF
2025-07-15 EMEA Volledig Inzicht Dutch Webinar
ThousandEyes
 
PDF
Shuen Mei Parth Sharma Boost Productivity, Innovation and Efficiency wit...
AWS Chicago
 
PDF
Empowering Cloud Providers with Apache CloudStack and Stackbill
ShapeBlue
 
PPTX
Earn Agentblazer Status with Slack Community Patna.pptx
SanjeetMishra29
 
PDF
Market Insight : ETH Dominance Returns
CIFDAQ
 
PPTX
Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...
ShapeBlue
 
PPTX
TYPES OF COMMUNICATION Presentation of ICT
JulieBinwag
 
PDF
Novus-Safe Pro: Brochure-What is Novus Safe Pro?.pdf
Novus Hi-Tech
 
PDF
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
PDF
Ampere Offers Energy-Efficient Future For AI And Cloud
ShapeBlue
 
PDF
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
PDF
The Past, Present & Future of Kenya's Digital Transformation
Moses Kemibaro
 
Extensions Framework (XaaS) - Enabling Orchestrate Anything
ShapeBlue
 
Machine Learning Benefits Across Industries
SynapseIndia
 
python advanced data structure dictionary with examples python advanced data ...
sprasanna11
 
Building and Operating a Private Cloud with CloudStack and LINBIT CloudStack ...
ShapeBlue
 
Julia Furst Morgado The Lazy Guide to Kubernetes with EKS Auto Mode + Karpenter
AWS Chicago
 
GITLAB-CICD_For_Professionals_KodeKloud.pdf
deepaktyagi0048
 
Apache CloudStack 201: Let's Design & Build an IaaS Cloud
ShapeBlue
 
UI5Con 2025 - Beyond UI5 Controls with the Rise of Web Components
Wouter Lemaire
 
2025-07-15 EMEA Volledig Inzicht Dutch Webinar
ThousandEyes
 
Shuen Mei Parth Sharma Boost Productivity, Innovation and Efficiency wit...
AWS Chicago
 
Empowering Cloud Providers with Apache CloudStack and Stackbill
ShapeBlue
 
Earn Agentblazer Status with Slack Community Patna.pptx
SanjeetMishra29
 
Market Insight : ETH Dominance Returns
CIFDAQ
 
Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...
ShapeBlue
 
TYPES OF COMMUNICATION Presentation of ICT
JulieBinwag
 
Novus-Safe Pro: Brochure-What is Novus Safe Pro?.pdf
Novus Hi-Tech
 
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
Ampere Offers Energy-Efficient Future For AI And Cloud
ShapeBlue
 
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
The Past, Present & Future of Kenya's Digital Transformation
Moses Kemibaro
 

Data Formats

  • 1. How I Turned to the Dark Side. Formats of Data Transfer
  • 2. What file types are there? The 4/5 most popular are: CSV, TSV XML JSON YAML
  • 3. CSV / TSV Comma or Tab Separated Values Easy to dump into a spreadsheet Parsable using a library that uses SQL Little space CSV is not very human readable, and if large amounts TSV can get confusing
  • 4.  
  • 5.  
  • 6. YAML YAML Aint Markup Language Very Human Readable Data structures Very useful for config and fixture files Easy for machines to read Whitespace dependent, so can produce very long files. Forced to be a particular structure
  • 7.  
  • 8. XML eXtensible Mark up Language If well-formed, can be read by a number of libraries Very common Whitespace independent Layout of data very much up to the individual - but needs documenting!
  • 9.  
  • 10.  
  • 11.  
  • 12. JSON JavaScript Object Notation Forced to be a particular structure Potentially Dangerous in JavaScript if just evalled Little memory space Data structure is obvious to a human reader if spaced out, although whitespace independent
  • 13.  
  • 14.  
  • 15. What’s the best? Pros CSV/TSV good for sending to Spreadsheets and Databases YAML is great when it needs to be human modifiable, such as fixture data/config files XML is very versatile in how to markup data JSON is very compact and easily parsed into objects
  • 16. What’s the best? Cons CSV/TSV can be difficult to work with in apps, as no variable names necessarily associated. YAML can be very long files, and needs to adhere to the whitespacing
  • 17. What’s the best? Cons XML can be confusing if not well documented, and can be longwinded to obtain the information JSON can be less human readable if you are aiming for reducing bandwith by stripping whitespace
  • 18. What is my Choice? Depends on the application, but: I want the data to be both Human and Machine readable I want the format to be well defined I want it to be convenient to parse I want it to be supported long term
  • 19. What is my choice? Was XML, Fast becoming JSON Easy to parse Follows rigid structure If laid out it can be Easily eyeballed for the data Easily hand-modified