SlideShare a Scribd company logo
6
Most read
7
Most read
14
Most read
3 pillars of big data :
Structured Data
Semi Structured Data
Unstructured Data
By Prowebscraper
What is big data ?
- Big data stands for large volumes of data.
- It could be both structured and unstructured.
- It’s not about the amount of data but what
businesses do with the given data that counts.
- Businesses leverage on big data for insights that
can propel their growth.
Example of Big Data
Following are some of the examples of 'Big Data' :
- Facebook users send roughly 31.25 million
messages and watch 2.77 million videos, every
minute!!
- Walmart customers’ transactions provide the
company with about 2.5 petabytes of data, every
hour.
Example of Big Data
- YouTube usage jumped three times from 2014-
2016 with users uploading 400 hours of new
video each minute of every day! Now, in 2017,
users are watching 4,146,600 videos every
minute.
- Instagram users upload 46,740 million posts
every minute!
- 5.2 BILLION daily Google Searches in 2017.
As you can see, there are 3 pillars of Big Data :
1. Structured data
2. Unstructured data
3. Semi structured data
1. Structured data
- Structured Data refer to the data which is already
stored in databases, in an ordered manner.
- It accounts for about 20% of the total existing data,
and is used the most in programming and computer-
related activities.
Example of Structured data :
- Meta-data (Time and date of creation, File size,
Author etc.)
- Library Catalogues (date, author, place, subject, etc)
- Census records (birth, income, employment, place
etc.)
- Economic data (GDP, PPI, ASX etc.)
2. Unstructured data :
- Any data with unknown form or the structure is
classified as unstructured data. In addition to the
size being huge, unstructured data poses multiple
challenges in terms of its processing for deriving
value out of it.
- The rest of the data created, about 80% of the total
account for unstructured big data.
Example of Unstructured data :
- Media ( MP3, digital photos, audio and video files )
- Text files (Word processing, spreadsheets,
presentations etc. )
- Social Media ( Data from Facebook, Twitter,
LinkedIn)
3. Semi-Structured data :
- Semi-structured data is a form of structured data
that does not conform with the formal structure of
data models associated with relational databases or
other forms of data tables, but nonetheless contains
tags or other markers to separate semantic elements
and enforce hierarchies of records and fields within
the data. Therefore, it is also known as self-
describing structure.
Example of Semi-Structured data :
- Personal data stored in a XML file-
<rec><name>Harry</name><sex>Male</sex><age>35</age></rec>
<rec><name>Justin</name><sex>Female</sex><age>41</age></rec>
<rec><name>Shawn</name><sex>Male</sex><age>29</age></rec>
<rec><name>Ed sheeran</name><sex>Male</sex><age>26</age></rec>
<rec><name>Drake</name><sex>Male</sex><age>35</age></rec>
- As you can notice, Internet is a maze of unbounded
data.
- But it can be understood, interpreted and used
under these three categories.
- You can benefit from each type of data if you
understand their characteristics and strengths.
- Businesses worldwide construct their empire on
these three pillars and capitalize on their limitless
potential.
The question is,
do you???
References :
http://www.prowebscraper.com/blog/structured-vs-
unstructured-data-best-thing-you-need-to-know/

More Related Content

What's hot (20)

PDF
Introduction to data analytics
SSaudia
 
PPTX
Big Data Analytics
RohithND
 
PDF
Big data Analytics
ShivanandaVSeeri
 
PDF
Introduction to Data Warehouse
SOMASUNDARAM T
 
PPTX
OLAP v/s OLTP
ahsan irfan
 
PPTX
Data science.chapter-1,2,3
varshakumar21
 
PPTX
Data analytics
Dr.Bhuvaneswari Velumani
 
PPT
Data Warehouse Basic Guide
thomasmary607
 
PDF
Data mining & data warehousing (ppt)
Harish Chand
 
PPT
Dimensional Modeling
Sunita Sahu
 
PPTX
1. Data Analytics-introduction
krishna singh
 
PPT
DATA WAREHOUSING AND DATA MINING
Lovely Professional University
 
PPT
Big data ppt
IDBI Bank Ltd.
 
PPTX
Big Data - The 5 Vs Everyone Must Know
Bernard Marr
 
PPTX
Digital data
ShivanandaVSeeri
 
PPT
Map reduce in BIG DATA
GauravBiswas9
 
PPTX
Data Mining: Application and trends in data mining
DataminingTools Inc
 
PPTX
Business intelligence
Randy L. Archambault
 
PPTX
Metadata ppt
Shashikant Kumar
 
PPTX
Big data ppt
Nasrin Hussain
 
Introduction to data analytics
SSaudia
 
Big Data Analytics
RohithND
 
Big data Analytics
ShivanandaVSeeri
 
Introduction to Data Warehouse
SOMASUNDARAM T
 
OLAP v/s OLTP
ahsan irfan
 
Data science.chapter-1,2,3
varshakumar21
 
Data analytics
Dr.Bhuvaneswari Velumani
 
Data Warehouse Basic Guide
thomasmary607
 
Data mining & data warehousing (ppt)
Harish Chand
 
Dimensional Modeling
Sunita Sahu
 
1. Data Analytics-introduction
krishna singh
 
DATA WAREHOUSING AND DATA MINING
Lovely Professional University
 
Big data ppt
IDBI Bank Ltd.
 
Big Data - The 5 Vs Everyone Must Know
Bernard Marr
 
Digital data
ShivanandaVSeeri
 
Map reduce in BIG DATA
GauravBiswas9
 
Data Mining: Application and trends in data mining
DataminingTools Inc
 
Business intelligence
Randy L. Archambault
 
Metadata ppt
Shashikant Kumar
 
Big data ppt
Nasrin Hussain
 

Similar to 3 pillars of big data : structured data, semi structured data and unstructured data (20)

PPTX
Types of Big Data.pptx
varun453331
 
DOCX
Data and Information.docx
swarna627082
 
PPTX
Bigdata Hadoop introduction
Sunitha Mutchintala
 
PPTX
Big data Analytics Fundamentals Chapter 1
karpagavalli38
 
PPTX
Overview of Big Data
LexiConn Content Services
 
PDF
Topic guide big data as a technology
Mrmmbs Vision
 
PPTX
Introduction to Big Data
Akshata Humbe
 
PDF
Unit III.pdf
PreethaSuresh2
 
PPTX
Data set module 1
Data-Set
 
PDF
Unit No2 Introduction to big data.pdf
Ranjeet Bhalshankar
 
PPTX
Data set Introduction to Big Data
Data-Set
 
PDF
big-data.pdf
aditi276464
 
PDF
Bda assignment can also be used for BDA notes and concept understanding.
Aditya205306
 
PDF
bda-unit-bda-unit-materail big data1.pdf
nandan543979
 
PPTX
Evolution & Introduction to Big data-2.pptx
navdeepKaur496978
 
PPTX
Big Data.pptx
ssuser2cc0d4
 
PPTX
Lecture #03
Konpal Darakshan
 
PDF
Intro to big data and applications - day 1
Parviz Vakili
 
PDF
IRJET- Big Data Management and Growth Enhancement
IRJET Journal
 
PDF
Big data Paper
Daryaz Fares
 
Types of Big Data.pptx
varun453331
 
Data and Information.docx
swarna627082
 
Bigdata Hadoop introduction
Sunitha Mutchintala
 
Big data Analytics Fundamentals Chapter 1
karpagavalli38
 
Overview of Big Data
LexiConn Content Services
 
Topic guide big data as a technology
Mrmmbs Vision
 
Introduction to Big Data
Akshata Humbe
 
Unit III.pdf
PreethaSuresh2
 
Data set module 1
Data-Set
 
Unit No2 Introduction to big data.pdf
Ranjeet Bhalshankar
 
Data set Introduction to Big Data
Data-Set
 
big-data.pdf
aditi276464
 
Bda assignment can also be used for BDA notes and concept understanding.
Aditya205306
 
bda-unit-bda-unit-materail big data1.pdf
nandan543979
 
Evolution & Introduction to Big data-2.pptx
navdeepKaur496978
 
Big Data.pptx
ssuser2cc0d4
 
Lecture #03
Konpal Darakshan
 
Intro to big data and applications - day 1
Parviz Vakili
 
IRJET- Big Data Management and Growth Enhancement
IRJET Journal
 
Big data Paper
Daryaz Fares
 
Ad

Recently uploaded (20)

PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PDF
Data Retrieval and Preparation Business Analytics.pdf
kayserrakib80
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PPTX
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PPTX
apidays Singapore 2025 - Generative AI Landscape Building a Modern Data Strat...
apidays
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PDF
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
Data Retrieval and Preparation Business Analytics.pdf
kayserrakib80
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
apidays Singapore 2025 - Generative AI Landscape Building a Modern Data Strat...
apidays
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
Ad

3 pillars of big data : structured data, semi structured data and unstructured data

  • 1. 3 pillars of big data : Structured Data Semi Structured Data Unstructured Data By Prowebscraper
  • 2. What is big data ? - Big data stands for large volumes of data. - It could be both structured and unstructured. - It’s not about the amount of data but what businesses do with the given data that counts. - Businesses leverage on big data for insights that can propel their growth.
  • 3. Example of Big Data Following are some of the examples of 'Big Data' : - Facebook users send roughly 31.25 million messages and watch 2.77 million videos, every minute!! - Walmart customers’ transactions provide the company with about 2.5 petabytes of data, every hour.
  • 4. Example of Big Data - YouTube usage jumped three times from 2014- 2016 with users uploading 400 hours of new video each minute of every day! Now, in 2017, users are watching 4,146,600 videos every minute. - Instagram users upload 46,740 million posts every minute! - 5.2 BILLION daily Google Searches in 2017.
  • 5. As you can see, there are 3 pillars of Big Data : 1. Structured data 2. Unstructured data 3. Semi structured data
  • 6. 1. Structured data - Structured Data refer to the data which is already stored in databases, in an ordered manner. - It accounts for about 20% of the total existing data, and is used the most in programming and computer- related activities.
  • 7. Example of Structured data : - Meta-data (Time and date of creation, File size, Author etc.) - Library Catalogues (date, author, place, subject, etc) - Census records (birth, income, employment, place etc.) - Economic data (GDP, PPI, ASX etc.)
  • 8. 2. Unstructured data : - Any data with unknown form or the structure is classified as unstructured data. In addition to the size being huge, unstructured data poses multiple challenges in terms of its processing for deriving value out of it. - The rest of the data created, about 80% of the total account for unstructured big data.
  • 9. Example of Unstructured data : - Media ( MP3, digital photos, audio and video files ) - Text files (Word processing, spreadsheets, presentations etc. ) - Social Media ( Data from Facebook, Twitter, LinkedIn)
  • 10. 3. Semi-Structured data : - Semi-structured data is a form of structured data that does not conform with the formal structure of data models associated with relational databases or other forms of data tables, but nonetheless contains tags or other markers to separate semantic elements and enforce hierarchies of records and fields within the data. Therefore, it is also known as self- describing structure.
  • 11. Example of Semi-Structured data : - Personal data stored in a XML file- <rec><name>Harry</name><sex>Male</sex><age>35</age></rec> <rec><name>Justin</name><sex>Female</sex><age>41</age></rec> <rec><name>Shawn</name><sex>Male</sex><age>29</age></rec> <rec><name>Ed sheeran</name><sex>Male</sex><age>26</age></rec> <rec><name>Drake</name><sex>Male</sex><age>35</age></rec>
  • 12. - As you can notice, Internet is a maze of unbounded data. - But it can be understood, interpreted and used under these three categories. - You can benefit from each type of data if you understand their characteristics and strengths.
  • 13. - Businesses worldwide construct their empire on these three pillars and capitalize on their limitless potential. The question is, do you???