2
Most read
4
Most read
8
Most read
WEB GRAPHS
2
Internet/Web as Graphs
• Graph of the physical layer with routers ,
computers etc as nodes and physical
connections as edges
– It is limited
– Does not capture the graphical connections
associated with the information on the Internet
• Web Graph where nodes represent web
pages and edges are associated with
hyperlinks
3
Web Graph
4
Web Graph Considerations
• Edges can be directed or undirected
• Graph is highly dynamic
– Nodes and edges are added/deleted often
– Content of existing nodes is also subject to
change
– Pages and hyperlinks created on the fly
• Apart from primary connected component
there are also smaller disconnected
components
5
Why the Web Graph?
• Example of a large,dynamic and
distributed graph
• Possibly similar to other complex graphs
in social, biological and other systems
• Reflects how humans organize
information (relevance, ranking) and their
societies
• Efficient navigation algorithms
• Study behavior of users as they traverse
the web graph (e-commerce)
6
Statistics of Interest
• Size and connectivity of the graph
• Number of connected components
• Distribution of pages per site
• Distribution of incoming and outgoing
connections per site
• Average and maximal length of the
shortest path between any two vertices
(diameter)
7
Properties of Web Graphs
• Connectivity follows a power law
distribution
• The graph is sparse
– |E| = O(n) or atleast o(n2
)
– Average number of hyperlinks per page
roughly a constant
• A small world graph
8
Small World Networks
• It is a ‘small world’
– Millions of people. Yet, separated by “six
degrees” of acquaintance relationships
– Popularized by Milgram’s famous experiment
• Mathematically
– Diameter of graph is small (log N) as
compared to overall size
• 3. Property seems interesting given ‘sparse’ nature
of graph but …
• This property is ‘natural’ in ‘pure’ random graphs
Conclusion
The compression techniques are
specializedspecialized for Web Graphs.
The average link sizelink size decreases with the
increase of the graph.
The average link access timelink access time increases
with the increase of the graph.
9
Conclusion
The compression techniques are
specializedspecialized for Web Graphs.
The average link sizelink size decreases with the
increase of the graph.
The average link access timelink access time increases
with the increase of the graph.
9

More Related Content

DOCX
Leave management System
PDF
Query trees
PPTX
Seminar ppt fog comp
PPTX
Event managementsystem
PDF
CS6010 Social Network Analysis Unit V
DOCX
Leave Management System Documentation
PPT
HCI 3e - Ch 14: Communication and collaboration models
PPTX
CS8651 Internet Programming - Basics of HTML, HTML5, CSS
Leave management System
Query trees
Seminar ppt fog comp
Event managementsystem
CS6010 Social Network Analysis Unit V
Leave Management System Documentation
HCI 3e - Ch 14: Communication and collaboration models
CS8651 Internet Programming - Basics of HTML, HTML5, CSS

What's hot (20)

PPTX
Load Balancing in Parallel and Distributed Database
PPTX
Io t system management with
PPT
Cloud adoption and rudiments
PDF
CS6010 Social Network Analysis Unit IV
PPTX
Mobile cloud computing
DOCX
Hospital Management System
DOCX
Online Student Registration System
PPT
HCI 3e - Ch 19: Groupware
PDF
Collaborating Using Cloud Services
DOCX
Attendance Management Report 2016
PPTX
Multidimensional data models
PPTX
WEB INTERFACE DESIGN
PDF
Online News Portal System
PPT
Diabetes prediction using machine learning
DOC
Online Voting System Project File
DOCX
Doctor appointment system.docx
PPTX
Travel and Tour Advisory
PPTX
Pig latin
PDF
drag and drop.pdf
PPTX
connecting smart object in IoT.pptx
Load Balancing in Parallel and Distributed Database
Io t system management with
Cloud adoption and rudiments
CS6010 Social Network Analysis Unit IV
Mobile cloud computing
Hospital Management System
Online Student Registration System
HCI 3e - Ch 19: Groupware
Collaborating Using Cloud Services
Attendance Management Report 2016
Multidimensional data models
WEB INTERFACE DESIGN
Online News Portal System
Diabetes prediction using machine learning
Online Voting System Project File
Doctor appointment system.docx
Travel and Tour Advisory
Pig latin
drag and drop.pdf
connecting smart object in IoT.pptx
Ad

Viewers also liked (20)

PPTX
Case study of goggle map
PDF
Oinarri-plakaren osagaiak
PDF
AIPPLE Catalogue 2016
PDF
Nouth Phouthavongsa - Asian TYA Network event presentation at ricca ricca*fes...
PDF
Hugh Brown - Asian TYA Network event presentation at ricca ricca*festa 2016
ODP
Beth presngksgkis;og
DOCX
Cs ss 2016
PPTX
evolució del web
PDF
Irteerako periferikoak
PPTX
Presentacion infogobierno slideshare.
PDF
Melissa Tan - Asian TYA Network event presentation at ricca ricca*festa 2016
PDF
B uppsats den kyrkliga försvenskningen av bohuslänska hisingen
DOC
CV Tamer Abbass
DOCX
Trabajo contencioso
DOCX
ΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOME
DOCX
Production schedule
DOCX
PPTX
eczema treatment singapore
PDF
Teta Tulay - Asian TYA Network event presentation at ricca ricca*festa 2016.
PPTX
Contents page powerpoint
Case study of goggle map
Oinarri-plakaren osagaiak
AIPPLE Catalogue 2016
Nouth Phouthavongsa - Asian TYA Network event presentation at ricca ricca*fes...
Hugh Brown - Asian TYA Network event presentation at ricca ricca*festa 2016
Beth presngksgkis;og
Cs ss 2016
evolució del web
Irteerako periferikoak
Presentacion infogobierno slideshare.
Melissa Tan - Asian TYA Network event presentation at ricca ricca*festa 2016
B uppsats den kyrkliga försvenskningen av bohuslänska hisingen
CV Tamer Abbass
Trabajo contencioso
ΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOME
Production schedule
eczema treatment singapore
Teta Tulay - Asian TYA Network event presentation at ricca ricca*festa 2016.
Contents page powerpoint
Ad

Similar to Case Study Of Webgraph (20)

PPTX
JM Information Retrieval Techniques Unit IV
PDF
Complex Networks Analysis @ Universita Roma Tre
PPTX
lec3_socialnetwork_part1.pptx
ZIP
Social Networks and Computer Science
PPTX
Node XL - features and demo
PPTX
2010 06-08 chania stochastic web modelling - copy
PDF
Community Detection in Social Media
PDF
Graph Structure In The Web
PPT
Webometrics 1.0 from AltaVista to Small Worlds and Genre Drift
PPT
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
PPT
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
PPT
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
PDF
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
PPT
PPTX
Small Worlds Social Graphs Social Media
PPTX
Network graphs n’at
PPTX
Discrete mathematics presentation related to application
PPT
The Structure of the Web
PPTX
Graph theory in Search engines and web connectivity.pptx
PDF
Graphs - Chris Dixon & Matt Gattis
JM Information Retrieval Techniques Unit IV
Complex Networks Analysis @ Universita Roma Tre
lec3_socialnetwork_part1.pptx
Social Networks and Computer Science
Node XL - features and demo
2010 06-08 chania stochastic web modelling - copy
Community Detection in Social Media
Graph Structure In The Web
Webometrics 1.0 from AltaVista to Small Worlds and Genre Drift
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
Webometrics 1.0 - from AltaVista to Small Worlds and Genre Drift
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Small Worlds Social Graphs Social Media
Network graphs n’at
Discrete mathematics presentation related to application
The Structure of the Web
Graph theory in Search engines and web connectivity.pptx
Graphs - Chris Dixon & Matt Gattis

More from Suraksha Sanghavi (6)

PPTX
Case study
PPTX
Sampling and its type
PPTX
Air pollution monitoring system
PPTX
An eye with an atm
PPTX
Smart water tank
PPTX
Case study
Sampling and its type
Air pollution monitoring system
An eye with an atm
Smart water tank

Recently uploaded (20)

PDF
Principles of operation, construction, theory, advantages and disadvantages, ...
PDF
Mechanics of materials week 2 rajeshwari
PPT
Programmable Logic Controller PLC and Industrial Automation
PPTX
BBOC407 BIOLOGY FOR ENGINEERS (CS) - MODULE 1 PART 1.pptx
PDF
ECT443_instrumentation_Engg_mod-1.pdf indroduction to instrumentation
PDF
Project_Mgmt_Institute_-Marc Marc Marc .pdf
PDF
VTU IOT LAB MANUAL (BCS701) Computer science and Engineering
PPTX
Module1.pptxrjkeieuekwkwoowkemehehehrjrjrj
PDF
Beginners-Guide-to-Artificial-Intelligence.pdf
PPT
Comprehensive Java Training Deck - Advanced topics
PDF
Software defined netwoks is useful to learn NFV and virtual Lans
PDF
Using Technology to Foster Innovative Teaching Practices (www.kiu.ac.ug)
DOCX
ENVIRONMENTAL PROTECTION AND MANAGEMENT (18CVL756)
DOCX
An investigation of the use of recycled crumb rubber as a partial replacement...
PDF
CELDAS DE COMBUSTIBLE TIPO MEMBRANA DE INTERCAMBIO PROTÓNICO.pdf
PDF
Engineering Solutions for Ethical Dilemmas in Healthcare (www.kiu.ac.ug)
PDF
Lesson 3 .pdf
PPTX
INTERNET OF THINGS - EMBEDDED SYSTEMS AND INTERNET OF THINGS
PPTX
Real Estate Management PART 1.pptxFFFFFFFFFFFFF
PPTX
ARCHITECTURE AND PROGRAMMING OF EMBEDDED SYSTEMS
Principles of operation, construction, theory, advantages and disadvantages, ...
Mechanics of materials week 2 rajeshwari
Programmable Logic Controller PLC and Industrial Automation
BBOC407 BIOLOGY FOR ENGINEERS (CS) - MODULE 1 PART 1.pptx
ECT443_instrumentation_Engg_mod-1.pdf indroduction to instrumentation
Project_Mgmt_Institute_-Marc Marc Marc .pdf
VTU IOT LAB MANUAL (BCS701) Computer science and Engineering
Module1.pptxrjkeieuekwkwoowkemehehehrjrjrj
Beginners-Guide-to-Artificial-Intelligence.pdf
Comprehensive Java Training Deck - Advanced topics
Software defined netwoks is useful to learn NFV and virtual Lans
Using Technology to Foster Innovative Teaching Practices (www.kiu.ac.ug)
ENVIRONMENTAL PROTECTION AND MANAGEMENT (18CVL756)
An investigation of the use of recycled crumb rubber as a partial replacement...
CELDAS DE COMBUSTIBLE TIPO MEMBRANA DE INTERCAMBIO PROTÓNICO.pdf
Engineering Solutions for Ethical Dilemmas in Healthcare (www.kiu.ac.ug)
Lesson 3 .pdf
INTERNET OF THINGS - EMBEDDED SYSTEMS AND INTERNET OF THINGS
Real Estate Management PART 1.pptxFFFFFFFFFFFFF
ARCHITECTURE AND PROGRAMMING OF EMBEDDED SYSTEMS

Case Study Of Webgraph

  • 2. 2 Internet/Web as Graphs • Graph of the physical layer with routers , computers etc as nodes and physical connections as edges – It is limited – Does not capture the graphical connections associated with the information on the Internet • Web Graph where nodes represent web pages and edges are associated with hyperlinks
  • 4. 4 Web Graph Considerations • Edges can be directed or undirected • Graph is highly dynamic – Nodes and edges are added/deleted often – Content of existing nodes is also subject to change – Pages and hyperlinks created on the fly • Apart from primary connected component there are also smaller disconnected components
  • 5. 5 Why the Web Graph? • Example of a large,dynamic and distributed graph • Possibly similar to other complex graphs in social, biological and other systems • Reflects how humans organize information (relevance, ranking) and their societies • Efficient navigation algorithms • Study behavior of users as they traverse the web graph (e-commerce)
  • 6. 6 Statistics of Interest • Size and connectivity of the graph • Number of connected components • Distribution of pages per site • Distribution of incoming and outgoing connections per site • Average and maximal length of the shortest path between any two vertices (diameter)
  • 7. 7 Properties of Web Graphs • Connectivity follows a power law distribution • The graph is sparse – |E| = O(n) or atleast o(n2 ) – Average number of hyperlinks per page roughly a constant • A small world graph
  • 8. 8 Small World Networks • It is a ‘small world’ – Millions of people. Yet, separated by “six degrees” of acquaintance relationships – Popularized by Milgram’s famous experiment • Mathematically – Diameter of graph is small (log N) as compared to overall size • 3. Property seems interesting given ‘sparse’ nature of graph but … • This property is ‘natural’ in ‘pure’ random graphs
  • 9. Conclusion The compression techniques are specializedspecialized for Web Graphs. The average link sizelink size decreases with the increase of the graph. The average link access timelink access time increases with the increase of the graph. 9
  • 10. Conclusion The compression techniques are specializedspecialized for Web Graphs. The average link sizelink size decreases with the increase of the graph. The average link access timelink access time increases with the increase of the graph. 9