Web Information Systems
Tanvi Banerjee, Amit Sheth, et al.
1Kno.e.sis Center / Dept. of Computer Science & Engineering
Wright State University
One of the courses under
Big and Smart Data Science Certificate.
Flipped Classroom
Most classes will have three components:
Before the class: Prerecorded online video and
other reading material
During the class: discussions/Q&A of what was
reviewed before the class; exercises with
consultation if needed
Exercises – some of which will be graded
Web Information Systems Course
3
Class 1: Module 1:
Preliminaries and Objectives
Image source: CISCO
“More data has been created in the last
three years than in all the past 40,000 years”
- Teradata
Course Context: How to manage, use
and benefit from Data (on the Web)
This statistics is for all data, but almost all of the data flows on the Internet and is accessible via the Web
Course Objectives
"The World Wide Web is the universe of network-
accessible information, an embodiment of human
knowledge.” [TBL]. The Web has become the
largest repository of documents and data, and the
largest source of information and knowledge.
How do you effectively access and utilize Web
data?
How do you build data centric Web applications?
How do you convert data into information? How
do you build, share and use knowledge?
Web Information Systems Course
DIKW Pyramid: Data to Wisdom
Web Information Systems Course
© Angus McDonald. Also see.
Wisdom: ability to increase
effectiveness; adds value, requires
judgment.
Knowledge: information having been
processed, organized or structured;
being applied or put into action;
convey understanding, experience,
accumulated learning.
Information: inferred from data,
endowed with meaning and purpose,
help answer: "who", "what", "where",
"how many", "when”…
Data: symbols, signals or stimuli.
Observations and Facts.
This course is about data and information aspects as related to the Web.
Also see.
Course Plan
Learn and exercise variety of techniques,
technologies and tools that focus on collecting
and using data for building rich data and
information centric Web applications
Do exercises to become comfortable with the
techniques and technologies
Do a substantial project
Some additional assignments and more
indepth/demanding projects for graduate
students
Web Information Systems Course
8
Class 1: Module 2
Internet and Web:
Brief History and Review
Web Information Systems Course
Internet (the Net) vs WWW (the Web)
The Net: “a world-wide system of computer
networks, … users at any computer can get
information (if they have permissions) get
information from any other computer” [Review:
ARPANet,TCP/IP,… ]
The Web: “all the resources and users on the
internet that are using HTTP, a system of
internetworked hypertext documents, the
universe of network-accessible information”
[Review: http, html, URL, …; also Web History,
Original Proposal]
The World Wide Web functions
as a layer on top of the Internet
Source: Wikipedia
Web Information Systems Course
Web standards
Web Design & Applications: HTML, CSS, SVG,
Ajax
Web Architecture: URIs, HTTP, …
XML Technology: XML, XML Namespaces,
XML Schema, XSLT, ...
Web of Services: WSDL, choreography, policy
specifications,
Semantic Web: RDF, SPARQL, OWL, and
SKOS
Browser & Authoring, Web of Devices
Web Information Systems Course
http://www.w3.org/standards/
Let us focus on Content/Data
Markup language- html; URIs/URLs; linking <A
HREF>
Browser: Mosaic: 1993
Dynamic updates on Web pages: Javascript:
1995
Deepweb – structured data on the web,
dynamically rendered web pages, often nonpublic
data
Directory/Cataloging: Yahoo, Looksmart, DMOZ
Web pages: 2B in 2002, 11+B in 2005, 22+B in 2009
Search: Lycos (1994), Altavista/Excite (1995), Google (1998)
Web Information Systems Course
Evolution of the Web
Web 1.0 (pre 2002)
Retronym. Few content creators, many consumer;
Personal web sites, Static Pages, XML (1996)
Web 2.0 (1999/2002/2003)
Dynamic pages, Web as a platform, RW Web - UGC,
mass use participation- blogging, social web, Web
APIs, mashups, growth of media
Web 3.0 (1999/2009)
Semantic Web, broader content type- esp IoT
Web Information Systems Course
A review of first 5 years of WWW
WWW implementation started: end of 1990;
usable browser – Mosaic: 1993. Here is my
1995 review [Slides, Video first 36 mins] of the first 3-5
years of the Web and some prognosis. The
video made in 2013 also makes connections
between the thoughts presented in 1995 and
have it has turned out in the present.
Web Information Systems Course
Web 1.0 vs Web 2.0
Web 1.0 Web 2.0
Static/personal web pages,
served from server file
system.
Dynamic HTML, often created from RDBMS
HTML 3.2, custom
extensions, GIFs
More standardization, better Web authoring
tools, CSS etc, more client side code, rich user
experience
Limited/controlled authoring Extensive user participation/user generated
content; anyone can author, broader forms of
content
CGI, JSP, ASP Software as a service, APIs, mashups; Client:
AJAX/XML/JSON/jQuery/DOM, Flash/Flex;
Server: PHP, Python, RUBY, PERL
Directory, simple search Decline of directory; Sophisticated search;
PageRank,
Social Web/Networking. Blogging.
Web Information Systems Course
Review & Discussion Questions
URI vs URL
Is www prefix in URL necessary?
What document specification language was the basis of HTML (& married w/
the concept of hypertext)?
What is DNS? Are these two same or different? http://74.125.224.72/ and
http://www.google.com
Is the Web a proper subset of Internet? What is the most used Internet
application? What is the most used Web application?
What was the first inflexion point for the Web?
What format/ data representation (hint: it is a Web standard) made data
exchange and syndication across the We possible?
Web Information Systems Course
Review & Discussion Questions
Background for the following questions:
See what survives of Yahoo! director today and DMOZ. Looksmart
history which will give you one case study about a directory that was
successful in the past. Then read this excellent review of past and
current search engines with some information on directories.
Reflect on the the challenges of maintaining a directory for the Web.?
What was the key advance of Google over initial search engines?
Why did search engines replace directories (Yahoo!, Looksmart,..)?
Web Information Systems Course

Web Information Systems Introduction and Origin of World Wide Web

  • 1.
    Web Information Systems TanviBanerjee, Amit Sheth, et al. 1Kno.e.sis Center / Dept. of Computer Science & Engineering Wright State University One of the courses under Big and Smart Data Science Certificate.
  • 2.
    Flipped Classroom Most classeswill have three components: Before the class: Prerecorded online video and other reading material During the class: discussions/Q&A of what was reviewed before the class; exercises with consultation if needed Exercises – some of which will be graded Web Information Systems Course
  • 3.
    3 Class 1: Module1: Preliminaries and Objectives Image source: CISCO
  • 4.
    “More data hasbeen created in the last three years than in all the past 40,000 years” - Teradata Course Context: How to manage, use and benefit from Data (on the Web) This statistics is for all data, but almost all of the data flows on the Internet and is accessible via the Web
  • 5.
    Course Objectives "The WorldWide Web is the universe of network- accessible information, an embodiment of human knowledge.” [TBL]. The Web has become the largest repository of documents and data, and the largest source of information and knowledge. How do you effectively access and utilize Web data? How do you build data centric Web applications? How do you convert data into information? How do you build, share and use knowledge? Web Information Systems Course
  • 6.
    DIKW Pyramid: Datato Wisdom Web Information Systems Course © Angus McDonald. Also see. Wisdom: ability to increase effectiveness; adds value, requires judgment. Knowledge: information having been processed, organized or structured; being applied or put into action; convey understanding, experience, accumulated learning. Information: inferred from data, endowed with meaning and purpose, help answer: "who", "what", "where", "how many", "when”… Data: symbols, signals or stimuli. Observations and Facts. This course is about data and information aspects as related to the Web. Also see.
  • 7.
    Course Plan Learn andexercise variety of techniques, technologies and tools that focus on collecting and using data for building rich data and information centric Web applications Do exercises to become comfortable with the techniques and technologies Do a substantial project Some additional assignments and more indepth/demanding projects for graduate students Web Information Systems Course
  • 8.
    8 Class 1: Module2 Internet and Web: Brief History and Review Web Information Systems Course
  • 9.
    Internet (the Net)vs WWW (the Web) The Net: “a world-wide system of computer networks, … users at any computer can get information (if they have permissions) get information from any other computer” [Review: ARPANet,TCP/IP,… ] The Web: “all the resources and users on the internet that are using HTTP, a system of internetworked hypertext documents, the universe of network-accessible information” [Review: http, html, URL, …; also Web History, Original Proposal]
  • 10.
    The World WideWeb functions as a layer on top of the Internet Source: Wikipedia Web Information Systems Course
  • 11.
    Web standards Web Design& Applications: HTML, CSS, SVG, Ajax Web Architecture: URIs, HTTP, … XML Technology: XML, XML Namespaces, XML Schema, XSLT, ... Web of Services: WSDL, choreography, policy specifications, Semantic Web: RDF, SPARQL, OWL, and SKOS Browser & Authoring, Web of Devices Web Information Systems Course http://www.w3.org/standards/
  • 12.
    Let us focuson Content/Data Markup language- html; URIs/URLs; linking <A HREF> Browser: Mosaic: 1993 Dynamic updates on Web pages: Javascript: 1995 Deepweb – structured data on the web, dynamically rendered web pages, often nonpublic data Directory/Cataloging: Yahoo, Looksmart, DMOZ Web pages: 2B in 2002, 11+B in 2005, 22+B in 2009 Search: Lycos (1994), Altavista/Excite (1995), Google (1998) Web Information Systems Course
  • 13.
    Evolution of theWeb Web 1.0 (pre 2002) Retronym. Few content creators, many consumer; Personal web sites, Static Pages, XML (1996) Web 2.0 (1999/2002/2003) Dynamic pages, Web as a platform, RW Web - UGC, mass use participation- blogging, social web, Web APIs, mashups, growth of media Web 3.0 (1999/2009) Semantic Web, broader content type- esp IoT Web Information Systems Course
  • 14.
    A review offirst 5 years of WWW WWW implementation started: end of 1990; usable browser – Mosaic: 1993. Here is my 1995 review [Slides, Video first 36 mins] of the first 3-5 years of the Web and some prognosis. The video made in 2013 also makes connections between the thoughts presented in 1995 and have it has turned out in the present. Web Information Systems Course
  • 15.
    Web 1.0 vsWeb 2.0 Web 1.0 Web 2.0 Static/personal web pages, served from server file system. Dynamic HTML, often created from RDBMS HTML 3.2, custom extensions, GIFs More standardization, better Web authoring tools, CSS etc, more client side code, rich user experience Limited/controlled authoring Extensive user participation/user generated content; anyone can author, broader forms of content CGI, JSP, ASP Software as a service, APIs, mashups; Client: AJAX/XML/JSON/jQuery/DOM, Flash/Flex; Server: PHP, Python, RUBY, PERL Directory, simple search Decline of directory; Sophisticated search; PageRank, Social Web/Networking. Blogging. Web Information Systems Course
  • 16.
    Review & DiscussionQuestions URI vs URL Is www prefix in URL necessary? What document specification language was the basis of HTML (& married w/ the concept of hypertext)? What is DNS? Are these two same or different? http://74.125.224.72/ and http://www.google.com Is the Web a proper subset of Internet? What is the most used Internet application? What is the most used Web application? What was the first inflexion point for the Web? What format/ data representation (hint: it is a Web standard) made data exchange and syndication across the We possible? Web Information Systems Course
  • 17.
    Review & DiscussionQuestions Background for the following questions: See what survives of Yahoo! director today and DMOZ. Looksmart history which will give you one case study about a directory that was successful in the past. Then read this excellent review of past and current search engines with some information on directories. Reflect on the the challenges of maintaining a directory for the Web.? What was the key advance of Google over initial search engines? Why did search engines replace directories (Yahoo!, Looksmart,..)? Web Information Systems Course