SlideShare a Scribd company logo
Making Emergent Creativity
Overview of Open Data, Linked Data and Web Science

                                     Haklae Kim, PhD. , August 2012
Best Practices
London 2012: Open Data Olympics




                                  2
Today
This Presentation .....

Conceptual overview   Case Studies   The Semantic Web   What We Will Do




                                                                      3
4
Let’s Start
Big Data
“data that becomes large enough that it cannot be processed using conventional methods”




                  “Big Data is like Sex in High School–Lots of people are talking about it,
                                                                     but few are having it.”
                                                   -Eric Hansen, SiteSpect founder and CEO

                                                                                         5
Definition
What is Open (Government) Data?

                  “Open”
                                        freely
                  material (data) is open if it can be
                  used, reused and redistributed by
                  anyone


                  “Government data”
                               produced or
                   data and information
                  commissioned by government or
                  government controlled entities.

                                      Source: Open Knowledge Foundation, 2010




                                                                         6
•  Transparency
•  Participation
•  Collaboration




“My administration is committed to creating an unprecedented level of
openness in Government.” – Barack Obama
 “Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009
Today
This Presentation .....

Conceptual overview   Case Studies   The Semantic Web   What We Will Do




                                                                          8
9	
  
h"p://www.prac+calpar+cipa+on.co.uk/odi/wp-­‐content/uploads/2010/06/Open-­‐Data-­‐Impacts-­‐Timeline-­‐Dra@-­‐0.1.png	
  
Case Studies
   Top 10 Apps: Data.gov.uk




  Where Does My Money Go             OurProperty.co.uk                  OpenlyLocal.com                PlanningAlerts.com




                                                                                                                      10
Source: Telegraph, 2010, http://www.telegraph.co.uk/technology/news/7044147/Data.gov.uk-Top-Ten-Apps-so-far.html
Public Sector Dataset
The State of Open Government Data




                                    Source: http://tinyurl.com/44rub56

                                                              11
Open Data Strategies
Open data instruments
“The application of the four types of instruments by the five countries is depicted – the larger
the circle the more instruments are applied” – Huijboom & Van den Broek, 2011.	



       Education and training                                       Voluntary approaches


                 US
                                         AU        ES         UK                 DK
                             DK

                      UK
                                    ES                  AU             US


                                                                             ES
                              DK
              US                         ES
                                                   DK             AU
                        AU
                                                                            UK
                                   UK                        US
       Economic instruments                                        Legislation and control

                                                                                             12
Critical factors
    Drivers and barries of open data policy implementation

    1            Strategies and experience in front runner countries   Closed government culture


    2            Political leadership                                  Privacy legislation


    3            Regional initiatives                                  Limited quality of data


    4            Citizen initiatives                                   Limited user-friendliness/information overload


    5            Market initiatives                                    Lack of standardization of open data policy


    6            Emerging technologies                                 Security threats


    7            European legislation                                  Existing charging models


    8            Thought leaders                                       Uncertain economic impact


    9            Possibility of monitoring government                  Digital divide


   10            Budgets cuts                                          Network overload



Source:	
  Huijboom	
  and	
  Van	
  den	
  Broek,	
  2011	
                                                         13
Today
This Presentation .....

Conceptual overview   Case Studies   The Semantic Web   What We Will Do




                                                                      14
Let’s Start
Web in Transition
“a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics”




                                                                                            (Source: Mike, 2007)	


                                                                                                             15
Overview
The Semantic Web & Linked Data
“The Semantic Web isn't just about putting data on the web. It is about making links, so that a
person or machine can explore the web of data.  With linked data, when you have some of it,
you can find other, related, data” - TBL.	




                                             5      Stars Open linked data

                                                      ★   Make your stuff available on the Web

                                                   ★★     Make it available as structured data

                                                 ★★★      Use open, standard formats (instead of excel)


                                              ★★★★        Use a open data format – URLs, descriptions

                                            ★★★★★         Link your data to other people’s data




                                                                                                 16
Overview
Growth of Interlinks
… Linked Data provides the means to reach the goal of the Semantic Web
– “the emergence of a Web of Data”




   2007-05-01	
   2007-10-08	
   2007-11-10	
   2008-02-28	
   2008-03-31	




   2008-09-18	
   2009-03-05	
   2009-03-27	
   2009-07-14	
   2010-09-22	
                                                                              17
Structured Wikipedia                        Multimedia Content




     DBpedia                                           BBC



            Commercial Product                         Government Data




     Best Buy                                        UK Gov


                     October, 2011                                       18
295 interlinked datasets, approximately 31 billions triples
Question
What is the Semantic Web for?




               Standards	
       Inference	




                Search	
        Intelligence	

                                                 19
Case Studies
Google’s Semantic Search
People should be able to ask questions and we should understand their meaning, or they should be able to
talk about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible
answer to that.“ - Google Vice President of Search Products & User Experience Marissa Mayer	




an initiative launched on 2 June 2011 by Bing, Google and Yahoo!
to "create and support a common set of schemas for
structured
data markup on web pages."


Freebase is an open, Creative Commons licensed repository
of structured data of almost 22 million entities. An entity is a
single person, place, or thing connected by a graph.




The Knowledge Graph is a collection of information sources that
help discern a user’s specified intent with each individual query.
The graph is actually an encyclopedia with structured              http://schema.org/docs/full.html	
information obtained from the web. (currently, 200 million
entities)	
                                                                                                            20
Case Studies
Apple’s Siri
Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: "Me."	



Siri (Speech Interpretation and Recognition Interface) is        Knowledge Navigator (1987)
an intelligent personal assistant and knowledge                  a concept described by former Apple Computer CEO John
navigator which works as an application for Apple's iOS.         Sculley in his 1987 book, Odyssey.	

A Brief History
- In December 2007 Siri, Inc. was formed by Dag Kittlaus
(CEO), Adam Cheyer (VP Engineering), and Tom Gruber
(CTO/VP Design).
- Siri Inc. went after funding and by November 2009 it had
secured $15.5 million investment, resulted in the creation of
the first Siri application, which debuted on the iPhone 3GS in
February 2010.
- Siri acquired by Apple; iPhone becomes the Virtual Personal
Assistant




                                                                 (Source: http://www.youtube.com/watch?v=QRH8eimU_20)	

                                                                                                                          21
Case Studies
Active Ontology
A processing formalism where distinct processing elements are arranged according to ontology notions;
an execution environment.	

                                                      Basic concepts
                                                       * Ontology : A data structure
                                                          - Formal representation for domain knowledge
                                                          - Classes, attributes, relations
                                                       * Active Ontology : A processing environment
                                                         - Processing elements arranged according to ontology
                                                            notions
                                                         - Communication channels
                                                                                    P    movie



                                                          P      genre         P     actor         P      rating



                                                          rule set
                                                              rule
                                                                rule
                                                                  rule
                                                                 condition
                                                                   condition
                                                                     condition
                                                                   action
                                                                     action
                                                                       action
                                                                                             (Baur et al., 2007)	
                                                                                                            22
Why
Linked Data and Open Government Data




                                       23
Linked	
  Data	
  life	
  cycles	
  

1               2              3                4               5                   6

      data          modeling       publishing       discovery       integration         use cases
    awareness




thedatahub      Neologism      Google Refine        VoID        LATC 24/7         datacatalogs

LOD cloud       DataCube       RDB2RDF              DCAT        duke              data.gov

                prefix.cc                           Sindice     Sig.ma            data.gov.uk

                                                    CKAN
Today
This Presentation .....

Conceptual overview   Case Studies   The Semantic Web   What We Will Do




                                                                      25
Reality Check
Data.gov in crisis




 Data.gov, along with a number of other data-related sites of the government
 such as USAspending.gov and Apps.gov, are slated to be shut down due to
 budget cuts. The current annual budget of $37 million will be reduced to $2
 million. – (Guardian April 11)
                                                                               26
Reality Check in Korea
고려 사항

1   정부의 역할: 시스템 구축 vs 생태계 구축             - 통제가 아닌 효율적인 서비스 지향
                                         - 데이터 공개 및 연계를 위한 로드맵 수립


                                         - 정부기관의 데이터 소유 인식 전환 필요
2   데이터 플랫폼: 정부 vs 민간 vs 커뮤니티            - 자발적인 참여와 소비를 촉진하는 전략 필요


                                         - 데이터 범주에 따른 차별화된 공개 전략
3   데이터 민감성: WikiLeaks vs Open Data      - 데이터의 활용에 따른 최적화된 서비스 모델


                                         - 서비스 범위에 따른 구축비용/운영 모델
4   서비스 범위: Domestic vs International    - 국제 표준에 기반한 데이터 접근 서비스 제공


                                          - 통계 기반 시각화에 한정된 모델 지양
5   데이터 내용: 통계/수치 데이터 vs 정보형 데이터          - 데이터 특성에 맞는 기술 적용 모델 수립


                                                 - 지능적인 데이터 매쉬업 지원을
6   데이터 형식: human-readable vs machine-readable   위한 데이터 모델링 검토




                                                                    27
Conceptual Architecture
  Vision of Government Open Data
 “realise significant economic benefits by enabling businesses and non-profit organisations to build
 innovative applications and websites using public data.”	




                                                                                             28
(Ding et al., 2012)
Conceptual Architecture
  Roadmap of linked open government data
 “the combination of machine power and human power and deliver higher-quality data to a wide
 range of data consumers via visualization, mashups, and more.”	




                                                                                         29
(Ding et al., 2012)
Summary
Data on the Web


  Data is information about things



  Data is something machines can process



  Data drives applications (e.g. web sites, mobile services)



  Data is relations among things


                                                               30
Summary
Open Data vs Linked Data

 Open Data starts with making available the data that you already have, in whatever format.



                              •  Equal access for all
     Open Data                •  Licensing, legal issues
                              •  Transparency
                              •  Changing the way government works

                              •  URIs
     Linked Data              •  HTTPs
                              •  RDF vocabularies
                              •  Standards




                                                                                         31
What We Will Do
Interdisciplinary Collaboration




                   Difficult



 Concluding Remarks
 Hope is not a strategy and the “change” has been
 change for the worse, and not better.              32
References
- Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software
- Noor Huijboom and Tijs Van den Broek, Open Data: an international comparison of strategies, European journal of ePractices,
  March/April 2011
- Li Ding, Vassilios Peristeras, and Michael Hausenblas, Linked Open Government Data, IEEE Intelligent Systems, May/June 2012

-  Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png
-  Page 4: http://www.go-gulf.com/60seconds.jpg
-  Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg
-  Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi
-  Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg
-  Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg

Page 2 Case Studies

-  http://www.guardian.co.uk/commentisfree/2012/aug/03/london-2012-olympics-open-data
-  http://www.bbc.co.uk/news/uk-19050139
-  http://london2012.nytimes.com/results
-  http://www.guardian.co.uk/sport/interactive/2012/jul/23/could-you-be-a-medallist
-  http://www.guardian.co.uk/sport/datablog/2012/aug/13/olympics-2012-data-journalism
-  http://www.guardian.co.uk/sport/datablog/interactive/2012/jul/26/london-2012-price-olympic-games-visualised




                                                                                                                   33
For more information
contact Haklae Kim via

haklae.kim@gmail.com
Twitter: haklaekim

Or read up on the
sonagi blog at:

http://blogweb.co.kr

http://thedatahub.kr

More Related Content

What's hot (20)

PDF
Google scholar
Abdulrahaman Momin
 
PPTX
Decision trees
Jagjit Wilku
 
PPTX
Clustering in data Mining (Data Mining)
Mustafa Sherazi
 
PDF
Data Mining & Data Warehousing Lecture Notes
FellowBuddy.com
 
PPTX
Citation Databases & Research Metrics U-4.pptx
Ramakrishna1983
 
PPTX
Data Visualization
Marco Torchiano
 
PDF
K means Clustering
Edureka!
 
PPT
4.3 multimedia datamining
Krish_ver2
 
PDF
Clinical prediction models: development, validation and beyond
Maarten van Smeden
 
PDF
Data Mining Module 3 Business Analtics..pdf
Jayanti Pande
 
PPTX
Author Level Metrics
Dr. M Vijayakumar
 
PPTX
Graph databases
Vinoth Kannan
 
PPTX
Clique
sk_klms
 
PPTX
Citation analysis
Masoud Mohammadi
 
PDF
Performance Metrics for Machine Learning Algorithms
Kush Kulshrestha
 
PPT
Clustering
NLPseminar
 
PPTX
Citation Database
Dr. Utpal Das
 
PPTX
Cluster Analysis
Kamal Acharya
 
PPTX
Prediction of heart disease using machine learning.pptx
kumari36
 
PPTX
Information retrieval dynamic indexing
Nadia Nahar
 
Google scholar
Abdulrahaman Momin
 
Decision trees
Jagjit Wilku
 
Clustering in data Mining (Data Mining)
Mustafa Sherazi
 
Data Mining & Data Warehousing Lecture Notes
FellowBuddy.com
 
Citation Databases & Research Metrics U-4.pptx
Ramakrishna1983
 
Data Visualization
Marco Torchiano
 
K means Clustering
Edureka!
 
4.3 multimedia datamining
Krish_ver2
 
Clinical prediction models: development, validation and beyond
Maarten van Smeden
 
Data Mining Module 3 Business Analtics..pdf
Jayanti Pande
 
Author Level Metrics
Dr. M Vijayakumar
 
Graph databases
Vinoth Kannan
 
Clique
sk_klms
 
Citation analysis
Masoud Mohammadi
 
Performance Metrics for Machine Learning Algorithms
Kush Kulshrestha
 
Clustering
NLPseminar
 
Citation Database
Dr. Utpal Das
 
Cluster Analysis
Kamal Acharya
 
Prediction of heart disease using machine learning.pptx
kumari36
 
Information retrieval dynamic indexing
Nadia Nahar
 

Viewers also liked (20)

PDF
Linked Open Data Principles, Technologies and Examples
Open Data Support
 
KEY
Publishing Linked Open Data in 15 minutes
Alvaro Graves
 
PPT
Linked Open Data for Libraries
Lukas Koster
 
PPTX
Self-service Linked Government Data
Fadi Maali
 
PDF
Linked (Open) Data
Bernhard Haslhofer
 
PPTX
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...
JohannWanja
 
PPT
A few metrics about Open Data in the cultural sector
Joris Pekel
 
PDF
The four pillars of Lean Enterprise Execution
Marco Tedone
 
PPT
Asug Gov Sig Data Quality Metrics Report Sapphire 2008
nnorthrup
 
PDF
La resiliència. Primer abordatge amb els professionals de mesures penals alte...
Departament de Justícia. Generalitat de Catalunya.
 
PDF
7139 hib brochure v10 (1)
Linda Madisone
 
PPTX
Fem pac1 david_gómezmartínez
DGM86
 
PDF
TELES AG: Geschäftsbericht 2013 01
TELES AG Informationstechnologien
 
PPTX
Software en Odontología
Debora
 
PPTX
Dynamik im E-Mail Marketing Toolmarkt - so finden Sie das richtige Versandtool
netnomics GmbH
 
DOC
Intervenciones comunitarias
Lorena Alvarez
 
PDF
Silke Gester: Quo vadis, DaF?
VeRBuMPublishing
 
PPTX
Linked Open Data
Lars Marius Garshol
 
PDF
Cuestionario impact. gráficas.docx
vanderweb
 
Linked Open Data Principles, Technologies and Examples
Open Data Support
 
Publishing Linked Open Data in 15 minutes
Alvaro Graves
 
Linked Open Data for Libraries
Lukas Koster
 
Self-service Linked Government Data
Fadi Maali
 
Linked (Open) Data
Bernhard Haslhofer
 
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...
JohannWanja
 
A few metrics about Open Data in the cultural sector
Joris Pekel
 
The four pillars of Lean Enterprise Execution
Marco Tedone
 
Asug Gov Sig Data Quality Metrics Report Sapphire 2008
nnorthrup
 
La resiliència. Primer abordatge amb els professionals de mesures penals alte...
Departament de Justícia. Generalitat de Catalunya.
 
7139 hib brochure v10 (1)
Linda Madisone
 
Fem pac1 david_gómezmartínez
DGM86
 
TELES AG: Geschäftsbericht 2013 01
TELES AG Informationstechnologien
 
Software en Odontología
Debora
 
Dynamik im E-Mail Marketing Toolmarkt - so finden Sie das richtige Versandtool
netnomics GmbH
 
Intervenciones comunitarias
Lorena Alvarez
 
Silke Gester: Quo vadis, DaF?
VeRBuMPublishing
 
Linked Open Data
Lars Marius Garshol
 
Cuestionario impact. gráficas.docx
vanderweb
 

Similar to Overview of Open Data, Linked Data and Web Science (20)

PDF
Big Data on the Web – What We Will Do
Haklae Kim
 
PDF
Open Government Data, Linked Data, and the Missing Blocks in Korea
Haklae Kim
 
PDF
Open Data how to
Lorenzo Benussi
 
PPT
데이터, 미래 그리고 현재
Haklae Kim
 
PDF
What is opendata
Lorenzo Benussi
 
PDF
US National Archives & Open Government Data
3 Round Stones
 
PPT
Open Data and Linked Data
Haklae Kim
 
PDF
US EPA OSWER Linked Data Workshop 1-Feb-2013
3 Round Stones
 
PDF
Opendata challeges&opportunites
Maurizio Napolitano
 
PPT
EDF2012 Nigel Shadbolt - Transparency and Open Data
European Data Forum
 
PDF
Open Data Conference - Andrew Stott - Open Data Position in the EU/UK
Opening-up.eu
 
PDF
The new flow of information
fabiolaeinhorn
 
PDF
W3C TPAC 2012 Breakout Session on Government Linked Data
3 Round Stones
 
PDF
Government Linked Data: A Tipping Point for the Semantic Web
Nigel Shadbolt
 
PDF
Sentara Linked Data Workshop - Sept 10, 2012
3 Round Stones
 
PPTX
ISWC 2012 Keynote
Jeanne Holm
 
PPT
B2: Open Up: Open Data in the Public Sector
Marieke Guy
 
PDF
Open Data for Transportation Agencies
Novavia Solutions
 
PPTX
Commercial opportunities in the Open Data sector, Kiev
Steven De Costa
 
PDF
Dane Wright, London Borough of Brent - open and linked data
Socitm
 
Big Data on the Web – What We Will Do
Haklae Kim
 
Open Government Data, Linked Data, and the Missing Blocks in Korea
Haklae Kim
 
Open Data how to
Lorenzo Benussi
 
데이터, 미래 그리고 현재
Haklae Kim
 
What is opendata
Lorenzo Benussi
 
US National Archives & Open Government Data
3 Round Stones
 
Open Data and Linked Data
Haklae Kim
 
US EPA OSWER Linked Data Workshop 1-Feb-2013
3 Round Stones
 
Opendata challeges&opportunites
Maurizio Napolitano
 
EDF2012 Nigel Shadbolt - Transparency and Open Data
European Data Forum
 
Open Data Conference - Andrew Stott - Open Data Position in the EU/UK
Opening-up.eu
 
The new flow of information
fabiolaeinhorn
 
W3C TPAC 2012 Breakout Session on Government Linked Data
3 Round Stones
 
Government Linked Data: A Tipping Point for the Semantic Web
Nigel Shadbolt
 
Sentara Linked Data Workshop - Sept 10, 2012
3 Round Stones
 
ISWC 2012 Keynote
Jeanne Holm
 
B2: Open Up: Open Data in the Public Sector
Marieke Guy
 
Open Data for Transportation Agencies
Novavia Solutions
 
Commercial opportunities in the Open Data sector, Kiev
Steven De Costa
 
Dane Wright, London Borough of Brent - open and linked data
Socitm
 

More from Haklae Kim (20)

PPTX
The Semantic Web and Linked Open Data
Haklae Kim
 
PDF
OKFN Korea 소개자료
Haklae Kim
 
PDF
센서데이터 웹으로의 비상
Haklae Kim
 
PPT
공공데이터 개방현황 및 포털 발전방향
Haklae Kim
 
PDF
개인건강기록관리 플랫폼에서 링크드 데이터의 활용
Haklae Kim
 
PDF
Extended open data and big data in public sector
Haklae Kim
 
PDF
대한민국, 잇다!
Haklae Kim
 
PDF
Linked Data 이야기
Haklae Kim
 
PDF
Linked Data 이야기
Haklae Kim
 
PDF
오픈 데이터 현황과 과제
Haklae Kim
 
PPT
서울시 링크드 데이터 서비스 사례 소개-모델링
Haklae Kim
 
PPT
서울시 링크드 데이터 서비스 사례 소개-모델링개요
Haklae Kim
 
PPTX
서울시 Linked Data 서비스 소개-열린데이터광장
Haklae Kim
 
PPT
서울시 링크드 데이터 서비스 소개-Overview
Haklae Kim
 
PDF
오픈 데이터에서 링크드 데이터로 진화
Haklae Kim
 
PPT
오픈 데이터에서 링크드 데이터로 진화
Haklae Kim
 
PDF
Data science-2013-heekim
Haklae Kim
 
PDF
Data science (조명대)
Haklae Kim
 
PPT
시민이 함께 만들어가는 서울 열린 데이터광장
Haklae Kim
 
PPT
시민이 함께 만들어가는 서울 열린 데이터광장(서울시청 임성우)
Haklae Kim
 
The Semantic Web and Linked Open Data
Haklae Kim
 
OKFN Korea 소개자료
Haklae Kim
 
센서데이터 웹으로의 비상
Haklae Kim
 
공공데이터 개방현황 및 포털 발전방향
Haklae Kim
 
개인건강기록관리 플랫폼에서 링크드 데이터의 활용
Haklae Kim
 
Extended open data and big data in public sector
Haklae Kim
 
대한민국, 잇다!
Haklae Kim
 
Linked Data 이야기
Haklae Kim
 
Linked Data 이야기
Haklae Kim
 
오픈 데이터 현황과 과제
Haklae Kim
 
서울시 링크드 데이터 서비스 사례 소개-모델링
Haklae Kim
 
서울시 링크드 데이터 서비스 사례 소개-모델링개요
Haklae Kim
 
서울시 Linked Data 서비스 소개-열린데이터광장
Haklae Kim
 
서울시 링크드 데이터 서비스 소개-Overview
Haklae Kim
 
오픈 데이터에서 링크드 데이터로 진화
Haklae Kim
 
오픈 데이터에서 링크드 데이터로 진화
Haklae Kim
 
Data science-2013-heekim
Haklae Kim
 
Data science (조명대)
Haklae Kim
 
시민이 함께 만들어가는 서울 열린 데이터광장
Haklae Kim
 
시민이 함께 만들어가는 서울 열린 데이터광장(서울시청 임성우)
Haklae Kim
 

Recently uploaded (20)

PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
From Code to Challenge: Crafting Skill-Based Games That Engage and Reward
aiyshauae
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 

Overview of Open Data, Linked Data and Web Science

  • 1. Making Emergent Creativity Overview of Open Data, Linked Data and Web Science Haklae Kim, PhD. , August 2012
  • 2. Best Practices London 2012: Open Data Olympics 2
  • 3. Today This Presentation ..... Conceptual overview Case Studies The Semantic Web What We Will Do 3
  • 4. 4
  • 5. Let’s Start Big Data “data that becomes large enough that it cannot be processed using conventional methods” “Big Data is like Sex in High School–Lots of people are talking about it, but few are having it.” -Eric Hansen, SiteSpect founder and CEO 5
  • 6. Definition What is Open (Government) Data? “Open” freely material (data) is open if it can be used, reused and redistributed by anyone “Government data” produced or data and information commissioned by government or government controlled entities. Source: Open Knowledge Foundation, 2010 6
  • 7. •  Transparency •  Participation •  Collaboration “My administration is committed to creating an unprecedented level of openness in Government.” – Barack Obama “Memorandum for the Heads of Executive Departments and Agencies – Transparency and Open Government” Jan 2009
  • 8. Today This Presentation ..... Conceptual overview Case Studies The Semantic Web What We Will Do 8
  • 10. Case Studies Top 10 Apps: Data.gov.uk Where Does My Money Go OurProperty.co.uk OpenlyLocal.com PlanningAlerts.com 10 Source: Telegraph, 2010, http://www.telegraph.co.uk/technology/news/7044147/Data.gov.uk-Top-Ten-Apps-so-far.html
  • 11. Public Sector Dataset The State of Open Government Data Source: http://tinyurl.com/44rub56 11
  • 12. Open Data Strategies Open data instruments “The application of the four types of instruments by the five countries is depicted – the larger the circle the more instruments are applied” – Huijboom & Van den Broek, 2011. Education and training Voluntary approaches US AU ES UK DK DK UK ES AU US ES DK US ES DK AU AU UK UK US Economic instruments Legislation and control 12
  • 13. Critical factors Drivers and barries of open data policy implementation 1 Strategies and experience in front runner countries Closed government culture 2 Political leadership Privacy legislation 3 Regional initiatives Limited quality of data 4 Citizen initiatives Limited user-friendliness/information overload 5 Market initiatives Lack of standardization of open data policy 6 Emerging technologies Security threats 7 European legislation Existing charging models 8 Thought leaders Uncertain economic impact 9 Possibility of monitoring government Digital divide 10 Budgets cuts Network overload Source:  Huijboom  and  Van  den  Broek,  2011   13
  • 14. Today This Presentation ..... Conceptual overview Case Studies The Semantic Web What We Will Do 14
  • 15. Let’s Start Web in Transition “a steady progression from a document-centric Web to one that is data-centric, including the mediation of semantics” (Source: Mike, 2007) 15
  • 16. Overview The Semantic Web & Linked Data “The Semantic Web isn't just about putting data on the web. It is about making links, so that a person or machine can explore the web of data.  With linked data, when you have some of it, you can find other, related, data” - TBL. 5 Stars Open linked data ★ Make your stuff available on the Web ★★ Make it available as structured data ★★★ Use open, standard formats (instead of excel) ★★★★ Use a open data format – URLs, descriptions ★★★★★ Link your data to other people’s data 16
  • 17. Overview Growth of Interlinks … Linked Data provides the means to reach the goal of the Semantic Web – “the emergence of a Web of Data” 2007-05-01 2007-10-08 2007-11-10 2008-02-28 2008-03-31 2008-09-18 2009-03-05 2009-03-27 2009-07-14 2010-09-22 17
  • 18. Structured Wikipedia Multimedia Content DBpedia BBC Commercial Product Government Data Best Buy UK Gov October, 2011 18 295 interlinked datasets, approximately 31 billions triples
  • 19. Question What is the Semantic Web for? Standards Inference Search Intelligence 19
  • 20. Case Studies Google’s Semantic Search People should be able to ask questions and we should understand their meaning, or they should be able to talk about things at a conceptual level. ... A lot of people will turn to things like the semantic Web as a possible answer to that.“ - Google Vice President of Search Products & User Experience Marissa Mayer an initiative launched on 2 June 2011 by Bing, Google and Yahoo! to "create and support a common set of schemas for structured data markup on web pages." Freebase is an open, Creative Commons licensed repository of structured data of almost 22 million entities. An entity is a single person, place, or thing connected by a graph. The Knowledge Graph is a collection of information sources that help discern a user’s specified intent with each individual query. The graph is actually an encyclopedia with structured http://schema.org/docs/full.html information obtained from the web. (currently, 200 million entities) 20
  • 21. Case Studies Apple’s Siri Ask Siri how Apple recorded the best quarter in history for a tech company, and her answer should be: "Me." Siri (Speech Interpretation and Recognition Interface) is Knowledge Navigator (1987) an intelligent personal assistant and knowledge a concept described by former Apple Computer CEO John navigator which works as an application for Apple's iOS. Sculley in his 1987 book, Odyssey. A Brief History - In December 2007 Siri, Inc. was formed by Dag Kittlaus (CEO), Adam Cheyer (VP Engineering), and Tom Gruber (CTO/VP Design). - Siri Inc. went after funding and by November 2009 it had secured $15.5 million investment, resulted in the creation of the first Siri application, which debuted on the iPhone 3GS in February 2010. - Siri acquired by Apple; iPhone becomes the Virtual Personal Assistant (Source: http://www.youtube.com/watch?v=QRH8eimU_20) 21
  • 22. Case Studies Active Ontology A processing formalism where distinct processing elements are arranged according to ontology notions; an execution environment. Basic concepts * Ontology : A data structure - Formal representation for domain knowledge - Classes, attributes, relations * Active Ontology : A processing environment - Processing elements arranged according to ontology notions - Communication channels P movie P genre P actor P rating rule set rule rule rule condition condition condition action action action (Baur et al., 2007) 22
  • 23. Why Linked Data and Open Government Data 23
  • 24. Linked  Data  life  cycles   1 2 3 4 5 6 data modeling publishing discovery integration use cases awareness thedatahub Neologism Google Refine VoID LATC 24/7 datacatalogs LOD cloud DataCube RDB2RDF DCAT duke data.gov prefix.cc Sindice Sig.ma data.gov.uk CKAN
  • 25. Today This Presentation ..... Conceptual overview Case Studies The Semantic Web What We Will Do 25
  • 26. Reality Check Data.gov in crisis Data.gov, along with a number of other data-related sites of the government such as USAspending.gov and Apps.gov, are slated to be shut down due to budget cuts. The current annual budget of $37 million will be reduced to $2 million. – (Guardian April 11) 26
  • 27. Reality Check in Korea 고려 사항 1 정부의 역할: 시스템 구축 vs 생태계 구축 - 통제가 아닌 효율적인 서비스 지향 - 데이터 공개 및 연계를 위한 로드맵 수립 - 정부기관의 데이터 소유 인식 전환 필요 2 데이터 플랫폼: 정부 vs 민간 vs 커뮤니티 - 자발적인 참여와 소비를 촉진하는 전략 필요 - 데이터 범주에 따른 차별화된 공개 전략 3 데이터 민감성: WikiLeaks vs Open Data - 데이터의 활용에 따른 최적화된 서비스 모델 - 서비스 범위에 따른 구축비용/운영 모델 4 서비스 범위: Domestic vs International - 국제 표준에 기반한 데이터 접근 서비스 제공 - 통계 기반 시각화에 한정된 모델 지양 5 데이터 내용: 통계/수치 데이터 vs 정보형 데이터 - 데이터 특성에 맞는 기술 적용 모델 수립 - 지능적인 데이터 매쉬업 지원을 6 데이터 형식: human-readable vs machine-readable 위한 데이터 모델링 검토 27
  • 28. Conceptual Architecture Vision of Government Open Data “realise significant economic benefits by enabling businesses and non-profit organisations to build innovative applications and websites using public data.” 28 (Ding et al., 2012)
  • 29. Conceptual Architecture Roadmap of linked open government data “the combination of machine power and human power and deliver higher-quality data to a wide range of data consumers via visualization, mashups, and more.” 29 (Ding et al., 2012)
  • 30. Summary Data on the Web Data is information about things Data is something machines can process Data drives applications (e.g. web sites, mobile services) Data is relations among things 30
  • 31. Summary Open Data vs Linked Data Open Data starts with making available the data that you already have, in whatever format. •  Equal access for all Open Data •  Licensing, legal issues •  Transparency •  Changing the way government works •  URIs Linked Data •  HTTPs •  RDF vocabularies •  Standards 31
  • 32. What We Will Do Interdisciplinary Collaboration Difficult Concluding Remarks Hope is not a strategy and the “change” has been change for the worse, and not better. 32
  • 33. References - Charles Baur, Adam Cheyer, Didier Guzzoni, Active, a platform for building intelligent software - Noor Huijboom and Tijs Van den Broek, Open Data: an international comparison of strategies, European journal of ePractices, March/April 2011 - Li Ding, Vassilios Peristeras, and Michael Hausenblas, Linked Open Government Data, IEEE Intelligent Systems, May/June 2012 -  Page 1: http://www.w3.org/DesignIssues/diagrams/websci/Marius%20Watz%20-%20Web%20Science%20artwork.png -  Page 4: http://www.go-gulf.com/60seconds.jpg -  Page 9: http://cloud.frontpagemag.com/wp-content/uploads/2012/03/obama11.jpg -  Page 27: http://www.patentlyapple.com/.a/6a0120a5580826970c0168e5ccdd81970c-800wi -  Page 29: http://programminggeeks.com/wp-content/uploads/2010/05/Programming-Geeks-Web-Science.jpg -  Page 29: http://3.bp.blogspot.com/-C0Kyck90Djo/T4KZTg3k1XI/AAAAAAAAAsE/RUp165S0FCQ/s1600/Commitment.jpeg Page 2 Case Studies -  http://www.guardian.co.uk/commentisfree/2012/aug/03/london-2012-olympics-open-data -  http://www.bbc.co.uk/news/uk-19050139 -  http://london2012.nytimes.com/results -  http://www.guardian.co.uk/sport/interactive/2012/jul/23/could-you-be-a-medallist -  http://www.guardian.co.uk/sport/datablog/2012/aug/13/olympics-2012-data-journalism -  http://www.guardian.co.uk/sport/datablog/interactive/2012/jul/26/london-2012-price-olympic-games-visualised 33
  • 34. For more information contact Haklae Kim via [email protected] Twitter: haklaekim Or read up on the sonagi blog at: http://blogweb.co.kr http://thedatahub.kr