Open Data
technical approach
    Maurizio Napolitano
     <napo@fbk.eu>



    Internet Governance Forum Italia
    10-12 Novembre 2011
IGF Trento
                  open community data vs closed data




http://tools.geofabrik.de/mc/?mt0=mapnik&mt1=googlemap&lon=11.11372&lat=46.07098&zoom=18
wish list fo the next year
• Open Government Data as a Right
• More Schemas (Knowledge APIs) – keep it focused, let’s not try to boil
  the ocean
• Open Data as a Platform, Not a Commodity
• Massive Interconnection Between Open Data Sites
• Open Corporate Data (for and by Corporates)
• Standards (e.g. for catalog metadata) for Data Portals and Data Hubs
• Open Data for Growth – making clear the the connection
• Strong international norms for data inventories
• Organizational identifiers – Dunn & Bradstreet should be replaced with
  open data
• MiData – getting personal data out of corporates and government back
  into the hands of the people whose data it is


 http://blog.okfn.org/2011/10/23/open-data-wishlist-for-the-next-year/
Tecnical issue
• Open Government Data as a Right
• More Schemas (Knowledge APIs) – keep it focused, let’s not try
  to boil the ocean
• Open Data as a Platform, Not a Commodity
• Massive Interconnection Between Open Data Sites
• Open Corporate Data (for and by Corporates)
• Standards (e.g. for catalog metadata) for Data Portals and Data
  Hubs
• Open Data for Growth – making clear the the connection
• Strong international norms for data inventories
• Organizational identifiers – Dunn & Bradstreet should be replaced with open data
• MiData – getting personal data out of corporates and government back into the hands
  of the people whose data it is
Open data features


Complete Primary Timely Accessible
           Machine-readable
Non-proprietary License-free Reviewable

Discoverable Permanent Access
Redistribution Reuse Description
Metadata Attribution Integrity
Absence of Technological Restriction
Open Linked Data
★ make your stuff available on the Web

(whatever format) under an open license

★★ make it available as structured data

(e.g., Excel instead of image scan of a table)
                                                     Tim Berners Lee
★★★ use non-proprietary formats

(e.g., CSV instead of Excel)

★★★★ use URIs to identify things, so that people can point at your
stuff

★★★★★ link your data to other data to provide context
       http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/
Linked Data Ingredients

                          Things have names
                          (a person, a city, a
                          company)
                          Let this names start
                          with http://
                          Represent data
                          (relations among
                          things) as RDF


http://www.w3.org/DesignIssues/LinkedData.html
Example
Example
Example
Example
Example




●
  Access/browse a global interconnected DB
●
  Merge, mix data
●
  Perform powerful cross-domain queries
The Linking Open Data cloud diagram




http://richard.cyganiak.de/2007/10/lod/
The Linking Open Data cloud diagram




   an italian dataset: ISTAT immigration :)
How to join a dataset in this cloud?

First [...] publish data according to the Linked Data principles.


1.Please add it to CKAN
2.[...]
3.[...]
4.[…]
5.The dataset will be included
next update of the diagram.

http://richard.cyganiak.de/2007/10/lod/#how-to-join
CKAN = The Data Hub




http://www.thedatahub.org
Data portals
Italian data portals
[OT]




       Thanks!
Next steps to the five stars ...

                    Il Trentino deve farsi protagonista del
                    movimento Open Data in Italia e, nel
                    2012, avrà l'occasione di mettersi alla
                    guida dell'avanguardia concentrata sulla
                    creazione di SEMPLICI best practices
                    per una semantica condivisa! (Linked
Federico            open data for dummies: questa è la
Morando             sfida.)
NEXA
                     http://www.igfitalia2011.it/programma/open-trentino




    https://www.facebook.com/federico.muras/posts/171226996304428
Trentino and the semantic web




                                                    TECHNOLOGIES
Nicola Guarino – LOA CNR                 OKKAM




                                PEOPLE
[…]                                      SINDICE
Fausto Giunchiglia – UNITN               ONTOTEXT
[...]                                    MOKI
Paolo Bouquet – UNITN                    [...]
[...]
Luciano Serafini – FBK – DKM
[...]
Bernardo Magnini – FBK – HLT
[...]
Giovanni Tummarello – FBK – WED
[…]
Pavel Shvaiko – Informatica Trentina
[...]
End

    Thanks to
•    Giulio De Petra – Informatica Trentina
•    Federico Morando - NEXA
•    Lorenzo Benussi - TOP-IX
•    Michele Barbera – LinkedOpenData.it
•    Paolo Bouquet – Trentino Open Data
•    Open Knowledge Foundation

                     for the support in this presentation

Open Data - technical approach

  • 1.
    Open Data technical approach Maurizio Napolitano <[email protected]> Internet Governance Forum Italia 10-12 Novembre 2011
  • 2.
    IGF Trento open community data vs closed data http://tools.geofabrik.de/mc/?mt0=mapnik&mt1=googlemap&lon=11.11372&lat=46.07098&zoom=18
  • 3.
    wish list fothe next year • Open Government Data as a Right • More Schemas (Knowledge APIs) – keep it focused, let’s not try to boil the ocean • Open Data as a Platform, Not a Commodity • Massive Interconnection Between Open Data Sites • Open Corporate Data (for and by Corporates) • Standards (e.g. for catalog metadata) for Data Portals and Data Hubs • Open Data for Growth – making clear the the connection • Strong international norms for data inventories • Organizational identifiers – Dunn & Bradstreet should be replaced with open data • MiData – getting personal data out of corporates and government back into the hands of the people whose data it is http://blog.okfn.org/2011/10/23/open-data-wishlist-for-the-next-year/
  • 4.
    Tecnical issue • OpenGovernment Data as a Right • More Schemas (Knowledge APIs) – keep it focused, let’s not try to boil the ocean • Open Data as a Platform, Not a Commodity • Massive Interconnection Between Open Data Sites • Open Corporate Data (for and by Corporates) • Standards (e.g. for catalog metadata) for Data Portals and Data Hubs • Open Data for Growth – making clear the the connection • Strong international norms for data inventories • Organizational identifiers – Dunn & Bradstreet should be replaced with open data • MiData – getting personal data out of corporates and government back into the hands of the people whose data it is
  • 5.
    Open data features CompletePrimary Timely Accessible Machine-readable Non-proprietary License-free Reviewable Discoverable Permanent Access Redistribution Reuse Description Metadata Attribution Integrity Absence of Technological Restriction
  • 6.
    Open Linked Data ★make your stuff available on the Web (whatever format) under an open license ★★ make it available as structured data (e.g., Excel instead of image scan of a table) Tim Berners Lee ★★★ use non-proprietary formats (e.g., CSV instead of Excel) ★★★★ use URIs to identify things, so that people can point at your stuff ★★★★★ link your data to other data to provide context http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/
  • 7.
    Linked Data Ingredients Things have names (a person, a city, a company) Let this names start with http:// Represent data (relations among things) as RDF http://www.w3.org/DesignIssues/LinkedData.html
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
    Example ● Access/browsea global interconnected DB ● Merge, mix data ● Perform powerful cross-domain queries
  • 13.
    The Linking OpenData cloud diagram http://richard.cyganiak.de/2007/10/lod/
  • 14.
    The Linking OpenData cloud diagram an italian dataset: ISTAT immigration :)
  • 15.
    How to joina dataset in this cloud? First [...] publish data according to the Linked Data principles. 1.Please add it to CKAN 2.[...] 3.[...] 4.[…] 5.The dataset will be included next update of the diagram. http://richard.cyganiak.de/2007/10/lod/#how-to-join
  • 16.
    CKAN = TheData Hub http://www.thedatahub.org
  • 17.
  • 18.
  • 19.
    [OT] Thanks!
  • 20.
    Next steps tothe five stars ... Il Trentino deve farsi protagonista del movimento Open Data in Italia e, nel 2012, avrà l'occasione di mettersi alla guida dell'avanguardia concentrata sulla creazione di SEMPLICI best practices per una semantica condivisa! (Linked Federico open data for dummies: questa è la Morando sfida.) NEXA http://www.igfitalia2011.it/programma/open-trentino https://www.facebook.com/federico.muras/posts/171226996304428
  • 21.
    Trentino and thesemantic web TECHNOLOGIES Nicola Guarino – LOA CNR OKKAM PEOPLE […] SINDICE Fausto Giunchiglia – UNITN ONTOTEXT [...] MOKI Paolo Bouquet – UNITN [...] [...] Luciano Serafini – FBK – DKM [...] Bernardo Magnini – FBK – HLT [...] Giovanni Tummarello – FBK – WED […] Pavel Shvaiko – Informatica Trentina [...]
  • 22.
    End Thanks to • Giulio De Petra – Informatica Trentina • Federico Morando - NEXA • Lorenzo Benussi - TOP-IX • Michele Barbera – LinkedOpenData.it • Paolo Bouquet – Trentino Open Data • Open Knowledge Foundation for the support in this presentation