SlideShare a Scribd company logo
iConference 2011 Archiving as a Service - A Model for the Provision of Shared Archiving Services Using Cloud Computing   Jan Askhoj –   janaskhoej[at]gmail.com Shigeo Sugimoto –  sugimoto[at]slis.tsukuba.ac.jp   Mitsuharu Nagamori –  nagamori[at]slis.tsukuba.ac.jp University of Tsukuba, Japan
The Rise of Cloud Computing Big business: Reported that the cloud computing market will grow to more than $150 billion in 2013 Gartner listed cloud computing as one of the most hyped technologies in 2009.  Many benefits: Reduced cost, increased storage, no software deployment, flexibility, mobility and allowing IT to shift focus. Cloud computing is being used increasingly for  content creation  and  storage . * Global Industry Analysts, 2010
A Cloud Definition (One of Many) Cloud Computing is an  abstracted, scalable  plat-form for service delivery. Cloud computing makes use of  existing technologies  that can be described via a  layered model. Access to both platform and services is available  via the internet . Availability, quality and number  of services are offered according to  agreements with a provider . -  Vaquero et al. 2009
Cloud Computing from an Archiving Perspective In the cloud, archives may not have knowledge of records creation hardware and software . How do we document such formats? Cloud Providers are good at managing data and hosting software.  But what if something happens? There are providers of services for  backup , but not for  preservation .   Can we find and read documents created and stored in the cloud in 10 years from now?
I found the document...  If only I knew how to access it!
Object of Research Providing a reference model for cloud based archiving that makes possible: Offering trusted storage and long term preservation as a cloud based service. Automatically providing preservation metadata and information packages for transfer of digital records. Extending preservation to as early in the records lifecycle as possible.
Current Archive Model: OAIS Reference Model for an Open Archival Information System (OAIS). Defines Entities, Relationships and Information Types in digital archives. Consultative Committee for Space Data Systems, 2002.
OAIS and the Cloud The OAIS Model does not cover the  use of a shared platform for storage , outside the control of an archive. Such functionality overlaps with several OAIS functional entities. An OAIS Archive does not cover  the early stages of the document lifecycle . With a shared platform, digital objects can be immediately accessible to an archive for early preservation planning. In OAIS,  Digital Objects  and  metadata are included in information packages . If Producer and Archive share a common platform, this is not necessary.
Hardware/Facilities Connectivity Abstraction OS   Virtualization Data Metadata Content Applications   APIs   Presentation (User facing) SaaS  (Software as a Service).  Users access applications via user-facing software or APIs.  PaaS  (Platform as a Service). Virtualized platform for executing applications and providing storage. IaaS  (Infrastructure as a Service). Hardware and Infrastructure.  A General Layered Model for Cloud Computing Services
Some Characteristics of the Layered Model In a layered model, each layer offers defined services to the layers above. Services are abstracted and interchangeable. Benefits:  - Makes it easy to offer and take advantage of defined levels of services. - Facilitates resource sharing - Facilitates migration
Archive Digital Object Digital Object Business System Storage Layer Simple Layered Cloud Archiving System Interaction Layer Trusted repository (bit-level integrity)
Expanding the Simple Model Storage  does not equal  preservation .  Information is needed to support: “ Viability, Renderability, Understandability, Authenticity, and Identity  of Digital Objects” (known in OAIS as an Information Package).
Proposed Four Layer Model Interaction Layer : User facing Archives/ Records Management Systems and Business Systems. Preservation Layer : Adds preservation information. Turns Digital Objects into Information Packages for use by Archives/Records Management Systems. SaaS Layer : Applications represent bit-strings as Digital Objects used by systems and users. PaaS Layer : Application platform and trusted repository for storing bit-strings.
Information Object Data Object Represent. Information Digital Object Bit Sequence 1+ 1+ 1+ OAIS Information Package Layered Model Interaction Layer Preservation Layer SaaS Layer PaaS Layer Preservation Description Information Information Package
Where does Preservation Metadata come from? Business System Metadata :  Generated at the time of document creation or records export.  Registry Information :   Pre-provided (semi-static) information about registered Entities and Information Types Event Related Information :  Information describing changes to Digital Objects and metadata taking place during the preservation process.
PaaS Layer SaaS Layer Preservation Layer Interaction Layer Digital Object Type & Metadata Bitstream Storage & API Information Package Layered Model Applications, Information and Provided Services Archive System Package Creator Business Software Storage/ Hosting Platform Application Service Preservation Information Information Package Digital Object Bit-stream Information Type
Case Study: Japanese Government Problems with system incompatibility and insufficient record management has led to a new  Archives Policy  and a new  IT Strategy One part is a cloud computing project: The Kasumigaseki Cloud ( 霞が関クラウド ). This is still in the early stages of planning. We focus on three archiving problem areas to see how these could be resolved using our model.
Platform Platform Platform Record Historic Record Destruction Destruction Common Document Registration System Registration Transfer Plan Preservation Plan Retention Schedule Agency Records Mgmt. Agency National Archives Business System National Archive Current  Workflow Business System Business System Business System Business System Records Mgmt.  System
Problem Areas Lack of system integration : Individual government offices use different systems. Preparing records is a time consuming task. Lack of resources : The burden of transferring records to the National Archives lies with government agencies. The size of the NAJ makes it hard to provide assistance. Preservation : Lack of preservation of records in government agency systems.
Applying the model Assumption that the Kasumigaseki Cloud will offer both a  storage/hosting platform  (PaaS) and  software services  (SaaS) Added functionality  in Preservation Layer: Registration Harvesting Preservation Reporting
Archive System PaaS Layer Package Layer SaaS Layer ARM Layer User Facing Systems Transfer Transfer SaaS Business Systems  ->  Digital Objects Platform  ->  Bit-sequences Preservation Description Information Representation Information Package Information Package Desc. Functionality  ->  Registration, Harvesting, Conversion, Reporting RMS Agency Records Mgmt. Agency National Archives Business System Back-end Transfer Plan Preservation Plan Retention Schedule
Benefits and Limitations in Case Benefits :  Automatic package creation, simplifying records transfer. Early and consistent preservation metadata addition Allows keeping current workflow, but adds automation Limitations/Requirements :  Cloud platform must be truly trustworthy with no unexpected change or loss of service.  Need good export of content and metadata from SaaS business systems Providing semantic or community specific information
Concluding Remarks We believe our model has a number of advantages when developing a cloud archive framework: Builds on OAIS model concepts and information types. Adds trusted storage and preservation to early stages in the document lifecycle.  Simplifies archive system design by allowing organizations choose different levels of service. Current Status : Work on defining information classes and properties. Designing a test system using the model.
Thank you ! ありがとうございました ! University of Tsukuba, Japan
References ISO 15489-1:2001 - Information and documentation - Records management - Part 1: General. 2001. Requirements for Electronic Records Management Systems. 2002.  http://www.nationalarchives.gov.uk/documents/metadatafinal.pdf . Reference Model for an Open Archival Information System (OAIS) . Consultative Committee for Space Data Systems, 2002. Electronic Records Archives ERA Lifecycle. 2004. http://www.archives.gov/era/pdf/era-life-cycle.pdf. National Archives Law . National Archives of Japan, 2007. Outline of the National Archives. 2007. http://www.archives.go.jp/english/abouts/outline.html. Chan, T. Japan to build massive cloud infrastructure for e-government.  Green Telecom . http://www.greentelecomlive.com/2009/05/13/japan-to-build-massive-cloud-infrastructure-for-e-government/. Guenther, R. Understanding and Implementing the PREMIS Data Dictionary for Preservation  Metadata. 2009. http://www.digitalpreservation.gov/news/events/ndiipp_meetings/ndiipp09/docs/June26/premis-ndiipp-20090626.ppt. Koga, T. Recent development of the government information policy in Japan.  International Federation of Library Associations and Institutions, Government Information and Official Publications Section (GIOPS) Newsletter, 8 , (2010), 8-11. Kulovits, H., Becker, C., and Kraxner, M. Plato: A Preservation Planning Tool Integrating Preservation Action Services.  5173/2008 , (2008), 413-414. Okamoto, S. New Developments in Managing Records in Japan - The Establishment, Direction and Structure of the Archive Law. 2010. Sugimoto, S. Ensuring the Preservation and Use of Electronic Records. (2007). Vaquero, L.M., Rodero-Merino, L., and Caceres, J. A Break  in the Clouds: Towards a Cloud Definition.  ACM SIGCOMM Computer Communication Review 39 , 1 (2009),  50-55. Youseff, L., Butrico, M., and DaSilva, D. Toward a Unified Ontology of Cloud Computing.  Grid Computing Environments Workshop , (2008), 1-10.

More Related Content

PDF
Review and Classification of Cloud Computing Research
iosrjce
 
PDF
[IJET-V1I1P3] Author :R.Rajkamal, P.Rajenderan
IJET - International Journal of Engineering and Techniques
 
PDF
Towards a Resource Slice Interoperability Hub for IoT
Hong-Linh Truong
 
PPT
Data as a service
Zoltan Nagy
 
PDF
A Literature Survey on Resource Management Techniques, Issues and Challenges ...
TELKOMNIKA JOURNAL
 
PDF
An efficient resource sharing technique for multi-tenant databases
IJECEIAES
 
PDF
An Enhanced Cloud Backed Frugal File System
IRJET Journal
 
PPTX
Open Cloud Consortium: An Update (04-23-10, v9)
Robert Grossman
 
Review and Classification of Cloud Computing Research
iosrjce
 
[IJET-V1I1P3] Author :R.Rajkamal, P.Rajenderan
IJET - International Journal of Engineering and Techniques
 
Towards a Resource Slice Interoperability Hub for IoT
Hong-Linh Truong
 
Data as a service
Zoltan Nagy
 
A Literature Survey on Resource Management Techniques, Issues and Challenges ...
TELKOMNIKA JOURNAL
 
An efficient resource sharing technique for multi-tenant databases
IJECEIAES
 
An Enhanced Cloud Backed Frugal File System
IRJET Journal
 
Open Cloud Consortium: An Update (04-23-10, v9)
Robert Grossman
 

What's hot (15)

PPTX
The Extreme Data Cloud (XDC) Project
EUDAT
 
PDF
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Dr. Arif Wider
 
PDF
InterSystems IRIS Data Platform : Machine learning on the way
Robert Bira
 
PPT
Cloud computing
Nibi Maouriyan
 
PDF
2018 19 Cloudcomputing
Rajesh Math
 
PPSX
Meeting today’s dissemination challenges – Implementing International Standar...
Jonathan Challener
 
PDF
Unlock Your Data for ML & AI using Data Virtualization
Denodo
 
PDF
The Curse of the Data Lake Monster
Thoughtworks
 
PPTX
Mobile Offline First for inclusive data that spans the data divide
Rob Worthington
 
PPT
Enterprise Information Integration
Sharbani Bhattacharya
 
PPTX
cloud computing
Mukhid Khan LashKari
 
PDF
E04432934
IOSR-JEN
 
PPT
Cloud presentation
Sachin Darekar
 
PPT
Data Federation/EII Uses And Abuses
mark madsen
 
PDF
Accelerating Time to Research Using CloudBank
Sanjay Padhi, Ph.D
 
The Extreme Data Cloud (XDC) Project
EUDAT
 
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Dr. Arif Wider
 
InterSystems IRIS Data Platform : Machine learning on the way
Robert Bira
 
Cloud computing
Nibi Maouriyan
 
2018 19 Cloudcomputing
Rajesh Math
 
Meeting today’s dissemination challenges – Implementing International Standar...
Jonathan Challener
 
Unlock Your Data for ML & AI using Data Virtualization
Denodo
 
The Curse of the Data Lake Monster
Thoughtworks
 
Mobile Offline First for inclusive data that spans the data divide
Rob Worthington
 
Enterprise Information Integration
Sharbani Bhattacharya
 
cloud computing
Mukhid Khan LashKari
 
E04432934
IOSR-JEN
 
Cloud presentation
Sachin Darekar
 
Data Federation/EII Uses And Abuses
mark madsen
 
Accelerating Time to Research Using CloudBank
Sanjay Padhi, Ph.D
 
Ad

Viewers also liked (17)

PPTX
Digital Archiving with Fishbowl Solutions
Billy Cripe
 
PPT
Ssm Appliance Ssm Demo
jerrycarleo
 
PDF
Rethinking Document Management eBook
Kimberly Jones
 
PPT
Dspace Webinar
Gavin Henrick
 
PPTX
Paper and Digital Filing Systems
Amy Geils
 
PDF
Itas profile
Dhanasekar Remo
 
PPT
How Document Management Solutions Benefit Government Agencies
osaminc
 
PPTX
Rivonia Trial Dictabelt Project, Save Your Archive, Gerrit Wagener, Brenda Ko...
FIAT/IFTA
 
PPTX
natural disaster project by mirza ibrahim from greenwich academy
199917
 
PPTX
Disaster response 101
haightv
 
PPT
Digital Archiving at the Meertens Institute
juntez
 
PPTX
“Resurrecting Lost Voices: DIY Digital Archiving” PowerPoint Presentation
Stan Prager
 
PPT
An archivist's view on preserving archaeological data in Flanders (Inge Roosens)
Onroerend Erfgoed
 
PPT
Library Disasters
hoganedix
 
PPTX
Front cover image process
AmanpreetBhopal
 
PDF
Archiveslegalsolutions plaquette a a 2012 1
archiveslegalsolutions
 
PDF
The prevention of conflict damage to archive and library materials
Alessandro Sidoti
 
Digital Archiving with Fishbowl Solutions
Billy Cripe
 
Ssm Appliance Ssm Demo
jerrycarleo
 
Rethinking Document Management eBook
Kimberly Jones
 
Dspace Webinar
Gavin Henrick
 
Paper and Digital Filing Systems
Amy Geils
 
Itas profile
Dhanasekar Remo
 
How Document Management Solutions Benefit Government Agencies
osaminc
 
Rivonia Trial Dictabelt Project, Save Your Archive, Gerrit Wagener, Brenda Ko...
FIAT/IFTA
 
natural disaster project by mirza ibrahim from greenwich academy
199917
 
Disaster response 101
haightv
 
Digital Archiving at the Meertens Institute
juntez
 
“Resurrecting Lost Voices: DIY Digital Archiving” PowerPoint Presentation
Stan Prager
 
An archivist's view on preserving archaeological data in Flanders (Inge Roosens)
Onroerend Erfgoed
 
Library Disasters
hoganedix
 
Front cover image process
AmanpreetBhopal
 
Archiveslegalsolutions plaquette a a 2012 1
archiveslegalsolutions
 
The prevention of conflict damage to archive and library materials
Alessandro Sidoti
 
Ad

Similar to Archiving as a Service - A Model for the Provision of Shared Archiving Services Using Cloud Computing (20)

PPT
Intro To Cloud Computing
prakashjjaya
 
PDF
Introduction Big Data
Frank Kienle
 
PDF
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
PPTX
Research data management 1.5
John Martin
 
PPT
GSA on Cloud Computing and More
guest163bca0
 
PDF
BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...
dbpublications
 
PPT
Technology Overview
Liran Zelkha
 
PDF
Privacy preserving public auditing for secured cloud storage
dbpublications
 
PPT
Bob Plumridge - Enabling easier Cloud Solution Deployment (Storage Expo 2010)
VNU Exhibitions Europe
 
PPTX
Big Data Session 1.pptx
ElsonPaul2
 
PPTX
Cloud Computing_ICT Concepts & Trends.pptx
ssuser6063b0
 
PDF
Archonnex at ICPSR
Harshakumar Ummerpillai
 
PDF
ArtigofinalpublicadoASTESJ_060139.pdf
MeftahMehdawi
 
PDF
Enterprise Data Lakes
Farid Gurbanov
 
PDF
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
PDF
Cloud Storage System like Dropbox
IRJET Journal
 
PDF
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics
 
PDF
A Strategy for Improving the Performance of Small Files in Openstack Swift
Editor IJCATR
 
PDF
critical_capabilities_for_ob_271719 copy
Chris Woeppel
 
Intro To Cloud Computing
prakashjjaya
 
Introduction Big Data
Frank Kienle
 
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Research data management 1.5
John Martin
 
GSA on Cloud Computing and More
guest163bca0
 
BFC: High-Performance Distributed Big-File Cloud Storage Based On Key-Value S...
dbpublications
 
Technology Overview
Liran Zelkha
 
Privacy preserving public auditing for secured cloud storage
dbpublications
 
Bob Plumridge - Enabling easier Cloud Solution Deployment (Storage Expo 2010)
VNU Exhibitions Europe
 
Big Data Session 1.pptx
ElsonPaul2
 
Cloud Computing_ICT Concepts & Trends.pptx
ssuser6063b0
 
Archonnex at ICPSR
Harshakumar Ummerpillai
 
ArtigofinalpublicadoASTESJ_060139.pdf
MeftahMehdawi
 
Enterprise Data Lakes
Farid Gurbanov
 
Webinar Data Mesh - Part 3
Jeffrey T. Pollock
 
Cloud Storage System like Dropbox
IRJET Journal
 
Maginatics @ SDC 2013: Architecting An Enterprise Storage Platform Using Obje...
Maginatics
 
A Strategy for Improving the Performance of Small Files in Openstack Swift
Editor IJCATR
 
critical_capabilities_for_ob_271719 copy
Chris Woeppel
 

Recently uploaded (20)

PDF
Health-The-Ultimate-Treasure (1).pdf/8th class science curiosity /samyans edu...
Sandeep Swamy
 
PDF
RA 12028_ARAL_Orientation_Day-2-Sessions_v2.pdf
Seven De Los Reyes
 
PPTX
Five Point Someone – Chetan Bhagat | Book Summary & Analysis by Bhupesh Kushwaha
Bhupesh Kushwaha
 
PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PPTX
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
PPTX
Care of patients with elImination deviation.pptx
AneetaSharma15
 
PDF
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
PDF
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
PPTX
CONCEPT OF CHILD CARE. pptx
AneetaSharma15
 
DOCX
Action Plan_ARAL PROGRAM_ STAND ALONE SHS.docx
Levenmartlacuna1
 
PPTX
TEF & EA Bsc Nursing 5th sem.....BBBpptx
AneetaSharma15
 
PPTX
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
PDF
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
PPTX
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
DOCX
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
PPTX
An introduction to Dialogue writing.pptx
drsiddhantnagine
 
PPTX
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
PPTX
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
Artificial Intelligence in Gastroentrology: Advancements and Future Presprec...
AyanHossain
 
PPTX
Applications of matrices In Real Life_20250724_091307_0000.pptx
gehlotkrish03
 
Health-The-Ultimate-Treasure (1).pdf/8th class science curiosity /samyans edu...
Sandeep Swamy
 
RA 12028_ARAL_Orientation_Day-2-Sessions_v2.pdf
Seven De Los Reyes
 
Five Point Someone – Chetan Bhagat | Book Summary & Analysis by Bhupesh Kushwaha
Bhupesh Kushwaha
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
Care of patients with elImination deviation.pptx
AneetaSharma15
 
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
What is CFA?? Complete Guide to the Chartered Financial Analyst Program
sp4989653
 
CONCEPT OF CHILD CARE. pptx
AneetaSharma15
 
Action Plan_ARAL PROGRAM_ STAND ALONE SHS.docx
Levenmartlacuna1
 
TEF & EA Bsc Nursing 5th sem.....BBBpptx
AneetaSharma15
 
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
Review of Related Literature & Studies.pdf
Thelma Villaflores
 
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
An introduction to Dialogue writing.pptx
drsiddhantnagine
 
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
Python-Application-in-Drug-Design by R D Jawarkar.pptx
Rahul Jawarkar
 
Artificial Intelligence in Gastroentrology: Advancements and Future Presprec...
AyanHossain
 
Applications of matrices In Real Life_20250724_091307_0000.pptx
gehlotkrish03
 

Archiving as a Service - A Model for the Provision of Shared Archiving Services Using Cloud Computing

  • 1. iConference 2011 Archiving as a Service - A Model for the Provision of Shared Archiving Services Using Cloud Computing Jan Askhoj – janaskhoej[at]gmail.com Shigeo Sugimoto – sugimoto[at]slis.tsukuba.ac.jp Mitsuharu Nagamori – nagamori[at]slis.tsukuba.ac.jp University of Tsukuba, Japan
  • 2. The Rise of Cloud Computing Big business: Reported that the cloud computing market will grow to more than $150 billion in 2013 Gartner listed cloud computing as one of the most hyped technologies in 2009. Many benefits: Reduced cost, increased storage, no software deployment, flexibility, mobility and allowing IT to shift focus. Cloud computing is being used increasingly for content creation and storage . * Global Industry Analysts, 2010
  • 3. A Cloud Definition (One of Many) Cloud Computing is an abstracted, scalable plat-form for service delivery. Cloud computing makes use of existing technologies that can be described via a layered model. Access to both platform and services is available via the internet . Availability, quality and number of services are offered according to agreements with a provider . - Vaquero et al. 2009
  • 4. Cloud Computing from an Archiving Perspective In the cloud, archives may not have knowledge of records creation hardware and software . How do we document such formats? Cloud Providers are good at managing data and hosting software. But what if something happens? There are providers of services for backup , but not for preservation . Can we find and read documents created and stored in the cloud in 10 years from now?
  • 5. I found the document... If only I knew how to access it!
  • 6. Object of Research Providing a reference model for cloud based archiving that makes possible: Offering trusted storage and long term preservation as a cloud based service. Automatically providing preservation metadata and information packages for transfer of digital records. Extending preservation to as early in the records lifecycle as possible.
  • 7. Current Archive Model: OAIS Reference Model for an Open Archival Information System (OAIS). Defines Entities, Relationships and Information Types in digital archives. Consultative Committee for Space Data Systems, 2002.
  • 8. OAIS and the Cloud The OAIS Model does not cover the use of a shared platform for storage , outside the control of an archive. Such functionality overlaps with several OAIS functional entities. An OAIS Archive does not cover the early stages of the document lifecycle . With a shared platform, digital objects can be immediately accessible to an archive for early preservation planning. In OAIS, Digital Objects and metadata are included in information packages . If Producer and Archive share a common platform, this is not necessary.
  • 9. Hardware/Facilities Connectivity Abstraction OS Virtualization Data Metadata Content Applications APIs Presentation (User facing) SaaS (Software as a Service). Users access applications via user-facing software or APIs. PaaS (Platform as a Service). Virtualized platform for executing applications and providing storage. IaaS (Infrastructure as a Service). Hardware and Infrastructure. A General Layered Model for Cloud Computing Services
  • 10. Some Characteristics of the Layered Model In a layered model, each layer offers defined services to the layers above. Services are abstracted and interchangeable. Benefits: - Makes it easy to offer and take advantage of defined levels of services. - Facilitates resource sharing - Facilitates migration
  • 11. Archive Digital Object Digital Object Business System Storage Layer Simple Layered Cloud Archiving System Interaction Layer Trusted repository (bit-level integrity)
  • 12. Expanding the Simple Model Storage does not equal preservation . Information is needed to support: “ Viability, Renderability, Understandability, Authenticity, and Identity of Digital Objects” (known in OAIS as an Information Package).
  • 13. Proposed Four Layer Model Interaction Layer : User facing Archives/ Records Management Systems and Business Systems. Preservation Layer : Adds preservation information. Turns Digital Objects into Information Packages for use by Archives/Records Management Systems. SaaS Layer : Applications represent bit-strings as Digital Objects used by systems and users. PaaS Layer : Application platform and trusted repository for storing bit-strings.
  • 14. Information Object Data Object Represent. Information Digital Object Bit Sequence 1+ 1+ 1+ OAIS Information Package Layered Model Interaction Layer Preservation Layer SaaS Layer PaaS Layer Preservation Description Information Information Package
  • 15. Where does Preservation Metadata come from? Business System Metadata : Generated at the time of document creation or records export. Registry Information : Pre-provided (semi-static) information about registered Entities and Information Types Event Related Information : Information describing changes to Digital Objects and metadata taking place during the preservation process.
  • 16. PaaS Layer SaaS Layer Preservation Layer Interaction Layer Digital Object Type & Metadata Bitstream Storage & API Information Package Layered Model Applications, Information and Provided Services Archive System Package Creator Business Software Storage/ Hosting Platform Application Service Preservation Information Information Package Digital Object Bit-stream Information Type
  • 17. Case Study: Japanese Government Problems with system incompatibility and insufficient record management has led to a new Archives Policy and a new IT Strategy One part is a cloud computing project: The Kasumigaseki Cloud ( 霞が関クラウド ). This is still in the early stages of planning. We focus on three archiving problem areas to see how these could be resolved using our model.
  • 18. Platform Platform Platform Record Historic Record Destruction Destruction Common Document Registration System Registration Transfer Plan Preservation Plan Retention Schedule Agency Records Mgmt. Agency National Archives Business System National Archive Current Workflow Business System Business System Business System Business System Records Mgmt. System
  • 19. Problem Areas Lack of system integration : Individual government offices use different systems. Preparing records is a time consuming task. Lack of resources : The burden of transferring records to the National Archives lies with government agencies. The size of the NAJ makes it hard to provide assistance. Preservation : Lack of preservation of records in government agency systems.
  • 20. Applying the model Assumption that the Kasumigaseki Cloud will offer both a storage/hosting platform (PaaS) and software services (SaaS) Added functionality in Preservation Layer: Registration Harvesting Preservation Reporting
  • 21. Archive System PaaS Layer Package Layer SaaS Layer ARM Layer User Facing Systems Transfer Transfer SaaS Business Systems -> Digital Objects Platform -> Bit-sequences Preservation Description Information Representation Information Package Information Package Desc. Functionality -> Registration, Harvesting, Conversion, Reporting RMS Agency Records Mgmt. Agency National Archives Business System Back-end Transfer Plan Preservation Plan Retention Schedule
  • 22. Benefits and Limitations in Case Benefits : Automatic package creation, simplifying records transfer. Early and consistent preservation metadata addition Allows keeping current workflow, but adds automation Limitations/Requirements : Cloud platform must be truly trustworthy with no unexpected change or loss of service. Need good export of content and metadata from SaaS business systems Providing semantic or community specific information
  • 23. Concluding Remarks We believe our model has a number of advantages when developing a cloud archive framework: Builds on OAIS model concepts and information types. Adds trusted storage and preservation to early stages in the document lifecycle. Simplifies archive system design by allowing organizations choose different levels of service. Current Status : Work on defining information classes and properties. Designing a test system using the model.
  • 24. Thank you ! ありがとうございました ! University of Tsukuba, Japan
  • 25. References ISO 15489-1:2001 - Information and documentation - Records management - Part 1: General. 2001. Requirements for Electronic Records Management Systems. 2002. http://www.nationalarchives.gov.uk/documents/metadatafinal.pdf . Reference Model for an Open Archival Information System (OAIS) . Consultative Committee for Space Data Systems, 2002. Electronic Records Archives ERA Lifecycle. 2004. http://www.archives.gov/era/pdf/era-life-cycle.pdf. National Archives Law . National Archives of Japan, 2007. Outline of the National Archives. 2007. http://www.archives.go.jp/english/abouts/outline.html. Chan, T. Japan to build massive cloud infrastructure for e-government. Green Telecom . http://www.greentelecomlive.com/2009/05/13/japan-to-build-massive-cloud-infrastructure-for-e-government/. Guenther, R. Understanding and Implementing the PREMIS Data Dictionary for Preservation Metadata. 2009. http://www.digitalpreservation.gov/news/events/ndiipp_meetings/ndiipp09/docs/June26/premis-ndiipp-20090626.ppt. Koga, T. Recent development of the government information policy in Japan. International Federation of Library Associations and Institutions, Government Information and Official Publications Section (GIOPS) Newsletter, 8 , (2010), 8-11. Kulovits, H., Becker, C., and Kraxner, M. Plato: A Preservation Planning Tool Integrating Preservation Action Services. 5173/2008 , (2008), 413-414. Okamoto, S. New Developments in Managing Records in Japan - The Establishment, Direction and Structure of the Archive Law. 2010. Sugimoto, S. Ensuring the Preservation and Use of Electronic Records. (2007). Vaquero, L.M., Rodero-Merino, L., and Caceres, J. A Break in the Clouds: Towards a Cloud Definition. ACM SIGCOMM Computer Communication Review 39 , 1 (2009), 50-55. Youseff, L., Butrico, M., and DaSilva, D. Toward a Unified Ontology of Cloud Computing. Grid Computing Environments Workshop , (2008), 1-10.

Editor's Notes

  • #2: Greet Japan Late in day for talk on archiving – get right started
  • #3: Hot topic and here to stay What is cloud computing
  • #4: Software hosting - virtualization Abstracted – complexity reduced Pay for amount of service, trust service provider to deliver General definition – when it comes to a specific area such as archiving
  • #5: What we are afraid of…
  • #7: To what extent is this possible with existing models?
  • #8: Chosen because it is the de facto standard for archival systems
  • #9: Incompatabilities with cloud computing -Data management, archival storage -starts with SIP. Share platform, shouldn&t be necessary to wait for SIP -references to digital objects The oais model - What does cloud model look like their descriptive information and administrative data is handled by Data Management
  • #10: Layered, SaaS, PaaS, IaaS… HaaS in the case of crowdsourcing Bottom to top
  • #11: 2- With a defined API and classes and properties, possible to exchange one service for another as long as they support 3- Sharing, different programs sharing and taking advantage of similar services in the same layer What would a layered cloud archive system look like? particular set of rules and specifications that a software program can follow to access and make use of the services and resources provided by another particular software program that implements that API
  • #12: Simple 2 layered model. 2 systems taking using a shared cloud repository as a storage backend All well and good, but to offer preservation
  • #13: So to provide this information, we have expanded on the simple model
  • #14: More detail about information provided by each layer
  • #15: place different OAIS Information types in the layers just described. Preservation Layer, lot of different information types needed to generate information package. Where from
  • #16: Putting it all together.
  • #17: Moving on from theory to practice – Application of the model
  • #20: Number of problems raised