INVENIO: A Practical Solution for e‐Government Document Management
Flavio Costa, Jean‐Yves Le Meur and Tim Smith IT Department, CERN, Geneva, Switzerland Flavio. Costa @ cern. ch Jean‐Yves. Le. Meur @ cern. ch Tim. Smith @ cern. ch
Abstract: Invenio is an open source software package which provides tools to manage digital assets in large scale digital and web driven document repositories. Its scope covers all aspects of document management from document ingestion through classification, indexing, and curation to dissemination. Invenio complies with interoperability standards that aim to facilitate the efficient dissemination of content and uses a library standard flexible metadata format. Invenio’ s capabilities make it an ideal solution for repositories of medium to large sizes( several millions of records). After introducing Invenio the paper will describe a practical approach to implement solutions for diverse aspects of e‐Government on top of Invenio. Examples of e‐services will include the use of Invenio for blog archiving, e‐procurement processes at CERN, document stores of administrative and social interest and electronic Bulletins. This paper will be useful to any Government bodies wishing to enhance relationships with communities either outside or inside( citizens, businesses, other Government bodies, as well as employees) based on electronic documents of all types. Invenio was originally developed at CERN in 2002 to run the CERN Document Server, managing today around 1.5M records in high‐energy physics. Invenio is now being codeveloped by an international collaboration comprising institutes such as CERN, Deutsches Elektronen Synchrotron( DESY), École Polytechnique Fédérale de Lausanne( EPFL), Fermi National Accelerator Laboratory( Fermilab) and Stanford Linear Accelerator Center( SLAC) and is being used by over forty non‐profit scientific and non‐scientific institutions worldwide. Invenio serves a wide variety of electronic documents( including articles, books, journals, photos) and multimedia content such as pictures, presentations, talks, posters, plots, audio podcasts and videos and is freely available for download. Because of the software architecture and the wide coverage of document types, Invenio is very flexible and well suited for very diverse uses.
Keywords: e‐government, document repository, e‐procurement, e‐services, blog, interoperability
1. Introduction to Invenio
Invenio( http:// invenio‐software. org) is an Open Source software suite of modules enabling you to run your own document repository or digital library on the web. The technology offered by the software covers all aspects of document management from ingestion through classification, indexing, and curation to dissemination. The flexibility and performance of Invenio make it a comprehensive solution for management of document repositories of moderate to large sizes( several millions of records). Invenio complies with standards such as the Open Archives Initiative( OAI http:// www. openarchives. org /) metadata harvesting protocol( OAI‐PMH) and uses MARC 21( MARC 21 2012) as its underlying bibliographic format.
1.1 Overview
Invenio has shown its value in very different domains like e‐procurement and web journal publishing, and is expanding into blog preservation and archival, It has typical web 2.0 features for a collaborative approach to document management, customizing and sharing the Invenio experience.
Invenio’ s features make it a comprehensive digital repository and archiving solution both for the user and the administrator. The powerful search interface enables the user to look for and find records according to their criteria, using multiple fields, boolean queries and regular expressions. Records are organized in collections of various levels, grouped by type or other common fields they may have, making it easy for the user to look for similar records based on their interests. Social tools and features also give a chance to the users to personalize their experience, joining user groups, commenting on or reviewing records, creating personal sets of documents( baskets) or sharing them within groups of users, setting alerts based on their interests, exporting rss feeds based of their search queries, etc.
The system can handle not only articles and books, but also photos, videos, theses, etc. Records maintained by Invenio can be organized in collections that can be defined on top of any query. Users are offered either simple
627