Accepted Presentations for Plenary Sessions
Session 1: Management Strategy and Policy
Moderator: Mark McFarland
-
The ARROW Project at 3 years: Looking Backwards, Aiming Forwards
Andrew TreloarSince 2003 Arrow has been funded by the Australian Commonwealth Department of Education, Science and Training to identify and test solutions for best institutional repository practices. Andrew Treloar, Monash University, will offer an analysis of how their objectives have evolved, views on repository technology then and now, software development issues, and implementation decisions culled from three years of practice using Fedora.
URL: arrow.edu.au
PDF: treloar.pdf -
How the Principles and Activities of Digital Curation Guide Repository Management and Operations
Leslie JohnstonLeslie Johnson, University of Virginia Library, will share four overarching principles of digital curation that have been successful in making it easier to build trusted discovery and delivery services and tools for the use of digital objects. Principles for Selection, Principles for the Use of Standards, Principles for Trustworthiness, and Principles for Preservation and Sustainability are local principles that have provided a model for the creation of collection development policies, the identification of service goals for a repository and related policies and activities.
URL: www.lib.virginia.edu/digital/
PDF: johnston.pdf -
CURATOR: its developmental strategy
Atsuko Takano, Kazuko Takagi, Sachiyo Arai, Hiroya Takeuchi and Syun TutiyaHow do you enable indexing of Japanese character strings for searching? This presentation describes practical and strategic approaches adopted by Japan's first institutional repository launched by a university library: Chiba University's Repository for Access to Outcome from Research (CURATOR).
URL: mitizane.ll.chiba-u.jp/curator/index_e.html
PDF: takano.pdf
Session 2: Preservation
Moderator: Grace Agnew
-
Policy Frameworks for Institutional Repositories
MacKenzie Smith and Reagan MooreAs repositories begin to federate and interoperate at a large scale, the inability to express local policies as part of the context of the digital collections becomes more problematic. MacKenzie Smith, MIT and Reagan Moore, SDSC, will report on work by the MIT Libraries and the University of California, San Diego Supercomputer Center on the PLEDGE project (PoLicy Enforcement in Data Grid Environments). The project is funded by the US National Archives and Records Administration.
PDF: smith.pdf -
Using OAI-PMH Resource Harvesting and MPEG-21 DIDL for Digital Preservation
Joan Smith and Michael NelsonTo successfully preserve a web site, its resources must be crawled and the structure and relationships among the resources must be maintained. Joan Smith and Michael Nelson, Old Dominion University, propose involving the web server in the preservation process through “mod_oai”, an Apache module to harvest a web site packaged with its associated metadata thereby contributing to its long-term preservation.
URL: www.modoai.org -
CRiB: Preservation Services for Digital Repositories
Miguel Ferreira, Ana Alice Baptista and Jose Carlos RamalhoThe active lifespan of digital materials is much longer than the lifetime of individual storage media, hardware and software components, as well as the formats in which the information is encoded. As hardware and software become obsolete, digital materials become prisoners of their own encodings. Miguel Ferreira, Ana Alice Baptista, and Jose Carlos Ramalho from the University of Minho, Portugal will present the CRiB recommendation service that is designed to help institutions determine optimal migration strategies within a range of choices to preserve authentic materials.
URL: crib.dsi.uminho.pt/
PDF: ferreira.pdf
Session 3: User Services and Workflow
Moderator: Sayeed Choudhury
-
Making Fedora easier to implement with Fez - A free open source content model and workflow management front-end to Fedora
Christiaan Kortekaas, Andrew Bennett and Keith WebsterThe University of Queensland, Australia has developed Fez, a world-leading user-interface and management system for Fedora-based institutional repositories, which bridges the gap between a repository and users. Christiaan Kortekaas, Andrew Bennett and Keith Webster will review this open source software that gives institutions the power to create a comprehensive repository solution without the hassle.
URL: sourceforge.net/projects/fez/
PDF: ferreira.pdf -
Real-time duplicate and plagiarism detection
Simeon WarnerWhile electronic access to documents provides unprecedented opportunity for plagiarism, it also provides an unprecedented opportunity to automate the detection of plagiarism. Simeon Warner, Cornell University, will describe the implementation and the underlying algorithm of a service to compare the full-text of each new submission against all existing submissions in real-time used in managing the arXiv.org repository. ArXiv contains over 390,000 articles, and will grow by more than 10% in the next year.
URL: arxiv.org/ -
An ethnographic study of institutional repository librarians: their experiences of usability
Sally Jo Cunningham, Dave Nichols, Dana McKay and David BainbridgeThe usability of current repository software and its tools is largely unknown when it comes to understanding whether they are adequate and appropriate for the tasks performed by repository managers. Sally Jo Cunningham, Dave Nichols, Dana McKay and David Bainbridge from the University of Waikato, New Zealand, will share their observations based on their ethnographic study of local librarians who support the inclusion of new material in institutional repositories.
URL: nzdl.sadl.uleth.ca/cgi-bin/library
PDF: cunningham.pdf
Session 4: Semantic Web and Web 2.0
Moderator: Sandy Payette
-
Realizing the role of digital repositories in educational applications: Supporting content and context
Huda Khan and Keith MaullDLESE Teaching Boxes are customizable, digital replicas of the traditional collections that most educators create, store (in boxes), re-use and improve on during their years of teaching. Huda Khan and Keith Maull from DLESE: Digital Library for Earth System Education, will review development of the Teaching Box Builder application and discuss questions raised with respect to repository integration with real-time Web 2.0 technologies as well as how this application design provides support for educators’ creation and adaptation of pedagogical content and context.
URL: teachingboxes.org/
PDF: khan.pdf -
Cross-Repository Semantic Interoperability: the MIT SIMILE Project
Richard Rodgers and MacKenzie SmithMany questions are raised as previously unreachable digital content is found in and among new repositories--is each repository an island or a separately searchable resource? SIMILE (Semantic Interoperability of Metadata and Information in Unlike Environments) has developed an extensive 'tool chain' for gathering and manipulating data assets. Richard Rodgers and MacKenzie Smith, MIT, will demonstrate how tools developed by the SIMILE project can be used as powerful instruments for the federation, discovery, exploration, and curation of metadata.
URL: simile.mit.edu/
PDF: rodgers.pdf -
The BibApp: Enabling rapid repository population
Eric LarsonThe University of Wisconsin-Madison Libraries recently launched the Office of Scholarly Communication and Publishing (OSCP) and uses BibApp to consolidate campus directory information with citation data gathered by librarians, departments and research centers into a single online interface. Eric Larson will describe how BibApp alerts OSCP to content that may be suitable for fast “mashup” repository ingest. OSCP has prepared 1,200+ papers for ingest using BibApp.
URL: oscp.library.wisc.edu/response.html#libraries
PDF: larson.pdf
Session 5: Interoperability
Moderator: Carl Lagoze
-
The OAI Object Re-Use & Exchange (ORE) Initiative
Carl Lagoze and Herbert Van de SompelThere are numerous examples of the need to re-use objects across repositories in scholarly communication. Carl Lagoze, Cornell University and Herbert Van de Sompel, Los Alamos National Laboratory, will discuss the ORE (Object Re-Use and Exchange) Initiative that seeks to implement an interoperable fabric consisting of service interfaces shared across repositories, and some shared infrastructure. Repository federation efforts such as aDORe, CORDRA, the Chinese DSpace Federation, DARE, and Pathways (NSF IIS-0430906) suggest that such object re-use is achievable and will create the building blocks of a global scholarly communication federation in which each individual digital object will fuel a variety of applications.
URL: www.openarchives.org/ore/
PDF: lagoze.pdf -
Repository Deposit Service Description
Rachel Heery, Julie Allinson, Jim Downing, Christopher Gutteridge and Martin MorreyRachel Heery, Julie Allinson, Jim Downing, Christopher Gutteridge and Martin Morrey, UKOLN, University of Bath, will update attendees on a three-year UK program that is developing repository infrastructure aimed at increasing open access to scholarly material, while improving management of assets in higher education institutions. This effort is designed to ensure that the emerging network of JISC (Joint Information Services Committee) Digital Repositories is well populated with content. They will present their work towards defining a lightweight Common Repository Deposit Service Description.
URL: www.jisc.ac.uk/whatwedo/programmes/programme_rep_pres.aspx
PDF: heery.pdf -
An analysis of Digital Repository Scenarios, Use Cases and Workflows
Mahendra Mahey, Rachel Heery, Julie Allinson and Robert John RobertsonThis presentation will set out the preliminary results of a study for a cross-section of the diverse repository developments ongoing in the United Kingdom. To date, over 80 scenarios and 20 use cases have been collected covering contexts such as: delineating the community dimensions of learning object repositories, depositing geospatial data, storing versions of content in a repository, developing metadata workflow in a laboratory repository holding research data, and adding digital rights information. The authors will present the methodology developed to collect, compare and analyze scenarios, use cases and workflows for the identification of common functional internal components and interactions with external services in the information landscape.
SPARC Presentation
-
Legislative Update: The Growing Call for Public Access
Heather Joseph, SPARC DirectorAs the scholarly community examines mechanisms for expanding access, use and redistribution of research outputs, the concept of Open Access has played a central role in helping to define key paths forward. Two major paths towards Open Access have been identified: the first, Open Access journal publishing, focuses on the creation of alternatives to the traditional “user pays” model of supporting peer-reviewed journal publications. The second, Open Access repositories, focuses on the creation of freely accessible digital databases, populated by the output of individual researchers, and organized around either disciplinary or geographical (institutional) constraints.
As the concept of freely, or publicly, accessible digital repositories has taken root in the scholarly community, policy makers, particularly those responsible for providing funding for large-scale scientific research, have begun to explore the possibilities that these databases hold in advancing the conduct of research, providing stable, long term archives for research outputs, and maximizing access and use of research results. Government-wide policies geared towards ensuring that value of publicly-funded is maximized have begun to emerge worldwide. These emerging policies share key characteristics and goals, and this paper will explore the both roots of these policies as well as their current status.
Session 6: e-Science and e-Scholarship
Moderator: Andrew Treloar
-
The Eprints Application Profile: a FRBR approach to modelling repository metadata
Julie Allinson, Pete Johnston and Andy PowellJulie Allinson, Pete Johnston and Andy Powell, UKOLN, University of Bath, present recent work on developing a Dublin Core Application Profile (DCAP) for describing "scholarly publications" (eprints). They will explain why the Dublin Core Abstract Model is well suited to creating descriptions based on entity-relational models such as the FRBR-based (Functional Requirements for Bibliographic Records) Eprints data model. The ePrints DCAP highlights the relational nature of the model underpinning Dublin Core and illustrates that the Dublin Core Abstract Model can support the representation of complex data describing multiple entities and their relationships.
PDF: allinson.pdf -
eSciDoc: a Scholarly Information and Communication Platform for the Max Planck Society
Matthias RazumDigital libraries have become tools for everyday work. But are they ready for e-Scholarship? Scholarship produces additional types of information that are not curated by traditional libraries such as primary data, simulations, informal results, and annotations. Matthias Razum, FIZ Karlsruhe, will discuss eSciDoc, a joint project of the Max Planck Society and FIZ Karlsruhe that will create a next-generation platform for communication and publication in research organizations.
URL: www.escidoc-project.de/homepage.html
PDF: razum.pdf -
ChemXSeer: A Chemistry Web Portal for Scientific Literature and Datasets
Levent Bolelli, Xiaonan Lu, Ying Liu, Anuj Jaiswal, Kun Bai, Isaac Councill, Prasenjit Mitra, James Z. Wang, Karl Mueller, James Kubicki, Barbara Garrison, Joel Bandstra and C. Lee GilesChemXSeer portal is designed to be a hub for research in chemistry by facilitating search and access to both scientific literature and experimental datasets, while bridging these information sources in a unified framework.The authors will present an overview of ChemXSeer, a portal for academic researchers in environmental chemistry that integrates scientific literature with experimental, analytical and simulation result datasets. The hybrid repository of ChemXSeer will be comprised of information crawled from the web, manual submissions of scientific documents, and user submitted datasets as well as scientific documents and metadata provided by major publishers.
PDF: bolelli.pdf






