ADT - Australasian Digital Thesis Program

Standards

OAI-PMH Harvesting Guidelines

The Australasian Digital Theses Program (ADT) uses the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to regularly gather metadata from the institutional repositories of participating members. This document describes how members can conform to the export profile for ADT. Version 1 and version 2 of OAI-PMH are both supported. It also tells you how to harvest metadata from ADT.

ADT will process any well formed XML content in unqualified Dublin Core metadata constructed as defined by the Dublin Core Metadata Initiative.

The following information appears on this page:

  • Testing harvesting
  • Metadata elements used
  • Rules for all metadata elements
  • Use of specific elements
  • Example of Dublin Core in XML
  • OAI-PMH harvesting from ADT

Testing harvesting

ADT has successfully used OAI-PMH to harvest from EPrints, DSpace, ARROW and Digital Commons.

Send the following information to the ADT program to test harvesting from your repository:

  1. URL of your server
  2. SetSpec
  3. SetName

Examples:

URL: http://researchspace.auckland.ac.nz/dspace-oai/request
SetSpec: hdl_2292_2
SetName: PhD Theses

A ":" can be used to denote a hierarchy in the SetSpec, as specified in the Implementation Guidelines for the Open Archives Initiative Protocol for Metadata Harvesting. Otherwise, there are no rules governing the format of the SetSpec or SetName.

Metadata elements used

The following metadata elements are recommended for the ADT Program.

  • title
  • subject
  • description
  • date
  • language
  • publisher
  • rights and conditions of use
  • identifier

The following metadata elements are not used by the ADT Program.

  • contributor (an entity responsible for making contributions to the resource)
  • coverage (geographical and chronological aspects of the content of the work)
  • relation (a version of the work, a significant part of a work, a related work)
  • type

Rules for all metadata elements

  1. There is no limit on the length of an element as stored in ADT.
  2. All metadata should be provided in XML format.
  3. The elements may be provided in any order.
  4. ADT supports storage and display of diacritic characters, but metadata should be supplied in the UTF-8 character set.
  5. Do NOT include html or xml mark-up in the text.

Note: this document recommends the repeatability (number of occurrences) of an element as it applies to theses in the Australasian Digital Theses context.

Use of specific elements

Element

Use

title

Definition: formal name of the thesis as it appears on the thesis title page
Obligation: mandatory
Repeatable: no
Recommended use:

  • Full title including any subtitle

creator

Definition: author of the thesis
Obligation: mandatory
Repeatable: no
Recommended use:

  • Component parts will be in order of family name, given names and/or initials
  • Component parts of the name should be delimited by commas

subject

Definition: the topic of the thesis
Obligation: optional
Repeatable: yes
Recommended use:

  • All the keywords deemed appropriate by the author should be added.
  • All the keywords deemed appropriate by the Librarian should be added.
  • The appropriate qualifier must be supplied when the keywords are from a thesaurus, e.g. scheme="LCSH" and scheme=“ASRC”
  • Each keyword should be provided as a separate occurrence.

description

Definition: abstract supplied in the thesis
Obligation: optional
Repeatable: no
Recommended use:

  • no qualifiers expected

date

Definition: date that the thesis is declared to have completed all the requirements for Award
Obligation: mandatory
Repeatable: no
Recommended use:

  • in the form YYYY according to W3C-DTF7
  • no qualifiers expected

language

Definition: the language of the work
Obligation: optional
Repeatable: yes
Recommended use:

  • use language codes defined in RFC3066 and ISO639.2, e.g. en-aus

publisher

Definition: University name, School name
Obligation: mandatory
Repeatable: no
Recommended use:

  • component parts are in order of: university; faculty; department; school

rights & conditions of use

Definition: refers to both the standard university-wide disclaimer plus the author of the thesis
Obligation: optional
Repeatable: yes
Recommended use:

  • can be a URI that links to a copyright statement

identifier

Definition: the persistent identifier for the work that links to the public view of the thesis
Obligation: mandatory
Repeatable: no
Recommended use:

  • qualifiers should be used to enable hyper-linking from the identifier and title elements on the central ADT service.  Examples:

scheme= hdl
scheme=URI
scheme=DOI
scheme=ISBN

Example of Dublin Core in XML

<record>
<header>
<identifier>oai:repositoryaut.lconz.ac.nz:theses-1221</identifier>
<datestamp>2007-05-08</datestamp>
<setSpec>publication:theses</setSpec>
</header>
<metadata>
<oai_dc:dc
:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/"
:dc="http://purl.org/dc/elements/1.1/"
:xsi="http://www.w3.org/2001/XMLSchema-instance"
:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/
://www.openarchives.org/OAI/2.0/oai_dc.xsd">
<dc:title>Resident Perceptions of Event Impacts: Taupo and Ironman New Zealand</dc:title>
<dc:creator>Buch, Tina</dc:creator>
<dc:description>In New Zealand the development and promotion of sporting events is becoming increasingly popular, due in part to the role of sport in building the nation's identity, the economic benefits, and a growing awareness of the importance of health and fitness within community. Given this increased popularity it is important that the impacts of these upon the local 'host' communities be understood. Social impacts have been given and more attention in recent years by event researchers, due to the recognition that the term sustainability of such events can only be achieved with the approval and of residents. This research addresses these issues in the context of Taupo, New Zealand using the case of Ironman New Zealand. This thesis adopts a mixed method approach using interviews (n=7), participant observation, and a web-survey (n=111). The findings confirm that it is vital to investigate and understand impacts of events on host communities. The findings are also consistent with theories identified in the literature in terms of the nature of social impacts on host communities. The overall results indicate that residents are aware of both the positive and negative impacts of Ironman New Zealand. Respondents highlight positive impacts such as economic benefits, exposure of Taupo, community togetherness, and the encouragement of, yet they also recognize negative impacts such as inconvenience from traffic congestion, and road closures. Accordingly, the community was grouped into three distinct clusters with positive, negative and ambivalent perceptions, and were labelled 'Lovers', 'Pessimists' and 'Realists'. The findings further suggest that the use of web-surveying for community research is still in its infancy and needs further development enabling it to be as an effective tool. Finally it is proposed that small to medium scale sporting event hosted in regional communities may contribute to a sense of community and to the creation of social capital. Further research is needed to confirm this proposition.</dc:description>
<dc:date>2006</dc:date>
<dc:identifier>http://repositoryaut.lconz.ac.nz/theses/399</dc:identifier>
<dc:publisher>ScholarlyCommons@AUT</dc:publisher>
<dc:subject>Resident Perceptions</dc:subject>
<dc:subject>Sporting Events</dc:subject>
<dc:subject>Community Participation</dc:subject>
<dc:subject>Social Impacts</dc:subject>
<dc:subject>Web-Survey</dc:subject>
</oai_dc:dc>
</metadata>
</record>

OAI-PMH harvesting from ADT

The ADT service exposes metadata for OAI-PMH harvesting. The base URL (address) of the OAI-PMH provider service is : http://metasuite.nun.unsw.edu.au:9004/. The sets are listed at http://metasuite.nun.unsw.edu.au:9004/CMD/OAI?verb=ListSets. There is a set defined for all records with a digital copy attached. It is called adt-digital-copy. Please note that back-ups and updating occur between 3:00 am and 5:20 am Sydney time, which can lock tables and result in time outs for harvesting.

Copyright © Council of Australian University Librarians 1997 - Updated: Monday April 28, 2008 17:38 - Web Co-ordinator