Metadata Interest Group
8:30 am to 10:00 am, US/Pacific
Metadata Migration to Islandora: Is There An Easy Way?: Sai Deng, U of Central Florida Libraries
This presentation introduces UCF’s digital collection migration from DigiTool to Islandora, the new content management system for the state universities in Florida. Discusses the issues in DublinCore (DC) to MODS transformation, explores the possible options, the approach adopted and the tool used for MODS metadata editing.
As part of the state-wide Islandora implementation, UCF has been migrating its collections in DigiTool to MODS records for Islandora to ingest. In migrating from a less granular metadata schema to a more granular one, many issues are involved such as data ambiguity, overly generic data representation, the markup inadequacy in describing sub-elements and element relationships, and a less intricate data structure. Two options were explored: a. revamp the LC's DC-MODS stylesheet to produce more desirable MODS metadata, b. edit the MODS records generated from a more generic stylesheet conversion. Due to consensus needing to be reached among the state universities for any change, only some adjustments such as adding local subjects and online thesauri were made to the LC stylesheet state-wide, and the major work of MODS metadata editing fell on the individual university libraries. At UCF Libraries, Notepad++ was used to edit the MODS records, such as the first set of 847 records in the Political & Rights Issues & Social Movements (PRISM) collection. In batch editing the MODS metadata and dealing with the DC-MODS transformation problems, data patterns in author year, author role terms, publication places, corporate and conference names were identified, data normalization and cleaning was executed, and several types of mark up and editing were performed: authors were marked up to distinguish author year and role from name; personal, corporate and conference names were differentiated; the main entry and added entries were distinguished; subtitle was separated from the main title; publication place was distinguished from publisher; topical, temporal, geographic, genre subdivisions were marked up for subjects; and series name and other common fields were added for the collection.
Discovering Isaac Leeser: Improving access to text collections with TEI markup: Nicole Arbuckle, Backstage Library Works & David McKnight, U of Pennsylvania
In 2013, the Penn Libraries completed its Gershwind-Bennett Isaac Leeser Digital Repository project, which features access to the personal papers and publications of Isaac Leeser, one of the foremost American Jewish figures of the 19th century. Penn Libraries established a template for collecting metadata from Leeser's correspondence, teamed with Backstage to complete TEI markup, and developed an online digital repository. The metadata files and the repository structure provide users with sophisticated tools for full-text search and discovery, with filters for details such as document type, date range, language, author and addressee.
McKnight discusses the background of the collection and the library's preparation of transcriptions and descriptive metadata for the project. Arbuckle discusses the use of that metadata in the TEI markup process. The digital repository can be viewed online at: http://ubuwebser.cajs.upenn.edu/