LITA / ALCTS MARC Formats Transition Interest Group Community
Please join us for the ALCTS/LITA MARC Formats Transition IG at ALA Midwinter! We will meet on Saturday, January 9th from 3-4pm at the Boston Convention and Exhibition Center, Room 105.
Our featured presentations include:
Open Linked Data in Discovery: BIBFRAME & Schema.org in an Experimental Bento View
Jim Hahn, University of Illinois at Urbana-Champaign
With support from an internal innovation grant of the University of Illinois Library at Urbana-Champaign, researchers transformed and enriched nearly 300,000 e-book records in their library catalog from Machine-Readable Cataloging (MARC) records to Bibliographic Framework (BIBFRAME) Linked Data Resources. The process utilized existing marc2bib code from the Library of Congress, and then used additional python processing so that each of the BIBFRAME XML records would include open linked resources. (https://bitbucket.org/minrvaproject-admin/bibframeuiuc/overview)
Following the transformation & enrichment researchers indexed the BIBFRAME resources online, and created two search interfaces for discovery of BIBFRAME linked data.
One of the outputs of the grant was the incorporation of BIBFRAME records within an open source Bento view ( http://sif.library.illinois.edu/megasearch/ ) for the linked library data for e-books utilizing Schema.org and a Google Custom Search Engine. Project page for data sources and enrichment Python code developed is available at : http://sif.library.illinois.edu/bibframe/http://sif.library.illinois.edu/bibframe/
MARC to MODS XML to Linked Data Friendly RDF
Steven Anderson, Boston Public Library
One of the main focuses at the Boston Public Library over the past few months has been in creating a process to move ~40,000 Internet Archive books into the Digital Commonwealth repository. One challenge of this? Our repository speaks MODS; not MARC. As part of using these items with our larger system then, we have custom code to convert the MARC to MODS and, more importantly, automatically enrich it along the way to play nicely with the other objects in our system.
However, MODS XML is not the end destination for this data but rather just a legacy format we are stuck using at this current time. The Linked Data world has exploded... but those of us with great specificity in describing our objects haven't been given a path to join in on the party. As such, the BPL is part of a multi-institutional group to map MODS XML to appropriate RDF predicates in frequent use from multiple different namespaces without giving up any piece of information the group cares about. This talk will go over some of those proposed mappings and what those original MARC records will likely start to be in the end based on that group's work.
BIBFRAME and Linked Data for Archives
Gloria Gonzalez, Zepheira, and Dustin Stokes, Atlas Systems
Zepheira and Atlas Systems have partnered to explore methods for increasing visibility and accessibility for rare and archival materials. To explore Linked Data for Archives, Zepheira and Atlas Systems began working together by hosting a series of focus groups in September and October 2015. This presentation will provide an overview of our findings, along with a summary of the next steps that arose from the focus group discussions. We will discuss how Linked Data principles can be applied to collection-level MARC records and archival finding aids. Attendees will leave with an understanding of how Linked Data can improve how users find archival collections and discover archival context. Attendees will also have the opportunity to provide feedback on the key concepts that we are exploring.
The Visible Library @ Multnomah County
Matthew Timberlake, Multnomah County Public Library
The Libhub Initiative aims to raise the visibility of Libraries on the Web by actively exploring the promise of BIBFRAME and Linked Data. Multnomah County Library joined the Libhub Initiative as a Founding Partner. Mr. Timberlake will share their experience in exploring and implementing Linked Data and making Multnomah County Library's entire catalog available on the web!
Ground Truthing MARC
"Ground truthing" is the verification of remote sensing data by direct observation. Over the last few years, OCLC Research has been doing the equivalent for MARC, by exposing exactly how various fields and subfields have been used by catalogers around the world and over time. Using the massive WorldCat database and modern cluster computing techniques, data is extracted, summarized, and reported, to better inform efforts to move this data into new formats and uses.
Roy Tennant is a Senior Program Officer with OCLC Research.
MarcEdit Tools for Linked Data
Terry Reese will discuss how catalogers can track BIBFRAME developments and participate locally by beginning to add linked data concepts into their MARC data. Mr. Reese will show how to utilize MarcEdit's MARCNext Toolset or the MarcEdit LibHub plugin.
Terry Reese is the Head of Digital Initiatives at The Ohio State University and the developer of MarcEdit.
A BIBFLOW Project Update
BIBFLOW is a two-year project of the UC Davis University Library and Zepheira, funded by IMLS. Its official title is “Reinventing Cataloging: Models for the Future of Library Operations” and it is investigating the future of library technical services, i.e., cataloging and related workflows, in light of modern technology infrastructure such as the Web and new data models and formats such as Resource Description and Access (RDA) and BIBFRAME, the new encoding and exchange format in development by the Library of Congress.
Carl Stahmer is the Director of Digital Scholarship at the University Library, University of California, Davis
The Libhub initiative and Denver Public Library - Leading Libraries to the Web!
The Libhub Initiative aims to raise the visibility of Libraries on the Web by actively exploring the promise of BIBFRAME and Linked Data. Denver Public Library joined the Libhub Initiative as a Founding Partner and has served as its pioneer institution. Ms. Jeske will share DPL’s experience in exploring and implementing linked data and making DPL’s entire catalog available on the web!
Michelle Jeske is the City Librarian of Denver.
Experiments in BIBFRAME: A modular approach
Nancy Fallgren, Metadata Specialist Librarian, National Library of Medicine
The National Library of Medicine has been collaborating with Library of Congress on BIBFRAME development and experimentation from the Early Experimenters to the current Implementers. The goal of BIBFRAME is to develop a data sharing standard to replace MARC that is web-based, flexible, and extensible beyond the traditional library cataloging community (i.e., rule agnostic). To that end, NLM is focusing on flexibility and extensibility by experimenting with a ‘modular’ approach to BIBFRAME. This modeling approach is comprised of a core BIBFRAME vocabulary designed to be extended with existing descriptive schemes already developed by the various sectors of the cultural heritage community. Nancy will talk about this approach and the status of NLM’s experimentation.
BIBFLOW: An IMLS Project
Xiaoli Li, Co-head of Content Support Services Department, University of California Davis
BIBFLOW is a two-year project of the UC Davis University Library and Zepheira, funded by IMLS. Its official title is “Reinventing Cataloging: Models for the Future of Library Operations” and we are investigating the future of library technical services, i.e., cataloging and related workflows, in light of modern technology infrastructure such as the Web and new data models and formats such as Resource Description and Access (RDA) and BIBFRAME, the new encoding and exchange format in development by the Library of Congress. Xiaoli Li will be provide an update on the progress of this work.
Gordon Dunsire and Deborah Fritz will give back-to-back presentations on the topic of interactions between RDA, MARC and BIBFRAME.
Title: RDA, MARC and BIBFRAME: transition and interaction
Summary: How do RDA, MARC 21, and BIBFRAME relate? MARC 21 treats a resource as a single entity, similar to ISBD, BIBFRAME uses two entities, and RDA uses four. How these entities interact is the key to understanding the issues facing cataloging in a linked data environment. How can four go into one? Can two go into four? The presentation describes ongoing work in aligning RDA elements with BIBFRAME and with MARC 21, how an RDA editing tool can input RDA and output MARC21, and discusses the outcomes to date.
Gordon Dunsire is Chair of the Joint Steering Committee for RDA, and has been involved in the development of RDA: Resource Description and Access since the 1997 Toronto conference on the future of AACR. He is also Chair of the IFLA Namespaces Technical Group, and participates in various projects and working groups that develop library and cultural heritage standards for use in linked data applications and the Semantic Web. He was formerly Head of the Centre for Digital Library Research at the University of Strathclyde, following a career in academic libraries as cataloger, systems librarian, and senior manager. He presents and publishes widely at international level (http://www.gordondunsire.com/publications.htm).
Deborah Fritz is a founder and co-owner of The MARC of Quality (TMQ, Inc), a provider of cataloging training and MARC record software, established in 1992. She is the author/co-author of two major works on cataloging using MARC, and is currently in the process of transitioning the company’s training from AACR to RDA, and expanding its software beyond MARC to RIMMF.
Curating MARC Indexing to Optimize Discoverability of Resources at ECU Libraries
East Carolina University
Presents a case study of an implementation of VuFind at the East Carolina U Libraries. Cataloging representatives from the main university library, the health sciences library, and the music library formed a working group to evaluate the MARC fields that would be indexed and displayed for the end user of the VuFind discovery layer. Intent was to clarify scoping of resources based on the indexing tied to particular facets and search functionalities, and to replicate search functions available in the Libraries’ existing Classic Catalog. The end result was to construct search options and parameters that would optimize discoverability of the library’s resources. The presentation will reveal the facet elements selected for the implementation and the process behind mapping those elements. An overview regarding decisions on the integration of MARC fields and elements behind search indices and the rationale behind these decisions will be discussed.
Book Artists Unbound: an EAC-CPF-based Discovery Tool for Contextualization of Creators
Alison Jai O'Dell
University of Miami
For artists’ books, contextual information is available in library catalog databases, but is hidden or dispersed, in part to challenges inherent in the MARC authority format and its typical handling. To expose background and relational information about book artists, this project harvested metadata concerning the U of Miami Libraries Special Collections Artists’ Book Collection from a MARC-format bibliographic database and converted relevant data elements to the EAC-CPF XML schema with the Remixing Archival Metadata Project (RAMP) editor. Using RAMP allowed the data to be semi-automatically enhanced with information from WorldCat Identities and VIAF. The enriched, linked data to come out of this process facilitated development of a widget-based discovery platform that presents users with contextual information about the artists. Interactive record display and various data visualizations nurture exploration of relationships between artists, their studios and spaces, educational and socio-cultural background, themes, techniques, media, and visual/conceptual genres. O’Dell explains advantages discovered when using the EAC-CPF format to deliver and transform creator metadata, focusing on showcasing the discovery product and its design, and conclude with feedback from user focus groups.
Building on the Past: (Re)Constructing Comprehensive Title Histories for Serials Management using MARC Records
When using trusted sources, keeping up with serials title changes remains a major pain point for librarians and those who need to identify the publication history of periodicals. It can be problematic for companies that provide serials resource management tools. Title change data has traditionally been locked up within various fields of MARC records created by individual catalogers. Tthe results of different cataloging rules and interpretations, copy cataloging, typographical errors, and "the cataloger's judgment" have built a huge store of information that can be inconsistent and hard to follow. Stetson will talk about the knowledgebase service he has designed to analyze, mine, and reconstitute this data, resulting in comprehensive title histories that can be used in serials management
The LITA/ALCTS MARC Formats Transition Interest Group is pleased to host two presentations on transitioning MARC metadata to other formats and environments at ALA Annual in Chicago, IL.
Date:Saturday, June 29, 2013
Time: 3:00-4:00 PM
Location: E351 McCormick Place
"On BIBFRAME Instance"
Presenter: Kevin Ford, Digital Project Coordinator, Network Development and MARC Standards Office, Library of Congress
This presentation will explore the nature of a BIBFRAME Instance within the BIBFRAME Model which is currently being developed to replace the MARC Bibliographic communication format. Defined as "an individual, material embodiment of a BIBFRAME Work that can be physical or digital in nature," a BIBFRAME Instance is understood to be an atomic unit of a specific BIBFRAME Work. The Library of Congress's Network Development and MARC Standards Office has been experimenting with this definition by way of transforming MARC Bibliographic records to BIBFRAME resources, an exercise that invariably raises the question: "How do we best identify, and create, BIBFRAME Instances from MARC records?" A number of data elements in MARC - such as provider information, physical details, and publisher identification numbers - present themselves as good starting points, but no one data point is perfect. A number of them, in fact, must be used as part of a complicated calculus to identify and create BIBFRAME Instances. This presentation will look closely at the equation behind identifying BIBFRAME Instances from MARC data points, LC's current experimentation with transforming MARC records, and it will review future implications stemming from a more atomic approach to bibliographic description.
"Redesigning the English Short Title Catalog: From MARC to Data Agnostic Triplets."
Presenters: Brian Geiger, Co-Director, English Short Title Catalog for North America, and Carl Stahmer, Associate Director, English Broadside Ballad Archive
The English Short Title Catalog is the most comprehensive guide available to the output of the press in the English-speaking world before 1801. The project is both a bibliography that aims to record every distinct item printed during the “hand-press era” in England and its territories, and a union catalog that lists both copies of those items held by libraries and, more recently, digital reproductions of those copies. Begun in the late 1970s, the database currently contains about 500,000 records and more than 3 million holdings. The ESTC is co-managed by the British Library (BL) and the Center for Bibliographical Studies and Research at the University of California, Riverside, and is freely available at http://estc.bl.uk. Although the ESTC is an exemplar scholarly research tool, over the last few years it has become increasingly clear that the project needs to be updated and enhanced in order to capitalize on the latest available technologies. Hundreds of thousands of records from contributing libraries and an ever-growing metadata collection from online archives need to be matched against the ESTC to enrich and improve the file, a number that is prohibitively expensive to process with the traditional means of student matchers and professional catalogers. Researchers, no longer content to simply search for and download records, increasingly want both to edit and comment on ESTC data and to harvest and manipulate ESTC data for their own research. Traditional library catalog records and access software, however, make this kind of data curation, mining and acquisition cumbersome at best and, in many cases, impossible. Data transformation will help to ensure that the ESTC meets the changing needs of its users and partners and, by harnessing their energy and enthusiasm, continues to grow in size and accuracy and remains the central organizing tool of the printed history of the early modern era.
The MARC Formats Interest Group invites you to its discussion meeting at ALA Annual.
Saturday, June 23, 2012, 4:00 p.m. to 6:00 p.m.
Anaheim Convention Center, Room 208B
Topic: Issues in the Bibliographic Transition: Extending the Reach of Authority Data
Our speakers at this meeting will be John Riemer (UCLA), Reinhold Heuvelmann (DNB), and Robert Pillow (VTLS). They will discuss issues relating to the bibliographic framework transition initiative, with a focus on the opportunity for an expanded role for authority data.
Abstracts and biographical information appear below.
New Prospects for Library Authority Data
John Riemer will discuss how authority data can be put to new uses outside an online catalog environment, including contributing to scholarly research in a linked data environment, and outline changes in practice (including those concerned with undifferentiated names) that can help it play that role.
John Riemer is Head of UCLA's Cataloging & Metadata Center. Within in the Program for Cooperative Cataloging, he is Chair Emeritus of the Policy Committee and Chair of the PCC Advisory Committee on Initiatives.
John was co-author with Philip Schreur of a discussion paper proposing changes to how undifferentiated personal name records are handled in the LC/NACO Authority File (http://www.loc.gov/aba/pcc/Undiff%20Personal%20NARs%20Discussion%20Paper%20March%202012.doc). Attendees may be interested to note that the PCC Participants Meeting will be an open forum to conclude community input on this discussion paper. The PCC Participants Meeting will take place on Sunday, June 24 from 4:00 - 5:30 p.m., Anaheim Convention Center, Room 213AB.
From Authority Control to Linked Authority Data:
the "Gemeinsame Normdatei" in German Speaking Countries
The presentation will describe the tradition of authority control in the German speaking countries, with its former authority files, SWD, GKD, PND, and EST, and the recently finished project "Gemeinsame Normdatei" (GND). During this project, the files have been merged into one integrated authority file, structured by one common format. The presentation will inform about the partners, the principles, the working packages and the results of the merging process, as well as the next steps to be taken. While trying to balance the needs of legacy data and existing systems and workflows with MARC as a base structure for the cataloging format, an approach was taken that was driven by the "Functional Requirements for Authority Data" (FRAD). A specific focus was put on relationships and their coding, which may be seen as a step into the Linked Data world. Thus, the GND and its underlying concepts can be of significance as a contribution to the "Bibliographic Framework Transition Initiative".
Reinhold Heuvelmann works in the Office for Library Standards at the German National Library. He holds a degree in library science and has worked as a cataloger. For more than 15 years, he has been responsible for designing and implementing interfaces to export and import bibliographic and authority data. In the Office for Data Formats, he is responsible for bibliographic data formats and other standards related to information management in libraries. He played a key role in the changeover of German and Austrian libraries from the MAB-Format to MARC 21. Reinhold is a member of the MARC Advisory Committee, where he represents the German speaking countries at MARBI meetings. He has participated in the work of the RDA/MARC Working Group and the PCC ISBD and MARC Task Group. He is also a member of the MODS/MADS Editorial Committee.
The Bibliographic Transition: a Vendor Perspective
Robert Pillow, VTLS, will discuss the role of the vendor community in the transition to a linked data environment. While LoC's Bibliographic Framework Initiative is still in its infancy, a commitment to a transition is evident; but, do we know where that transition will take us? Many issues remain to be not only resolved, but in fact, discovered. Current practices need not be an impediment to progress.
Robert Pillow joined VTLS Inc. as a Customer Support Librarian in 1996, bringing with him over 16 years of experience in the library field. Mr. Pillow graduated with an M.S.L.S. in Information & Library Science from the University of Tennessee in 1986. Prior to VTLS, his experience includes work as a Cataloger, Reference Librarian, Circulation Manager, ILL Manager, teaching Library Science, and service as a Library Board member within both academic and public libraries. At VTLS he has served in a variety of positions in Customer Services, Sales and Marketing, Design & Development, and Project Management. Currently Mr. Pillow is the Project Manager for the NGILS implementation at the Hong Kong Public Library.
The current focus of the MARC Formats Interest Group is on issues related to the Library of Congress's Bibliographic Transition project.
In recent years this group has held a series of discussion meetings on questions surrounding what users and libraries need in the new environment, and the practices and technologies that may allow these needs to be met. The group will continue to explore this theme at ALA Annual.
Did you attend this Interest Group meeting? Take our post-conference survey at: http://www.surveymonkey.com/s/alctsevents2012