Intellectual Access to Preservation Metadata Interest Group
3:00 pm to 4:00 pm, US/Eastern
Following a short business meeting, our program will feature the work of researchers from the Drexel University College of Computing and Informatics, who are working with a scientific agency to develop an OAIS-compliant joint preservation and reuse data repository.
Decomposing Results Without Burying the Body of Evidence: A Modus Operandi for Developing Metadata and Digital Preservation Requirements
Presented by Lorraine L. Richards (Assistant Professor, Drexel University College of Computing and Informatics) and Adam Townes (Doctoral Candidate, Drexel University College of Computing and Informatics).
Abstract: In February 2013, the Office of Science and Technology Policy (OSTP) issued a directive to each Federal agency with over $100 million in annual conduct of research and development expenditures to develop an “approach for optimizing search, archival, and dissemination features that encourages innovation in accessibility and interoperability, while ensuring long-term stewardship of the results of federally funded research.” We are working directly with scientists, engineers and program managers at the Federal Aviation Administration’s (FAA) William J. Hughes Technical Center (WJHTC) to develop and enhance current data sets and sources. We will jointly develop a prototype technical architecture for the establishment of an OAIS-compliant digital repository to service the Tech Center, Drexel University, and future users. Key components of this project involve:
- Understanding and identifying the appropriate metadata names and development of a metadata taxonomy;
- Enhancing knowledge towards the possible design of the ingest and tagging mechanisms to auto-generate metadata tags for relevant analysis, and machine learning software for this task;
- Conducting research for specification and/or standards for rules and policies for the data sets and access controls; and
- Analyzing user modeling, conducting task analysis for the users of the data sets and the workflows to be executed against the data.
Because data sharing has not been the norm for this agency in the past, particular care must be taken to ensure that metadata is appropriately developed for search, retrievability, and reuse. However, developing metadata for scientific data offers challenges associated with the need to ensure reproducibility of results and appropriate data reuse when data is taken outside its initial disciplinary boundaries. This presentation will discuss these challenges briefly and provide information on the techniques we are using to attempt to mitigate the risks associated with developing requirements for a joint preservation/reuse repository of scientific data.
Full Author List: Lorraine L. Richards, Adam Townes, William C. Regli, and Yuanyuan Feng (all Drexel University College of Computing and Informatics)