Dead Media Beat: POCOS digital preservation glossary

http://www.pocos.org/images/pub_material/POCOS_2_booklet.pdf

THE 2nd SYMPOSIUM ON THE PRESERVATION OF COMPLEX OBJECTS
The Lighthouse, Glasgow, Scotland, 11-12 October 2011
University of Glasgow, Humanities Advanced Technology & Information Institute
www.pocos.org

QUICK REFERENCE (based on digital preservation coalition -dpc-Jargon Buster)

Access: the process of turning an AIP into DIP, ie using data from a digital archive

ADS: Archaeology Data Service, a digital archive specialising in archaeological data based in York

AHDS: Arts and Humanities Data Service, a data service for higher education, closed in 2008

AIMS: Project funded by Mellon foundation to examine archival principles in the digital age

AIP: Archival Information Package, a package of information held within an OAIS

APA: Alliance for Permanent Access, a European network, set up APARSEN

APARSEN: a Network of Excellence funded by the EC, see APA

API: an interface provided by a software program in order to interact with other software applications

Archival Storage: The OAIS entity that contains the services and functions used for the storage and retrieval of AIP

ARCOMEM: ARchive COmmunities MEMories, EC-funded project in digital preservation

ASCII: American Standard Code for Information Interchange, standard for electronic text

BlogForever: EC-funded project working on robust digital preservation, management and dissemination facilities for weblogs

BLPAC: British Library Preservation Advisory Centre – a service of the BL which promotes preservation

BS10008: a British standard pertaining to the evidential weight of digital objects

CCSDS: Consultative Committee for Space Data Systems, originators of the OAIS standard

Characterisation: stage of ingest processes where digital objects are analysed to assess their composition and validity

Checksum: a unique numerical signature derived from a file. Used to compare copies

Cloud (cloud-computing, cloud-based etc): on demand, offsite data storage and processing provided by a third party

DAFD: Data Audit Framework Development, JISC-funded project

Data Dictionary: A formal repository of terms used to describe data

Data Seal of Approval: a process by which organisations can undertake self-evaluation of their DP practices

DCC: Digital Curation Centre, data management advisory service for research

Designated Community: group of users who should be able to understand a particular set of information

DigiCurVE: Digital Curation in Vocational Education, assessment project funded by EU on training provision in Europe

Digital Object: a set of bit sequences, e.g. a single document such as a PDF file, or an image of a (console) game, etc.

DIP: Dissemination Information Package, the data disseminated from an OAIS

DPA: Digital Preservation Award, biannual prize awarded by the DPC, won twice by TNA

DPC: Digital Preservation Coalition, a membership body that supports digital preservation

DPTP: Digital Preservation Training Programme, an intensive training course run by ULCC

DRAMBORA: Digital Repository Audit Methodology Based on Risk Assessment

DROID: tool developed and distributed by TNA to identify file formats. Based on PRONOM

EC: European Commission

Edina: a national data centre based in Edinburgh University mainly funded by JISC

Emulation Framework: a framework that offers emulation services for digital preservation

Emulation: adapts a computer environment so that it can render a software artefact as if it were running on its original environment

Encapsulation: a process where digital objects are captured with information necessary to interpret them

ENSURE: Enabling kNowledge Sustainability Usability and Recovery for Economic value, EC-funded project

GIF: Graphic Interchange Format, an image which typically uses lossy compression

GIS: Geographical Information System, a system that processes mapping and data together

HTML: Hypertext Markup Language, a format used to present text on the World Wide Web Incremental: a project funded by JISC at HATII and Cambridge University

Inegx: tool kit developed by BBC R&D to manage the ingest of audio visual collections

Ingest: the process of turning an SIP into an AIP, ie putting data into a digital archive

ISO: International Organization for Standardization, body that promotes standards

JISC: Joint Information Systems Committee of the Higher Education Funding Councils

JPEG 2000: a revision of the JPEG format which can use lossless compression

JPEG: Joint Photographic Experts Group, a format for digital photographs which is lossy

KB: Koninklijke Bibliotheek, national library of the Netherlands, partner in KEEP and APARSEN; APA home to LIBER and NCDD

KEEP: Keeping Emulation Environments Portable, EC-funded project to develop emulation services to run on a virtual machine

LC: Library of Congress

LIBER: network of European Research Libraries involved in APARSEN and AP, offices at the KB

LiWa: Living web archives, EC-funded project which developed web archiving tools

LOCKSS: Lots of Copies Keeps Stuff Safe a DP principle made into a toolkit for E-Journal preservation, see UKLA

Lossless compression: a mechanism for reducing file sizes that retains all original data

Lossy compression: a mechanism for reducing file sizes which typically discards data

Memento: an innovative tool which allows time based discovery of web pages, winner of DPA 2010

METS: Metadata Encoding and Transmission Standard, a standard for presenting metadata

Migration: the process of moving data from one format to another

MLA: Council of Museum Libraries and Archives, strategic body for such organisations in England

NARA: US National Archives and Records Administration

NCDD: Dutch national digital preservation coalition, closely aligned with APA, DPC and Nestor and hosted by KB

NDAD: UK National Digital Archive of Datasets, formerly funded by TNA and operated by ULCC

NDIIPP: National Digital Information Infrastructure and Preservation Programme – a major programme from the LC

Nestor: German network of expertise in digital preservation, closely aligned to APA and NCDD

OAIS: Open Archival Information System, a reference model describing a digital archive

OCLC: Online Computer Library Center, Inc., US-based library and research group Open source: software in which the underlying code is available for free

OPF: Open Planets Foundation, a membership organisation which sustains outputs from the PLANETS project

PARSE.INSIGHT: EC-funded project that developed a roadmap for DP infrastructure in Europe

PDF/A: a version of the PDF standard intended for archives

PDF: Portable Document Format, a format for producing and sharing documents

PLANETS: a project funded by the EC to develop a suite of DP tools including PLATO. Now maintained by OPF

PLATO: a preservation planning tool which was created by the PLANETS project.

PORTICO: service provider that provides value-added long term digital storage for publishers and libraries

PREMIS: Preservation Metadata: Information Strategies, metadata standard

Preservation planning: defining a series of preservation actions to address an identified risk for a given set of digital objects

PrestoPRIME: EC-funded project which develops tools and services for the preservation of digital audio-visual content

PRONOM: a database of file formats with notes on associated issues. Used with DROID

PROTAGE: Preservation organizations using tools in agent environments, EC-funded project

RCUK: Research Councils UK

RIN: Research Information Network, a group that studies and reports on research needs

RLG: Research Libraries Group, US research group that produced TDR. Now part of OCLC

RLUK: Research Libraries UK

SaaS: software as a service, architecture whereby software is managed remotely by a service provider (see also cloud)

SAP: global IT company based in Germany, leaders of the TIMBUS project

SCAPE: Scalable Preservation Environments, EC-funded project developing scalable preservation actions

SDB: Safety Deposit Box, a digital preservation product supplied by Tessella

SHAMAN: Sustaining Heritage Access through Multivalent Archiving, EC-funded project

Significant properties: concept whereby identifying the most important elements element of a file will aid preservation

SIP: Submission Information Package, data received into an OAIS

SOA: Society of Archivists, now renamed the Archives and Records Association (ARA)

STFC: Science and Technology Facilities Council, a constituent part of RCUK and the lead of APARSEN

STM: Science Technology and Medicine – major area of publishing, sometimes meaning the STM Publishers Association

SWISH: joint venture between RCAHMS and RCAHMW to provide digital services including long term preservation

TDR: Trusted Digital Repository, a standard which characterises ‘trust’ in a digital archive

TIFF: Tagged Image File Format, a common format for images typically lossless

TIMBUS: an EC-funded project which is investigating the preservation of online services

Tessella: an IT company with digital preservation product (SDB). Partner in APARSEN, PLANETS, KEEP

TRAC: Trusted Repository Audit and Certification, toolkit for auditing a digital repository

UKDA: UK Data Archive University of Essex, digital archive for social and economic data

UKLA: UK LOCKSS Alliance, a service of Edina which offers E-journal preservation

UKWAC: UK Web Archiving Consortium

ULCC: University of London Computer Centre, host of NDAD and creators of DPTP

UMD: Universal Media Disc; proprietary CD-ROM format of Sony Computer Entertainment

UML: an industry standard for visualisation, specification construction and documentation of artefacts of software systems

Virtualization: creation of a virtual rather than actual instance of software or hardware (see also emulation)

WF4EVER: Advanced Workflow Preservation Technologies for Enhanced Science, EC-funded project

XML: Extensible Markup Language, a widely used format for encoding information