Metadata Formats
Work Package 1 of Telematics for Libraries project BIBLINK (LB 4034)
The BIBLINK Project
Title page
Table of Contents

Previous - Next

3. Objectives, Scope and Glossary

The objectives of this work package are to:

The outcome of this work package will feed into the consensus building process in WP3 which includes producing an agreed minimum set of data elements for the description of off-line and on-line electronic resources, and agreeing recommended formats for the exchange of data.

3.1 Scope and Assumptions

There has been a considerable amount of recent work and activity in the metadata area, particularly over the last year. It is not our intention to replicate existing documents, and wherever possible we refer to existing work. This report will consider the suitability of various formats but will not prescribe data elements at this stage. More detailed work on data elements is scheduled for WP3. In particular the final choice of which metadata format(s) will be used will be constrained by

As part of this work package we have done some initial investigation of the requirements of national libraries, but as the project progresses these requirements will need to be defined and clarified. As yet we do not have a definition of the publishers' requirements as this will be built up during the consensus building process. For that reason, the suitability of various formats from the publishers' viewpoint will be suggested rather than matched against requirements. During the consensus building process it is anticipated that these criteria for choice of metadata will be refined to enable a final decision on metadata format. This work-package hopes to raise some of the issues involved in the choice, and provide background on the possible options.

Metadata is defined as "data which assists in the identification, description, evaluation and selection of an information object". Metadata can exist for objects at various levels of granularity e.g. it can refer to collections of documents, a single document or a chapter within a document; it can refer to a series, an individual journal, or an article within a journal; it could refer to a web-site, to a particular logical archive on that site, to a web page, or to an image embedded in that page.

At any given level of granularity, metadata may be required in a number of areas:

In addition there may be metadata associated with the record itself. This includes such details as date record created, creator of record, etc. We call this data administrative metadata. No assumption is made that all metadata associated with an object needs to be contained within the same metadata record. Note that the use of particular identification schemes will not be considered as this is dealt with in WP2.

Different implementations may store and transport metadata records in different ways. For example, a record may be stored and transported within the object which it describes, or it may be stored and transported as a self-contained entity. Metadata referring to a single object may be subdivided into different packages, some of which are stored in separate locations and are referenced in the 'master' record. Decisions on the manner in which metadata is transferred will form part of WP5 (Transmission of metadata).

3.2 Work Available in Background

IFLA Section on Cataloguing. Functional requirements for bibliographic records: draft report for world-wide review. May 1996.

Models for the provision of national bibliographic services in Europe : final report / by Michele Lenart; translated and edited by the British Library. - Luxembourg : European Commission, 1996.

Metadata: a survey of current resource description formats. Work Package 3 of Telematics for Research project DESIRE (no. 1004). November 1996, at

<URL:http://www.ukoln.ac.uk/metadata/DESIRE/overview/>
OCLC/NCSA Metadata Workshop Report. S. Weibel, J. Godby, E. Miller and R. Daniel, March 1995. The March 1995 Metadata Workshop, sponsored by the On-line Computer Library Center (OCLC) and the National Center for Supercomputing Applications (NCSA), at

<URL:http://www.oclc.org:5047/oclc/research/publications/weibel/metadata/dublin_core_report.html>
The Warwick Metadata Workshop: A Framework for the Deployment of Resource Description. L.Dempsey, and S.Weibel, D-Lib Magazine, July/August 1996, at

<URL:http://www.ukoln.ac.uk/dlib/dlib/july96/07weibel.html>
A Syntax for Dublin Core Metadata - Recommendations from the Second Metadata Workshop. L.Burnard, E.Miller, L.Quin, and C.Sperberg-McQueen,, April 1996, at

<URL:http://www.uic.edu/~cmsmcq/tech/metadata.syntax.html>
BIC (Book Industry Communication) and British Library BNBRF Book Product Information project.

BIC's web pages provide background information, as well as their publications of standards and guidelines:

BIC Manual on Publishers' Bibliographic Databases, draft, 1994. (unpublished, but available on request from Book Industry Communication.)

Bide, Mark (on behalf of Book Industry Communication) Electronic Tables of Contents (EToCs) for serials: standards for structure & transmission. A research study for the BNB Research Fund. Fourth Draft 1994.

Martin, David. Data elements for an EDI 'Book Product Information' message. British National Bibliography Research fund Report 75. Published for BNB by BIC, 1995.

SSSH: Simplified SGML for Serials Headers, London, Book Industry Communication and PIRA International, 1996.

3.3 Glossary

This Glossary describes terms which are most relevant in the context of the BIBLINK project and this deliverable. Further information about the terms can be found in the following glossaries:

AHDSArts and Humanities Data Service.
Bibliographic descriptiona set of formalised data elements describing a publication.
Bibliographic recorda discrete bibliographic description stored either manually or electronically.
BICBook Industry Communications.
CIMIComputer Interchange of Museum Information.
CIPCataloguing-In-Publication records, created using information supplied pre-publication by the publisher.
CD-ROMCompact Disc Read Only Memory.
CURLConsortium of University Research Libraries.
Database (DB)a computer Program for entering, storing and retrieving items of information in a structured fashion.
Deposit of publicationsa system in operation in most countries, usually legally enforced, whereby publishers must deposit one or more copies of every publication with nominated libraries. Often referred to as Legal Deposit.
DESIREDevelopment of a European Service for Information and Research.
DTDDocument Type Definition.
EDIElectronic Data Interchange. The exchange of structured data messages to enable automated transactions between application systems.
EDIFACTEDI For Administrations, Commerce and Transport. The international EDI standard messaging syntax under the responsibility of the UN, for trading transactions in all industries. Also known as UN-EDIFACT.
Electronic journalsimilar to a traditional journal but published only in electronic form - on a CD-ROM or the World Wide Web.
Electronic publishersee publisher.
Electronic publicationdocument, file, journal, etc. made available in electronic form.
eLibElectronic Library Programme, previously FIGIT (Follett Implementation Group on IT).
FGDCThe Federal Geographic Data Committee.
Formatin the context of bibliographic control, the formalised structure in which the specific elements of bibliographic description are accommodated.
FTPFile Transfer Protocol, an internet standard means of transferring electronic files between computers.
Home pagea World Wide Web page set up as an introductory page by an organisation or individual.
HTMLHypertext Mark-up Language The standard language used for creating Web documents.
HTTPHyperText Transfer Protocol. The protocol used for communication between Web clients and servers.
IAFAInternet Anonymous FTP Archive.
IEEEInstitute of Electrical and Electronics Engineers.
IETFInternet Engineering Taskforce.
IFLAInternational Federation of Library Associations and Institutions.
ILLInter-Library Loan.
Internetthe world wide network of computer systems connected to each other.
ISADInformation Society and Development (conference).
ISBDInternational Standard Bibliographic Description. There are seven specific ISBDs as well as the general ISBD(G): monographs -(M), serial publications -(S), cartographic material -(CM), non-book material -(NBM), printed music -(PM), antiquarian publications -(A), computer files -(CF).
ISSNInternational Standard Serial Number.
Legal Depositsee Deposit of Publications.
MARCMAchine Readable Cataloguing. A family of formats based on ISO 2709 for the exchange of bibliographic and other related information in machine readable form. For example USMARC and UNIMARC.
Metadatainformation about a publication as opposed to the content of the publication; includes not only bibliographic description but also other relevant information such as its subject, price, conditions of use, etc.
Monographa publication either complete in one part or complete, or intended to be completed, in a finite number of separate parts. A non-serial publication.
Multimediaa publication in which images, sound and text are integrated.
National Bibliographya listing of all national publications. May include all publications produced in that country, or in the language of that country, or sometimes about that country.
NCSANational Center for Supercomputing Applications.
NDISNational Document Information Service.
OCLCOn-line Computer Library Center.
On-line resourcean on-line resource is an electronic document which is bibliographically identifiable, which is stored in machine readable form on an electronic storage medium and which is available on-line. For example - a Web page.
Off-line publicationan off-line publication is an electronic document which is bibliographically identifiable, which is stored in machine readable form on an electronic storage medium. For example a CD-ROM.
OPACOnline Public Access Catalogue.
PICSPlatform for Internet Content Selection, an infrastructure for associating labels with Internet content.
PIIPublisher Item Identifier.
pspostscript, a standard format for exchange of printable files.
Publicationsdocuments containing either text or sound or images, or combinations of these, packaged for wider distribution, whether off-line (e.g. printed book, CD-ROM) or on-line (e.g. Web, database for information retrieval).
Publishera person or organisation that produces documents and makes them available. Newly emerging publishers may produce and distribute documents electronically - for instance, on the Web.
RDMResource Description Messages.
Recordsee bibliographic record.
RFCRequest For Comments, a method by which standards (sic) are proposed and agreed, usually with reference to the Internet.
ROADSResource Organisation And Discovery in Subject based services
Seriala publication in any medium issued in successive parts bearing numeric or chronological designations and intended to be continued indefinitely. Serials include periodicals; newspapers; annuals (reports, yearbooks, etc.); the journals, memoirs, proceedings, transactions etc. of societies; and numbered monographic series.
SGML (ISO 8879)Standard Generalised Mark-up Language. ISO standard for document description, separating contents and structure.
SOIFSummary of Object Interchange Formats.
SSSHSimplified SGML for Serials Headers.
SURFnetthe Dutch national computer network for research and education in the Netherlands, joining local networks.
TEIText Encoding Initiative.
URLUniform Resource Locator. The standard way to give the address of a source of information on the WWW. It contains four different parts: the protocol type, the machine name, the directory path and the file name. For example: http://WWW2.echo.lu/libraries/en/libraries.html
Voluntary Depositsee Deposit of Publications.
Web siteused to refer to a single location on the World Wide Web, usually on the same piece of hardware. Part of the Internet that stores and gives access to documents using HTTP.
World Wide Webthe global set of Internet web sites offering world wide access to information using HTTP.
Z39.50A network protocol which allows searching of (usually remote) heterogeneous databases and retrieval of data, most often used for retrieving bibliographic records.
Next Table of Contents