ISO-28500 Historical Revision Information
Information and documentation - WARC file format

ISO-28500 - 1ST EDITION - SUPERSEDED
Show Complete Document History

Document Center Inc. is an authorized dealer of ISO standards.
The following bibliographic material is provided to assist you with your purchasing decision:


ISO 28500:2009 specifies the WARC file format:

  • to store both the payload content and control information from mainstream Internet application layer protocols, such as the Hypertext Transfer Protocol (HTTP), Domain Name System (DNS), and File Transfer Protocol (FTP);
  • to store arbitrary metadata linked to other stored data (e.g. subject classifier, discovered language, encoding);
  • to support data compression and maintain data record integrity;
  • to store all control information from the harvesting protocol (e.g. request headers), not just response information;
  • to store the results of data transformations linked to other stored data;
  • to store a duplicate detection event linked to other stored data (to reduce storage in the presence of identical or substantially similar resources);
  • to be extended without disruption to existing functionality;
  • to support handling of overly long records by truncation or segmentation, where desired.
ORDER

To find similar documents by classification:

35.240.30 (IT applications in information, documentation and publishing Including Standard Generalized Markup Language (SGML), automatic translation machines, etc.)

This document comes with our free Notification Service, good for the life of the document.

This document is available in either Paper or PDF format.

Document Number

ISO 28500:2009

Revision Level

1ST EDITION

Status

Superseded

Publication Date

May 15, 2009

Committee Number

ISO/TC 46/SC 4