What common archiving formats and standards should I know?

Common formats and standards for long-term archives

Choosing widely supported formats and standards helps ensure archived data remains readable and portable. Favor open, well-documented formats and avoid proprietary encodings where long-term access is required.

Common file and container choices:

  • Document formats: PDF/A for long-term document preservation, plain text, CSV for tabular data.
  • Images and media: TIFF or PNG for images, WAV or FLAC for audio, MP4 for video with standard codecs.
  • Data containers: TAR/ZIP for bundled files; use non-proprietary compression when possible.
  • Metadata standards: Dublin Core for general metadata; PREMIS for preservation metadata.

Also consider checksums (MD5, SHA-256) for integrity checks and standardized manifest files that list archive contents. For enterprise scenarios, compliance standards such as ISO 14721 (OAIS) provide frameworks for archival systems.

Using these established formats and documenting encoding, codecs, and software versions reduces the risk of obsolescence and eases migration or legal review years down the line.