Jump to content

File archiver

From Wikipedia, the free encyclopedia

In computing, a file archiver is utility software that combines files into a single archive file – or in less common cases, multiple files.

A minimally designed archiver might concatenate the content of files along with file name and length. A more advanced archiver stores additional metadata, such as the timestamps, file attributes and access control information. An archiver might compress input file content to reduce the size of the resulting archive.

The process of making an archive file is called archiving or packing. Reconstructing the original files from an archive is called unarchiving, unpacking or extracting.

Multics

[edit]

In the early days of computing, Multics provided the archive command – a basic archiver without compression – that descended from the CTSS command of the same name. Multics also provided a magnetic tape archiver command, ta, which was perhaps the forerunner of the Unix command tar.[1]

Unix

[edit]

As the Unix archive tools ar, tar, and cpio do not provide compression, other tools, such as gzip, bzip2, or xz, are used to compress an archive file after it is created and to decompress before extracting.

Not only does separating archiving from compressing follow the Unix philosophy that each tool should provide a single capability; not attempt to accomplish everything with one tool, it has the following advantages:

  • As compression technology progresses, users may use a different compression tool without having to change how they use the archiver
  • Solid compression allows the compressor can exploit redundancy across multiple archived files to achieve better compression than compressing each file individually

Disadvantages include:

  • Extracting a single file requires decompressing the entire file, which can be costly in terms of time and storage space; adding a file to an existing archive requires both decompression and recompression
  • The archive becomes damage-prone; corruption in any part of the file might cause all files to be lost

A challenge:

  • Compression cannot take advantage of redundancy between files unless the compression window is larger than the size of an individual file; for example, gzip uses DEFLATE, which typically operates with a 32768-byte window, whereas bzip2 uses a Burrows–Wheeler transform roughly 27 times bigger; xz defaults to 8 MiB but supports significantly larger windows

Generally, extensions are successively added to the file name to indicate the operations performed and therefore required to read a file. For example, archiving with tar command and then compressing with gzip command might be indicated with the .tar.gz extension.

Windows

[edit]

Archiving tools on Windows tend to have a graphical user interface (GUI) and to include compression – including the built-in Windows feature as well as commonly used, third-party tools such as WinRAR and 7-Zip. Unlike the built-in feature, WinRAR and 7-zip also provide a command-line interface (CLI) and solid compression.

See also

[edit]

References

[edit]
  1. ^ Van Vleck, Tom. "Multics Glossary - A". multicians.org. Retrieved Jan 6, 2016.