Sorting Files for Better Compression

Created on 2024-08-30T16:33:44-05:00

Return to the Index

This card pertains to a resource available on the internet.

This card can also be read via Gemini.

Sorting files by similarity hash increases the odds a (small window) compression algorithm will succeed at matching more repetitive data.

The example has many copies of Audacity projects where audio is mostly the same but the files are repeated for every copy of the project.