Sorting Files for Better Compression
Created on 2024-08-30T16:33:44-05:00
Sorting files by similarity hash increases the odds a (small window) compression algorithm will succeed at matching more repetitive data.
The example has many copies of Audacity projects where audio is mostly the same but the files are repeated for every copy of the project.