Systematically benchmark compression algorithm, compression factor, block size #44

probonopd · 2024-08-08T02:48:13Z

Execute a systematic and reproducible benchmark to find the optimal combination of

Compression algorithm
Compression factor
Block size

in terms of

File size
Applicartion startup time
zsync efficiency (delta update size)

for

zlib
lzma
zstandard
dwarfs
libdeflate

References:

probonopd · 2024-08-08T02:50:56Z

This test matrix includes 5 compression algorithms (zlib, lzma, zstandard, dwarfs, and libdeflate), 3 compression factors (low, mid, high), and 3 block sizes (256KB, 512KB, and 1MB), resulting in a total of 45 test cases.

Compression Algorithm	Compression Factor	Block Size
squashfs with zlib	6	256KB
squashfs with zlib	6	512KB
squashfs with zlib	6	1MB
squashfs with zlib	9	256KB
squashfs with zlib	9	512KB
squashfs with zlib	9	1MB
squashfs with lzma	0 (fast)	256KB
squashfs with lzma	0 (fast)	512KB
squashfs with lzma	0 (fast)	1MB
squashfs with lzma	6 (normal)	256KB
squashfs with lzma	6 (normal)	512KB
squashfs with lzma	6 (normal)	1MB
squashfs with lzma	9 (ultra)	256KB
squashfs with lzma	9 (ultra)	512KB
squashfs with lzma	9 (ultra)	1MB
squashfs with zstandard	3	256KB
squashfs with zstandard	3	512KB
squashfs with zstandard	3	1MB
squashfs with zstandard	10	256KB
squashfs with zstandard	10	512KB
squashfs with zstandard	10	1MB
squashfs with zstandard	19	256KB
squashfs with zstandard	19	512KB
squashfs with zstandard	19	1MB
dwarfs	128	256KB
dwarfs	128	512KB
dwarfs	128	1MB
dwarfs	256	256KB
dwarfs	256	512KB
dwarfs	256	1MB
libdeflate	6	256KB
libdeflate	6	512KB
libdeflate	6	1MB
libdeflate	9	256KB
libdeflate	9	512KB
libdeflate	9	1MB

probonopd · 2024-08-08T02:55:51Z

Volunteers?

Samueru-sama · 2024-08-08T05:15:05Z

Something that needs to be considered as well is to take into account how long it normally takes for the same application not as an appimage to start.

For example we might see that on a very big application a certain algo is 30% faster, but that application even when not being an appimage due to its size takes several seconds to start anyway, and that 30% ends up being a very small percentage of the overall delay for the app.

Same way for very small applications, the speed difference might not matter much, because they are very small and take no time regardless.

Where problems can happen is with the mid size applications that you normally expect to start fast, those are web-browsers in other words.

Right now zstd with the current default block size is actually very good, I will do some benchmarks comparing the size and startup times, however I can't measure zsync efficiency since that seems quite a bit more work.

I'm also interested to know how this affects very old hardware, IE, some pre sandy bridge cpu for example, my hardware is from 2016 and not the worst I would say lol.

probonopd · 2024-08-09T08:26:38Z

Application startup times also depend on the hardware. On systems with slow disk but fast CPU, a highly compressed image may lead to faster application launch times than uncompressed files (iirc, I have seen this myself with a large application in the past, likely with a spinning drive). It always depends where the performance bottleneck is on a particular system.

So for this to be really scientific, we'd have to execute the test matrix for typical defined machines.

But then, we are not exactly writing a dissertation here ;-)

Drsheppard01 · 2024-09-30T01:55:13Z

I'm probably a bit late, but I think EROFS is an interesting option as well.

wide kernel support (since 5.4, Ubuntu 20.04, Alpine since 3.11)
fuse support,
compression:
- zstd,
- lzma,
- lz4

dwarfs is incredibly performant, but gpl3 makes it impossible to use with proprietary programs packaged in appimage. At the same time, AppImage packaging is used by large companies, so the introduction of dwarfs will cut off a significant part of the audience

probonopd added the help wanted label Aug 8, 2024

probonopd mentioned this issue Aug 10, 2024

Default to a sensible zstd compression level AppImage/appimagetool#42

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Systematically benchmark compression algorithm, compression factor, block size #44

Systematically benchmark compression algorithm, compression factor, block size #44

probonopd commented Aug 8, 2024

probonopd commented Aug 8, 2024 •

edited

Loading

probonopd commented Aug 8, 2024

Samueru-sama commented Aug 8, 2024

probonopd commented Aug 9, 2024

Drsheppard01 commented Sep 30, 2024

Systematically benchmark compression algorithm, compression factor, block size #44

Systematically benchmark compression algorithm, compression factor, block size #44

Comments

probonopd commented Aug 8, 2024

probonopd commented Aug 8, 2024 • edited Loading

probonopd commented Aug 8, 2024

Samueru-sama commented Aug 8, 2024

probonopd commented Aug 9, 2024

Drsheppard01 commented Sep 30, 2024

probonopd commented Aug 8, 2024 •

edited

Loading