Some OS’s zram compress the unpinned pages instead of swapping to disk. It might...

		0xakhil on April 1, 2023 \| parent \| context \| favorite \| on: Llama.cpp 30B runs with only 6GB of RAM now Some OS’s zram compress the unpinned pages instead of swapping to disk. It might be faster than fetching the pages again from disk. I wonder if this is a reason why folks see different results.