Using libjpeg (or libjpeg-turbo) to do JPEG encoding, I was wondering if there is any improvements providing multiple scanlines at once to the jpeg_write_scanlines function. I did some tests on 720x288 images, but I only get 0,5% increase when processing the whole image at once.
I guess this increase is just due to the removal of call stack overhead, but I was expecting a bit more, at least with libjpeg-turbo.
The performance test was run with Callgrind (in Valgrind), so maybe I'm missing something. Or I really misunderstood how JPEG encoder works.