In my experience, LLMs tend to take noticeably longer to process images than tex...

weird-eye-issue · 2025-11-08T16:47:25 1762620445

It has to get the image data first, basically just IO time before processing it

ashed96 · 2025-11-09T06:50:11 1762671011

IIRC there's pre-processing (embedding/tokenization?) before feeding images to LLMs?

Hit this issue optimizing LLM request times. Ending up lowering image resolution. Lost some accuracy but could bear that.

psadri · 2025-11-08T16:17:35 1762618655

I wonder if these stay in the prefix cache?