Google Gemini is picky about file formats
The current forms of LLMs are reached by an API call, and to make them easier to work with they are supposed to be stateless. That also means they are not more clever than the people that programmed it.
That means that Google Gemini only supports png, jpeg and webp. That is peculiar. 200 IQ but can not read common images like .gif, .heic, .tiff
That gives us two options:
- Predictable: convert it before giving it to the LLM
- Unpredictable: give the LLM access to file storage and have it use tools to convert it itself