The Feature
The infinity serving engines offers interesting embedding models (among which CLIP models). The rerank feature is already implemented in the proxy, it would be nice to have the embedding feature as well
Motivation, pitch
I need self-hosted multimodal embeddings, which infinity is great at providing
Are you a ML Ops Team?
No
Twitter / LinkedIn details
No response