Do you think https://github.com/openai/CLIP can be ran on it? LLM makes me think...

smarterclayton · 2025-05-20T15:41:30 1747755690

Inference is the process of evaluating a model ("inferring" a response to the inputs). LLMs are uniquely difficult to serve because they push the limits on the hardware.

The models we support come from the model server vLLM https://docs.vllm.ai/en/latest/models/supported_models.html, which has a focus on large generative models. I don't see CLIP in the list.