Deploying any model from HuggingFace Hub using optimized model servers
LLMs (text-generation) and VLMs (image-text-to-text)
Embedding (sentence-similarity and feature-extraction)
sentence-transformers
library we use Text-Embeddings Inference or Infinity to deploy these models.You can use OpenAI
SDK to generate embeddings from the model.Ranking (sentence-similarity and feature-extraction)
sentence-transformers
library we use Text-Embeddings Inference or Infinity to deploy these models.Text Classification (text-classification)
Zero-Shot Classification (zero-shot-classification)
Token Classification (token-classification)
Fill Mask (fill-mask)
[MASK]
Summarization (summarization)
Translation (translation)
Image Generation (text-to-image)
Image Classification (image-classification)
Object Detection (object-detection)
Image to Text (image-to-text)
Automatic Speech Recognition (automatic-speech-recognition)