TrueFoundry AI Gateway supports various types of multimodal inputs, allowing you to work with different data formats including images, audio, and video.
For audio inputs, you can send audio files in supported formats (MP3, WAV, etc.). Please make sure that the model supports audio inputs, otherwise the request will fail. Audio inputs in chat completions are currently supported for Google Gemini models.