File type, then pass it alongside text in a list:
Use vision-capable models for audio processing. Check Model Capabilities to see which models support audio input.
Key Features
- Automatic Processing: Audio files are automatically transcribed to text
- Speech Analysis: Extract content, sentiment, and insights from audio recordings
- File Support: Works with local files, URLs, and base64 data