from definable.media import Audiooutput = await agent.arun( "Transcribe this audio.", audio=[Audio(filepath="/path/to/audio.mp3")],)
Most models do not support raw audio input. Use audio_transcriber=True on the agent to automatically transcribe audio to text before the model sees it.
from definable.media import Videooutput = await agent.arun( "Describe what happens in this video.", videos=[Video(url="https://example.com/video.mp4")],)