Back to Agno

Test Log - _11_audio_transcription

cookbook/data_labeling/_11_audio_transcription/TEST_LOG.md

2.6.8697 B
Original Source

Test Log - _11_audio_transcription

Tested 2026-05-17 against gemini-3-flash-preview (Gemini), agno 2.6.6.

basic.py

Status: PASS

Description: Verbatim transcript from QA-01.mp3 (English Q&A).

Result: Clean transcript of the full clip.


with_diarization.py

Status: PASS

Description: Transcript split into speaker turns over sample_conversation.wav.

Result: Two speakers identified with five turns; speaker labels are consistent across turns.


with_timestamps.py

Status: PASS

Description: Transcript with [start, end] second timestamps per segment.

Result: Segments returned with monotonically increasing timestamps.