Documentation Index
Fetch the complete documentation index at: https://docs.augent.app/llms.txt
Use this file to discover all available pages before exploring further.
Model Sizes
| Model | Speed | Accuracy |
|---|---|---|
| tiny | Fastest | Excellent (default) |
| base | Fast | Excellent |
| small | Medium | Superior |
| medium | Slow | Outstanding |
| large | Slowest | Maximum |
Example
Request:Example: Transcribe a specific section
Usestart and duration to transcribe only a portion of the file — no manual ffmpeg trimming needed.
Example: Export to file
output is provided, the transcription is written to disk and output_path is added to the response. Use .xlsx for styled spreadsheets with bold headers, or .csv for plain data.
Parameters
| Parameter | Required | Default | Description |
|---|---|---|---|
audio_path | Yes | — | Path to the audio file |
model_size | No | tiny | Whisper model size |
start | No | 0 | Start transcription at this many seconds into the audio |
duration | No | full file | Only transcribe this many seconds of audio |
output | No | — | File path to save transcription (.csv or .xlsx) |
translated_text | No | — | English translation to store alongside the original. Used after translating a non-English transcription. |
Multilingual
Augent transcribes audio in its original language — Chinese, French, Spanish, Japanese, etc. Translation to English is handled by Claude, which produces far better results than any local translation model. When the transcription language is not English, the response includes:transcribe_audioreturns the original-language transcription withtranslation_available: true- Claude translates the text
- Claude calls
transcribe_audioagain with the sameaudio_pathandtranslated_textcontaining the English translation - A sibling
(eng)markdown file is created in memory alongside the original
search_memory.
Memory
- Transcriptions are stored by file content hash + model size
- Same file, same model = instant memory hit
- Same file, different model = new transcription
- Modified file = new transcription (hash changes)
- A markdown file is also saved to
~/.augent/memory/transcriptions/ - Translated transcriptions get a sibling
(eng)file (e.g.,My Video.md+My Video (eng).md)

