Conversation
| agentLoop.SetMediaStore(mediaStore) | ||
|
|
||
| // Wire up voice transcription if Groq API key is available | ||
| groqAPIKey := cfg.Providers.Groq.APIKey |
There was a problem hiding this comment.
Longer term, we may want to consider multiple transcribe providers and how can we prioritise them. For now this works and address an issue, I'm happy with it.
There was a problem hiding this comment.
@xiaket let me address this then slightly more elegantly
nikolasdehor
left a comment
There was a problem hiding this comment.
LGTM. This is a well-structured fix that moves voice transcription from channel-specific (Telegram only) to agent-level, making it work across all channels.
Key improvements:
- Transcriber interface with Name() and Transcribe() methods -- clean abstraction
- DetectTranscriber auto-detects Groq from either direct provider config or model_list entries
- Agent-level transcription via transcribeAudioInMessage replaces audio annotations [voice] with transcribed text
- Falls back gracefully: if transcription fails, the annotation is left as-is (empty string appended)
- README updates across all translations correctly reflect the change
The regex-based annotation replacement is clean: audioAnnotationRe matches [voice] and [audio:*] patterns, and transcriptions are applied in order. Remaining transcriptions (more audio than annotations) are appended with newlines.
Test coverage is thorough: interface satisfaction, DetectTranscriber with various configs, actual transcription with mocked HTTP server, API errors, and missing files. Good work.
Fix voice transcription
|
@dim Thanks for fixing the voice transcription issue! Audio handling bugs can be tough to track down, glad this one got sorted out. We have a PicoClaw Dev Group on Discord where contributors connect and share ideas. If you'd like to join, send an email to |
Fix voice transcription
|
@Orgmar thanks, that's kind. I am a bit busy at the moment but would be very happy to join in a few weeks time. Will email |
Fix voice transcription
Fix voice transcription
📝 Description
Addresses #945
🗣️ Type of Change
🤖 AI Code Generation
🔗 Related Issue
Fixes #945
📚 Technical Context (Skip for Docs)
🧪 Test Environment
📸 Evidence (Optional)
☑️ Checklist