Issue
Add voice input capability to HumanCLI so users can speak commands instead of typing them. Validate that it works on the Unitree Go2 platform.
Requirements
- Implement speech-to-text using Python audio libraries (e.g.,
pyaudio, speech_recognition, or similar)
- Add directly to HumanCLI module (
dimos/agents/cli/human.py)
- Support microphone input on Go2
- Handle audio device selection/configuration
- Test on actual Go2 hardware
Implementation Considerations
- Use lightweight STT that runs on-device or can call external API
- Handle noise/background audio on robot
- Provide fallback to typed input if voice fails
- Toggle for enabling/disabling voice input
Acceptance Criteria
Related
Issue
Add voice input capability to HumanCLI so users can speak commands instead of typing them. Validate that it works on the Unitree Go2 platform.
Requirements
pyaudio,speech_recognition, or similar)dimos/agents/cli/human.py)Implementation Considerations
Acceptance Criteria
Related