A Voice + CLI Driven Multi-Agent Operating System with Autonomous AI Capabilities
VoiceOS is a next-generation AI operating system interface that combines:
- π€ Real-time voice interaction
- π§ Multi-agent reasoning system
- π€ Autonomous agent execution
- π Permission-based safety architecture
- π³ Docker-based isolated runtime
VoiceOS evolves beyond traditional assistants into a:
Voice-Controlled, Multi-Agent, Autonomous AI System
- Real-time speech input (STT)
- Streaming responses (TTS)
- CLI fallback for development and control
- Core agents (Planner, Router, Safety)
- Dynamic agents (YAML-defined roles)
- Autonomous agent loop (goal-driven execution)
- Iterative reasoning (think β act β observe)
- Tool generation
- Code execution in sandbox
- Multi-step workflow automation
- Search β Fetch β Analyze β Summarize
- Multi-source reasoning
- Generate code
- Edit files
- Execute scripts
- Debug and iterate
- Open applications
- File operations
- OS control (safe and permission-based)
- Explicit user approval required
- Sandboxed execution
- Full logging of actions
graph TD
A[Voice/CLI Input] --> B[Planner]
B --> C{Task Type}
C -->|Simple| D[Tool Execution]
C -->|Complex| E[Dynamic Agent]
C -->|Autonomous| F[Autonomous Agent]
F --> G[Agent Loop]
G --> H[Tool Generator]
G --> I[Tool Executor]
H --> J[Workspace Sandbox]
I --> K[Safety + Permission]
K --> L[Execution]
L --> G
G --> M[Final Output]
M --> N[TTS / CLI Output]
| Mode | Description |
|---|---|
| Simple | Direct tool execution |
| Complex | Dynamic agent execution |
| Autonomous | Iterative agent loop |
VoiceOS/
βββ agents/ # Multi-agent system
β βββ core/ # Core agents (Planner, Router, Safety)
β βββ autonomous/ # Autonomous agent loop
β βββ dynamic/ # Dynamic agent roles
β βββ roles/ # YAML-defined agent roles
βββ tools/ # Native VoiceOS tools
β βββ file_tools/ # File operations
β βββ web_tools/ # Web browsing & scraping
β βββ code_tools/ # Code execution
β βββ document_tools/ # Document processing
β βββ scheduler_tools/ # Task scheduling
βββ core/ # Core system components (restructured)
β βββ config.py # Configuration management
β βββ logger.py # Logging system
β βββ event.py # Event system
β βββ security.py # Security system
β βββ orchestrator.py # System orchestrator
β βββ config_manager.py # Configuration manager
β βββ plugins/ # Plugin system (8 modules)
β β βββ secure_plugin_integration.py
β β βββ plugin_lifecycle.py
β β βββ plugin_registry.py
β β βββ plugin_configuration.py
β β βββ plugin_error_handling.py
β β βββ plugin_monitoring.py
β β βββ plugin_testing.py
β β βββ complete_plugin_integration.py
β βββ helpers/ # Helper system (4 modules)
β β βββ secure_helper_integration.py
β β βββ helper_bridge_integration.py
β β βββ helper_extension_discovery.py
β β βββ helper_extension_monitoring.py
β βββ extensions/ # Extension system (2 modules)
β β βββ secure_extension_integration.py
β β βββ extension_point_system.py
β βββ integration/ # Integration framework (2 modules)
β β βββ integration_patterns.py
β β βββ controlled_execution.py
β βββ monitoring/ # Monitoring system (2 modules)
β β βββ performance_monitor.py
β β βββ error_recovery.py
β βββ events/ # Event system (3 modules)
β β βββ event_bus.py
β β βββ event_handlers.py
β β βββ events.py
β βββ cli/ # CLI system (2 modules)
β β βββ voice_cli_integration.py
β β βββ response_builder.py
β βββ pipelines/ # Pipeline system (1 module)
β β βββ stream_pipeline.py
β βββ system/ # System management (2 modules)
β βββ system_verification.py
β βββ unified_integration_dashboard.py
βββ permissions/ # Permission & safety system
βββ audio/ # Voice processing
βββ llm/ # LLM integration
βββ memory/ # Memory management
βββ plugins/ # Plugin system
βββ workspace/ # Workspace management
βββ frontend/ # Web interface
βββ docs/ # Documentation
βββ models/ # AI models
VoiceOS features a comprehensive integration framework with restructured core components:
- Secure Plugin Integration: Security-first plugin loading and validation
- Plugin Lifecycle: Complete plugin state management (DISCOVERED β ACTIVE β SUSPENDED)
- Plugin Registry: Centralized plugin discovery and registration
- Plugin Configuration: Multi-scope configuration management
- Plugin Error Handling: Comprehensive error recovery and reporting
- Plugin Monitoring: Real-time performance and health monitoring
- Plugin Testing: Built-in security and compatibility testing
- Secure Helper Integration: Categorized helper function management
- Helper Bridge Integration: VoiceOS tool bridging with multiple modes
- Helper Extension Discovery: Background discovery and validation
- Helper Extension Monitoring: System-wide helper metrics
- Secure Extension Integration: Extension type management and security
- Extension Point System: Hook-based extension with decorators
- Extension Decorators: Easy-to-use decorators for common extension points
@before_tool_execution,@after_tool_execution@before_llm_request,@after_llm_response@data_processing,@user_input_validation@error_handling,@logging_decorator
- Integration Patterns: Standardized integration approaches
- Controlled Execution: Sandboxed execution with resource limits
- Performance Monitoring: Real-time system performance tracking
- Error Recovery: Automatic error detection and recovery
- Integration Dashboard: Centralized management interface
- System Verification: Automated system health checks
- Real-time Monitoring: Live system metrics and status
docker build -t voiceos .
docker run -it voiceos# Install dependencies
pip install -r requirements.txt
# Run VoiceOS
python main.py
# Or with specific configuration
VOICEOS_CONFIG=dev python main.py"Open Chrome"
"Search latest AI research"
"Write a Python script to scrape data"
"Analyze this dataset"
"Automate this workflow"
All actions follow:
Agent β Safety β Permission β Execution
- Native VoiceOS tools integration
- Permission-based safety system
- Multi-agent execution modes
- Advanced multi-agent collaboration
- Enhanced GUI interface
- Plugin marketplace
- Distributed agent execution
- Voice-controlled IDE integration
- Real-time collaboration features
Contributions welcome.
MIT License