Skip to content

AjayRajan05/VoiceOS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸŽ™οΈ VoiceOS

A Voice + CLI Driven Multi-Agent Operating System with Autonomous AI Capabilities


πŸš€ Overview

VoiceOS is a next-generation AI operating system interface that combines:

  • 🎀 Real-time voice interaction
  • 🧠 Multi-agent reasoning system
  • πŸ€– Autonomous agent execution
  • πŸ” Permission-based safety architecture
  • 🐳 Docker-based isolated runtime

VoiceOS evolves beyond traditional assistants into a:

Voice-Controlled, Multi-Agent, Autonomous AI System


✨ Key Features

🎀 Voice + CLI Interaction

  • Real-time speech input (STT)
  • Streaming responses (TTS)
  • CLI fallback for development and control

🧠 Hybrid Multi-Agent System

  • Core agents (Planner, Router, Safety)
  • Dynamic agents (YAML-defined roles)
  • Autonomous agent loop (goal-driven execution)

πŸ€– Autonomous Agent Mode

  • Iterative reasoning (think β†’ act β†’ observe)
  • Tool generation
  • Code execution in sandbox
  • Multi-step workflow automation

πŸ”Ž Web Research Engine

  • Search β†’ Fetch β†’ Analyze β†’ Summarize
  • Multi-source reasoning

πŸ’» Code Development Mode

  • Generate code
  • Edit files
  • Execute scripts
  • Debug and iterate

πŸ› οΈ System Automation

  • Open applications
  • File operations
  • OS control (safe and permission-based)

πŸ” Safety & Permissions

  • Explicit user approval required
  • Sandboxed execution
  • Full logging of actions

🧠 Architecture

graph TD

A[Voice/CLI Input] --> B[Planner]

B --> C{Task Type}

C -->|Simple| D[Tool Execution]
C -->|Complex| E[Dynamic Agent]
C -->|Autonomous| F[Autonomous Agent]

F --> G[Agent Loop]
G --> H[Tool Generator]
G --> I[Tool Executor]

H --> J[Workspace Sandbox]
I --> K[Safety + Permission]

K --> L[Execution]
L --> G

G --> M[Final Output]
M --> N[TTS / CLI Output]
Loading

⚑ Execution Modes

Mode Description
Simple Direct tool execution
Complex Dynamic agent execution
Autonomous Iterative agent loop

πŸ“‚ Project Structure

VoiceOS/

β”œβ”€β”€ agents/                    # Multi-agent system
β”‚   β”œβ”€β”€ core/                 # Core agents (Planner, Router, Safety)
β”‚   β”œβ”€β”€ autonomous/           # Autonomous agent loop
β”‚   β”œβ”€β”€ dynamic/              # Dynamic agent roles
β”‚   └── roles/                # YAML-defined agent roles
β”œβ”€β”€ tools/                    # Native VoiceOS tools
β”‚   β”œβ”€β”€ file_tools/           # File operations
β”‚   β”œβ”€β”€ web_tools/            # Web browsing & scraping
β”‚   β”œβ”€β”€ code_tools/           # Code execution
β”‚   β”œβ”€β”€ document_tools/       # Document processing
β”‚   └── scheduler_tools/      # Task scheduling
β”œβ”€β”€ core/                     # Core system components (restructured)
β”‚   β”œβ”€β”€ config.py            # Configuration management
β”‚   β”œβ”€β”€ logger.py            # Logging system
β”‚   β”œβ”€β”€ event.py             # Event system
β”‚   β”œβ”€β”€ security.py          # Security system
β”‚   β”œβ”€β”€ orchestrator.py      # System orchestrator
β”‚   β”œβ”€β”€ config_manager.py    # Configuration manager
β”‚   β”œβ”€β”€ plugins/             # Plugin system (8 modules)
β”‚   β”‚   β”œβ”€β”€ secure_plugin_integration.py
β”‚   β”‚   β”œβ”€β”€ plugin_lifecycle.py
β”‚   β”‚   β”œβ”€β”€ plugin_registry.py
β”‚   β”‚   β”œβ”€β”€ plugin_configuration.py
β”‚   β”‚   β”œβ”€β”€ plugin_error_handling.py
β”‚   β”‚   β”œβ”€β”€ plugin_monitoring.py
β”‚   β”‚   β”œβ”€β”€ plugin_testing.py
β”‚   β”‚   └── complete_plugin_integration.py
β”‚   β”œβ”€β”€ helpers/             # Helper system (4 modules)
β”‚   β”‚   β”œβ”€β”€ secure_helper_integration.py
β”‚   β”‚   β”œβ”€β”€ helper_bridge_integration.py
β”‚   β”‚   β”œβ”€β”€ helper_extension_discovery.py
β”‚   β”‚   └── helper_extension_monitoring.py
β”‚   β”œβ”€β”€ extensions/          # Extension system (2 modules)
β”‚   β”‚   β”œβ”€β”€ secure_extension_integration.py
β”‚   β”‚   └── extension_point_system.py
β”‚   β”œβ”€β”€ integration/         # Integration framework (2 modules)
β”‚   β”‚   β”œβ”€β”€ integration_patterns.py
β”‚   β”‚   └── controlled_execution.py
β”‚   β”œβ”€β”€ monitoring/          # Monitoring system (2 modules)
β”‚   β”‚   β”œβ”€β”€ performance_monitor.py
β”‚   β”‚   └── error_recovery.py
β”‚   β”œβ”€β”€ events/              # Event system (3 modules)
β”‚   β”‚   β”œβ”€β”€ event_bus.py
β”‚   β”‚   β”œβ”€β”€ event_handlers.py
β”‚   β”‚   └── events.py
β”‚   β”œβ”€β”€ cli/                 # CLI system (2 modules)
β”‚   β”‚   β”œβ”€β”€ voice_cli_integration.py
β”‚   β”‚   └── response_builder.py
β”‚   β”œβ”€β”€ pipelines/           # Pipeline system (1 module)
β”‚   β”‚   └── stream_pipeline.py
β”‚   └── system/              # System management (2 modules)
β”‚       β”œβ”€β”€ system_verification.py
β”‚       └── unified_integration_dashboard.py
β”œβ”€β”€ permissions/              # Permission & safety system
β”œβ”€β”€ audio/                    # Voice processing
β”œβ”€β”€ llm/                      # LLM integration
β”œβ”€β”€ memory/                   # Memory management
β”œβ”€β”€ plugins/                  # Plugin system
β”œβ”€β”€ workspace/                # Workspace management
β”œβ”€β”€ frontend/                 # Web interface
β”œβ”€β”€ docs/                     # Documentation
└── models/                   # AI models

οΏ½ Core Integration Systems

VoiceOS features a comprehensive integration framework with restructured core components:

πŸ”Œ Plugin System

  • Secure Plugin Integration: Security-first plugin loading and validation
  • Plugin Lifecycle: Complete plugin state management (DISCOVERED β†’ ACTIVE β†’ SUSPENDED)
  • Plugin Registry: Centralized plugin discovery and registration
  • Plugin Configuration: Multi-scope configuration management
  • Plugin Error Handling: Comprehensive error recovery and reporting
  • Plugin Monitoring: Real-time performance and health monitoring
  • Plugin Testing: Built-in security and compatibility testing

🀝 Helper System

  • Secure Helper Integration: Categorized helper function management
  • Helper Bridge Integration: VoiceOS tool bridging with multiple modes
  • Helper Extension Discovery: Background discovery and validation
  • Helper Extension Monitoring: System-wide helper metrics

πŸ”— Extension System

  • Secure Extension Integration: Extension type management and security
  • Extension Point System: Hook-based extension with decorators
  • Extension Decorators: Easy-to-use decorators for common extension points
    • @before_tool_execution, @after_tool_execution
    • @before_llm_request, @after_llm_response
    • @data_processing, @user_input_validation
    • @error_handling, @logging_decorator

πŸ“Š Integration Framework

  • Integration Patterns: Standardized integration approaches
  • Controlled Execution: Sandboxed execution with resource limits
  • Performance Monitoring: Real-time system performance tracking
  • Error Recovery: Automatic error detection and recovery

πŸŽ›οΈ Unified Dashboard

  • Integration Dashboard: Centralized management interface
  • System Verification: Automated system health checks
  • Real-time Monitoring: Live system metrics and status

�🐳 Docker Setup

docker build -t voiceos .
docker run -it voiceos

▢️ Run Locally

# Install dependencies
pip install -r requirements.txt

# Run VoiceOS
python main.py

# Or with specific configuration
VOICEOS_CONFIG=dev python main.py

πŸ§ͺ Example Commands

"Open Chrome"
"Search latest AI research"
"Write a Python script to scrape data"
"Analyze this dataset"
"Automate this workflow"

πŸ” Safety Model

All actions follow:

Agent β†’ Safety β†’ Permission β†’ Execution

πŸš€ Roadmap

  • Native VoiceOS tools integration
  • Permission-based safety system
  • Multi-agent execution modes
  • Advanced multi-agent collaboration
  • Enhanced GUI interface
  • Plugin marketplace
  • Distributed agent execution
  • Voice-controlled IDE integration
  • Real-time collaboration features

🀝 Contributing

Contributions welcome.


πŸ“œ License

MIT License

About

VoiceOS: A real-time, voice driven operating system interface powered by modular AI for speech, reasoning, web research, and OS automation.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors