Skip to content

QuentinCrane/ShadowLite

Repository files navigation

Shadow Puppetry Digital Human

A lightweight, dual-agent interaction system for 2D shadow puppetry digital humans, powered by LLMs and designed for low-resource environments. 

LOGO

   

Demo.mp4

Requirements

Before running the system, make sure the following components are ready:

  1. Pull the Gemma 3 (4B) model via Ollama:
ollama pull gemma3:4b
  1. Download the VOSK Chinese speech recognition model:
  • Go to VOSK Models
  • Download the model named vosk-model-cn-0.22
  • Place the unzipped folder inside the model-cn/ directory in this project:
project_root/
├── model-cn/
│   └── vosk-model-cn-0.22/  

Run the Digital Human

To start the digital human interaction system with LLM functionality, run:

python main.py

How to Interact

Once the interface is running, you can interact in two modes:

Text Mode

  • Type Chinese text directly into the input box as a prompt.

Voice Mode

  • Click the Text Mode button to switch to voice interaction.
  • Press the Space key to start recording.
  • Press Space again to stop recording.
  • The recognized speech will appear in the top-left corner of the screen.

Some solutions 

maybe you need to manually ollama serve in the terminal to active llm 

About

A lightweight, dual-agent driven 2D shadow puppetry interaction system designed for low-computation environments.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages