AGI - Voice-Activated 3D Avatar

Voice-activated 3D avatar system with Hungarian language support.

Project Structure

C:\Users\USER\Desktop\projects\agi\
├── backend/                    # Express + TypeScript
│   ├── src/
│   │   ├── config/index.ts
│   │   ├── controllers/chat.controller.ts
│   │   ├── services/
│   │   │   ├── audio.service.ts    (Whisper STT + ElevenLabs TTS)
│   │   │   ├── llm.service.ts      (Ollama + LangChain)
│   │   │   └── lipsync.service.ts  (Rhubarb)
│   │   ├── middleware/
│   │   ├── types/
│   │   └── utils/
│   ├── bin/
│   │   ├── ffmpeg/
│   │   │   └── ffmpeg.exe          (Audio conversion)
│   │   └── rhubarb/
│   │       └── rhubarb.exe         (Lip sync generation)
│   ├── package.json
│   └── tsconfig.json
│
└── frontend/                   # React + R3F + TypeScript
    ├── src/
    │   ├── components/
    │   │   ├── Avatar.tsx          (3D avatar with lipsync)
    │   │   ├── Experience.tsx      (R3F scene)
    │   │   └── PushToTalk.tsx      (Status indicator)
    │   ├── hooks/
    │   │   ├── useChat.ts          (API + T key handling)
    │   │   └── useVoiceRecorder.ts (MediaRecorder)
    │   ├── types/
    │   └── utils/
    ├── package.json
    └── vite.config.ts

To Get Started

1. Backend:

cd C:\Users\USER\Desktop\projects\agi\backend
npm install
# Edit .env with your ElevenLabs API key
# Ensure Ollama is running with llama3 model
# Ensure Whisper CLI is installed (pip install openai-whisper)
# FFmpeg and Rhubarb binaries are included in ./bin/
npm run dev

2. Frontend:

cd C:\Users\USER\Desktop\projects\agi\frontend
npm install
# Add your avatar.glb to ./public/models/
npm run dev

3. Use:

Open http://localhost:5173
Hold T key to record voice
Release T to send to backend
Avatar responds with synced lipsync

Prerequisites

Ollama running with llama3 model
Python Whisper CLI (pip install openai-whisper)
ElevenLabs API key
ReadyPlayer Me avatar .glb file

Bundled Binaries

FFmpeg and Rhubarb are included in backend/bin/:

backend/bin/
├── ffmpeg/
│   └── ffmpeg.exe      # Audio conversion (MP3→WAV)
└── rhubarb/
    └── rhubarb.exe     # Lip sync generation

Verify paths in backend/.env:

FFMPEG_PATH=./bin/ffmpeg/ffmpeg.exe
RHUBARB_PATH=./bin/rhubarb/rhubarb.exe

Updating Binaries

FFmpeg: Download from https://ffmpeg.org/download.html

Rhubarb: Download from https://github.com/DanielSWolf/rhubarb-lip-sync/releases

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.claude		.claude
backend		backend
docs		docs
frontend		frontend
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
POC.md		POC.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
prd.md		prd.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AGI - Voice-Activated 3D Avatar

Project Structure

To Get Started

Prerequisites

Bundled Binaries

Updating Binaries

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AGI - Voice-Activated 3D Avatar

Project Structure

To Get Started

Prerequisites

Bundled Binaries

Updating Binaries

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages