AGENTS.md

Guidance for AI assistants working with this repository.

Project Overview

MetaHuman Engine is a browser-native 3D digital human interaction engine built with React, TypeScript, Three.js, and Zustand.

OpenSpec Workflow

This project uses OpenSpec for spec-driven development. Always follow the spec-driven workflow for new features and changes.

Commands

Command	Description
`/opsx:propose "<idea>"`	Create a new change proposal with specs, design, and tasks
`/opsx:explore`	Investigate problems and clarify requirements
`/opsx:apply`	Implement the current change following the task checklist
`/opsx:archive`	Archive completed change and merge specs

Key Directories

Directory	Purpose
`openspec/specs/`	Source of truth for system behavior
`openspec/changes/`	Active change proposals
`openspec/changes/archive/`	Completed changes

Workflow

Before implementing new features, create a proposal using /opsx:propose
Review and refine the generated specs, design, and tasks
Implement following the task checklist with /opsx:apply
Archive when complete to merge specs with /opsx:archive

When to Create a Proposal

Create a proposal when:

Adding a new feature or capability
Changing existing behavior
Refactoring core systems
Introducing new dependencies

You can skip the proposal process for:

Bug fixes with clear cause
Documentation updates
Configuration changes
Test additions

Tech Stack

Framework: React 18 + TypeScript
Build: Vite 5
3D: Three.js + React Three Fiber + Drei
State: Zustand
Styling: Tailwind CSS
Testing: Vitest + Testing Library

Path alias: @/* → src/*

Architecture

App Shell

src/main.tsx → src/App.tsx
  └── Routes:
      /, /advanced → AdvancedDigitalHumanPage (main)
      /digital-human → DigitalHumanPage (simple demo)

Layer Structure

pages/          → Route-level components
  └── hooks/    → Business logic hooks
       └── core/ → Engine services
            └── store/ → Zustand state

Core Services

Service	File	Purpose
Avatar	`core/avatar/DigitalHumanEngine.ts`	Imperative façade over store for avatar control
Audio	`core/audio/audioService.ts`	TTS synthesis, ASR recognition via Web Speech API
Dialogue	`core/dialogue/dialogueService.ts`	HTTP client with retry, timeout, fallback
Orchestrator	`core/dialogue/dialogueOrchestrator.ts`	Full dialogue turn orchestration
Vision	`core/vision/visionService.ts`	MediaPipe face/pose inference
Performance	`core/performance/deviceCapability.ts`	Device tier detection

State Stores

Store	Scope
`chatSessionStore`	Messages, sessionId
`systemStore`	Connection, errors, performance metrics
`digitalHumanStore`	Avatar state, audio, behavior

Key Patterns

Service → Store Flow

Services read/write store via useXStore.getState() to avoid props drilling:

// In service
const { setSpeaking, setBehavior } = useDigitalHumanStore.getState();
setBehavior('thinking');

Store Selector Pattern

Use selectors to minimize re-renders:

// Good - only subscribes to specific value
const isPlaying = useDigitalHumanStore((s) => s.isPlaying);

// Avoid - subscribes to entire store
const { isPlaying, ...rest } = useDigitalHumanStore();

Fallback Strategy

All external calls have fallbacks:

Model load fails → procedural CyberAvatar
API unavailable → local mock response
TTS fails → text-only display
Vision fails → disable panel gracefully

Backend Contract

Base URL: VITE_API_BASE_URL or http://localhost:8000

Endpoint	Method	Purpose
`/health`	GET	Connectivity check
`/v1/chat`	POST	Dialogue request
`/v1/chat/stream`	POST	SSE streaming dialogue
`/ws`	WS	WebSocket connection

Chat Response Shape:

interface ChatResponse {
  replyText: string;
  emotion: 'neutral' | 'happy' | 'surprised' | 'sad' | 'angry';
  action: 'idle' | 'wave' | 'greet' | 'think' | 'nod' | 'shakeHead' | 'dance' | 'speak';
}

Development Commands

npm install          # Install dependencies
npm run dev          # Start dev server (port 5173)
npm run build        # Production build
npm run preview      # Preview production build
npm run lint         # ESLint check
npm run lint:fix     # Auto-fix ESLint issues
npm run format       # Prettier formatting
npm run typecheck    # TypeScript check
npm run test         # Vitest watch mode
npm run test:run     # Run tests once
npm run test:coverage # Coverage report

Testing Notes

Tests mock Three.js, R3F, and browser APIs
Main test file: src/__tests__/digitalHuman.test.tsx
Browser APIs (speech, media) require mocking

Implementation Tips

Prefer AdvancedDigitalHumanPage for changes (main experience)
Avatar reactions: check DigitalHumanEngine.ts + dialogueOrchestrator.ts
Chat issues: check dialogueService.ts retry/fallback + health check
Speech: verify if logic belongs in audioService.ts or page orchestration
Event listeners: always clean up in useEffect return

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AGENTS.md

Project Overview

OpenSpec Workflow

Commands

Key Directories

Workflow

When to Create a Proposal

Tech Stack

Architecture

App Shell

Layer Structure

Core Services

State Stores

Key Patterns

Service → Store Flow

Store Selector Pattern

Fallback Strategy

Backend Contract

Development Commands

Testing Notes

Implementation Tips

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

AGENTS.md

Project Overview

OpenSpec Workflow

Commands

Key Directories

Workflow

When to Create a Proposal

Tech Stack

Architecture

App Shell

Layer Structure

Core Services

State Stores

Key Patterns

Service → Store Flow

Store Selector Pattern

Fallback Strategy

Backend Contract

Development Commands

Testing Notes

Implementation Tips