soniqo · ivan-digital · May 13, 2026 · May 13, 2026
diff --git a/.gitignore b/.gitignore
@@ -5,12 +5,8 @@ build/
 .idea/
 local.properties
 
-# ONNX Runtime (downloaded by setup.sh / setup_linux.sh)
+# ONNX Runtime (downloaded by setup.sh)
 /ort/
-/ort-linux/
-
-# Test models (downloaded by linux/tests/download_models.sh)
-linux/tests/models/
 
 # Native build artifacts
 .cxx/

diff --git a/AGENTS.md b/AGENTS.md
@@ -2,17 +2,23 @@
 
 ## Project
 
-speech-android — on-device speech SDK for Android and embedded Linux (VAD + STT + TTS + noise cancellation).
+speech-android — on-device speech SDK for Android (VAD + STT + TTS + noise cancellation).
+
+Thin Kotlin SDK + JNI bridge over the [speech-core](https://github.com/soniqo/speech-core)
+C++ engine, which provides the orchestration pipeline AND the ONNX Runtime
+model wrappers (Silero VAD, Parakeet STT, Kokoro TTS, DeepFilterNet3). This
+repo owns only the Android packaging and a single ~250-line JNI bridge.
+
+Linux/automotive support moved to [speech-core's `examples/linux/`](https://github.com/soniqo/speech-core/tree/main/examples/linux).
 
 ## Structure
 
-- `speech-core/` — C++17 git submodule, pipeline orchestration (do not modify directly)
-- `sdk/src/main/cpp/` — ONNX Runtime model implementations, JNI bridge, audio DSP
+- `speech-core/` — git submodule (do not modify directly; open PRs against soniqo/speech-core)
+- `sdk/src/main/cpp/` — `jni_bridge.cpp` + `CMakeLists.txt`. That's it. All model code lives in speech-core.
 - `sdk/src/main/kotlin/com/soniqo/speech/` — Kotlin public SDK
 - `sdk/src/androidTest/` — instrumented e2e tests
-- `linux/` — embedded Linux C API (automotive/Yocto)
 - `app/` — demo application
-- `setup.sh` — downloads ONNX Runtime, initializes submodule
+- `setup.sh` — downloads ONNX Runtime, initializes the speech-core submodule
 
 ## Build
 
@@ -24,76 +30,25 @@ speech-android — on-device speech SDK for Android and embedded Linux (VAD + ST
 
 ## Tests
 
-### Android (emulator or device)
-
-```bash
-./gradlew :sdk:connectedAndroidTest
-```
-
-Models download automatically via `ModelManager.ensureModels()`.
-23 tests across 5 suites: SileroVadTest, ParakeetSttTest, KokoroTtsTest, PipelineE2ETest, BargeInTest.
-
-### Linux
-
-```bash
-# 1. Download ONNX Runtime
-linux/setup_linux.sh
-
-# 2. Download test models
-linux/tests/download_models.sh
-
-# 3. Build
-cd linux && cmake -B build -DORT_DIR=../ort-linux && cmake --build build
-
-# 4. Run (set model dir)
-SPEECH_MODEL_DIR=tests/models ./build/speech_test
-```
-
-11 tests: config, lifecycle, speech detection, concurrency, null safety.
-
-## Models
-
-ONNX models hosted on HuggingFace under `aufklarer/` org. INT8 is default.
-Parakeet TDT v3 — multilingual STT (114 languages, 8192 BPE vocab).
-ModelManager.kt handles download and caching.
-
-## Key files
-
-- `jni_bridge.cpp` — wires ONNX models to speech-core C API via vtables
-- `SpeechPipeline.kt` — main public API
-- `parakeet_stt.cpp` — STT with TDT greedy decoder + per-feature mel normalization
-- `kokoro_tts.cpp` + `kokoro_phonemizer.cpp` — TTS with dictionary-based phonemizer
-- `silero_vad.cpp` — voice activity detection
-- `deepfilter.cpp` — noise cancellation with STFT/ERB processing
-- `onnx_engine.h` — platform-aware ONNX Runtime wrapper (Android NNAPI / Linux QNN)
-- `linux/src/speech.cpp` — Linux C API implementation
-- `linux/include/speech.h` — Linux public C header
-
-## Workflow
-
-- **Never push directly to main.** Create a feature branch, open a PR, and merge after review.
-- Branch naming: `feat/description`, `fix/description`, `chore/description`
-- PRs should include: summary, test plan, and link to related issues
-- Tag releases from main after PR is merged: `git tag v0.0.X && git push origin v0.0.X`
-- CI runs on tags: builds SDK, runs unit tests, publishes to Maven Central + GitHub Packages, creates GitHub Release with APK
-
-## Testing
-
 ### Unit tests (no device needed)
 
 ```bash
 ./gradlew :sdk:test
 ```
 
-15 tests: download retry, resume, timeout, validation, edge cases.
+Download retry / resume / timeout / validation / edge cases.
 
 ### E2E tests (arm64 emulator or device)
 
 ```bash
 ./gradlew :sdk:connectedAndroidTest
 ```
 
-31 tests across 7 suites: SileroVadTest, ParakeetSttTest, KokoroTtsTest, KokoroMultilingualTest, PipelineE2ETest, BargeInTest, DeepFilterTest.
+Suites: `SileroVadTest`, `ParakeetSttTest`, `KokoroTtsTest`,
+`KokoroMultilingualTest`, `PipelineE2ETest`, `BargeInTest`, `DeepFilterTest`.
+
+Models (~1.2GB) download on first run via `ModelManager.ensureModels()`.
+Subsequent runs use the device-side cache.
 
 #### Emulator setup (arm64, 4GB RAM required)
 
@@ -104,29 +59,50 @@ echo "no" | avdmanager create avd -n speech_test -k "system-images;android-35-ex
 /opt/homebrew/share/android-commandlinetools/emulator/emulator -avd speech_test -no-window -no-audio -no-boot-anim -gpu swiftshader_indirect -memory 4096
 ```
 
-Models (~1.2GB) download on first run. Subsequent runs use cache.
+## Models
+
+ONNX models hosted on HuggingFace under [`aufklarer/`](https://huggingface.co/aufklarer)
+org. INT8 quantized by default.
 
-### Linux
+- `aufklarer/Silero-VAD-v5-ONNX` — VAD
+- `aufklarer/Parakeet-TDT-v3-ONNX` — STT (114 languages, 8192 BPE vocab)
+- `aufklarer/Kokoro-82M-ONNX` — TTS + phonemizer dicts + voice embeddings
+- `aufklarer/DeepFilterNet3-ONNX` — noise enhancer
 
-```bash
-linux/setup_linux.sh
-linux/tests/download_models.sh
-cd linux && cmake -B build -DORT_DIR=../ort-linux && cmake --build build
-SPEECH_MODEL_DIR=tests/models ./build/speech_test
-```
+`ModelManager.kt` handles download and caching. See speech-core's
+[`docs/models.md`](https://github.com/soniqo/speech-core/blob/main/docs/models.md)
+for the full model-file inventory.
+
+## Key files
+
+- `sdk/src/main/cpp/jni_bridge.cpp` — constructs `speech_core::SileroVad`/`ParakeetStt`/`KokoroTts` and feeds them to `speech_core::VoicePipeline`. No vtable adapters — the model wrappers implement the interfaces directly.
+- `sdk/src/main/cpp/CMakeLists.txt` — pulls speech-core in via `add_subdirectory` with `SPEECH_CORE_WITH_ONNX=ON`; the speech_core_models target provides every model wrapper.
+- `sdk/src/main/kotlin/com/soniqo/speech/SpeechPipeline.kt` — main public Kotlin API.
+- `sdk/src/main/kotlin/com/soniqo/speech/NativeBridge.kt` — JNI surface (must stay in lockstep with `jni_bridge.cpp`).
+- `sdk/src/main/kotlin/com/soniqo/speech/ModelManager.kt` — model download + caching.
+
+Native code that used to live here (`models/*.{cpp,h}`, `audio/{fft,mel,stft}.cpp`,
+`util/json.h`, `onnx_engine.h`) is now under speech-core. Modify it via a
+speech-core PR, then bump the submodule pointer here.
+
+## Workflow
 
-11 tests: config, lifecycle, speech detection, concurrency, null safety.
+- **Never push directly to main.** Create a feature branch, open a PR, merge after review.
+- Branch naming: `feat/description`, `fix/description`, `chore/description`.
+- PRs should include: summary, test plan, and link to related issues.
+- Tag releases from main after merge: `git tag v0.0.X && git push origin v0.0.X`.
+- CI runs on tags: builds SDK, runs unit tests, publishes to Maven Central + GitHub Packages, creates GitHub Release with APK.
 
 ## Guidelines
 
-- Keep native code in C++17, no external deps beyond ONNX Runtime, OkHttp, and speech-core
-- Kotlin SDK should be minimal — thin wrapper over JNI
-- All model tensor names/shapes must match actual ONNX exports
-- Test on arm64-v8a (Snapdragon) as primary target
-- No Claude attribution in commits, PRs, or model cards
-- **Never push directly to main — always use a PR**
-- **Always ask for confirmation before creating a git commit**
-- **Always ask for confirmation before any action visible to others** — pushing to any branch, opening / commenting on / reviewing / closing / merging PRs or issues, posting to Slack or any external service. The git commit rule above is one instance of this broader principle: never create externally visible artifacts without explicit confirmation.
-- **Run unit tests (`./gradlew :sdk:test`) after making code changes**
-- **Run e2e tests (`./gradlew :sdk:connectedAndroidTest`) before tagging a release**
-- **README translations must stay in sync.** Any change to `README.md` must be mirrored in all translated copies: `README_zh.md`, `README_ja.md`, `README_ko.md`, `README_es.md`, `README_de.md`, `README_fr.md`, `README_hi.md`, `README_pt.md`, `README_ru.md`
+- Keep native code in C++17. No external deps beyond ONNX Runtime, OkHttp, and speech-core.
+- Kotlin SDK stays minimal — thin wrapper over JNI.
+- All model tensor names/shapes must match the published ONNX exports under `aufklarer/`.
+- Test on arm64-v8a (Snapdragon) as primary target.
+- **No Claude attribution** in commits, PRs, or model cards. Strip both the `🤖 Generated with [Claude Code]` footer and the `Co-Authored-By: Claude …` trailer from defaults.
+- **Never push directly to main — always use a PR**.
+- **Always ask for confirmation before creating a git commit**.
+- **Always ask for confirmation before any externally-visible action** — pushing to any branch, opening / commenting on / reviewing / closing / merging PRs or issues, posting to Slack or any external service. The git commit rule above is one instance of this broader principle.
+- **Run unit tests (`./gradlew :sdk:test`) after making code changes**.
+- **Run e2e tests (`./gradlew :sdk:connectedAndroidTest`) before tagging a release**.
+- **README translations must stay in sync.** Any change to `README.md` must be mirrored in all translated copies: `README_zh.md`, `README_ja.md`, `README_ko.md`, `README_es.md`, `README_de.md`, `README_fr.md`, `README_hi.md`, `README_pt.md`, `README_ru.md`.