Popular repositories Loading
-
llama.cpp-turboquant-hip
llama.cpp-turboquant-hip PublicTurboQuant KV cache compression for llama.cpp — HIP/ROCm port for AMD RDNA3 (gfx1100)
-
hermes-skills
hermes-skills PublicReusable skills and dependency graph system for Hermes Agent — bitwarden secrets, email composition, autoresearch loop
Python 6
-
turboquant-hip
turboquant-hip PublicTurboQuant KV cache compression for AMD GPUs (HIP/ROCm). First working HIP port. 4.9x compression, zero accuracy loss.
C++ 2
-
agentsearch
agentsearch PublicCLI tool to index and search AI agent sessions across Hermes, Moltis, Nanobot, and markdown notes. BM25 via tantivy.
Rust 1
-
triattention-ggml
triattention-ggml PublicFrequency-based KV cache pruning for llama.cpp — 25% cache reduction, improved PPL at long context. GPU compaction kernel for HIP/ROCm.
Python 1
-
moltis
moltis PublicForked from dr34ming/moltis
A Rust-native claw you can trust. One binary — sandboxed, secure, auditable. Voice, memory, MCP tools, and multi-channel access built-in.
Rust
If the problem persists, check the GitHub status page or contact support.



