Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion .typos.toml
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,8 @@ extend-ignore-identifiers-re = [
"noe",
"NOE",
"BA",
"lin2-rin2"
"lin2-rin2",
"SME"
]
extend-ignore-re = [
"lin2-rin2",
Expand Down
13 changes: 8 additions & 5 deletions docs/common/ai/_llama_cpp.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ llama.cpp 是一个基于纯 C/C++ 实现的高性能大模型推理框架,它
<NewCodeBlock tip="Device" type="device">

```bash
git clone https://github.com/ggml-org/llama.cpp.git
git clone https://github.com/ggml-org/llama.cpp.git && cd llama.cpp
```

</NewCodeBlock>
Expand All @@ -23,7 +23,7 @@ git clone https://github.com/ggml-org/llama.cpp.git
<NewCodeBlock tip="Device" type="device">

```bash
sudo apt install cmake gcc g++
sudo apt install cmake gcc g++ libcurl4-openssl-dev
```

</NewCodeBlock>
Expand All @@ -39,15 +39,18 @@ cmake --build build --config Release -j$(nproc)

</NewCodeBlock>

:::info[KleidiAI]
:::info[ARMv9]
对于采用 ARM-v9 架构的 [瑞莎星睿 O6 / O6N](/orion/o6) 设备,可以开启 armv9-a 和 KleidiAI 编译选项进行硬件级优化。

**请使用 `4aced7a` commit**
:::

<NewCodeBlock tip="Device" type="device">

```bash
git checkout 4aced7a
cmake -B build -DGGML_NATIVE=OFF -DGGML_CPU_ARM_ARCH=armv9-a+i8mm+dotprod -DGGML_CPU_KLEIDIAI=ON
cmake --build build --config Release
cmake --build build --config Release -j$(nproc)
```

</NewCodeBlock>
Expand All @@ -60,7 +63,7 @@ Llama.cpp 已集成 Arm KleidiAI 库,该库针对 SME、I8MM 及点积加速

```bash
cmake -B build -DGGML_CPU_KLEIDIAI=ON
cmake --build build --config Release
cmake --build build --config Release -j$(nproc)
```

</NewCodeBlock>
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ git clone https://github.com/ggml-org/llama.cpp.git
<NewCodeBlock tip="Device" type="device">

```bash
sudo apt install cmake gcc g++
sudo apt install cmake gcc g++ libcurl4-openssl-dev
```

</NewCodeBlock>
Expand All @@ -39,15 +39,18 @@ cmake --build build --config Release -j$(nproc)

</NewCodeBlock>

:::info[KleidiAI]
:::info[ARMv9]
On devices using the ARMv9 architecture, such as [Radxa Orion O6 / O6N](/orion/o6), you can enable the `armv9-a` and KleidiAI build options for hardware-level optimization.

**Please use `4aced7a` commit**.
:::

<NewCodeBlock tip="Device" type="device">

```bash
git checkout 4aced7a
cmake -B build -DGGML_NATIVE=OFF -DGGML_CPU_ARM_ARCH=armv9-a+i8mm+dotprod -DGGML_CPU_KLEIDIAI=ON
cmake --build build --config Release
cmake --build build --config Release -j$(nproc)
```

</NewCodeBlock>
Expand All @@ -60,7 +63,7 @@ llama.cpp integrates the Arm KleidiAI library, which provides highly optimized m

```bash
cmake -B build -DGGML_CPU_KLEIDIAI=ON
cmake --build build --config Release
cmake --build build --config Release -j$(nproc)
```

</NewCodeBlock>
Expand Down
Loading