Skip to content

benchncnn_llm#6711

Open
nihui wants to merge 4 commits intoTencent:masterfrom
nihui:benchmark-llm-1
Open

benchncnn_llm#6711
nihui wants to merge 4 commits intoTencent:masterfrom
nihui:benchmark-llm-1

Conversation

@nihui
Copy link
Copy Markdown
Member

@nihui nihui commented May 9, 2026

No description provided.

@tencent-adm
Copy link
Copy Markdown
Member

CLA assistant check
Thank you for your submission, we really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
You have signed the CLA already but the status is still pending? Let us recheck it.

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d4da0f8dd2

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread benchmark/benchncnn_llm.cpp Outdated
Comment thread benchmark/benchncnn_llm.cpp Outdated
Comment thread benchmark/benchncnn_llm.cpp Fixed
Comment thread benchmark/benchncnn_llm.cpp Fixed
Comment thread benchmark/benchncnn_llm.cpp Fixed
Comment thread benchmark/benchncnn_llm.cpp Fixed
Comment thread benchmark/benchncnn_llm.cpp Fixed
Comment thread benchmark/benchncnn_llm.cpp Fixed
Comment thread benchmark/benchncnn_llm.cpp Fixed
@nihui nihui changed the title benchncnn_llm and benchncnn_diffusion benchncnn_llm May 9, 2026
@nihui
Copy link
Copy Markdown
Member Author

nihui commented May 9, 2026

@codex review

@chatgpt-codex-connector
Copy link
Copy Markdown

Codex Review: Didn't find any major issues. Can't wait for the next one!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new LLM-focused benchmark target (benchncnn_llm) to the benchmark/ suite, along with embedded NCNN decoder param assets for several small LLMs so they can be benchmarked without external model files.

Changes:

  • Introduce benchncnn_llm executable that runs prefill (1k) and decode (1 token with 1k KV cache) timing for multiple decoder graphs.
  • Extend benchmark/CMakeLists.txt to generate and embed .ncnn.param headers for LLM decoder models (benchncnn_llm_param_data.h) and build the new target.
  • Add new LLM decoder .ncnn.param files under benchmark/models/llm/.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 2 comments.

Show a summary per file
File Description
benchmark/CMakeLists.txt Adds param-header generation + build rules for new benchncnn_llm benchmark target.
benchmark/benchncnn_llm.cpp New benchmark driver that loads embedded LLM decoder params, builds inputs, runs warmup, and reports timing/tokens-per-second.
benchmark/models/llm/tinyllama_1.1b_decoder.ncnn.param Adds embedded TinyLlama decoder graph for benchmarking.
benchmark/models/llm/qwen2.5_0.5b_decoder.ncnn.param Adds embedded Qwen2.5 0.5B decoder graph for benchmarking.
benchmark/models/llm/llama3.2_1b_decoder.ncnn.param Adds embedded Llama 3.2 1B decoder graph for benchmarking.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread benchmark/benchncnn_llm.cpp Outdated
Comment thread benchmark/benchncnn_llm.cpp Outdated
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants