[Cpp API Compatibility] Add CUDABlas related APIs by youge325 · Pull Request #78060 · PaddlePaddle/Paddle

youge325 · 2026-02-27T09:45:14Z

PR Category

Execute Infrastructure

PR Types

New features

Description

新增 at::cuda::getCurrentCUDABlasHandle at::cuda::blas::gemm<T> 接口

新增 Allocator 结构体

文档详见 PFCCLab/PaddleCppAPITest#46

是否引起精度变化

否

paddle-bot · 2026-02-27T09:45:20Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Copilot

Pull request overview

This PR extends Paddle’s LibTorch/ATen compatibility layer by introducing a lightweight CUDA context interface and adding missing CUDA/CUBLAS handle accessors needed for C++ API compatibility (notably at::cuda::getCurrentCUDABlasHandle), along with a compat c10::Allocator abstraction.

Changes:

Add c10::cuda::device_count() API to the compat CUDA functions header.
Introduce compat c10::Allocator (plus DataPtr::release_context() support) for raw allocation APIs.
Add ATen/cuda/CUDAContextLight.{h,cpp} and switch ATen/cuda/CUDAContext.h to include the new light header; wire the new .cpp into the compat build.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
paddle/phi/api/include/compat/c10/cuda/CUDAFunctions.h	Adds `device_count()` to the compat c10 CUDA API surface.
paddle/phi/api/include/compat/c10/core/Allocator.h	Adds a compat `c10::Allocator` interface and `DataPtr::release_context()`.
paddle/phi/api/include/compat/CMakeLists.txt	Adds the new `CUDAContextLight.cpp` to the compat compilation sources.
paddle/phi/api/include/compat/ATen/cuda/CUDAContextLight.h	Declares lightweight `at::cuda` CUDA context APIs, including cuBLAS handle getters.
paddle/phi/api/include/compat/ATen/cuda/CUDAContextLight.cpp	Implements the lightweight CUDA context APIs via `phi::GPUContext`.
paddle/phi/api/include/compat/ATen/cuda/CUDAContext.h	Redirects CUDAContext to the new lightweight header.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

codecov-commenter · 2026-02-27T14:52:27Z

Codecov Report

❌ Patch coverage is 95.18072% with 24 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (develop@362b943). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
.../api/include/compat/ATen/cuda/CUDAContextLight.cpp	89.74%	8 Missing ⚠️
paddle/phi/api/include/compat/c10/core/Storage.h	93.79%	8 Missing ⚠️
...ddle/phi/api/include/compat/ATen/core/TensorBase.h	92.72%	4 Missing ⚠️
...addle/phi/api/include/compat/c10/core/DeviceType.h	90.90%	2 Missing ⚠️
...ddle/phi/api/include/compat/ATen/core/TensorBody.h	0.00%	1 Missing ⚠️
paddle/phi/api/include/compat/c10/core/Allocator.h	97.61%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             develop   #78060   +/-   ##
==========================================
  Coverage           ?   95.18%           
==========================================
  Files              ?       16           
  Lines              ?      498           
  Branches           ?        0           
==========================================
  Hits               ?      474           
  Misses             ?       24           
  Partials           ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

youge325 · 2026-03-24T06:19:35Z

/re-run windows-gpu

SigureMo · 2026-03-24T06:47:33Z


 #pragma once

-#include <cublasXt.h>


咦，这行改动的原因是？没必要是么？

这行不去掉的话，在编译的时候 Linux-DCU build会遇到redefinition错误，和 #include <cublas_v2.h> 冲突了，本来是要换一下 include 顺序，但是cpplint会直接换回来，后来发现直接去掉也没问题，这里暂时是用不到 cublasXt.h 的，后面需要用到的时候再 include 也没问题，具体可以看涉及到 fix dcu 的 commit

SigureMo · 2026-03-24T12:15:16Z

这轮正式结论调整为 APPROVE。

太不容易了啊这个 PR

…nsorImpl'" This reverts commit 1fa8416.

…ensorImpl'" This reverts commit 0a16785.

youge325 · 2026-03-26T03:51:49Z

/re-run all-failed

youge325 · 2026-03-26T05:49:42Z

/re-run all-failed

SigureMo

LGTMeow

Copilot AI review requested due to automatic review settings February 27, 2026 09:45

paddle-bot Bot added the contributor External developers label Feb 27, 2026

Copilot started reviewing on behalf of youge325 February 27, 2026 09:45 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

youge325 force-pushed the cOthers branch 2 times, most recently from c807df9 to 8ebc8c8 Compare February 28, 2026 13:34

youge325 changed the title ~~[Cpp API Compatibility] add at::cuda::getCurrentCUDABlasHandle~~ [Cpp API Compatibility] add CUDABlas related APIs Feb 28, 2026

youge325 force-pushed the cOthers branch from e9d1663 to 6d2592c Compare March 2, 2026 13:27

BingooYang reviewed Mar 3, 2026

View reviewed changes

Comment thread paddle/phi/api/include/compat/c10/core/Allocator.h

youge325 force-pushed the cOthers branch 17 times, most recently from 7a5c35e to c1cbf14 Compare March 5, 2026 09:51

youge325 force-pushed the cOthers branch 2 times, most recently from 1067c2f to adb8b58 Compare March 14, 2026 06:33

youge325 added 6 commits March 23, 2026 21:24

Fix compat storage live synchronization

60f0ccc

Stabilize compat parity and CUDA gating

e160055

Normalize compat Device include order

af338fb

Merge branch 'develop' into cOthers

e6d0f30

fix Windows build

f5aca72

fix use_count

59abc3f

SigureMo reviewed Mar 24, 2026

View reviewed changes

youge325 added 2 commits March 24, 2026 16:27

Merge branch 'develop' into cOthers

fce1903

fix storage use count invariant across independent wrappers

439c6f4

youge325 added 11 commits March 24, 2026 20:16

Merge branch 'develop' into cOthers

9cc2fbc

fix compiling error: 'ResetHolder' is not a member of 'at::TensorImpl'

1fa8416

Revert "fix compiling error: 'ResetHolder' is not a member of 'at::Te…

0a16785

…nsorImpl'" This reverts commit 1fa8416.

Reapply "fix compiling error: 'ResetHolder' is not a member of 'at::T…

86d9ffe

…ensorImpl'" This reverts commit 0a16785.

fix compat_basic_test

a4bdf37

refactor sparse_coo_tensor implementation

dab0730

Merge branch 'develop' into cOthers

3ef6514

fix coverage rate

b2573e0

Merge branch 'develop' into cOthers

c464279

fix coverage again as Codecov succeed

fa69b0e

fix coverage rate according to full Codecov report

4cd1c9f

SigureMo changed the title ~~[Cpp API Compatibility] add CUDABlas related APIs~~ [Cpp API Compatibility] Add CUDABlas related APIs Mar 26, 2026

SigureMo approved these changes Mar 26, 2026

View reviewed changes

SigureMo added the skip-ci: approval label Mar 26, 2026

SigureMo merged commit 59e6de0 into PaddlePaddle:develop Mar 26, 2026
125 of 130 checks passed

youge325 deleted the cOthers branch March 26, 2026 09:13

youge325 mentioned this pull request Mar 31, 2026

revert some modifications PFCCLab/DeepGEMM#9

Open

Conversation

youge325 commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

是否引起精度变化

Uh oh!

paddle-bot Bot commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

youge325 commented Mar 24, 2026

Uh oh!

SigureMo Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

youge325 Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

SigureMo commented Mar 24, 2026

Uh oh!

youge325 commented Mar 26, 2026

Uh oh!

youge325 commented Mar 26, 2026

Uh oh!

SigureMo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

youge325 commented Feb 27, 2026 •

edited

Loading

codecov-commenter commented Feb 27, 2026 •

edited

Loading