Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
403 commits
Select commit Hold shift + click to select a range
b5e3a56
Compiling kernel.bc
parnenziniGT Feb 14, 2024
53cd7b2
checking errors
parnenziniGT Feb 14, 2024
9c423ea
Adding compilation kernel.bc
parnenziniGT Feb 14, 2024
954119c
Fix issue compilation kernel
parnenziniGT Feb 14, 2024
0bb31bc
Fix issue compilation kernel - againwq
parnenziniGT Feb 14, 2024
0192f82
Fix LLVM_PREFIX
parnenziniGT Feb 14, 2024
0203138
Adding compilation kernel wrapper
parnenziniGT Feb 14, 2024
0bd34e8
Fix Vortex path
parnenziniGT Feb 14, 2024
c7e6c30
Reverting Vortex path
parnenziniGT Feb 14, 2024
15d1ce7
Investigation vx_path.h
parnenziniGT Feb 14, 2024
25d6225
Fix 2
parnenziniGT Feb 14, 2024
5a5a490
Fix 3
parnenziniGT Feb 14, 2024
d0b6115
Fix 4
parnenziniGT Feb 14, 2024
a8f1b97
Fix 5
parnenziniGT Feb 14, 2024
76d592b
Fix 6
parnenziniGT Feb 14, 2024
1559abe
Change directory Vortex
parnenziniGT Feb 14, 2024
4d5347a
Fixed path for build
parnenziniGT Feb 14, 2024
0f91cc2
Added installation dependencies
parnenziniGT Feb 14, 2024
96f946f
Added risc-v toolchain
parnenziniGT Feb 14, 2024
dde06b3
Added log messages
parnenziniGT Feb 14, 2024
cadc730
Added file for log messages
parnenziniGT Feb 14, 2024
10e60cd
Changed sysroot path
parnenziniGT Feb 14, 2024
88ff5b6
Added compiling kernel.elf
parnenziniGT Feb 14, 2024
e1927c9
Complete kernel compilation
parnenziniGT Feb 14, 2024
96dfda8
Complete host compilation
parnenziniGT Feb 14, 2024
88b5246
Complete execution
parnenziniGT Feb 14, 2024
ddbf388
Disable lint.yml
parnenziniGT Feb 15, 2024
4f0571b
Update build.yml to use Node.js 20
parnenziniGT Feb 15, 2024
b53cabb
Checking version of Node.js used
parnenziniGT Feb 15, 2024
3b9c3e1
Update build.yml to use checkout@v3
parnenziniGT Feb 15, 2024
aece54e
Fixed syntax error
parnenziniGT Feb 15, 2024
8c04e7d
Upgrade actions in build.yml
parnenziniGT Feb 15, 2024
d079b94
Update setup-node action to v4
parnenziniGT Feb 15, 2024
de770c6
kjrun.sh file for micro/gauss/bfs changed to include in-land machines
Feb 27, 2024
eb51bc0
32bit run fixed, (abuf_size was wrong)
Mar 1, 2024
3b51382
32bit run fixed, (abuf_size was wrong)
Mar 1, 2024
f63a41f
insert_warp_loop_printout
Mar 20, 2024
885c6c6
Merge branch 'vortex' of https://github.com/cupbop/CuPBoP_dev into vo…
Mar 20, 2024
92281fe
minors modification for handline the empty kernel function
Mar 26, 2024
278e974
upstreaming to latest version of vortex
Sep 11, 2024
972d3f2
fix re-allocation bug
Oct 22, 2024
6caf818
fix callback index
Oct 22, 2024
91bfd62
new branch for llvm18
Nov 15, 2024
70c894a
llvm18 related changes
Nov 16, 2024
a727431
printf changes and add kjrun_llvm18
Nov 20, 2024
e2ae623
add setup.sh file
Dec 10, 2024
926366e
commit changes to reflect syncthread changes
Dec 10, 2024
d5ea35d
Readme update
Dec 10, 2024
7ecfb54
get func_name fix, cache redundant calculation, and some cg support work
CharliePu Dec 18, 2024
e3db474
temp_fix for FCCM
Jan 17, 2025
3b1242d
temp_fix for FCCM
Jan 17, 2025
2c73ae3
temp_fix for FCCM
Jan 17, 2025
1c9ae0a
add kjrun_llvm18
Jan 18, 2025
a5408e7
btree filestructure
Jan 18, 2025
d049bc8
btree filestructure
Jan 18, 2025
38efa26
btree filestructure
Jan 18, 2025
10cdd5e
btree filestructure
Jan 18, 2025
5a13913
btree filestructure
Jan 18, 2025
523f0d6
btree filestructure
Jan 18, 2025
9590062
btree filestructure
Jan 18, 2025
1669e7e
Merge remote-tracking branch 'origin/vortex-llvm18' into cg-support
CharliePu Feb 7, 2025
477870c
work on one on one mapping
CharliePu Mar 11, 2025
03a5822
Fix threadIdx calls for one on one mapping
CharliePu Mar 18, 2025
99376b3
Support for block or warp sync on schedule 2
CharliePu Mar 19, 2025
29e8dac
Divergence error fix (blockid)
Mar 20, 2025
f25ddd4
Divergence error fix (blockid)
Mar 20, 2025
27dc0e5
enhance context array
Mar 21, 2025
9cf3dad
fix device to device copy
Mar 26, 2025
366d412
overwriting Jisheng's changes
May 14, 2025
8608331
Merge branch 'vortex-llvm18' into sche_2.0
CharliePu May 14, 2025
204568b
Update README.md
ChihyoA May 15, 2025
a7a68ab
Update README.md
ChihyoA May 15, 2025
8030622
Update README.md
ChihyoA May 15, 2025
1506629
Update README.md
ChihyoA May 15, 2025
1ee3e3e
Update README.md
ChihyoA May 15, 2025
9dba11c
Update README.md
ChihyoA May 15, 2025
fdc0112
Merge remote-tracking branch 'origin/vortex-llvm18' into sche_2.0
CharliePu May 23, 2025
39a46c9
add setup file
May 23, 2025
3b882ad
Merge branch 'vortex-llvm18' of https://github.com/cupbop/CuPBoP_dev …
May 23, 2025
da817cf
Merge remote-tracking branch 'origin/vortex-llvm18' into sche_2.0
CharliePu May 23, 2025
c1927ee
Update: refactor warp level function
Y-jiji May 30, 2025
741dc5c
Fix: replace warp level primitive pass
Y-jiji May 30, 2025
d48596f
Create docs folder
VineethNareddy May 31, 2025
a2fdb49
Upload docs
VineethNareddy May 31, 2025
e731354
Update README.md
VineethNareddy May 31, 2025
aa8e636
Update README.md
VineethNareddy May 31, 2025
515a023
Update README_Defs.md
VineethNareddy May 31, 2025
e998360
Update README_Defs.md
VineethNareddy May 31, 2025
20766eb
Update README.md
VineethNareddy May 31, 2025
2eb5c81
Update Architecture Image
VineethNareddy May 31, 2025
e8a8cdc
Merge pull request #14 from Y-jiji/vortex-tianji
hyesoon Jun 3, 2025
c739f64
Merge remote-tracking branch 'origin/vortex-llvm18' into sche_2.0
CharliePu Jul 1, 2025
accbcdc
Match the startup address with vortex
CharliePu Jul 1, 2025
e49c673
Fix support for cudaGetDeviceProperties; add vecadd test; revert inse…
CharliePu Jul 11, 2025
65ec921
Remove unnecessary .ll file and reflect all the changed that Mark mad…
Jul 15, 2025
1e0de57
Add new documentation files
VineethNareddy Jul 26, 2025
bb0c806
Update and rename ComonIssues.md to CommonIssues.md
VineethNareddy Jul 26, 2025
79dc4e3
Update FinalSetup.md Line Spacing
VineethNareddy Jul 26, 2025
237818f
Update README.md
VineethNareddy Jul 26, 2025
daaaf7c
Warp funcs support attempt
CharliePu Sep 2, 2025
993bf1f
Add hecbench tests
CharliePu Sep 4, 2025
43fb5c7
modified: examples/bscan-cuda/main.cu
CharliePu Sep 4, 2025
c7fdeee
new benchmarks added (sams as OpenCL) merging warp_level feature branch
Sep 5, 2025
b36a8ba
kjrun file added, batch run bash file added
Sep 11, 2025
659d7dc
same as previous
Sep 11, 2025
e41e14b
collectresult.py
Sep 11, 2025
7b3d24a
minor changes
Sep 12, 2025
e950e22
minor changes
Sep 12, 2025
8d5da19
minor changes
Sep 12, 2025
1793e88
minor changes
Sep 12, 2025
1fc6f92
warp benchmarks update
CharliePu Sep 17, 2025
24ab615
minor update
Dec 18, 2025
21851ff
minor update
Dec 18, 2025
e55fe6e
minor changes
Dec 18, 2025
7f90400
minor changes
Dec 18, 2025
caefcdc
minor changes
Dec 19, 2025
e4ccc8f
minor update
Dec 19, 2025
c380958
minor changes
CharliePu Dec 20, 2025
0570c01
minor update
Dec 20, 2025
8a2b1b4
minor update
Dec 20, 2025
7bef313
stencil data add
Dec 20, 2025
95bca9d
README update
Dec 25, 2025
799891e
readme update
Dec 27, 2025
be2b36b
configured .github
Jan 21, 2026
f43dba4
CI moved from CuPBoP_dev
Jan 21, 2026
2994a26
Update rg-ci-env-setup.sh paths from CuPBoP_Dev
aakan511 Jan 21, 2026
97a1928
Update run-crnch-cuda.sh
aakan511 Jan 21, 2026
d3e890c
Update run-crnch-cuda-test.sh
aakan511 Jan 21, 2026
d1ab6b5
Update run-crnch-cuda.sh
aakan511 Jan 21, 2026
d79d2c8
Update run-crnch-cuda.sh
aakan511 Jan 21, 2026
4a7d7dc
Create rg-ci-setup.sh
aakan511 Jan 21, 2026
f3dcbce
Delete ci/rg-ci-env-setup.sh
aakan511 Jan 21, 2026
1a8592a
Update run-crnch-cuda-test.sh
aakan511 Jan 21, 2026
68ee423
Apptainer test
Jan 30, 2026
88c2e19
Scaffolding Apptainer CI exec
Jan 30, 2026
a2985bd
removed conflicting cudaGetDeviceProperties_v2() definition
Jan 30, 2026
2af5e7d
Updating tests to use Apptainer
Jan 30, 2026
b76c6ea
shrinking test suite temporarily
Jan 30, 2026
7ca2bf1
Default test script (no output check)
Jan 30, 2026
4a1bd16
added nn test
Jan 30, 2026
e17bb35
3 tests running
Jan 30, 2026
0d8f3a9
refactor and test update
Jan 30, 2026
644dbf2
Trying to add file output to tests
Jan 30, 2026
dbf423c
waiting for test to complete
Jan 30, 2026
4bae386
improved debug
Jan 30, 2026
3063f30
figuring out test output
Jan 30, 2026
a4728b5
sbatch troubleshooting
Jan 30, 2026
a6d1841
stop removing build directory
Jan 30, 2026
cce157e
printing last 10 lines of log
Jan 30, 2026
f573f43
switching from sbatch to srun
Jan 30, 2026
e9351cf
trying to simplify SLURM JOB ID
Jan 30, 2026
b6a48a8
trying to use symlink
Jan 30, 2026
67b508b
typo fix
Jan 30, 2026
53a836f
Adding run success verification
Jan 30, 2026
fab2910
no continue on error test
Jan 30, 2026
76fe8da
updated kjrun-llvm18 for pathfinder
Feb 2, 2026
00cd901
flag update
Feb 2, 2026
da58ad7
expanded test suite
Feb 2, 2026
c7ffff4
removing gcc path in kjrun
Feb 2, 2026
295dac4
Update run-crnch-cuda-test.sh
aakan511 Feb 8, 2026
6173b94
Update run-crnch-cuda-test.sh
aakan511 Feb 8, 2026
7d09bb5
Shrinking btree and transpose test data; Adding vortex schedule algo …
Feb 9, 2026
80977eb
debug test
Feb 9, 2026
e25d818
Create CI_Docs.md
aakan511 Feb 13, 2026
3dcdd3a
Update CI_Docs.md
aakan511 Feb 15, 2026
91648b2
Update CI_Docs.md
aakan511 Feb 16, 2026
73ef5be
Update CI_Docs.md
aakan511 Feb 16, 2026
b84e429
Update CI_Docs.md
aakan511 Feb 16, 2026
42776bb
WIP: local changes before merging CI
Feb 20, 2026
b83602a
Merge remote-tracking branch 'origin/CI'
Feb 20, 2026
908e755
sche2 performance improvement and makefile structure for benchmarks w…
Mar 24, 2026
ec6f154
cudaRuntimeImpl.cpp updated for CI
Mar 24, 2026
6f76b59
CI update
Mar 24, 2026
50d6870
CI update
Mar 24, 2026
694a9b3
CI update
Mar 24, 2026
1b9f06d
bfs data added
Mar 24, 2026
4b6596a
triton itial commit
Mar 24, 2026
d3615fa
Merge upstream/master - resolve generate_wrapper.cpp conflict
Mar 24, 2026
a6edcd1
stencil data added:
Mar 24, 2026
090e20e
stencil update
Mar 24, 2026
d20ed8b
Merge branch 'master' of https://github.com/cupbop/CuPBoP_Vortex into…
Mar 24, 2026
612ca59
triton changes
Mar 24, 2026
223ba2d
pytorch runtime added
Mar 24, 2026
5ed408d
Triton change runtime update
Mar 25, 2026
7cf04cb
CI update, additional nvvm function support added
Mar 25, 2026
d8afacc
CI update
Mar 25, 2026
14e0e1f
CI update
Mar 25, 2026
1944021
CI update
Mar 25, 2026
3d93a9c
CI pass decision updated
Mar 26, 2026
e2d5645
CI update
Mar 26, 2026
eefe31b
CI update
Mar 26, 2026
cc27878
CI update
Mar 26, 2026
2efee2e
CI update
Mar 26, 2026
570c7ef
CI update
Mar 26, 2026
0e2dfa6
CI update
Mar 26, 2026
ad555e3
CI update
Mar 26, 2026
e53cd19
CI update
Mar 26, 2026
4db86d7
CI update
Mar 26, 2026
8f3a7d7
CI update
Mar 26, 2026
5c2d8ae
dynamic shared memory update for sche=2, lmem=0, also CI temp update …
Mar 27, 2026
66bad33
CI structure update
Mar 27, 2026
7d0fb1a
CI updated to have smaller size
Mar 27, 2026
3c4fcbb
CI final update
Mar 27, 2026
209b7db
benchamrk debugging
Mar 28, 2026
34f9f72
benchmark debug2
Mar 28, 2026
6293405
benchmark debug 3
Mar 30, 2026
fe99ed2
Benchmark debug4
Mar 30, 2026
9870887
TCU, DXA first commit
Mar 30, 2026
e7a9733
benchmark debug5
Mar 30, 2026
cf436f4
using vortex bugfix branch
Mar 30, 2026
6fba7c5
using vortex bugfix branch
Mar 30, 2026
6fe1fae
using vortex bugfix branch
Mar 30, 2026
5d6ca87
using vortex bugfix branch
Mar 30, 2026
48ec281
TCU base address fix and memcpyToSymbol dedup
Mar 30, 2026
2cc6758
Add sgemm_tcu, sgemm_dxa, sgemm_tcu_dxa to CI
Mar 30, 2026
ea90ea0
Merge master into vortex_bugfix_integration
Mar 30, 2026
f809bb9
Reduce bh-cuda and huffman CI sizes to avoid timeout
Mar 30, 2026
92464be
Fix DEVICE_BC default sm_50→sm_70 to match cuda-gpu-arch
Mar 31, 2026
686b08d
Reduce cc-cuda ThreadsPerBlock 1024→256 for Vortex
Mar 31, 2026
ab57306
Remove unconditional debug prints from tool.cpp
Mar 31, 2026
a30799a
Reduce bh-cuda THREADS 256→32 for Vortex CI timeout
Mar 31, 2026
dd56d8b
Pass env vars to kernel .o build and remove .o from git
Mar 31, 2026
0be892b
Comment out all unconditional debug IR prints
Mar 31, 2026
130754d
Force clean+rebuild of kernel .o files in cmake build
Mar 31, 2026
94ef565
Guard debug prints with cupbop_debug() instead of removing
Mar 31, 2026
ff2fc82
Reduce nw CI input size 128→32, update correctness check
Mar 31, 2026
4d315c2
Update nw golden file for CI_RUN_ARGS=32, restore golden diff check
Mar 31, 2026
2f9557a
Fix cg_sync.cpp build error: remove cupbop_debug() call
Mar 31, 2026
e802390
Move cupbop_debug() to tool.h so all source files can use it
Mar 31, 2026
8d43536
Remove duplicate static cupbop_debug() from 5 source files
Mar 31, 2026
e958e87
Skip stale perf entries in CI summary (status N/A → skip)
Mar 31, 2026
49691cc
Clean previous CI result files before build
Mar 31, 2026
245b3e8
Fix SCHE_0 dyn_cast crashes: null-check getCalledFunction() in 7 loca…
Mar 31, 2026
d5a39de
Fix SCHE_0 warp_shfl undefined reference and remaining null-check cra…
Mar 31, 2026
ab18cf0
Fix SCHE_0 threadIdx.y/z, block_size 3D, shfl idx/xor, C++ shfl wrapp…
Apr 1, 2026
7b4e01a
Revert C++ shfl wrapper detection (causes type mismatch and dominatio…
Apr 1, 2026
38f8d91
SCHE_0: fix alloca context save for warp shuffle kernels
Apr 3, 2026
fecf762
CI: fix false PASS when benchmark outputs both PASSED and FAIL
Apr 3, 2026
96d6260
Fix SCHE_2 regression + SCHE_0 shfl domination error
Apr 3, 2026
6d1c164
Fix jacobi-cuda: reduce size + golden comparison
Apr 3, 2026
c7bcc27
Fix srad_v2 SCHE_0 build + lower_constant_expr improvements
Apr 4, 2026
30faa80
Fix srad_v2 SCHE_0: reduce size + golden update + revert workarounds
Apr 4, 2026
1b6b5a7
CI: update LLVM path to llvm_edited (with LowerSwitch fix)
Apr 4, 2026
c98c3e0
Add mem_device2global: convert __device__ vars from addrspace(1) to 0
Apr 4, 2026
68845f0
Fix cc-cuda: addrspace(1) conversion + kernel name matching + device …
Apr 5, 2026
433e172
:Fix cc-cuda: addrspace(1) conversion + kernel name matching + device…
Apr 5, 2026
8e119e5
Fix cfd regression, cc-cuda device var persistence, score SCHE_0 domi…
Apr 5, 2026
ad33bbe
Reduce score-cuda CI size for faster SCHE_0 testing
Apr 6, 2026
5cfba85
SCHE_0 shfl barrier fix (cupbop.shfl.barrier) + huffman CI input size
Apr 7, 2026
71d0d86
Fix link to BenchmarkList in README.md
zqb-all Apr 7, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
29 changes: 29 additions & 0 deletions .devcontainer/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
FROM ubuntu:22.04
ARG DEBIAN_FRONTEND=noninteractive
RUN apt-get update && apt-get install -y lsb-release wget software-properties-common gnupg git cmake zlib1g zlib1g-dev g++ python3 python3-pip
RUN pip install pre-commit
RUN wget -qO- https://apt.llvm.org/llvm.sh | bash -s -- 18
RUN apt-key del -y 7fa2af80 && \
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb && dpkg -i cuda-keyring_1.1-1_all.deb && \
apt-get update && apt-get install -y nvidia-cuda-toolkit=11.5.1-1ubuntu1
ENV LLVM_SYS_181_PREFIX=/usr/lib/llvm-18
ENV PATH=$PATH:/usr/lib/llvm-18/bin
ENV CUDA_PATH=/usr/lib/cuda
ENV LLVM_CONFIG_PATH=/usr/lib/llvm-18/bin/llvm-config
RUN mkdir -p /workspaces &&\
git clone --depth=1 --recursive https://github.com/vortexgpgpu/vortex.git /workspaces/vortex --branch master
WORKDIR /workspaces/vortex
RUN ./ci/install_dependencies.sh
RUN mkdir build && cd build &&\
../configure --xlen=64 --tooldir=$HOME/tools --prefix=/usr/vortex &&\
./ci/toolchain_install.sh --all
WORKDIR /workspaces/vortex/build
RUN . ./ci/toolchain_env.sh && \
unset VERILATOR_ROOT && \
make -s && \
make install
ENV TOOLDIR=$HOME/tools
ENV POCL_ROOT=/workspaces/pocl
ENV VORTEX_HOME=/usr/vortex
ENV LLVM_VORTEX=/usr/lib/llvm-18
ENV CuPBoP_PATH=/workspaces/cupbop-vx
13 changes: 13 additions & 0 deletions .devcontainer/devcontainer.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
{
"name": "CuPBoP VX",
"build": {
"dockerfile": "Dockerfile"
},
"customizations": {
"vscode": {
"extensions": [
"llvm-vs-code-extensions.vscode-clangd"
]
}
}
}
157 changes: 0 additions & 157 deletions .github/workflows/build.yml

This file was deleted.

Loading