-
LINAK
- Denmark
- https://www.linkedin.com/in/tugbars-heptaskin/
Pinned Loading
-
SMC-square-with-CPMMH-Rejuvenation
SMC-square-with-CPMMH-Rejuvenation PublicA GPU-accelerated SMC² framework with Rao–Blackwellized inner filters and correlated PMMH rejuvenation, designed to amortize likelihood-based trial-and-error through massive parallelism.
Cuda
-
Flash-Attention-PTX-CUDA
Flash-Attention-PTX-CUDA PublicHand-written PTX flash attention kernel achieving 58% tensor core utilization on RTX 5080, matching A100's Flash Attention 2 without WGMMA, TMA, or datacenter hardware. 136 TFLOPS FP16.
Cuda
-
Bootstrap-Particle-Filter-in-PTX
Bootstrap-Particle-Filter-in-PTX PublicBPF Bootstrap Particle Filter — Hand-Written PTX: For educational purposes.
Cuda 1
-
ICEEMDAN-MKL
ICEEMDAN-MKL PublicHigh-performance ICEEMDAN implementation using Intel MKL. Header-only C++17, OpenMP parallelized, ~11ms @ 2048 samples. Cubic/Akima splines, multiple processing modes (Standard/Finance/Scientific).
-
Savitzky-Golay-Filter
Savitzky-Golay-Filter PublicHigh-performance Savitzky-Golay filter in C: batch, streaming, and 2D image processing. Embedded-friendly with coefficient export for MCUs. MATLAB-validated.
If the problem persists, check the GitHub status page or contact support.

