Getting Started

This guide walks you through installing SpawnDev.ILGPU and running your first GPU kernel — in the browser via Blazor WebAssembly or on desktop via a console app.

SpawnDev.ILGPU extends ILGPU — a high-performance .NET GPU computing framework by Marcel Koester — with three browser backends (WebGPU, WebGL, Wasm) while also supporting ILGPU's native desktop backends (CUDA, OpenCL, CPU). Your existing ILGPU kernels work across all six backends with zero changes to the kernel code.

Prerequisites

.NET 10 SDK (or later)

For browser (Blazor WebAssembly):

Blazor WebAssembly project
A modern browser (Chrome 113+ for WebGPU, any modern browser for WebGL/Wasm)

For desktop (Console, WPF, ASP.NET):

Any .NET 10 project
NVIDIA GPU + driver (for CUDA) or OpenCL 2.0+ GPU (for OpenCL) — or CPU-only

Installation

dotnet add package SpawnDev.ILGPU

This installs SpawnDev.ILGPU along with the bundled ILGPU compiler and SpawnDev.BlazorJS for browser interop.

Configure Program.cs

SpawnDev.ILGPU requires SpawnDev.BlazorJS to be initialized. This replaces the standard RunAsync() call:

using SpawnDev.BlazorJS;

var builder = WebAssemblyHostBuilder.CreateDefault(args);
builder.RootComponents.Add<App>("#app");
builder.RootComponents.Add<HeadOutlet>("head::after");

// Add BlazorJS services (required for browser interop)
builder.Services.AddBlazorJSRuntime();

// Use BlazorJSRunAsync instead of RunAsync
await builder.Build().BlazorJSRunAsync();

Desktop apps do not require SpawnDev.BlazorJS. Use SpawnDev.ILGPU directly — no special initialization needed.

Publishing Configuration

When publishing your app, you must disable IL trimming and AOT compilation. ILGPU relies on IL reflection to compile kernels at runtime:

<PropertyGroup>
  <!-- Required: ILGPU uses IL reflection for kernel compilation -->
  <PublishTrimmed>false</PublishTrimmed>
  <RunAOTCompilation>false</RunAOTCompilation>
</PropertyGroup>

Your First Kernel

Here's a complete example that adds two arrays on the GPU:

@page "/gpu-demo"
@using ILGPU
@using ILGPU.Runtime
@using SpawnDev.ILGPU

<h3>GPU Vector Addition</h3>
<p>@_result</p>

@code {
    private string _result = "Running...";

    protected override async Task OnAfterRenderAsync(bool firstRender)
    {
        if (!firstRender) return;

        // 1. Create a context with all available backends
        using var context = await Context.CreateAsync(builder => builder.AllAcceleratorsAsync());

        // 2. Create the best available accelerator
        // Browser: WebGPU > WebGL > Wasm | Desktop: CUDA > OpenCL > CPU
        using var accelerator = await context.CreatePreferredAcceleratorAsync();

        // 3. Allocate GPU buffers
        int length = 256;
        using var bufA = accelerator.Allocate1D(
            Enumerable.Range(0, length).Select(i => (float)i).ToArray());
        using var bufB = accelerator.Allocate1D(
            Enumerable.Range(0, length).Select(i => (float)i * 2f).ToArray());
        using var bufC = accelerator.Allocate1D<float>(length);

        // 4. Load and invoke the kernel
        var kernel = accelerator.LoadAutoGroupedStreamKernel<
            Index1D, ArrayView<float>, ArrayView<float>, ArrayView<float>>(VectorAddKernel);
        kernel((Index1D)length, bufA.View, bufB.View, bufC.View);

        // 5. Wait for GPU to finish (MUST use async version in Blazor WASM)
        await accelerator.SynchronizeAsync();

        // 6. Read results back to CPU
        var results = await bufC.CopyToHostAsync<float>();

        _result = $"Backend: {accelerator.Name} | " +
                  $"result[0]={results[0]}, result[1]={results[1]}, result[255]={results[255]}";
        StateHasChanged();
    }

    // The kernel — this is the code that runs on the GPU
    // It's a standard ILGPU kernel: static, void, Index as first param
    static void VectorAddKernel(
        Index1D index,
        ArrayView<float> a,
        ArrayView<float> b,
        ArrayView<float> c)
    {
        c[index] = a[index] + b[index];
    }
}

Understanding the Code

Context and Backend Discovery

using var context = await Context.CreateAsync(builder => builder.AllAcceleratorsAsync());

Context.CreateAsync is the async version of ILGPU's Context.Create. The AllAcceleratorsAsync() extension probes the environment for all available backends — browser backends (WebGPU, WebGL, Wasm) in Blazor, or native backends (CUDA, OpenCL, CPU) on desktop — and registers them.

Accelerator Creation

using var accelerator = await context.CreatePreferredAcceleratorAsync();

This picks the best available backend: WebGPU → WebGL → Wasm in the browser, or CUDA → OpenCL → CPU on desktop. You can also target a specific backend — see Backends.

The Kernel

static void VectorAddKernel(Index1D index, ArrayView<float> a, ArrayView<float> b, ArrayView<float> c)
{
    c[index] = a[index] + b[index];
}

A kernel is a static void method. The first parameter is always an index type (Index1D, Index2D, or Index3D). Think of it as the body of a parallel for loop — each thread gets a unique index value and runs the same code.

Key rules:

Kernels must be static methods
Only value types are allowed (no classes, no string, no reference types)
No throw statements — see Limitations
No ref or out parameters
Use ArrayView<T> to access GPU memory (like a Span<T> for the GPU)

Async Synchronization

await accelerator.SynchronizeAsync();

Critical: In Blazor WASM, you must use SynchronizeAsync() instead of Synchronize(). The main thread is single-threaded — calling the synchronous version will deadlock. On desktop, both sync and async work, but async is recommended for cross-platform code.

Data Readback

var results = await bufC.CopyToHostAsync<float>();

This copies data from the GPU back to a C# array. It works with all six backends automatically.

Next Steps

Backends — Learn about each backend's capabilities and configuration options
Capabilities & Backend Selection — Declare what your kernel needs upfront so the right backend is auto-selected (and incompatible ones rejected with a typed exception instead of silent garbage)
Writing Kernels — Deeper dive into kernel programming, math functions, and advanced patterns
Memory & Buffers — Buffer allocation, transfer patterns, and zero-allocation readback

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Getting Started

Prerequisites

Installation

Configure Program.cs

Publishing Configuration

Your First Kernel

Understanding the Code

Context and Backend Discovery

Accelerator Creation

The Kernel

Async Synchronization

Data Readback

Next Steps

Uh oh!

FilesExpand file tree

getting-started.md

Latest commit

History

getting-started.md

File metadata and controls

Getting Started

Prerequisites

Installation

Configure Program.cs

Publishing Configuration

Your First Kernel

Understanding the Code

Context and Backend Discovery

Accelerator Creation

The Kernel

Async Synchronization

Data Readback

Next Steps