Bison

Enterprise GPU Resource Billing & Multi-Tenant Management Platform

中文文档 • Architecture • Quick Start • Features

The GPU Management Challenge

Managing shared GPU clusters across multiple teams creates critical operational and financial challenges:

For Platform Administrators:

How do you fairly allocate expensive GPU resources across competing teams?
How do you prevent resource hogging while ensuring everyone gets their fair share?
How do you track who's using what and implement accurate chargeback?
How do you maintain strict multi-tenant isolation without complex manual configuration?

For Finance & Budget Teams:

How do you implement automated chargeback for GPU usage without manual accounting?
How do you prevent budget overruns before they happen?
How do you generate accurate cost reports for internal billing?

For Development Teams:

How do you get predictable, isolated access to GPU resources?
How do you know when you're approaching your budget limits?
How do you avoid impacting other teams' workloads?

Traditional Approach:

Manual quota configuration per namespace
Excel-based billing calculations
No real-time cost visibility
Complex multi-tool setup (quota management + cost tracking + billing system)
Frequent resource conflicts and budget surprises

Bison's Integrated Solution

graph TB
    subgraph WITHOUT["Without Bison"]
        P1[❌ Manual Quota Management<br/>Per-namespace configuration]
        P2[❌ Spreadsheet Billing<br/>Manual calculations & reports]
        P3[❌ No Resource Isolation<br/>Teams compete for resources]
        P4[❌ Budget Overruns<br/>No proactive alerts]
        P5[❌ Complex Tooling<br/>Multiple systems to manage]
    end

    subgraph WITH["With Bison"]
        S1[✅ Automated Team Quotas<br/>Capsule-powered isolation]
        S2[✅ Real-Time Billing<br/>OpenCost integration]
        S3[✅ True Multi-Tenancy<br/>Shared/Exclusive modes]
        S4[✅ Proactive Alerts<br/>Balance monitoring & auto-suspend]
        S5[✅ Unified Platform<br/>Single pane of glass]
    end

    P1 -.Transform.-> S1
    P2 -.Transform.-> S2
    P3 -.Transform.-> S3
    P4 -.Transform.-> S4
    P5 -.Transform.-> S5

    style WITHOUT fill:#ffebee
    style WITH fill:#e8f5e9
    style S1 fill:#4caf50,color:#fff
    style S2 fill:#4caf50,color:#fff
    style S3 fill:#4caf50,color:#fff
    style S4 fill:#4caf50,color:#fff
    style S5 fill:#4caf50,color:#fff

Bison combines:

🔐 Kubernetes-native multi-tenancy (Capsule) - True team isolation with shared or exclusive node pools
💰 Real-time cost tracking (OpenCost + Prometheus) - Per-pod, per-namespace, per-team cost visibility
💳 Automated billing & budgets - Prepaid balances, auto-deduction, low-balance alerts, and auto-suspension
📊 Unified dashboard - Single interface for admins, team leaders, and finance teams
🔧 Zero external dependencies - All data stored in Kubernetes ConfigMaps (etcd-backed)

Result: Deploy once, get complete GPU resource management with automated billing in under 30 minutes.

See Bison in Action

🎯 Real-Time Resource Dashboard

What you see:

Cluster Overview - Total teams, projects, resource pools, and quotas at a glance
Resource Utilization - Visual breakdown showing which teams are consuming resources
7-Day Cost Trends - Historical cost data to identify spending patterns
Top 5 Cost Rankings - Quickly identify heavy GPU consumers
Team Budget Status - Real-time balance monitoring with color-coded alerts

Who benefits:

Platform Administrators get instant visibility into cluster health and usage patterns
Finance Teams can track costs in real-time without waiting for monthly reports
Team Leaders can compare their usage against other teams

💼 Team Management & Budget Monitoring

What you see:

Team List with real-time status indicators:
- 🟢 Green balance = Healthy budget
- 🟡 Yellow balance = Approaching threshold
- 🔴 Red balance = Low balance or suspended
Resource Allocation - CPU/Memory/GPU quotas per team (e.g., "cpu 0/10" means 0 used out of 10 allocated)
Project Count - Number of namespaces/projects under each team
Quick Actions - Edit quotas, recharge balance, or delete team with one click

Who benefits:

Team Leaders monitor their budget status and resource usage at a glance
Administrators manage multiple teams from a single unified view
Finance Teams see which teams need recharging

💰 Flexible Billing Configuration

What you see:

Per-Resource Pricing - Set custom prices for CPU (per core-hour), Memory (per GB-hour), GPU (per GPU-hour)
Currency Selection - Support for CNY, USD, EUR, and other currencies
Enable/Disable Toggle - Turn billing on/off for specific resources with one click
Billing Rules - Define how resources are metered (hourly, daily, etc.)
Alert Thresholds - Configure when to send low-balance warnings

Who benefits:

Finance Teams align cloud costs with internal chargeback policies
Administrators adjust pricing based on actual hardware costs
Budget Managers set appropriate warning thresholds to prevent overruns

Features

Core Capabilities

✅ Multi-Tenant Management     Capsule-powered team isolation with OIDC
✅ Real-Time Billing           Usage-based billing with configurable pricing
✅ Dynamic Resource Quotas     CPU, Memory, GPU, or any K8s resource
✅ Team Balance & Wallet       Prepaid balance with auto-deduction
✅ Auto-Recharge               Scheduled balance top-ups (weekly/monthly)
✅ Balance Alerts              Multi-channel notifications (Webhook, DingTalk, WeChat)
✅ Usage Reports               Team/project analytics with export
✅ Audit Logging               Complete operation history

Architecture Highlights

graph TB
    subgraph USER_LAYER[User Layer]
        UI[Web UI<br/>React + Ant Design]
        API[REST API<br/>Go + Gin]
    end

    subgraph CORE[Core Services]
        BS[Billing Service]
        TS[Tenant Service]
        QS[Quota Service]
    end

    subgraph K8S[Kubernetes Layer]
        CA[Capsule<br/>Multi-Tenancy]
        OC[OpenCost<br/>Cost Tracking]
        PR[Prometheus<br/>Metrics]
    end

    subgraph DATA[Data Layer]
        CM[ConfigMaps<br/>Zero Database]
    end

    UI --> API
    API --> BS & TS & QS
    BS --> OC
    TS --> CA
    QS --> CA
    BS & TS --> CM
    OC --> PR

Quick Start

Prerequisites

Kubernetes 1.22+
Helm 3.0+
kubectl configured

1. Install Dependencies

# Install Capsule (multi-tenancy)
helm repo add projectcapsule https://projectcapsule.github.io/charts
helm install capsule projectcapsule/capsule -n capsule-system --create-namespace

# Install Prometheus + OpenCost
helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
helm install prometheus prometheus-community/kube-prometheus-stack -n monitoring --create-namespace

helm repo add opencost https://opencost.github.io/opencost-helm-chart
helm install opencost opencost/opencost -n opencost --create-namespace \
  --set opencost.prometheus.internal.serviceName=prometheus-kube-prometheus-prometheus \
  --set opencost.prometheus.internal.namespaceName=monitoring

2. Deploy Bison

Option A: Using GHCR (Recommended - OCI Format)

# Install directly from GitHub Container Registry
helm install bison oci://ghcr.io/supermarioyl/bison/bison \
  --namespace bison-system \
  --create-namespace \
  --set auth.enabled=true \
  --version 0.0.2

Note: Requires Helm >= 3.8.0 for OCI support

Option B: From GitHub Release

# Download Helm chart from GitHub Release
wget https://github.com/SuperMarioYL/Bison/releases/download/v0.0.2/bison-0.0.2.tgz

# Install from downloaded chart
helm install bison bison-0.0.2.tgz \
  --namespace bison-system \
  --create-namespace \
  --set auth.enabled=true

Option C: From Source

# Clone and deploy from source
git clone https://github.com/SuperMarioYL/Bison.git
cd Bison

helm install bison ./deploy/charts/bison \
  --namespace bison-system \
  --create-namespace \
  --set auth.enabled=true

3. Access the Platform

# Get admin password
kubectl get secret bison-auth -n bison-system -o jsonpath='{.data.password}' | base64 -d

# Port forward
kubectl port-forward svc/bison-api 8080:8080 -n bison-system

# Access API
curl http://localhost:8080/api/v1/health

User Journeys

Bison serves different user roles with tailored workflows. Here's how each persona uses the platform:

👨‍💼 Platform Administrator: Setting Up Multi-Tenant GPU Cluster

Scenario: Sarah is a platform engineer setting up a shared GPU cluster for 5 ML teams.

flowchart TD
    Start([Admin Logs In]) --> ViewDash[View Dashboard<br/>Check cluster status]
    ViewDash --> CreateTeam[Create New Team]

    CreateTeam --> SetName[Set Team Info<br/>Name: ml-team<br/>Owner: team-lead@company.com]
    SetName --> ChooseMode{Resource<br/>Mode?}

    ChooseMode -->|Shared Pool| SharedConfig[Use Shared Nodes<br/>Cost-effective for small teams]
    ChooseMode -->|Exclusive| ExclusiveConfig[Bind Dedicated Nodes<br/>Full performance isolation]

    SharedConfig --> SetQuota[Set Resource Quotas<br/>10 GPUs, 100 CPU cores<br/>500Gi Memory]
    ExclusiveConfig --> SelectNodes[Select Dedicated Nodes<br/>Label: bison.io/pool=team-ml]
    SelectNodes --> SetQuota

    SetQuota --> InitBalance[Set Initial Balance<br/>$10,000]
    InitBalance --> ConfigAlerts[Configure Alerts<br/>Warning: 20%<br/>Critical: 10%]

    ConfigAlerts --> TeamCreated[✅ Team Created]
    TeamCreated --> SendKubeconfig[Generate & Send<br/>Kubeconfig to Team Leader]
    SendKubeconfig --> Monitor[Monitor Usage<br/>via Dashboard]

    Monitor --> CheckAlerts{Low Balance<br/>Alert?}
    CheckAlerts -->|Yes| ApproveRecharge[Approve Recharge<br/>Request]
    CheckAlerts -->|No| Continue[Continue Monitoring]

    ApproveRecharge --> Monitor
    Continue --> Monitor

    style Start fill:#4caf50,color:#fff
    style TeamCreated fill:#8bc34a,color:#fff
    style CheckAlerts fill:#ff9800,color:#fff

Key Actions:

Create team with appropriate resource mode (shared vs exclusive)
Set quotas based on team size and workload requirements
Initialize balance and configure alert thresholds
Provide team leader with kubeconfig for namespace access
Monitor cluster-wide metrics and respond to alerts

👥 Team Leader: Managing Team Resources & Budget

Scenario: Mike leads the Computer Vision team and needs to manage budgets across multiple projects.

flowchart TD
    Start([Team Leader Logs In]) --> ViewTeam[View Team Dashboard]
    ViewTeam --> CheckBalance[Check Current Balance<br/>$5,234 remaining<br/>Burn rate: $120/day]

    CheckBalance --> Decision{What to do?}

    Decision -->|Create Project| NewProject[Create New Project]
    Decision -->|Monitor Costs| ViewReports[View Usage Reports]
    Decision -->|Low Balance| RequestRecharge[Request Recharge]

    NewProject --> ProjName[Set Project Name<br/>e.g., face-recognition]
    ProjName --> AllocQuota[Allocate from Team Quota<br/>3 GPUs, 20 CPU cores]
    AllocQuota --> AssignMembers[Assign Team Members]
    AssignMembers --> ProjCreated[✅ Project Created<br/>New namespace ready]
    ProjCreated --> NotifyDev[Notify Developers]

    ViewReports --> SelectPeriod[Select Time Range<br/>Last 7 days]
    SelectPeriod --> Breakdown[View Cost Breakdown<br/>By project/resource type]
    Breakdown --> Export[Export to CSV<br/>Share with finance]

    RequestRecharge --> FillAmount[Enter Amount: $10,000<br/>Add reason/PO number]
    FillAmount --> SubmitRequest[Submit to Admin]
    SubmitRequest --> WaitApproval{Approved?}
    WaitApproval -->|Yes| BalanceUpdated[✅ Balance Updated]
    WaitApproval -->|No| ReviseRequest[Revise & Resubmit]

    BalanceUpdated --> SetAutoRecharge[Optional: Enable Auto-recharge<br/>Monthly: $8,000]

    style Start fill:#2196f3,color:#fff
    style ProjCreated fill:#8bc34a,color:#fff
    style WaitApproval fill:#ff9800,color:#fff
    style BalanceUpdated fill:#4caf50,color:#fff

Key Actions:

Monitor team balance and consumption rate daily
Create projects and allocate quotas to sub-teams
Request recharges before balance runs out
View detailed cost breakdowns by project
(Optional) Set up auto-recharge for predictable budgets

👨‍💻 Developer: Deploying GPU Workload

Scenario: Alex is a data scientist deploying a GPU training job.

flowchart TD
    Start([Developer Receives<br/>Kubeconfig]) --> SetContext[Set kubectl Context<br/>kubectl config use-context ml-training]

    SetContext --> WriteYAML[Write Job Manifest<br/>Request 2 GPUs, 8 CPU cores]
    WriteYAML --> ApplyJob[kubectl apply -f job.yaml]

    ApplyJob --> K8sValidation{Kubernetes<br/>Validation}

    K8sValidation -->|Syntax Error| FixYAML[Fix YAML Syntax]
    FixYAML --> ApplyJob

    K8sValidation -->|OK| CapsuleCheck{Capsule<br/>Quota Check}

    CapsuleCheck -->|Quota Exceeded| QuotaError[❌ ERROR: Quota Exceeded<br/>requested: 2 GPU<br/>available: 0 GPU]
    QuotaError --> ContactLead[Contact Team Leader<br/>Request more quota]
    ContactLead --> WaitQuota[Wait for Quota Increase]
    WaitQuota --> ApplyJob

    CapsuleCheck -->|OK| Scheduled[✅ Pod Scheduled]
    Scheduled --> NodeSelect{Resource<br/>Mode?}

    NodeSelect -->|Shared| SharedNode[Runs on Shared Pool<br/>Might share node with other teams]
    NodeSelect -->|Exclusive| ExclusiveNode[Runs on Team's Dedicated Nodes<br/>Full isolation]

    SharedNode --> Running[Pod Running]
    ExclusiveNode --> Running

    Running --> BisonBilling[Bison Tracks Cost<br/>Real-time deduction]
    BisonBilling --> MonitorJob[Monitor via kubectl logs<br/>Check cost in Dashboard]

    MonitorJob --> JobComplete{Job<br/>Complete?}
    JobComplete -->|No| MonitorJob
    JobComplete -->|Yes| CleanUp[kubectl delete job<br/>Stop billing]

    CleanUp --> TotalCost[Final Cost: $15.60<br/>Deducted from team balance]

    style Start fill:#9c27b0,color:#fff
    style QuotaError fill:#f44336,color:#fff
    style Scheduled fill:#4caf50,color:#fff
    style Running fill:#8bc34a,color:#fff
    style TotalCost fill:#2196f3,color:#fff

Key Actions:

Receive kubeconfig from team leader
Write Kubernetes manifest (Job/Pod) with GPU requests
Deploy workload - Capsule automatically enforces quotas
Monitor job progress and cost accumulation
Clean up resources after job completes to stop billing

Resource Isolation in Action:

User's pods are automatically scheduled according to team's resource mode
In Exclusive Mode: Capsule injects nodeSelector: {bison.io/pool: team-ml} - pods only run on team's dedicated nodes
In Shared Mode: Pods can run on any node in the shared pool
Quota enforcement prevents over-allocation regardless of mode

How It Works

System Architecture & Dependencies

Bison integrates seamlessly with the Kubernetes ecosystem, leveraging proven open-source components:

graph TB
    subgraph BISON[Bison Platform]
        style "Bison Platform" fill:#e3f2fd
        BISON_UI[Bison Web UI<br/>React + Ant Design]
        BISON_API[Bison API Server<br/>Go + Gin]
        BISON_SCHED[Billing Scheduler<br/>Hourly Jobs]
    end

    subgraph K8S_CORE[Kubernetes Core]
        style "Kubernetes Core" fill:#fff3e0
        K8S_API[Kubernetes API Server]
        ETCD[etcd<br/>ConfigMaps Storage]
    end

    subgraph TENANT[Multi-Tenancy Layer]
        style "Multi-Tenancy Layer" fill:#f3e5f5
        CAPSULE[Capsule Controller]
        TENANT_CRD[Tenant CRDs]
    end

    subgraph COST[Cost Tracking Stack]
        style "Cost Tracking Stack" fill:#e8f5e9
        OPENCOST[OpenCost]
        PROM[Prometheus]
        NODE_EXP[Node Exporter]
        KUBE_STATE[kube-state-metrics]
    end

    subgraph ALERT[Alerting (Optional)]
        style "Alerting (Optional)" fill:#fce4ec
        WEBHOOK[Webhook]
        DINGTALK[DingTalk]
        WECHAT[WeChat Work]
    end

    BISON_UI -->|HTTPS| BISON_API
    BISON_API -->|Create/Update Tenants| CAPSULE
    BISON_API -->|Read/Write ConfigMaps| K8S_API
    BISON_API -->|Query Costs| OPENCOST

    BISON_SCHED -->|Hourly Billing| OPENCOST
    BISON_SCHED -->|Update Balances| K8S_API
    BISON_SCHED -->|Send Alerts| WEBHOOK
    BISON_SCHED -.->|Optional| DINGTALK
    BISON_SCHED -.->|Optional| WECHAT

    CAPSULE -->|Watch/Reconcile| TENANT_CRD
    CAPSULE -->|Create Namespaces| K8S_API

    K8S_API -->|Persist Data| ETCD

    OPENCOST -->|Query Metrics| PROM
    PROM -->|Scrape| NODE_EXP
    PROM -->|Scrape| KUBE_STATE

    style BISON_API fill:#4caf50,color:#fff
    style BISON_UI fill:#2196f3,color:#fff
    style CAPSULE fill:#ff9800,color:#fff
    style OPENCOST fill:#9c27b0,color:#fff
    style PROM fill:#e65100,color:#fff

Key Integration Points:

Kubernetes API - All Bison data stored in ConfigMaps (zero external database)
Capsule - Provides multi-tenant isolation; team = Tenant, project = Namespace
OpenCost - Tracks per-pod, per-namespace resource costs
Prometheus - Provides metrics for OpenCost cost calculation
Alert Channels - Flexible notification via Webhook, DingTalk, or WeChat

Resource Isolation Explained

Bison uses Capsule to enforce strict resource isolation between teams. Here's how it works:

graph TB
    subgraph K8S_CLUSTER[Kubernetes Cluster]
        subgraph TEAM_A[Team A (Exclusive Mode)]
            style "Team A (Exclusive Mode)" fill:#e3f2fd
            T1[Capsule Tenant: team-ml]
            T1_NS1[Namespace: ml-training<br/>ResourceQuota: 10 GPU]
            T1_NS2[Namespace: ml-inference<br/>ResourceQuota: 5 GPU]
            T1_POD1[Pod: trainer-1<br/>GPU: 2]
            T1_POD2[Pod: serve-1<br/>GPU: 1]

            T1 --> T1_NS1
            T1 --> T1_NS2
            T1_NS1 --> T1_POD1
            T1_NS2 --> T1_POD2
        end

        subgraph TEAM_B[Team B (Shared Mode)]
            style "Team B (Shared Mode)" fill:#fce4ec
            T2[Capsule Tenant: team-cv]
            T2_NS1[Namespace: cv-research<br/>ResourceQuota: 5 GPU]
            T2_POD1[Pod: detector-1<br/>GPU: 2]

            T2 --> T2_NS1
            T2_NS1 --> T2_POD1
        end

        subgraph NODES[Node Pools]
            style "Node Pools" fill:#f3e5f5
            N1[Node 1<br/>label: bison.io/pool=team-ml<br/>GPUs: 4]
            N2[Node 2<br/>label: bison.io/pool=team-ml<br/>GPUs: 4]
            N3[Node 3<br/>label: bison.io/pool=shared<br/>GPUs: 8]
            N4[Node 4<br/>label: bison.io/pool=shared<br/>GPUs: 8]
        end
    end

    T1_POD1 -.Scheduled ONLY on.-> N1
    T1_POD2 -.Scheduled ONLY on.-> N2
    T2_POD1 -.Scheduled on.-> N3
    T2_POD1 -.Scheduled on.-> N4

    style T1 fill:#2196f3,color:#fff
    style T2 fill:#e91e63,color:#fff
    style N1 fill:#4caf50,color:#fff
    style N2 fill:#4caf50,color:#fff
    style N3 fill:#ff9800,color:#fff
    style N4 fill:#ff9800,color:#fff

Isolation Mechanisms:

Isolation Type	Implementation	Benefit
Namespace	Capsule Tenant owns namespaces	Teams can only create/access their own namespaces
Compute	ResourceQuota per Tenant	Teams cannot exceed allocated CPU/Memory/GPU
Node (Exclusive)	NodeSelector injection	Team's pods ONLY run on dedicated nodes
Node (Shared)	Shared pool with quotas	Cost-effective for smaller teams
Network (Optional)	NetworkPolicies	Prevent cross-team pod communication
Billing	Separate balance per team	One team's spending doesn't affect others

How Capsule Enforces Isolation:

Admission Webhook - Intercepts all pod creation requests
Quota Validation - Checks if team has available quota (CPU/GPU/Memory)
NodeSelector Injection - Automatically adds node labels for exclusive teams
Rejection - Blocks requests that exceed quota or violate policies

Example:

# Developer creates this simple pod
apiVersion: v1
kind: Pod
metadata:
  name: my-job
  namespace: ml-training  # Owned by team-ml
spec:
  containers:
  - name: trainer
    image: pytorch:latest
    resources:
      requests:
        nvidia.com/gpu: 2

# Capsule automatically transforms it to:
# (adds nodeSelector for exclusive mode)
apiVersion: v1
kind: Pod
metadata:
  name: my-job
  namespace: ml-training
spec:
  nodeSelector:
    bison.io/pool: team-ml  # Injected by Capsule!
  containers:
  - name: trainer
    image: pytorch:latest
    resources:
      requests:
        nvidia.com/gpu: 2

Billing Flow

sequenceDiagram
    participant S as Scheduler
    participant OC as OpenCost
    participant BS as Billing Service
    participant CM as ConfigMaps
    participant A as Alert Service

    loop Every Hour
        S->>OC: Query namespace costs
        OC-->>S: Usage data (CPU/Memory/GPU hours)
        S->>BS: Calculate billing
        BS->>CM: Deduct from team balance

        alt Balance < Threshold
            BS->>A: Trigger low balance alert
            A->>A: Send notifications
        end

        alt Balance < 0
            BS->>BS: Suspend team workloads
        end
    end

Team Lifecycle

stateDiagram-v2
    [*] --> Created: Create Team
    Created --> Active: Initial Recharge
    Active --> Active: Normal Usage
    Active --> Warning: Balance < Threshold
    Warning --> Active: Recharge
    Warning --> Suspended: Balance ≤ 0
    Suspended --> Active: Recharge
    Suspended --> [*]: Delete Team

Configuration

Key Settings

Parameter	Description	Default
`auth.enabled`	Enable authentication	`false`
`auth.admin.username`	Admin username	`admin`
`apiServer.replicaCount`	API server replicas	`2`
`dependencies.opencost.apiUrl`	OpenCost API endpoint	`http://opencost.opencost:9003`

Billing Configuration

Configure pricing through the Web UI or API:

{
  "enabled": true,
  "currency": "USD",
  "pricing": {
    "cpu": 0.05,
    "memory": 0.01,
    "nvidia.com/gpu": 2.50
  },
  "billingInterval": "hourly"
}

Installation Details

Docker Images

Bison images are available on GitHub Container Registry:

# Pull images
docker pull ghcr.io/supermarioyl/bison/api-server:0.0.1
docker pull ghcr.io/supermarioyl/bison/web-ui:0.0.1

# Or use latest
docker pull ghcr.io/supermarioyl/bison/api-server:latest
docker pull ghcr.io/supermarioyl/bison/web-ui:latest

Available Tags:

0.0.1, 3.0, 3 - Semantic version tags
latest - Latest stable release

Supported Platforms:

linux/amd64
linux/arm64

Helm Installation Methods

Bison Helm charts are distributed via GitHub Container Registry (GHCR) using the OCI format, which is the modern standard for Helm 3.8+.

Requirements:

Helm >= 3.8.0 (for OCI support)
Kubernetes >= 1.22

Method 1: Install from GHCR (Recommended)

# Install specific version directly from GHCR
helm install my-bison oci://ghcr.io/supermarioyl/bison/bison --version 0.0.2

# Or pull the chart first, then install
helm pull oci://ghcr.io/supermarioyl/bison/bison --version 0.0.2
helm install my-bison bison-0.0.2.tgz

# Customize installation
helm install my-bison oci://ghcr.io/supermarioyl/bison/bison \
  --version 0.0.2 \
  --namespace bison-system \
  --create-namespace \
  --set opencost.url=http://opencost.opencost-system.svc:9003 \
  --set auth.enabled=true

Method 2: Install from GitHub Releases

# Download chart from GitHub Releases
wget https://github.com/SuperMarioYL/Bison/releases/download/v0.0.2/bison-0.0.2.tgz

# Install from downloaded file
helm install my-bison bison-0.0.2.tgz \
  --namespace bison-system \
  --create-namespace

Why GHCR OCI Format?

✅ No need for separate Helm repository maintenance
✅ Unified image and chart management in GHCR
✅ Faster installation (direct pull from registry)
✅ Better version control and immutability
✅ Standard practice for Helm 3.8+

Development

# Install dependencies
make install-deps

# Run locally
make dev          # API + Web UI (requires tmux)
make dev-api      # API server only
make dev-web      # Web UI only

# Build
make build        # Docker images
make build-binary # Binary files

# Test
make test
make lint

Release Process

Bison uses automated GitHub Actions for releases:

Create a Git tag to trigger release:
```
git tag v3.1.0
git push origin v3.1.0
```
GitHub Actions automatically:
- Builds multi-platform Docker images (amd64, arm64)
- Pushes images to GitHub Container Registry
- Packages Helm chart
- Publishes chart to GHCR (OCI format)
- Creates GitHub Release with chart attachment
Verify release:
- Check GitHub Releases
- Pull new images: docker pull ghcr.io/supermarioyl/bison/api-server:0.0.2
- Install chart: helm install test oci://ghcr.io/supermarioyl/bison/bison --version 0.0.2

Project Structure

bison/
├── api-server/           # Go backend (Gin framework)
│   ├── cmd/              # Entry point
│   ├── internal/
│   │   ├── handler/      # HTTP handlers
│   │   ├── service/      # Business logic
│   │   ├── k8s/          # Kubernetes client
│   │   └── scheduler/    # Background tasks
│   └── Dockerfile
├── web-ui/               # React frontend
│   ├── src/
│   │   ├── pages/        # Page components
│   │   ├── services/     # API clients
│   │   └── contexts/     # React contexts
│   └── Dockerfile
├── deploy/
│   └── charts/bison/     # Helm chart
├── docs/                 # Documentation
└── Makefile

Roadmap

License

MIT License - see LICENSE for details.

Bison - Simplify GPU Resource Management

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
.vscode		.vscode
api-server		api-server
deploy/charts/bison		deploy/charts/bison
docs		docs
scripts		scripts
web-ui		web-ui
website		website
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
release-commands.sh		release-commands.sh

Folders and files

Latest commit

History

Repository files navigation

Bison

The GPU Management Challenge

Bison's Integrated Solution

See Bison in Action

🎯 Real-Time Resource Dashboard

💼 Team Management & Budget Monitoring

💰 Flexible Billing Configuration

Features

Core Capabilities

Architecture Highlights

Quick Start

Prerequisites

1. Install Dependencies

2. Deploy Bison

Option A: Using GHCR (Recommended - OCI Format)

Option B: From GitHub Release

Option C: From Source

3. Access the Platform

User Journeys

👨‍💼 Platform Administrator: Setting Up Multi-Tenant GPU Cluster

👥 Team Leader: Managing Team Resources & Budget

👨‍💻 Developer: Deploying GPU Workload

How It Works

System Architecture & Dependencies

Resource Isolation Explained

Billing Flow

Team Lifecycle

Configuration

Key Settings

Billing Configuration

Installation Details

Docker Images

Helm Installation Methods

Method 1: Install from GHCR (Recommended)

Method 2: Install from GitHub Releases

Development

Release Process

Project Structure

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages