GitHub - LironKesem/KernelGeneration: Experimental evaluation of GPU kernels (MatMul, LayerNorm, GELU) across multiple backends. Operators are adapted from tutorials and examples and modified for this study.

KernelGeneration

KernelGeneration is a repository created to evaluate GPU kernel metrics, including:

correctness
performance
portability

Requirements

PyTorch nightly (CUDA or ROCm). Install the nightly that matches your system before running the installer.
- CUDA example:
```
pip install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu124
```
- ROCm example:
```
pip install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/rocm6.4
```
Use the official PyTorch install selector to pick the correct nightly wheel for your OS/driver stack.
(Optional) KernelLLM prompts
If you want to try KernelLLM-style prompting, see the templates in ./prompt/.
Note: the installer does not clone KernelLLM or install transformers/accelerate.

Environment

After setup, export:

export TRITONBENCH_RUN_CONFIG="$(pwd)/benchmark_helion_runner.yaml"
export TRITONBENCH_HELION_PATH="$(pwd)/helion"

Project Structure (before running `install.py`)

kernelGen/
├── README.md
├── install.py
├── benchmark_helion_runner.yaml
├── prompt/
│   └── ...                # optional prompt templates for KernelLLM-style tests
└── operators/
    ├── bf16_layernorm/
    └── bf16_matmul/

install.py — installs TritonBench, clones Helion (kernel-gen-rh branch) and installs it in editable mode, and copies local operators into TritonBench.
It does not install PyTorch or KernelLLM.
operators/ — GPU kernel operators (copied into TritonBench during install).

Evaluated Kernels

MatMul
LayerNorm — WIP (optional KernelLLM prompts available in ./prompt/)
WIP: GELU

Backends Tested

TorchInductor
Triton
Helion
KernelLLM (optional, prompts only via ./prompt/)
WIP: Mako (based on KernelLLM)

Tested on

TritonBench

Usage

Install PyTorch nightly (see Requirements above).
Run the installer (clones TritonBench, clones & installs Helion, and copies operators):
```
python install.py
```

Export environment variables (adjust paths as needed):

export TRITONBENCH_RUN_CONFIG="$(pwd)/benchmark_helion_runner.yaml"
export TRITONBENCH_HELION_PATH="$(pwd)/helion"

Run the benchmark:
```
python tritonbench/run.py 
```

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
operators		operators
prompts		prompts
.gitignore		.gitignore
ReadMe.md		ReadMe.md
benchmark_helion_runner.yaml		benchmark_helion_runner.yaml
custom_kernel_config.yaml		custom_kernel_config.yaml
install.py		install.py
runner.sh		runner.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

KernelGeneration

Requirements

Environment

Project Structure (before running `install.py`)

Evaluated Kernels

Backends Tested

Tested on

Usage

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

LironKesem/KernelGeneration

Folders and files

Latest commit

History

Repository files navigation

KernelGeneration

Requirements

Environment

Project Structure (before running install.py)

Evaluated Kernels

Backends Tested

Tested on

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Project Structure (before running `install.py`)

Packages