Your code runs on our servers.
Here's exactly what happens.

Puzzles and Labs share the same foundation: your code builds and runs server-side with independent safety layers around it. Puzzle metrics and lab checks happen next to the code, where the browser cannot fake the result.

Katabench Labs

From course to check

A course organizes the journey. Each lab opens a focused workspace, lessons guide the work, and executable checks prove the behavior before you continue.

Product

Katabench Labs

The guided, executable practice area.

Course

Choose a progression

The Aspire or Microservices course organizes the skills in order.

Lab

Focus the problem

One engineering scenario with its own project and goal.

Lesson

Make the change

Guided work that advances the lab.

Workspace

Build and run

A disposable environment for your code and live output.

Check

Prove the behavior

Executable verification before the next lesson unlocks.

Preview 2 courses, 17 labs, and 107 lessons →

The pipeline

The journey of a submission

From your keystroke to a graded report, five stops, each one assuming the previous could be wrong.

Write

C# in a full editor, right in your browser. Run the samples as often as you like.

Screen

Every submission is checked before anything runs. Hostile code never executes.

Compile

Built server-side with the same compiler you use locally, full diagnostics on errors.

Run, sealed

Executed in a disposable, fully isolated sandbox. One per submission.

Measure & grade

Time, memory, and correctness, measured on the server, where they can't be faked.

Seconds, end to end: the snippet goes out, the graded report comes back, per-test timings, budgets, memory, and diffs.

Containment

Inside the sandbox

We treat every submission as hostile, including yours. That's a feature: the same walls that contain malicious code make the grading fair and the timings clean.

our execution environment

containment log

$ submit Solution.cs

▸ sandbox spawned fresh · single-use

# security hardening, applied to every run

✓ network isolated

✓ file system read-only

✓ cpu · memory · processes capped

▸ tests complete 148 ms

▸ sandbox destroyed nothing persists

▮

This exact lifecycle runs for every submission, spawn, seal, run, measure, destroy. There is no long-lived server your code shares.

The grading

How each track grades you

Both solutions are correct. Only one survives the hidden suite.

Track 01

Algorithms

Correct gets you halfway. The hidden suite scales the input until complexity decides the outcome.

· Visible sample cases plus a hidden suite with inputs large enough that complexity decides the outcome.
· Per-test time budgets: correct-but-slow times out where the efficient solution passes with room to spare.
· Allocation tracking on every run, and on some puzzles a hard allocation budget, where the copy-and-reverse approach fails and the in-place one passes.

The N+1 starter

… ×41 round-trips

✗ Timeout over budget

The set-based rewrite

1 round-trip

✓ Passed 12 ms

Same rows returned. The grader shows you what each approach cost.

Track 02

Database / EF

Your LINQ runs against a real database, and the grader shows you what it really cost.

· Enough data that inefficiency shows up in the timings, not just in code review.
· Expand any test to see the queries your code actually produced, and what they cost.
· Plan-graded puzzles capture the execution plan the engine chose and grade it: full-table reads flagged in red, index usage in green. The right rows the wrong way fails.

method length

limit 25 lines

the starter: 78 ✗ yours: 18 ✓

cyclomatic complexity

limit 8

the starter: 14 ✗ yours: 6 ✓

nesting depth

limit 3 levels

the starter: 5 ✗ yours: 2 ✓

✓ 9 / 9 tests still passing (behavior never changed)

Measured directly from your source. The tests stay green; the mess has to go.

Track 03

Refactoring

Working code you'd hate to inherit. Make it clean, without changing what it does.

· The behavioral tests pass before you touch anything, and they must still pass when you're done.
· Structural gates measured from your source: method length, cyclomatic complexity, nesting depth, duplicate blocks.
· Every flavor of real-world mess (tangled conditionals, god methods, copy-paste blocks, arrow code), including the famous Gilded Rose kata.

Infrastructure

database, email, HTTP

Application

use cases

Domain

pure, depends on nothing

Domain code reaching for the database fails the grade, instantly, on every submission.

Track 04

Architecture

Multi-file refactoring katas, graded on behavior and design together.

· Dependency direction, layering, and abstraction boundaries are verified automatically.
· A wrong dependency fails the submission the same way a failing test does.
· Feedback in seconds, not in a code review three weeks later.

The adversarial suite

../../etc/passwd

path traversal

✓ denied

evil-example.com

suffix confusion

✓ rejected

http://169.254.169.254/

SSRF

✓ blocked

ada\n[INFO] admin granted

log forging

✓ escaped

✓ 6 / 6 functional | ✓ 4 / 4 exploits blocked

Two suites, one grade: close the hole without breaking the feature.

Track 05

Secure Coding

Functionally correct, quietly exploitable. Close the hole without breaking the feature.

· Two suites grade every submission: functional tests prove the feature still works, adversarial tests throw real attack payloads at it.
· The payloads are the classics that hit production systems: path traversal, SQL injection, SSRF, open redirect, log forging, mass assignment.
· The catch: the naive fix (blacklist a bad string, strip a tag) is exactly what the adversarial cases are built to defeat. Only a real fix, encode the output, allowlist the input, parameterize the query, passes.

Your code stays yours

Submissions are stored so you can see your own history and progress, that's it. We don't publish them, and the sandbox they ran in is gone seconds after they finish. The details live in the Privacy Policy.

See the grading for yourself

One puzzle is all it takes to understand why server-measured beats green checkmarks.

Start solving, it's free

Your code runs on our servers. Here's exactly what happens.

From course to check

Katabench Labs

Choose a progression

Focus the problem

Make the change

Build and run

Prove the behavior

The journey of a submission

Write

Screen

Compile

Run, sealed

Measure & grade

Inside the sandbox

How each track grades you

Algorithms

Database / EF

Refactoring

Architecture

Secure Coding

Your code stays yours

See the grading for yourself

Get new puzzles and .NET tips in your inbox

Your code runs on our servers.
Here's exactly what happens.