Documentation

Crabka is a Rust reimplementation of Apache Kafka. It speaks the Kafka wire protocol, stores records in Kafka-compatible log segments, runs metadata on KRaft, and is validated against the official JVM clients and command-line tooling.

This page combines the project introduction and system overview. Use it to learn what Crabka is, how the pieces fit together, and where to go next.

Project Status

Crabka is beta, pre-1.0 software. The Kafka-facing surface is broad: wire protocol, log storage, replication, KRaft metadata, authorization, quotas, tiered storage, transactions, consumer groups, share groups, Schema Registry, Kubernetes operation, and Rust clients are all implemented to meaningful depth and tested against JVM Kafka behavior.

The important caveat is the same one stated in the repository README: Crabka is still greenfield infrastructure. It has no production users and does not yet promise on-disk compatibility across versions. Use it for evaluation, development, interoperability testing, and non-critical workloads while the project hardens.

Why Crabka

If you run Kafka on Kubernetes today, you usually operate the JVM broker, Strimzi, and Cruise Control as separate pieces. Crabka reimplements the broker in safe Rust and ships the surrounding operational tools in one workspace.

Kafka wire compatibility. Protocol codecs are generated from Apache Kafka message schemas and checked byte-for-byte against kafka-clients.
Works with existing tooling. JVM tools such as kafka-topics.sh, kafka-configs.sh, kafka-acls.sh, and kafka-consumer-groups.sh run against a live Crabka broker.
Rust runtime. Crabka uses tokio, forbids unsafe code across the workspace, and avoids JVM heap tuning and GC behavior.
KRaft-native. Metadata is stored in a native KRaft quorum. ZooKeeper mode and ZooKeeper-to-KRaft migration are deliberately out of scope.
Operations included. The repository includes a Kubernetes operator, Prometheus metrics, OTLP tracing, Helm charts, OCI images, and a partition rebalancer.

How It Fits Together

flowchart TD
  Clients[Kafka clients and kafka-*.sh tools] --> Broker[crabka-broker]
  Broker --> Log[Kafka-compatible log segments]
  Broker --> KRaft[KRaft metadata quorum]
  Broker --> Remote[Tiered storage]
  Operator[crabka-operator] --> Broker
  Registry[crabka-schema-registry] --> Broker
  Gateway[crabka-grpc-gateway] --> Broker
  Rebalancer[crabka-rebalancer] --> Broker
  Replicator[crabka-replicator] --> Broker
  RustClients[Rust producer / consumer / admin / streams clients] --> Broker

The broker handles Kafka protocol traffic and persists data to Kafka-compatible logs. Metadata changes flow through the KRaft quorum. The operator, Schema Registry, gateway, rebalancer, replicator, and Rust clients sit around that core as normal Kafka clients or cluster controllers.

Documentation Map

The docs are organized by what you are trying to do:

Start Here explains the local quickstart, architecture, and browser consensus playground.
Deploy covers Kubernetes operation and Schema Registry.
Build covers the Rust Streams client, data formats, and rustdoc entry points.
Reference is generated from source during the docs build and contains exact CRD fields, broker configuration keys, topic configuration keys, protocol API tables, and consensus failure diagrams.

Next Steps

Run a local broker with the Quickstart.
Understand the system with Architecture.
Deploy a Kubernetes cluster with Operator Deployment.
Look up exact fields and config keys in the generated Reference.

Quickstart →

Architecture →

Playground →

Operator Deployment →

Schema Registry Deployment →

Streams and Data Formats →

Rust API →