Cloning Kafka production data into any test environment

Video

April 20, 2026

Every Kafka engineer has been asked the same question by a frustrated developer: "Can we get production data into our test environment?" And every Kafka engineer has had to explain why the answer is complicated. There's no built-in tool for it. There's no standard pattern for it. Whatever solution you build ends up being custom, brittle, and expensive to maintain.

This post walks through how environment cloning works with Kannika, including the operational details that matter when you're putting it in front of a real development team.

‍

Why this was hard to solve

The reason Kafka environment cloning never became a standard pattern is structural. A Kafka cluster is a streaming system with a specific broker topology, partition layout, and offset bookkeeping. Copying data between clusters means dealing with all of that. Topic names, partition counts, replication factors, consumer offsets, schema registry references, and the actual records themselves. A naive copy that only moves record payloads leaves you with an environment that doesn't behave like production.

Over the years, teams have built various workarounds. Some use Kafka MirrorMaker to replicate a topic between clusters, which works but is operationally coupled and doesn't handle point-in-time consistency well. Some write custom consumer-producer loops that read from production and write to test. Some give up and generate synthetic data based on production schemas. None of these options produce a clean, repeatable clone that developers can spin up on demand.

‍

How Kannika handles it

Kannika's restore flow is designed to target any Kafka cluster, not just the one that was backed up. That's the primitive that makes environment cloning work. Your production Kafka is backed up in real time. When your team needs a test environment, they apply a restore definition pointing at a different cluster.

The restore definition is a YAML file that specifies the source backup, the target cluster, and the configuration for the restore operation. Two parameters matter for cloning specifically. The target cluster is where the restored data lands. The topic prefix, if you set one, gets prepended to every restored topic name.

Setting a prefix is a small detail that turns out to be important. Without it, you're restoring topics with their original names, which conflicts with anything already in your target cluster. With it, all your production topics land in the target cluster as "dev.orders", "dev.inventory", "dev.payments", or whatever prefix you choose. Your existing dev topics stay untouched. Your applications can be configured to read from the prefixed topics during testing.

You apply the restore, Kannika starts hydrating the target cluster from the backup, and the UI gives you visibility into the progress. When the restore finishes, your dev environment has the full dataset from production, organized under the prefix, ready for testing.

‍

Handling sensitive data

Production data has a problem, which is that it contains production information. PII, account numbers, personal identifiers, business secrets. Moving that data into a development environment without transformation is a compliance violation waiting to happen.

Kannika handles this through transformation plugins. A transformation plugin runs during the restore and applies a transformation to records before they land in the target cluster. The plugin can mask specific fields, replace them with synthetic values, encrypt them, or strip them entirely. The transformation happens inline, so by the time the data reaches your dev cluster, the sensitive fields are already handled.

The plugin system is extensible. Kannika ships with a baseline plugin that handles common cases, and the implementation is open enough that you can write your own transformations to match your specific data model. If you have a custom record schema with domain-specific PII, you can build a plugin that understands your format and applies the right transformation rules. The plugin is packaged as a WASI module, which means you can use whatever language you prefer to write the transformation logic.

This matters for compliance because it turns environment cloning into a GDPR-compatible operation. Your dev team gets data that looks and behaves like production, but the sensitive fields are handled according to your privacy requirements. The auditor sees a documented transformation step, not a raw data copy.

‍

Repeatable test environments

There's one more use case that emerges once you have environment cloning working, and that's repeatable test sets. The idea is simple. You capture a specific snapshot of production data, save it as a named set, and reuse it as a baseline for testing. Every time a developer needs a fresh environment, they restore from the same set. Every time a test run corrupts the data, they throw it away and restore again.

This pattern solves a problem that used to require weeks of engineering. Setting up repeatable, isolated, production-like test environments on Kafka was previously the domain of custom infrastructure code. With Kannika, it's a restore operation against a saved baseline. You can use it for regression testing, performance testing, disaster recovery drills, or any workflow that benefits from a known starting state.

For performance testing specifically, there's a nice property of the restore mechanism. Kannika can replay records at their original production rate, not at maximum throughput. That means your test environment sees traffic with realistic timing, not a firehose. If your application has timing-sensitive logic, latency budgets, or windowing operations, this matters. A test that runs at the wrong speed isn't testing the right thing.

‍

Operational considerations

A few things to keep in mind when you're putting environment cloning into a real workflow:

The restore targets a Kafka cluster you control. That means you need a destination cluster configured and reachable from Kannika. For most teams, this is a dev or acceptance cluster that already exists. If you're spinning up ephemeral test environments, you'll need to provision the cluster as part of your workflow before the restore runs.

The transformation plugin runs for every record during the restore. For large datasets, plan for the processing time. A simple field masking operation is fast. A complex transformation with external lookups is slower. Benchmark your plugin against realistic data volumes before you put it in a critical path.

The prefix is a convention, not a guarantee of isolation. Your applications need to be configured to read from the prefixed topics, or you need to route them to the dev cluster entirely. Getting this wrong means tests that pass in dev but fail in production because they were reading the wrong data the whole time.

‍

Try it yourself

Spin up the Kannika sandbox at kannika.io to explore the UI and get a feel for how restores are configured. The sandbox covers the basic actions, so you can see the building blocks. To run the full environment cloning flow against your own clusters, with prefixing, transformation plugins, and target routing, get in touch and we can set up a deeper walkthrough.

Author

Bryan De Smaele

Bringing your Kafka environment into NIS2 and DORA compliance

Video

Cloning Kafka production data into any test environment

Why this was hard to solve

How Kannika handles it

Handling sensitive data

Repeatable test environments

Operational considerations

Try it yourself

NIS2 and DORA compliance for Kafka environments

How Kannika saves you money on Kafka

Protecting Kafka from ransomware and cyberattacks

Cookie Settings

Cloning Kafka production data into any test environment

Why this was hard to solve

How Kannika handles it

Handling sensitive data

Repeatable test environments

Operational considerations

Try it yourself

Related Stories

NIS2 and DORA compliance for Kafka environments

How Kannika saves you money on Kafka

Protecting Kafka from ransomware and cyberattacks