A cyber range is the closest analog to a live-fire range that a defense organization has for developing and testing cyber capabilities. Like a physical range, it must be realistic enough to produce genuine skill transfer, safe enough to prevent damage to operational systems, and repeatable enough to measure improvement across training rotations. Unlike a physical range, the target environment is entirely software-defined – which means every design decision about what to emulate, how to orchestrate scenarios, and how to score trainee performance is an architectural choice that directly determines training effectiveness. This article examines the principal architectural components of a defense cyber range and the engineering decisions that distinguish functional ranges from ones that fail to transfer to operations.
Core architecture: four layers of a defense cyber range
A well-structured cyber range separates concerns across four layers, each with distinct engineering requirements and operational interfaces.
1. Virtualization and network emulation layer. This layer provides the compute and network fabric on which the emulated target environment runs. Virtualization (KVM, VMware ESXi, or equivalent) hosts operating system instances with real OS images, real application stacks, and real vulnerabilities. The network emulation fabric – typically implemented with a software-defined networking (SDN) controller and virtual switches – controls the topology connecting those instances: which VLANs exist, what bandwidth and latency constraints apply to each link, which firewall rules govern inter-segment traffic. The distinguishing property of this layer is that it runs real binaries: a trainee running a port scanner against a range target gets the same response as they would against a live host, because the target is a live host, just isolated within the range fabric.
2. Scenario orchestration layer. Orchestration is what transforms a collection of running VMs into a training exercise. The orchestrator reads scenario definitions – structured files describing the initial state of the environment, the sequence of adversary actions to inject, the conditions that advance or branch the scenario, and the scoring criteria – and executes them against the virtualization layer via API. When an exercise is launched, the orchestrator provisions the environment from snapshots, configures network state, starts background traffic generators, and hands control to the exercise timer. During the exercise, it injects pre-scripted adversary actions (file drops, registry modifications, network connections, log entries) on schedule or in response to trainee actions. When the exercise ends, it collects telemetry and triggers environment reset.
3. Traffic generation and telemetry layer. A range without realistic baseline traffic is a poor training environment – every anomaly stands out because the baseline is flat. The traffic generation layer produces protocol-authentic background activity: HTTP/S browsing sessions, SMTP and IMAP email flows, Windows domain authentication sequences, file-share access, DNS queries, and where relevant, industrial control system protocol exchanges (Modbus, DNP3, IEC 61850). The telemetry layer simultaneously collects full-packet capture and structured logs from every node into a SIEM instance that trainees use during the exercise. Calibrating traffic volume so that injected adversary traffic is detectable but not trivially obvious is one of the more demanding content-design tasks in range operation.
4. Scoring, AAR, and administration layer. Scoring records exercise outcomes – flags captured, detection timestamps, containment actions, tools deployed – and presents them to exercise controllers in real time and to trainees in after-action review. The administration layer handles user and team management, environment allocation, exercise scheduling, and infrastructure health monitoring. These two concerns are often collapsed into a single web application, but they have different access control requirements: scoring data must be protected from trainees during the exercise, and infrastructure monitoring must be accessible to range operators even when the exercise frontend is unavailable.
Network emulation: topology fidelity and isolation
The network emulation layer is where most defense cyber range architectures make their most consequential tradeoffs. The fundamental choice is between full virtual machine emulation and container-based emulation. Virtual machines provide the highest fidelity – each host runs a complete OS kernel with the exact patch level, configuration, and service set of the target environment – but consume significant memory and require longer reset times. Containers share the host kernel, start in seconds, and reset almost instantaneously, but cannot emulate kernel-level vulnerabilities or OS-specific behaviors that differ between Linux distributions and Windows.
Defense ranges frequently use a hybrid approach: Windows hosts and ICS endpoints that require specific kernel behavior run as VMs; Linux-based services and network infrastructure run as containers. The SDN fabric (Open vSwitch with an OpenFlow controller, or a commercial equivalent) connects both without distinction, enforcing the same bandwidth, latency, and ACL rules regardless of the underlying virtualization technology.
Isolation is non-negotiable for any range hosting concurrent exercises. A range architecture that allows traffic to escape from one exercise tenant's network into another's – or, worse, into the production network – is a security incident, not a training environment. Isolation must be enforced at the SDN layer with explicit allow-listing, not by relying on VM-internal firewall configuration that trainees might modify as part of the exercise. The management network, used by range operators and the orchestration platform, must be on a physically or cryptographically separate segment that range participants cannot reach.
ICS and OT environment emulation
Defense organizations increasingly require ranges that emulate operational technology (OT) environments: industrial control systems, SCADA networks, and the IT/OT boundary that is a primary attack surface in infrastructure-focused threats. Emulating an ICS at training fidelity requires both software emulators for PLC logic and engineering workstations, and accurate protocol simulation for Modbus TCP, DNP3, and IEC 61850. Several open platforms provide software PLC emulation sufficient for training purposes – the key requirement is that the emulated PLC responds to protocol queries in a way that real attack and defense tools recognize as a live device.
Scenario orchestration: the exercise control plane
Scenario orchestration is the component that most directly determines the quality of the training experience. A sophisticated virtualization layer running a mediocre scenario produces mediocre training. The scenario definition must encode not just what adversary actions are injected, but the realistic artifacts each action produces – the specific log entries, registry keys, network flows, and file system modifications that a competent defender would detect and investigate.
Scenario definitions should be version-controlled alongside the infrastructure they reference. A scenario that was written for a specific OS image version may produce different artifact patterns on an updated image – log format changes, event ID changes, network stack behavior differences – that silently break the training validity. Treating scenario content as code, with the same review and testing discipline applied to software, is the practice that distinguishes operationally mature range programs from ones that produce confusing and inconsistent trainee experiences.
The inject pipeline – the mechanism by which the orchestrator implants adversary artifacts into running VMs – is a security-sensitive component. It typically operates via an agent running on each VM that accepts signed inject commands from the orchestration layer. The agent must validate command signatures before execution to prevent a compromised trainee workstation from injecting unauthorized artifacts. The inject agent's communication channel must be on the management network, not the exercise network – if it is reachable from the exercise network, a skilled trainee could use it to modify the scenario state.
Traffic generation: producing a realistic baseline
Background traffic generation is the least glamorous and most frequently underinvested component of a cyber range. The consequence of underinvestment is a range where every injected indicator of compromise is immediately obvious because it is the only non-trivial traffic on the wire. This produces overconfident trainees who learn to detect obvious threats in a sterile environment and struggle when those same techniques are applied against the noisy baseline of a real network.
A credible traffic generator must produce flows that are semantically coherent, not just statistically plausible. An HTTP session generator that produces random byte sequences at HTTP-shaped timing intervals will not fool a trainee examining packet captures – the payloads will not contain valid HTML, the content-type headers will not match the body. Tools such as PCAP replay, user-simulation agents, and purpose-built range traffic platforms all offer different fidelity-versus-complexity tradeoffs. PCAP replay from captures of real enterprise networks provides the highest payload fidelity but cannot adapt to the specific topology of the range or generate new authentication events with valid credentials.
Key insight: The most common failure mode in defense cyber range programs is not the attack-side content – it is the absence of a realistic baseline. When every log line and every packet in the range was injected by the orchestrator, defenders learn to treat every artifact as significant. On a real network, the skill is discrimination – distinguishing adversary activity from legitimate noise. Ranges that skip baseline investment systematically fail to train this discrimination skill.
Scoring and after-action review
Scoring architecture for a defense cyber range differs from capture-the-flag competitions in an important respect: the primary training objective is detection and containment speed, not exploitation depth. A scoring model that only rewards flag submission (proof-of-exploitation strings) incentivizes trainees to optimize for finding flags, not for building the detection and response workflows that transfer to operations.
Defense range scoring should instrument the defender workflow directly. Detection metrics record when a trainee queries the SIEM with a search that matches the injected indicator, opens an alert generated by the injected traffic, or executes an investigation action on a compromised host. Containment metrics record when a trainee isolates a host, blocks a C2 callback address, or resets compromised credentials – each with a timestamp relative to the inject that required those actions. These timestamps, combined with the exercise scenario timeline, produce a detection gap metric: the interval between an adversary action and the defender's first response. That metric is the primary measure of trainee competence that a red team versus blue team exercise is designed to reduce.
After-action review requires replay capability: the ability to reconstruct the full exercise timeline – adversary injects, network traffic, SIEM queries, trainee terminal commands – in a synchronized view that lets instructors walk through the exercise with trainees. Full-packet capture and session recording from every node are storage-intensive but operationally essential. A range that produces detailed scoring metrics but cannot explain why the score was what it was provides limited instructional value. For a broader view of how simulation architectures support training outcomes across domains, see the article on military training simulation software architecture.
Environment reset: infrastructure as a repeatable artifact
The ability to reset the range environment to a known-good baseline rapidly and reliably is what separates a professional range from an improvised lab. A range that takes hours to reset between exercise rotations limits training throughput and increases operational cost. A range that resets inconsistently – where some runs start from a clean baseline and others carry over state from previous exercises – introduces confounding variables that make performance comparison across rotations meaningless.
Snapshot-based reset for VMs and image replacement for containers are the standard mechanisms. Both depend on the snapshot or image being genuinely clean – not a snapshot taken after a previous exercise that introduced configuration drift. Range operators should validate baseline health after every reset using automated checks: service availability probes, configuration compliance scans, and traffic-generation heartbeats that confirm the exercise environment is in the expected state before handing it to trainees.
Infrastructure-as-code is the practice that makes reset both fast and reliable. When every VM, network rule, and service configuration is defined in version-controlled templates – and the range platform can instantiate the entire environment from those templates – reset becomes a deterministic operation rather than a manual procedure. It also enables the range to be deployed to new hardware or a different cloud region with confidence that the resulting environment is identical to the one on which the exercise content was validated.
Build and run your cyber training exercises with WARG
WARG is Corvus Intelligence's wargaming and exercise platform – purpose-built for defense organizations that need repeatable, scored cyber training environments with realistic scenario orchestration and full after-action replay. Purpose-built for the tempo and classification requirements of defense training programs.
This analysis was prepared by Corvus Intelligence engineers who build mission-critical software for defense and government organizations. Learn about our team →