Skip to content
On this page

eBPF Interpreter Implementation

The Linux interpreter lies here: kernel/bpf/core.c#___bpf_prog_run

To understand what exactly each instruction means and their undefined behaviors, given that no documentation ever states that (for now), we have to look into an actual, official implementation.

The following table uses a lot macros:

  • u64 *regs: The array of the registers,
  • #define DST regs[insn->dst_reg]: the dst_reg,
  • #define SRC regs[insn->src_reg]: the src_reg,
  • #define IMM insn->imm: the immediate number (which is i32 by the way).

To save some space, when SRC and IMM follows the same pattern, I am going to skip the latter.

INFO

Just in case someone forgets about type conversions in C like me, (u64_v) op (i32_v) actually does (u64_v) op (i32_v as i64 as u64) in Rust.

Ops / VariantCodeNotes
Shifts (64-bit)DST = DST OP (SRC & 63)
Shifts (32-bit)DST = (u32) DST OP ((u32) SRC & 31)Upper-half zeroed
Shifts (s64)(*(s64 *) &DST) >>= (SRC & 63)
Shifts (s32)DST = (u64) (u32) (((s32) DST) >> (SRC & 31))Upper-half zeroed
ALU (64-bit)DST = DST OP SRC+ - & | ^ *
ALU (32-bit)DST = (u32) DST OP (u32) SRCUpper-half zeroed
ALU (NEG64)DST = -DST
ALU (NEG32)DST = (u32) -DSTUpper-half zeroed
ALU (MOV64)DST = SRC
ALU (MOV32)DST = (u32) SRCUpper-half zeroed
ALU (MOV_K 64)DST = (__s32) IMMSign-extending
LD_IMM_DWSee LD_IMM_DW
MOD (64-bit)div64_u64_rem(DST, SRC, &AX); DST = AX;Unknown
MOD (32-bit)AX = (u32) DST; DST = do_div(AX, (u32) SRC);Unknown
DIV (64-bit)DST = div64_u64(DST, SRC)Unknown
DIV (32-bit)AX = (u32) DST; do_div(AX, (u32) SRC); DST = (u32) AX;Unknown
EndiannessDST = (__force u??) cpu_to_be??(DST)Upper bits zeroed
Function callsPre-processed by the verifier. Skipping.
JMPinsn += insn->off+1 by outer loop
Conditional JMPif ((SIGN##64) DST CMP_OP (SIGN##64) SRC) { ... }Casts
STX*(SIZE *)(unsigned long) (DST + insn->off) = SRCCasts
ST*(SIZE *)(unsigned long) (DST + insn->off) = IMMCasts?
LDXDST = *(SIZE *)(unsigned long) (SRC + insn->off)Casts
Atomic...WIP

Notes

Note that the interpreter implementation is not necessarily the actual specification.