Prevent loading of untrusted initrds #75

2023-01-23T17:49:16Z

alois31 commented

2023-01-23 17:49:16 +00:00

(Migrated from github.com)

Using a malicious boot loader specification entry, the kernel could be instructed to load untrusted initrds in two ways:

Since the kernel was signed, it could be loaded directly, with an arbitrary initrd (since neither systemd-boot nor Linux verify initrds).
The cmdline our stub received was passed on to the kernel. By loading the stub indirectly via boot loader specification, this could be abused to point the kernel to an arbitrary initrd, as well as instructing it to perform various other fun stuff.

To prevent these security bypasses, the following solution is implemented:

Use a custom PE loader, just enough to load the kernel, in order not to rely on LoadImage. This loader does not need to verify signatures, because we already confirmed using the hash that the kernel is trusted.
Stop signing kernels on installation, preventing the first bypass.
Always pass the cmdline built into the stub UKI to the kernel. By virtue of being built-in, this cmdline is trusted, and the second bypass is fixed.

Fixes: https://github.com/nix-community/lanzaboote/issues/65

Using a malicious boot loader specification entry, the kernel could be instructed to load untrusted initrds in two ways: * Since the kernel was signed, it could be loaded directly, with an arbitrary initrd (since neither systemd-boot nor Linux verify initrds). * The cmdline our stub received was passed on to the kernel. By loading the stub indirectly via boot loader specification, this could be abused to point the kernel to an arbitrary initrd, as well as instructing it to perform various other fun stuff. To prevent these security bypasses, the following solution is implemented: * Use a custom PE loader, just enough to load the kernel, in order not to rely on `LoadImage`. This loader does not need to verify signatures, because we already confirmed using the hash that the kernel is trusted. * Stop signing kernels on installation, preventing the first bypass. * Always pass the cmdline built into the stub UKI to the kernel. By virtue of being built-in, this cmdline is trusted, and the second bypass is fixed. Fixes: https://github.com/nix-community/lanzaboote/issues/65

❤️ 1

alois31 commented

2023-01-23 18:00:38 +00:00

(Migrated from github.com)

Please note that this implementation is still a bit rough around the edges. In particular, there are the following known shortcomings or changes in behavior (roughly in ascending order of severity, in my view):

The kernel image does not get measured into PCR 4 any more. This should be harmless, because its hash already influenced PCR 4 as part of measuring the stub UKI.
The PE loader is extremely basic: it performs nearly no checks at all, and doesn't support most relocation types. I don't think this is a problem, because we only load trusted Linux kernels.
NX is not supported. I'm honestly not quite sure how to properly do that given that some pages contain sections with different permissions:

  0 .setup        00003dc0  0000000001000200  0000000001000200  00000200  2**4
                  CONTENTS, ALLOC, LOAD, READONLY, CODE
  1 .reloc        00000020  0000000001003fc0  0000000001003fc0  00003fc0  2**0
                  CONTENTS, ALLOC, LOAD, READONLY, DATA
  2 .compat       00000020  0000000001003fe0  0000000001003fe0  00003fe0  2**0
                  CONTENTS, ALLOC, LOAD, READONLY, DATA
  3 .text         0082b000  0000000001004000  0000000001004000  00004000  2**4
                  CONTENTS, ALLOC, LOAD, READONLY, CODE

Stuff is completely broken on architectures with incoherent instruction caches. This is relevant for supporting AArch64, which I already see in another pull request. This can be easily fixed by flushing the icache after relocating the kernel image, which I think requires a bit of architecture-specific code. Fixed, in the sense that this now breaks the build instead of executing things we didn't intend to execute at runtime.

Please note that this implementation is still a bit rough around the edges. In particular, there are the following known shortcomings or changes in behavior (roughly in ascending order of severity, in my view): * The kernel image does not get measured into PCR 4 any more. This should be harmless, because its hash already influenced PCR 4 as part of measuring the stub UKI. * The PE loader is extremely basic: it performs nearly no checks at all, and doesn't support most relocation types. I don't think this is a problem, because we only load trusted Linux kernels. * NX is not supported. I'm honestly not quite sure how to properly do that given that some pages contain sections with different permissions: ``` 0 .setup 00003dc0 0000000001000200 0000000001000200 00000200 2**4 CONTENTS, ALLOC, LOAD, READONLY, CODE 1 .reloc 00000020 0000000001003fc0 0000000001003fc0 00003fc0 2**0 CONTENTS, ALLOC, LOAD, READONLY, DATA 2 .compat 00000020 0000000001003fe0 0000000001003fe0 00003fe0 2**0 CONTENTS, ALLOC, LOAD, READONLY, DATA 3 .text 0082b000 0000000001004000 0000000001004000 00004000 2**4 CONTENTS, ALLOC, LOAD, READONLY, CODE ``` * ~Stuff is completely broken on architectures with incoherent instruction caches. This is relevant for supporting AArch64, which I already see in another pull request. This can be easily fixed by flushing the icache after relocating the kernel image, which I think requires a bit of architecture-specific code.~ Fixed, in the sense that this now breaks the build instead of executing things we didn't intend to execute at runtime.

nikstur commented

2023-01-23 18:13:59 +00:00

(Migrated from github.com)

Thanks for the PR! I think @blitz can review this best.

blitz (Migrated from github.com) reviewed

 @ -0,0 +147,4 @@
             loaded_image.set_load_options(
                 load_options.as_ptr() as *const u8,
                 u32::try_from(load_options.num_bytes()).unwrap(),
             );

 @ -0,0 +173,4 @@
         status
     }
 }

Rows
Columns