Adding support for a new operating system

Introduction

HVMI is a technology designed to offer security from outside a virtual machine, by leveraging the hardware isolation provided by Intel VT-x. An issue with running outside a VM is the semantic gap, meaning that the Introspection engine will only be able to access raw memory with no apparent meaning.

Intel VT-x allows the hypervisor to notify the Introspection engine on some events (such as MSR writes, CR writes, interrupts, memory accesses, etc.) and also query the current guest state (MSRs, CRs, IDT, GDT, GPRs, raw memory, etc.) which help to overcome the semantic gap.

However, there’s only so much information one can get by classic introspection techniques while also maintaining a relatively low performance impact for the guest OS. For this reason, HVMI relies on an update file that contains important information about the guest OS that helps extract valuable information from the raw guest bytes.

For now, we’ll talk about adding support for a guest. Please see the official documentation for more information about the update file and the way HVMI interacts with it.

Adding support for a Guest

The cami project in the HVMI root directory is responsible for generating the update file. Said project is split into 3 main directories:

sources, which contains yaml files that contain the information needed to support a guest;
scripts, which contains the codebase responsible to pack said information into a format that will later be loaded by HVMI;
tools which contains a number of utilities to make life easier when adding support for a guest.

The sources directory already contains a couple of yaml files that describe a number of guests. When adding support for a new guest, the user should add a new file in this folder since this is the default location where the generator scripts search for update files.

Adding support for a Windows Guest

To add support for a new windows guest, the user needs to have a copy of said guests ntoskrnl.exe and ntdll.dll. Both of these are found in the WINDIR:\Windows\System32 folder, where WINDIR is the drive with the Windows installation (generally the C drive). Note that the kernel image may have different names, such as ntkrnlmp.exe, ntkrnlpa.exe or ntkrpamp.exe.

After acquiring those files, it’s time to shift focus to the tools directory, specifically to the r2cami directory within it. r2cami is a project containing a couple of python 3 scripts dedicated to automating the task of adding support for a new Windows OS. To do that, it uses existing tools such as radare2 and BitDefender Disassembler.

Note that r2cami takes advantage of r2pipe which is a collection of wrapper APIs that allow users to create scripts such as this. If your radare installation did not include r2pipe, please check the official r2pipe or radare2 documentation.

To generate an update file, a user should simply run r2cami as demonstrated below.

python r2cami.py --kernel=/path/to/ntoskrnl.exe --ntdll=/path/to/ntdll.dll --outfile=/path/to/desired/outfile

Now, the file at /path/to/desired/outfile will contain all of the guest specific fields, function patterns and other obscure fields that HVMI needs to successfully initialize and protect the guest.

Adding support for a Linux Guest

To add support for a Linux guest, one needs the kernel debug symbols and the configuration file (which is usually located under /boot directory) for the desired version. Extracting the layout of structures is done by a GDB module that creates a new command, all-offsets \<output-file\>. Running this command will create a new file containing the fields offsets and the size of each structure. In order to import this command, the user has to either run gdb from hvmi/cami/tools/linux directory, or import the offsets.py script with the following commands sequence:

(gdb) python
>gdb.execute('source hvmi/cami/tools/linux/offsets.py')
>end

Next, because HVMI does not need that much information, the user has to run another script, generate_yaml.py which will create a yaml file that should be placed under the hvmi/cami/sources/linux/opaque_fields directory.

The following commands sequence has to be executed to generate the support file for a kernel version, assuming hvmi/cami/tools/linux as current working directory:

$ gdb path/to/vmlinux
(gdb) all-offsets offsets.json
(gdb) exit
$ python3 generate_yaml.py offsets.json supported_os.yaml config-file

HVMI also requires a signature for the very first page from the kernel code section. There is no tool able to generate this yet, so this step must be done manually. The user needs to select a sequence of bytes generic enough to not be affected by alternative changes. This sequence should be placed in hvmi/cami/sources/linux/dist_sigs.yaml file.

Generating the binary file

Since HVMI can’t parse yaml files, one needs to pack it into a format that HVMI can understand and load. To do that, one should run the cami generator script from the scripts folder as follows:

python scripts/main.py --major 1 --minor 4 --buildnumber 0 --sources=sources

The sources argument specifies where the script will search for the yaml files. In this case, it is the path to the folder where the user saved the outfile generated earlier.

The other arguments are simply version control arguments. The update file is designed to be backwards compatible. That means that HVMI will NOT load an older file than supported, but will load a newer one. The values specified in the examples are the current HVMI compatible versions.

Please see the official documentation to fully understand the update file and the way HVMI interacts with it.