Add a target runner for running tests in VMs #5285

andrewjstone · 2024-03-19T00:08:24Z

This PR allows us to run tests inside falcon VMs via nextest. Due to the lack of nextest per-test target runners, we enable the target runner via environment variable and run a specific test at a time.

In order to run a test, do the following, where the trailing --nocapture is optional but encouraged at this moment.

cd omicron
CARGO_TARGET_X86_64_UNKNOWN_ILLUMOS_RUNNER=./test-utils/src/bin/falcon_runner.sh cargo nextest run -p <package> <test name> --nocapture

An example:

CARGO_TARGET_X86_64_UNKNOWN_ILLUMOS_RUNNER=./test-utils/src/bin/falcon_runner.sh cargo nextest run -p omicron-test-utils launch --nocapture

A few things remain to be done, although this is safe to merge without adding them in this PR.

Right now, only one test can be run per test binary, although we can change this if we want. We should also error out in the case the user doesn't pass a specific test.
We don't actually check the error result of the test in the VM. We should do this, and then automatically leave the VM running if the test fails.
We need a small binary to allow us to login to a VM on a failed VM test. I hacked up the solo example in falcon for a demo, but we'll want one in Omicron for ease of use.
We need a good way to get log files from failing tests out of the VMs. We can actually do this with our hypothetical binary and the falcon exec call like <BIN> exec <VM_NAME> 'cat <FILE>' > <LOCALFILE>. However, a better mechanism would be to make the p9 filesystem read/write instead of read-only, or to inject ssh keys and pull the files that way.

smklein

This is extremely rad, great work getting this set up. Took me seconds to check out this PR and get the VMM test up-and-running -- truly incredible!

Right now, only one test can be run per test binary, although we can change this if we want.

Given the hoops we're trying to jump through, and the current limitations of nextest, this seems totally reasonable, but probably worthwhile adding some docs in the form of a README ("here is how you add a VMM test, here is how you run it, etc"), even if just to make it "easy to follow along" with the way to use this thing as it develops.

The "requires an environment variable to work" + "single-test-per-binary" constraints are tricky, so we should document them pretty explicitly.

We don't actually check the error result of the test in the VM. We should do this, and then automatically leave the VM running if the test fails.

I'd consider this a hard-blocker before encouraging anyone to use this as a test harness. Even if we aren't extracting all the log info we want yet, this seems like it's "dangerous enough to omit" that it should probably be fixed first.

test-utils/src/bin/falcon_runner.rs

test-utils/src/dev/falcon.rs

smklein · 2024-03-19T00:25:12Z

test-utils/src/dev/falcon.rs

+//! Mechanisms to launch and control propolis VMs via falcon
+
+#[cfg(test)]
+mod test {


Idea, take it or leave it -- it might be kinda nice to have a way to validate "this test is being run from a VMM context, bail out if that isn't true?"

I dunno what we'd want to set in the falcon_runner, but just something to distinguish it from the host, since "getting the config right" is one of the trickiest bits to launch these VM tests right now

I added a check for the hostname, which is quite unique. We can certainly inject a file though, which might work better once we start using different names for VMs. We'll want to use different names to run tests in parallel.

This still needs a symlink of the .falcon dir to work.

andrewjstone · 2024-03-25T20:46:03Z

This is probably good enough for a real review now. I added a README to get started and also leave the VM up when a test fails.

andrewjstone · 2024-03-25T20:47:32Z

This is probably good enough for a real review now. I added a README to get started and also leave the VM up when a test fails.

There are few significant limitations listed in the README, but they can be resolved fairly quickly. It would probably be good to get this in to let people play with it, then figure out how we want to perform setup for more sophisticated tests that require injecting data into VMs, and for running in parallel.

smklein · 2024-03-25T23:41:45Z

test-utils/README.md

+* The name of the test `Runner` and `VM` is currently hardcoded. Therefore,
+tests can only be run serially for now. This is an easy limitation to lift, we


nit - the names are hard-coded, but these aren't the names used, right? It's the string including "launchpad_mcduck?"

smklein · 2024-03-25T23:44:40Z

test-utils/src/bin/falcon_runner.rs

+    let exit_code_index = out.rfind('\n').unwrap();
+    let exit_code: u8 = (&out[exit_code_index + 1..]).parse().unwrap_or(255);


Couple nitpicks here:

If the \n character didn't exist in the output we might want to get better feedback from the test (via an .expect call, perhaps?)

The indexing in out seems like it could easily go out-of-bounds -- what if \n was the last character emitted?

andrewjstone added 10 commits March 11, 2024 23:00

wip

2a96fe1

Merge branch 'main' into falcon-nextest

21c7818

wip

f3c4af3

A running test

f63ad85

pick the proper bin

377967a

Merge branch 'main' into falcon-nextest

7b3628c

wip

0d01550

Merge branch 'main' into falcon-nextest

4da2eff

Latest falcon updates and run with env target

54d08ec

fix gitignore

12b8e9e

smklein reviewed Mar 19, 2024

View reviewed changes

andrewjstone and others added 12 commits March 19, 2024 17:35

Merge branch 'main' into falcon-nextest

be45a0d

Add a CLI for interacting with a falcon VMM test

1409092

This still needs a symlink of the .falcon dir to work.

pass or fail based on exit code

8e8e8a4

Use ephemeral falcon dirs and better test failure handling

d1a7b37

Example test fails when not running in a VM

2fc5fd9

Add README.md

7bdeba9

more-readme

e5f1ee0

point back at latest falcon

cdc97a6

clippy

b56174c

use a logger instead of eprintln

a48aee0

do not double destroy

2a8216e

hakari

2ecb8cb

andrewjstone marked this pull request as ready for review March 25, 2024 20:45

illumos only

caae766

smklein approved these changes Mar 25, 2024

View reviewed changes

smklein mentioned this pull request Mar 26, 2024

[sled-agent] Tracking issue for "test it in a VMM" #5329

Open

7 tasks

review fixes

a016f04

only build for illumos

29f7c03

andrewjstone enabled auto-merge (squash) March 26, 2024 00:43

andrewjstone disabled auto-merge March 26, 2024 00:44

andrewjstone changed the title ~~[WIP] Add a target runner for running tests in VMs~~ Add a target runner for running tests in VMs Mar 26, 2024

andrewjstone enabled auto-merge (squash) March 26, 2024 00:44

andrewjstone added 4 commits March 26, 2024 04:57

fix cli

51292e4

damnit

1f66088

wtf

6f74a8b

derp

6a6bc4a

andrewjstone merged commit a17750d into main Mar 26, 2024
23 of 24 checks passed

andrewjstone deleted the falcon-nextest branch March 26, 2024 14:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a target runner for running tests in VMs #5285

Add a target runner for running tests in VMs #5285

andrewjstone commented Mar 19, 2024 •

edited

Loading

smklein left a comment

smklein Mar 19, 2024

andrewjstone Mar 23, 2024

andrewjstone commented Mar 25, 2024

andrewjstone commented Mar 25, 2024

smklein Mar 25, 2024

smklein Mar 25, 2024

		* The name of the test `Runner` and `VM` is currently hardcoded. Therefore,
		tests can only be run serially for now. This is an easy limitation to lift, we

		let exit_code_index = out.rfind('\n').unwrap();
		let exit_code: u8 = (&out[exit_code_index + 1..]).parse().unwrap_or(255);

Add a target runner for running tests in VMs #5285

Add a target runner for running tests in VMs #5285

Conversation

andrewjstone commented Mar 19, 2024 • edited Loading

smklein left a comment

Choose a reason for hiding this comment

smklein Mar 19, 2024

Choose a reason for hiding this comment

andrewjstone Mar 23, 2024

Choose a reason for hiding this comment

andrewjstone commented Mar 25, 2024

andrewjstone commented Mar 25, 2024

smklein Mar 25, 2024

Choose a reason for hiding this comment

smklein Mar 25, 2024

Choose a reason for hiding this comment

andrewjstone commented Mar 19, 2024 •

edited

Loading