Add support for a CDCL-type SAT solver using the Resolvo crate #1179

jrray · 2025-02-10T02:19:46Z

🚧 Work in Progress 🚧

It has been frequently requested over the life of this project to adopt a SAT solver in the hopes that it would provide faster solves and/or better messages when things can't solve. The Resolvo looked promising as it supports lazy solving and attempts to present solve errors in a relatively easy to understand format.

I have implemented Resolvo's DependencyProvider trait for spk packages and got enough of the spk package compatibility concepts right that the existing solver tests are all passing when run against the new solver.

I have also started to hard code changing all uses of the original solver into using the new solver just to expedite being able to test the new solver with some real world solves at SPI. There will be more work required to make it possible to choose between these solvers at runtime.

The new solver doesn't return a Graph because it doesn't use Node or State. It doesn't use Decision or Change. It doesn't use the existing validator types since those are designed to validate against a State. Solves aren't run via a SolverRuntime or through a DecisionFormatter. What is similar is the way a Solver is configured, and it still returns a Solution, although the Solution is not necessarily populated with the same details. It does well enough to pass all the tests (we need more tests!).

The high level view of how Resolvo works is that the DependencyProvider needs to be able to return a list of candidates for a request, filter those down based on compatibility, then enumerate all the dependencies of a given candidate. Rather than how our solver validates a package based on the state of the partial solve, Resolvo just needs to know if a given package (build) is compatible with a given request (python/3.11).

One of the first challenges was dealing with global var requests, meaning var requests that have no namespace. Resolvo requires that all requirements are attached to a specific package name. In order to handle global vars, each global var, for example, debug/true, is turned into a pseudo package called "debug". Then, since the DependencyProvider has to return the list of candidates (values) for "debug", all possible values that var can have has to be known in advance. To make that possible, the solve is restarted as new values are discovered. It is likely going to be necessary to maintain a cache that can be reused between solves to minimize the need to restart the solve.

Another challenge I haven't found a better approach for yet is that since all the candidates need to be returned upfront, when getting the candidates for a package, all the builds of all the versions of that package need to be visited, which includes reading in each build's package spec. This would benefit greatly from some kind of package index in the repo so this data could be read in bulk. I tried something where the list of candidates is first just the list of versions available for a package, but then it wasn't giving me the ability to filter those versions later after reading the builds.

What comes next is getting a test build up and running inside SPI and experimenting with some solves.

jrray · 2025-02-13T06:59:35Z

Something of a PSA for anyone trying this code out, we've discovered that #1130 introduced a bug that makes the original solver not respect host options. We're working on a fix.

jrray · 2025-02-13T19:38:55Z

An update on the topic we discussed in the weekly meeting about getting different results from the solver between different runs. I realized that what is happening is currently builds of a version are not being sorted and so the build the solver will pick is subject to one or more per-process random seeds.

If I ask for python and it depends on gcc, it will first try to use the highest version of python available, but it might by chance pick a build of python that depends specifically on gcc/6. This is all due to how the candidates of python are ordered. I will revisit the sort logic to make it behave more like the original solver. This should make the solve result deterministic between runs if the repo contents are the same.

Copilot

Pull Request Overview

This PR adds support for a CDCL‐type SAT solver via integration of the Resolvo crate. Key changes include replacing direct repository usage with a solver interface across several CLI commands, updating Cargo.toml to include the new spk‑solve workspace, and modifying the BinaryPackageBuilder API to accept a solver instance while removing legacy formatter resolver functions.

Reviewed Changes

Copilot reviewed 60 out of 60 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
crates/spk-cli/cmd-make-binary/src/cmd_make_binary.rs	Replaced the old repositories field with a new solver field and updated decision formatter usages.
crates/spk-cli/cmd-install/src/cmd_install.rs, crates/spk-cli/cmd-explain/src/cmd_explain.rs, crates/spk-cli/cmd-env/src/cmd_env.rs	Updated to utilize the new solver.decision_formatter_settings instead of the removed formatter_settings.
crates/spk-build/src/build/binary.rs	Modified BinaryPackageBuilder to be generic over a solver, replacing old resolver callbacks with calls to run_and_print_resolve, and updating related type constraints.
Various test and macro files	Updated invocations to use from_recipe_with_solver and added a new macro parameter for specifying the solver to use.
Cargo.toml files	Added the spk-solve workspace dependency and updated version numbers for rstest and tokio.

Comments suppressed due to low confidence (2)

crates/spk-cli/cmd-make-binary/src/cmd_make_binary.rs:29

The field 'solver' replaces the old 'repos' field; please ensure that all references and functionality formerly provided by 'repos' (including repository management) are now fully supported by the new solver configuration.

pub solver: flags::Solver,

crates/spk-build/src/build/binary.rs:450

[nitpick] Consider removing commented-out legacy resolver code to improve clarity and maintain maintainability now that the new run_and_print_resolve approach is being used.

//let (solution, graph) = self.source_resolver.solve(&self.solver).await?;

dcookspi · 2025-04-17T18:15:20Z

crates/spk-cli/common/src/flags.rs

@@ -232,6 +235,9 @@ pub struct Solver {
    #[clap(flatten)]
    pub repos: Repositories,

+    #[clap(flatten)]
+    pub decision_formatter_settings: DecisionFormatterSettings,


This change makes it more difficult to create a solver programmaitically when using spk as a library in another tool. The problem is DecisionFormatterSettings has some fields that are not public (my fault, I didn't make those ones public and I don't think there was a reason) and this prevents a program making this object direclty. You have to make a list of string for the correct arguments and feed them through a clap parser to be able to get get_solver() to get a properly configured solver. That's fine for spk command line commands, but it is a bit clunky from a program that already has all the values not as strings.

An example is a tool we have that used to run spk bake... command and process the output. We changed it to use spk as a library and constuct a configured solver to run a solve and get data from calls without all the parsing (of input and output). But now it can't construct Solver clap objects directly and it has to create and parse fake command line arg strings to get a configured solver.

I think moving the DecisionFormatterSettings inside the Solver struct is fine. But it would be nice to either:

Be able to construct the clap objects directly (by having all their fields public, or having them support default(), or having some contructors for them), or

Have a way of constructing a solver, using the same logic that's in get_solver(), without needing the clap objects (that probably means another layer of builder objects or similar structs to use as parameters to some create a solver function, that get_solver() would also call).

dcookspi · 2025-04-17T18:21:04Z

crates/spk-solve/src/io.rs

    AllImpossibleChecks,
    // This isn't a solver on its own. It indicates: the run all the
    // solvers in parallel but show the output from the unchanged one.
+    // This runs all the solvers implemented in the original solver. At least
+    // for now, it is not possible to run both the original solver and the
+    // new solver in parallel.


Is this because of its current relative slowness, having to read all the specs in? Or is it something else, and what is that something else? Is it likely to change in future? I could see a situation where running them all in parallel for confirmation testing, or a part of a transition to just using the new solver, would be good. I guess making the 'because ...' part of the comment would help our future selves.

This is documenting the state of the code but I didn't intend to imply it is impossible to implement.

rydrman

Haven't made it through the whole cdcl implementation yet, but here's my start

crates/spk-build/src/build/binary.rs

crates/spk-solve/src/abstract_solver.rs

rydrman · 2025-04-30T21:13:27Z

From the meeting today:

it would be great to augment this page with examples from the new solver so that we at least have some basics on helping new users grok the new style of output: https://spkenv.dev/use/solver/

jrray · 2025-05-19T18:53:00Z

From the meeting today:

* it would be great to augment this page with examples from the new solver so that we at least have some basics on helping new users grok the new style of output: https://spkenv.dev/use/solver/

I opted to work on this in a separate PR: #1217

Resolvo is a CDCL sat solver for package managers and will hopefully prove to be much better at solving quickly than our mostly brute-force approach. Making a checkpoint here after achieving a simple "solve" to get a feel for how spk type map onto Resolvo's concepts. Although currently the smallest unit the solver works with is a package, and that likely needs to change to a component. Signed-off-by: J Robert Ray <jrray@jrray.org>

By using an enum here, the global var handling can use its own enum variant instead of the flawed string prefix approach. Now it is simply not possible for a package name to be confused for a global var. Signed-off-by: J Robert Ray <jrray@jrray.org>

Messed up the logic when checking :all cases. Add test for this case. Signed-off-by: J Robert Ray <jrray@jrray.org>

These are effectively turned into more requests. Now the host options are being looked at. Signed-off-by: J Robert Ray <jrray@imageworks.com>

It is safe to update the known values without restarting the solve here, since these functions are only called before starting the solve. But that's not enforced by the code in any way at the moment. Signed-off-by: J Robert Ray <jrray@imageworks.com>

The requested by column is not populated. Signed-off-by: J Robert Ray <jrray@imageworks.com>

At SPI the newest python version available on centos is an embedded package, but it is undesirable for `spk env python` to choose an embedded python package. It also follows that it would have preferred to build from source a newer version of a package instead of use an existing build of an older version. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Improve performance by not reading builds for packages that are for versions that are not applicable to the root requests. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Log an info message for now, in order to gauge how many retries are happening in practice. Signed-off-by: J Robert Ray <jrray@imageworks.com> Signed-off-by: J Robert Ray <jrray@jrray.org>

The abstract solver now has a `run_and_print_resolve` that takes a formatter, which makes it possible to run either the old or new solver in a consistent way. The original `run_and_print_resolve` function has become `pub(crate)` to catch all the places that needed to be modified to use the new function. Signed-off-by: J Robert Ray <jrray@imageworks.com> Signed-off-by: J Robert Ray <jrray@jrray.org>

Making the original function `pub(crate)` to force code to change to use the trait instead. Remove the unused `run_and_log_decisions` function. Signed-off-by: J Robert Ray <jrray@imageworks.com> Signed-off-by: J Robert Ray <jrray@jrray.org>

The Solver trait was split into multiple traits with some blanket implementations for refs while chasing compile errors but this may not have actually been necessary. Add the "solver" flags to more commands so the user can choose what solver to use in more cases. Working towards being able to choose between the original solver and the new solver for all operations. More tests now run the test with both solves. Signed-off-by: J Robert Ray <jrray@imageworks.com> Signed-off-by: J Robert Ray <jrray@jrray.org>

Making it possible to choose what solver to use when running spk tests. Signed-off-by: J Robert Ray <jrray@imageworks.com>

However, this can still become a recursive call that blows out the stack if the bounds on Solver don't fit the bounds on the "real" `test` implementation. Signed-off-by: J Robert Ray <jrray@imageworks.com>

In order for `flags::Solver::get_solver` to offer a choice in what solver is returned, it needs to have access to the `solver_to_run` setting in `flags::DecisionFormatterSettings`. This change continues the theme of changing from a DecisionFormatter "containing" a solver to a solver "containing" a DecisionFormatter. Signed-off-by: J Robert Ray <jrray@jrray.org>

Signed-off-by: J Robert Ray <jrray@jrray.org>

`key_entry_names` is unused after it is cloned into `ordered_names`. Signed-off-by: J Robert Ray <jrray@jrray.org>

In order to reuse this logic for the resolvo solver, extract it to an external function. It is not practical for the resolvo solver to directly use a SortedBuildIterator. Signed-off-by: J Robert Ray <jrray@jrray.org>

Potential speedup by avoiding doing the reversal as a separate step. Signed-off-by: J Robert Ray <jrray@jrray.org>

Reuse the same build sorting logic as the original solver in order to get a deterministic result out of the solve. The solve can change drastically based on which build is selected first. Signed-off-by: J Robert Ray <jrray@jrray.org>

This is a public repo. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Signed-off-by: J Robert Ray <jrray@imageworks.com>

When populating the Solution object, embedded packages need to be categorized properly or else when calculating the layers for the solution it ends up with an empty blob as one of the layers, which is an error. Handle source builds in resolvo solve When not building from source. Signed-off-by: J Robert Ray <jrray@imageworks.com>

The case where a stub only depended on the Run component of the parent still needs to declare a dependency to bring it in. Signed-off-by: J Robert Ray <jrray@imageworks.com>

To promote better test coverage, the build_package family of macros now require the name of the solver to use, and all the tests using these macros now run the test against all three solver choices (ignoring "all"). Workaround too many arguments lint and rstest la10736/rstest#226 (comment) Bumped to the latest version of rstest first to see if that would help. Signed-off-by: J Robert Ray <jrray@imageworks.com> Signed-off-by: J Robert Ray <jrray@jrray.org>

The solver tests were not passing if this feature is enabled, and this exposed a few problems in not handling ':all' requests correctly. When generating the "VersionSet" for dependencies, the PkgRequest needs to have the correct component populated. The logic for skipping or not skipping candidates when build from source needed to handle ':all' and it was restructured for clarity. When filtering candidates for a request of a component of a specific build, source builds needed to be filtered out. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Fix commented out code in cmd_test for build stage tests and add a new test that checks the basic behavior of a build stage test that the test environment includes the build dependencies of the package being tested. Lift the old solver's configure_for_build_environment method to the Solver trait. Although the other test stages have code that only lives in the cmd_test crate to configure the solver properly, the build stage uses this method that was defined in the solver itself. While the logic of configure_for_build_environment could be duplicated in cmd_test, it is also reused by other code in the solver so it made sense to preserve the logic as a reusable solver method. Signed-off-by: J Robert Ray <jrray@jrray.org>

jrray added the enhancement New feature or request label Feb 10, 2025

jrray self-assigned this Feb 10, 2025

jrray marked this pull request as draft February 10, 2025 02:19

jrray force-pushed the resolvo branch from dbf8382 to 021b424 Compare February 10, 2025 23:03

jrray force-pushed the resolvo branch 2 times, most recently from 03ee415 to daa4be4 Compare February 15, 2025 02:14

jrray force-pushed the resolvo branch 3 times, most recently from 9a3c596 to c8a558a Compare March 15, 2025 02:28

jrray mentioned this pull request Apr 8, 2025

Bump tokio from 1.39.2 to 1.43.1 #1198

Merged

jrray requested a review from Copilot April 16, 2025 23:06

Copilot AI reviewed Apr 16, 2025

View reviewed changes

dcookspi reviewed Apr 17, 2025

View reviewed changes

rydrman reviewed Apr 30, 2025

View reviewed changes

jrray force-pushed the resolvo branch from 5a2b256 to 18180a9 Compare May 17, 2025 02:18

jrray changed the base branch from main to rename-solver May 18, 2025 17:34

jrray added the pr-chain This PR doesn't target the main branch, don't merge! label May 18, 2025

jrray force-pushed the resolvo branch from 18180a9 to 4e396db Compare May 18, 2025 20:15

jrray force-pushed the rename-solver branch from 45718f7 to f341570 Compare May 18, 2025 20:32

jrray marked this pull request as ready for review May 18, 2025 20:37

jrray force-pushed the resolvo branch from 4e396db to 2ba2880 Compare May 18, 2025 20:58

Base automatically changed from rename-solver to main May 23, 2025 22:19

jrray removed the pr-chain This PR doesn't target the main branch, don't merge! label May 23, 2025

rydrman approved these changes May 28, 2025

View reviewed changes

jrray and others added 27 commits May 28, 2025 15:21

Add a crude solve result output to cmd_explain

7e35be3

Fix requests for :all not filtering invalid versions

cfd6135

Messed up the logic when checking :all cases. Add test for this case. Signed-off-by: J Robert Ray <jrray@jrray.org>

Implement update_options in new solver

855ada4

These are effectively turned into more requests. Now the host options are being looked at. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Wire up the normal solution output in cmd_explain

c2152b1

The requested by column is not populated. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Avoid reading packages where possible

a5c558a

Improve performance by not reading builds for packages that are for versions that are not applicable to the root requests. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Add feedback for solver retries

99f93dc

Log an info message for now, in order to gauge how many retries are happening in practice. Signed-off-by: J Robert Ray <jrray@imageworks.com> Signed-off-by: J Robert Ray <jrray@jrray.org>

Add solver arg to spk test

8adfb26

Making it possible to choose what solver to use when running spk tests. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Change these calls to mimic the one in install.rs

eae5253

However, this can still become a recursive call that blows out the stack if the bounds on Solver don't fit the bounds on the "real" `test` implementation. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Make it possible to choose resolvo with --solver-to-run

8a8ef8d

Signed-off-by: J Robert Ray <jrray@jrray.org>

Remove unnecessary clone

018408f

`key_entry_names` is unused after it is cloned into `ordered_names`. Signed-off-by: J Robert Ray <jrray@jrray.org>

Refactor some build key logic from SortedBuildIterator

28b1d0d

In order to reuse this logic for the resolvo solver, extract it to an external function. It is not practical for the resolvo solver to directly use a SortedBuildIterator. Signed-off-by: J Robert Ray <jrray@jrray.org>

Do a reverse sort using std::cmp::Reverse

0bdcfdf

Potential speedup by avoiding doing the reversal as a separate step. Signed-off-by: J Robert Ray <jrray@jrray.org>

Rework build sorting in the new solver

cd1834e

Reuse the same build sorting logic as the original solver in order to get a deterministic result out of the solve. The solve can change drastically based on which build is selected first. Signed-off-by: J Robert Ray <jrray@jrray.org>

Don't inject credentials for resolvo repo

a578048

This is a public repo. Signed-off-by: J Robert Ray <jrray@imageworks.com>

Convenience conversion from EmbeddedBuildSource to BuildIdent

b124cf3

Signed-off-by: J Robert Ray <jrray@imageworks.com>

Resolvo: fix stubs getting added to solve without parent

b4f5690

The case where a stub only depended on the Run component of the parent still needs to declare a dependency to bring it in. Signed-off-by: J Robert Ray <jrray@imageworks.com>

jrray force-pushed the resolvo branch from 2ba2880 to cfba38f Compare May 28, 2025 22:54

jrray merged commit 3c32425 into main May 28, 2025
5 checks passed

jrray deleted the resolvo branch May 28, 2025 23:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for a CDCL-type SAT solver using the Resolvo crate #1179

Add support for a CDCL-type SAT solver using the Resolvo crate #1179

Uh oh!

jrray commented Feb 10, 2025

Uh oh!

jrray commented Feb 13, 2025

Uh oh!

jrray commented Feb 13, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

dcookspi Apr 17, 2025

Uh oh!

dcookspi Apr 17, 2025

Uh oh!

jrray May 9, 2025

Uh oh!

rydrman left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rydrman commented Apr 30, 2025

Uh oh!

jrray commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!

Add support for a CDCL-type SAT solver using the Resolvo crate #1179

Add support for a CDCL-type SAT solver using the Resolvo crate #1179

Uh oh!

Conversation

jrray commented Feb 10, 2025

Uh oh!

jrray commented Feb 13, 2025

Uh oh!

jrray commented Feb 13, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

dcookspi Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

dcookspi Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

jrray May 9, 2025

Choose a reason for hiding this comment

Uh oh!

rydrman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rydrman commented Apr 30, 2025

Uh oh!

jrray commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!