-
Notifications
You must be signed in to change notification settings - Fork 1.6k
RFC: result_ffi_guarantees #3391
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 3 commits
4b5a891
f647e37
e729531
2c258cc
e8ecdc4
40b65f3
7320c01
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,94 @@ | ||
# RFC: result_ffi_guarantees | ||
|
||
- Feature Name: `result_ffi_guarantees` | ||
- Start Date: 2023-02-15 | ||
- RFC PR: [rust-lang/rfcs#0000](https://github.com/rust-lang/rfcs/pull/0000) | ||
- Rust Issue: [rust-lang/rust#0000](https://github.com/rust-lang/rust/issues/0000) | ||
|
||
# Summary | ||
[summary]: #summary | ||
|
||
This RFC gives specific layout and ABI guarantees when wrapping "non-zero" data types from `core` in `Option` or `Result`. This allows those data types to be used directly in FFI, in place of the primitive form of the data (eg: `Result<(), NonZeroI32>` instead of `i32`). | ||
|
||
# Motivation | ||
[motivation]: #motivation | ||
|
||
Rust often needs to interact with foreign code. However, foreign function type signatures don't normally support any of Rust's rich type system. Particular function inputs and outputs will simply use 0 (or null) as a sentinel value and the programmer has to remember when that's happening. | ||
|
||
Though it's common for "raw bindings" crates to also have "high level wrapper" crates that go with them (eg: `windows-sys`/`windows`, or `sdl2-sys`/`sdl2`, etc), someone still has to write those wrapper crates which use the foreign functions directly. Allowing Rust programmers to use more detailed types with foreign functions makes their work easier. | ||
|
||
# Guide-level explanation | ||
[guide-level-explanation]: #guide-level-explanation | ||
|
||
I'm not sure how to write a "guide" portion of this that's any simpler than the "reference" portion, which is already quite short. | ||
|
||
# Reference-level explanation | ||
[reference-level-explanation]: #reference-level-explanation | ||
|
||
When either of these two `core` types: | ||
|
||
* `Option<T>` | ||
* `Result<T, E>` where either `T` or `E`: | ||
* Are a zero-sized type with alignment 1 (a "1-ZST"). | ||
* Either have no fields (eg: `()` or `struct Foo;`) or have `repr(transparent)` if there are fields. | ||
* Do not have the `#[non_exhaustive]` attribute. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The issue is that it's very easy to add a private field and accidentally break the guaranteed layout. Imho there should be some explicit way to opt into such optimizations for ZSTs which are guaranteed to remain ZST. I don't think Is the "private attribute" opinion documented somewhere? Afaik it isn't, and it's not uncommon to treat it as an API guarantee. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I asked Josh via Zulip and he said it would be fine to further restrict the rules here even while FCP is active. I will make an update later today to simply require no fields at all (and no non_exhaustive). Then it will be the usual semver breaking change if a field is added. I believe that should satisfy things? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. No. The thing that really worries me is that, being a guaranteed optimization, breaking the layout won't produce any errors. The optimization will just silently stop applying. This will lead to UB on the FFI side, and I don't see any reliable way to check for such breakage. An upstream crate makes a semver breaking change by adding some inner field (possibly even zero-sized one, but perhaps not in a guaranteed way). A downstream crate which used the changed type in Currently the solution would involve explicitly ABI-stable types, and conversions between them and native Rust types. Such conversions would either work correctly, fail to compile or fail at runtime (assuming they are properly written and verify their preconditions). Also note that this isn't an issue for current types with guaranteed layout optimizations, since their number is very small and they are all provided by the language. There is only a finite number of non-generic For this reason I would prefer to have an explicit way to specify that the layout optimization must apply, rather than deduce it from implicit properties of code, which may be changed by the programmer without even considering that someone could rely on them. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. If the crate you're getting a type from puts some attribute on their type, can't they just remove that attribute (presumably as a semver break) in a future version? Then isn't it just as bad for the downstream crates relying on it? Honestly I don't think people should do this in the first place with any type that's not their own type or There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. the point of having an attribute is that explicitly having to remove it makes people consider it a semver break since it's obvious, whereas having it completely implicit makes it much more likely people won't realize they need a semvar break to change their library type -- most people aren't going to think about There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
We do have There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
|
||
|
||
Is combined with a non-zero or non-null type (see the chart), the combination has the same layout (size and alignment) and the same ABI as the primitive form of the data. | ||
|
||
| Example combined Type | Primitive Type | | ||
|:-|:-| | ||
| `Result<NonNull<T>, ()>` | `*mut T` | | ||
| `Result<&T, ()>` | `&T` | | ||
| `Result<&mut T, ()>` | `&mut T` | | ||
| `Result<fn(), ()>` | `fn()` | | ||
| `Result<NonZeroI8, ()>` | `i8` | | ||
| `Result<NonZeroI16, ()>` | `i16` | | ||
| `Result<NonZeroI32, ()>` | `i32` | | ||
| `Result<NonZeroI64, ()>` | `i64` | | ||
| `Result<NonZeroI128, ()>` | `i128` | | ||
| `Result<NonZeroIsize, ()>` | `isize` | | ||
| `Result<NonZeroU8, ()>` | `u8` | | ||
| `Result<NonZeroU16, ()>` | `u16` | | ||
| `Result<NonZeroU32, ()>` | `u32` | | ||
| `Result<NonZeroU64, ()>` | `u64` | | ||
| `Result<NonZeroU128, ()>` | `u128` | | ||
| `Result<NonZeroUsize, ()>` | `usize` | | ||
|
||
* While `fn()` is listed just once in the above table, this rule applies to all `fn` types (regardless of ABI, arguments, and return type). | ||
|
||
For simplicity the table listing only uses `Result<_, ()>`, but swapping the `T` and `E` types, or using `Option<T>` is also valid. | ||
What changes are the implied semantics: | ||
* `Result<NonZeroI32, ()>` is "a non-zero success value" | ||
* `Result<(), NonZeroI32>` is "a non-zero error value" | ||
* `Option<NonZeroI32>` is "a non-zero value is present" | ||
* they all pass over FFI as if they were an `i32`. | ||
|
||
Which type you should use with a particular FFI function signature still depends on the function. | ||
Rust can't solve that part for you. | ||
However, once you've decided on the type you want to use, the compiler's normal type checks can guide you everywhere else in the code. | ||
|
||
# Drawbacks | ||
[drawbacks]: #drawbacks | ||
|
||
* The compiler has less flexibility with respect to discriminant computation and pattern matching optimizations when a type is niche-optimized. | ||
|
||
# Rationale and alternatives | ||
[rationale-and-alternatives]: #rationale-and-alternatives | ||
|
||
It's always possible to *not* strengthen the guarantees of the language. | ||
|
||
# Prior art | ||
[prior-art]: #prior-art | ||
|
||
The compiler already suports `Option` being combined with specific non-zero types, this RFC mostly expands the list of guaranteed support. | ||
|
||
# Unresolved questions | ||
[unresolved-questions]: #unresolved-questions | ||
|
||
None at this time. | ||
|
||
# Future possibilities | ||
[future-possibilities]: #future-possibilities | ||
|
||
* This could be expanded to include [ControlFlow](https://doc.rust-lang.org/nightly/core/ops/enum.ControlFlow.html) and [Poll](https://doc.rust-lang.org/nightly/core/task/enum.Poll.html). | ||
* This could be extended to *all* similar enums in the future. However, without a way to opt-in to the special layout and ABI guarantees (eg: a trait or attribute) it becomes yet another semver hazard for library authors. The RFC is deliberately limited in scope to avoid bikesheding. |
Uh oh!
There was an error while loading. Please reload this page.