VirtualPtr<dyn Trait>
VirtualPtr
is the key pointer type enabling all the FFI-safe dyn
-support machinery in safer-ffi
.
-
In order to better convey its purpose and semantics, other names considered for this type (besides a
VPtr
shorthand) have been:DynPtr<dyn Trait>
DynBox<dyn Trait>
VirtualBox<dyn Trait>
/VBox
(this one has been very strongly considered)
Indeed, this type embodies owning pointer semantics, much like Box
does.
But it does so with a twist, hence the dedicated special name: the owning mode is, itself, virtual/dyn
!
-
As will be seen in the remainder of this post, this aspect of
VirtualPtr
is gonna be the key element to allow full type unification across even different pointer types!For instance, consider:
fn together<'r>( a: Box<impl 'r + Trait>, b: Rc<impl 'r + Trait>, c: &'r impl Trait, ) -> [???; 3] // 🤔🤔🤔 { [a.into(), b.into(), c.into()] }
With
VirtualPtr
, we can fully type-erase and thus type-unify all these three types into a common one:
This allows a unified type able to cover all of Box<dyn Trait>
, {A,}Rc<dyn Trait>
, &[mut] dyn Trait
under one same umbrella
One type to unify them all,
One type to coërce them,
One type to bring them all
and in the erasure bind them.
Constructing a VirtualPtr
from Rust
That is, whilst a Box<impl Trait>
can1 be "coërced" .into()
a VirtualPtr<dyn Trait>
, Box
will oftentimes not be the sole pointer/indirection with that capability. Indeed, there will often be other similar "coërcions" from a &impl Trait
, a &mut impl Trait
, a Rc<impl Trait>
, or a Arc<impl Trait + Send + Sync>
!
provided that dyn Trait
be a ReprCTrait
, i.e., that the Trait
definition have been #[derive_ReprC(dyn)]
-annotated.
Here is the complete list of possible conversion at the moment:
-
Given
<T> where T : 'T + Trait
, -
With
Trait
"beingReprC
" / FFI-safe (i.e.,dyn Trait : ReprCTrait
)
From<...> | .into() | Notes for Trait |
---|---|---|
Box<T> | VirtualPtr<dyn 'T + Trait> | • (requires T : Clone when Clone -annotated) |
&T | VirtualPtr<dyn '_ + Trait> | • cannot have &mut self methods |
&mut T | VirtualPtr<dyn '_ + Trait> | • cannot be Clone -annotated |
Rc<T> | VirtualPtr<dyn 'T + Trait> | • must be Clone -annotated• cannot have &mut self methods |
Arc<T> | VirtualPtr<dyn 'T + Trait + Send + Sync> | • must be Clone -annotated• cannot have &mut self methods• requires T : Send + Sync |
- Where "
Clone
-annotated" refers to the#[derive_Repr(dyn, Clone)]
case.
Remarks
-
Whenever
T : 'static
, we can pick'T = 'static
, so thatdyn 'T + Trait
may be more succintly written asdyn Trait
. -
If the trait has methods with a
Pin
nedself
receiver, then theFrom<…>
-column needs to bePin
-wrapped. -
+ Send
and/or+ Sync
can always be added inside aVirtualPtr
, in which caseT : Send
and/orT : Sync
(respectively) will be required.- The only exception here is
Rc
, sinceRc<dyn Trait + Send + Sync>
& co. are oxymorons which have been deemed not to deserve the necessary codegen (if multiple ownership andSend + Sync
is required, useArc
, otherwise, useRc
).
Tip: Since
+ Send + Sync
is so pervasive(ly recommended for one's sanity) when doing FFI, these can be added as super-traits of ourTrait
, so that they be implied in bothT : Trait
anddyn Trait
, thereby alleviating the syntax without compromising the thread-safety:#[derive_ReprC(dyn, /* Clone */)] trait Trait : Send + Sync {
- But be aware that, even with such a super trait annotation,
dyn Trait
anddyn Trait + Send + Sync
will remain being distinct types as far as Rust is concerned! ⚠️
- The only exception here is
Its FFI-layout: constructing and using VirtualPtr
from the FFI
Given some:
#[derive_ReprC(dyn, /* Clone */)]
trait Trait {
fn get(&self, _: bool) -> i32;
fn set(&mut self, _: i32);
fn method3(&… self, _: Arg1, _: Arg2, …) -> Ret;
…
}
- (with
Arg1 : ReprC<CLayout = CArg1>
, etc.)
A VirtualPtr<dyn Trait>
will be laid out as the following:
type ErasedPtr = ptr::NonNull<ty::Erased>; /* modulo const/mut */
#[repr(C)]
struct VirtualPtr<dyn Trait> {
ptr: ErasedPtr,
// Note: it is *inlined* / *no* pointer indirection!
vtable: {
// the `drop` / `free`ing function.
release_vptr: unsafe extern fn(ErasedPtr),
/* if `Clone`-annotated:
retain_vptr: unsafe extern fn(ErasedPtr) -> VirtualPtr<dyn Trait>, */
/* and the FFI-safe virtual methods of the trait: */
get: unsafe extern fn(ErasedPtr, _: CLayoutOf<bool>) -> i32,
set: unsafe extern fn(ErasedPtr, _: i32),
method3: unsafe extern fn(ErasedPtr, _: CArg1, _: CArg2, …) -> CRet,
…
},
}
A fully virtual owning mode
Remember the sentence above?
But it does so with a twist, hence the dedicated special name: the owning mode is, itself, virtual/
dyn
!
What this means is that all of the destructor is virtual / dyn
amically-dispatched, for instance (and ditto for .clone()
ing, when applicable).
Non-fully-virtual examples
To better understand this nuance, consider the opposite (types which are not fully virtual / dyn
amically dispatched, such as Box<dyn …>
): what happens when you drop a Box<dyn Trait>
vs. dropping a Rc<dyn Trait>
?
-
when you drop a
Box<dyn Trait>
:- It virtually/
dyn
-amically queries theLayout
knowledge of thatdyn Trait
type-erased data; - It virtually/
dyn
-amically drops thedyn Trait
pointee in place; - It then calls
dealloc
(free
) of the backing storage using the aforementioned dataLayout
(as the layout of the whole allocation, since aBox<T>
allocates exactly as much memory as needed to hold aT
)
This last step is thus statically dispatched, thanks to the static/compile-time knowledge of the hard-coded
Box
type inBox<dyn Trait>
!Pseudo-code
//! Pseudo-code! fn drop(self: &mut Box<dyn …>) { let layout = self.dyn_layout(); // (self.vtable.layout)(self.ptr) unsafe { // SAFETY: this is conceptual pseudo-code and may have bugs. self.dyn_drop_in_place(); // (self.vtable.drop_in_place)(self.ptr) dealloc(&mut *self as *mut _, layout); } }
- It virtually/
-
when you drop a
Rc<dyn Trait>
:- It virtually/
dyn
-amically queries theLayout
knowledge of thatdyn Trait
type-erased data; - It then embiggens the aforementioned layout so as to get the layout of all of the
Rc
's actual pointee / actual allocation (that is, theRcBox
, i.e., the data alongside two reference counters), so as to be able to access those counters, - and then decrements the appropriate counters (mostly the strong count);
- if it detects that it was the last owner (strong count from 1 to 0):
- It virtually/
dyn
-amically drops thedyn Trait
pointee in place; - It then calls
dealloc
(free
) for that wholeRcBox
's backing storage (when there are no outstandingWeak
s).
- It virtually/
The steps
2.
,3.
and4.2
are thus statically dispatched, thanks to the static/compile-time knowledge of the hard-codedRc
type inRc<dyn Trait>
!Pseudo-code
//! Pseudo-code! fn drop(self: &mut Rc<dyn …>) { let layout = self.dyn_layout(); // (self.vtable.layout)(self.ptr) unsafe { // SAFETY: this is conceptual pseudo-code and may have bugs. let rcbox: &RcBox<dyn …> = adjust_ptr(self.ptr, layout); let prev = rcbox.strong_count.get(); rcbox.strong_count.set(prev - 1); if prev == 1 { // if last strong owner rcbox.data.dyn_drop_in_place(); // (….vtable.drop_in_place)(….ptr) if rcbox.weak_count == … { // if no outstanding weaks dealloc(rcbox as *const _ as *mut _, layout); } } } }
- It virtually/
We can actually even go further, and wonder what Rust does:
-
when a
&mut dyn Trait
or a&dyn Trait
goes out of scope:-
Nothing.
(Since it knows that the
&[mut] _
types have no drop glue whatsoever)
This step (or rather, lack thereof) is another example of statically dispatched logic.
-
It should thus now be clear that:
-
whilst type erasure of the pointee does happen whenever your deal with a
ConcretePtr<dyn Trait>
such asBox<dyn Trait>
,&mut dyn Trait
, etc. -
on the other hand, the
ConcretePtr
behind which such erasure happens is not, itself, type-erased! It is still statically-known, and functionality such asDrop
,Clone
, or evenCopy
may take advantage of that information (e.g.,&dyn Trait
isCopy
).
Another example: dyn_clone()
Let's now compare, in the context of type-erased dyn Trait
pointees, a static operation vs. a virtual / dyn
amically dispatched one.
For starters, let's consider the following Trait
definition:
trait Trait : 'static {
// &dyn Trait
fn dyn_clone(self: &Self) -> Box<dyn Trait>;
}
impl<T : 'static + Clone> Trait for T {
fn dyn_clone(self: &T) -> Box<dyn Trait> {
Box::new(T::clone(self)) /* as Box<dyn Trait> */
}
}
and now, let's think about and compare the behaviors of the two following functions:
fn clone_box(b: &Box<dyn Trait>) -> Box<dyn Trait> {
b.dyn_clone()
}
fn clone_rc(r: &Rc<dyn Trait>) -> Rc<dyn Trait> {
r.clone() // Rc::clone(r)
}
clone_box
isdyn
amically calling and delegating todyn Trait
'sdyn_clone
virtual method;clone_rc
is statically / within-hard-coded code logic performing a (strong) reference-count increment inside theRcBox<dyn Trait>
pointee, thereby never interacting with thedyn Trait
value itself.
(Granted, the former is performing a statically-dispatched Deref
coercion beforehand, and the latter may be dyn
amically looking up dyn Trait
's Layout
, but the main point still stands).
From partially dyn
amic to fully dyn
amic
From all this, I hope the hybrid static-dyn
amic nature of Rust's ConcretePtr<dyn ErasedPointee>
(wide) pointers logic is now more apparent and clearer.
From there, we can then wonder what happens if we made it all fully dynamic
: VirtualPtr
is born!
Summary
-
all of the
drop
glue is to bedyn
amically dispatched (through some virtualfn
pointer performing adrop_ptr
operation)://! Pseudo-code impl<T> DynDrop for Box<T> { fn dyn_drop_ptr(self) { drop::<Box<T>>(self); } } impl<T> DynDrop for Arc<T> { fn dyn_drop_ptr(self) { drop::<Arc<T>>(self); } } impl<T> DynDrop for &mut T { fn dyn_drop_ptr(self) {} } impl<T> DynDrop for &T { fn dyn_drop_ptr(self) {} }
Notice how this shall therefore imbue with
move
/ownership semantics originally-Copy
pointers such as&T
. Indeed, once we go fully virtual, by virtue of being compatible/type-unified with non-Copy
pointers such asBox<T>
or&mut T
, it means we have to conservatively assume anyVirtualPtr<…>
instance may have to run significant drop glue at most once, which thence makesVirtualPtr
s not beCopy
, even when they've originated from a&T
reference. -
Clone
, if any, is also to be fullydyn
amically dispatched as well://! Pseudo-code impl<T> DynClone for Box<T> where T : Clone, { fn dyn_clone_ptr(self: &Self) -> Self { Box::new(T::clone(&**self)) } } impl<T> DynClone for Arc<T> { fn dyn_clone_ptr(self: &Self) -> Self { Arc::clone(self) } } /* * no `Clone` for `&mut`, obviously: * thus, no `From<&mut T>` for `VirtualPtr<dyn DynClone>` either. */ impl<T> DynClone for &'_ T { fn dyn_clone_ptr(self: &Self) -> Self { // `&T : Copy` *self } }
Regarding the previous point about
&T
-originatedVirtualPtr
s not beingCopy
anymore, we can see we can get the functional API back (i.e.,Clone
), if we pinky promise not to mix suchVirtualPtr
s with non-Clone
-originating pointers (such as&mut T
)Bonus:
&mut T
-reborrowingIf you think about
&mut T
, whilst notCopy
, it's still kind of an interesting pointer, since a&'short mut &'long mut T
can yield a&'short mut T
through reborrowing, thereby removing one layer of indirection, by "squashing" the lifetimes together into their intersection (which here happens to be the shortest one,'short
, since'long : 'short
).In explicit API parlance, this would become:
Despite intellectually interesting, this is nonetheless a niche and contrived API which is therefore not exposed through
safer-ffi
'sVirtualPtr
type, for it is deemed that&'short mut VirtualPtr<dyn 'long + …>
ought to offer most of the API of a reborrowedVirtualPtr<dyn 'short + …>
.
Related read/concept: dyn *
In a way, the API/functionality of VirtualPtr
is quite similar to the very recent dyn *
experimental2 unstable feature of Rust.
As of this writing2, there isn't that much proper documentation about it, and one would have to wander through Zulip discussions to know more about it, but for the following post:
A Look at dyn* Code Generation — by Eric Holk
as of 1.68.0
Click here to see my own summary of dyn *
The gist of it is that barring repr(C)
/ extern "C"
/ FFI-compatibility considerations about which dyn *
does not worry, the idea is kind of the same as VirtualPtr
, but for one extra quirk. Instead of a simple thin pointer-sized data pointer, a dyn *
will rather use the following "erased data" type:
union ErasedData<const N: usize = 1> {
ptr: *mut ty::Erased,
inline: [MaybeUninit<usize>; N],
}
struct dyn* Trait<const N: usize = 1> {
data: ErasedData<N>,
vtable: …,
}
Historically, most platforms have featured *mut _
and usize
having the same layout, so on those platforms and in the case of N = 1
(and papering over MaybeUninit
/ padding bytes), you may still be seeing a *mut ty::Erased
directly rather than an ErasedData
.
For instance, we could imagine all this applied to our VirtualPtr
s: we'd now be able to implement it for non-pointer types, provided they fit within the ErasedData
inline storage!
//! Pseudo-code: currently *NOT* in `safer-ffi`
#[derive_ReprC(dyn)]
trait Example : Clone {
fn dyn_print(&self);
}
impl Example for usize /* which fits inside `*mut Erased` */ {
/* auto-generated:
fn dyn_drop(self)
{
/* nothing to do to drop a `usize` since it is `Copy` */
}
fn dyn_clone(self: &Self)
-> Self
{
*self
} */
fn dyn_print(self: &Self)
{
dbg!(*self);
}
}
fn main()
{
let n: usize = 42;
// look ma: no indirection!
let vptr: VirtualPtr<dyn Example> = n.into();
/* let vptr = VirtualPtr {
vptr: ErasedData { inline: n },
vtable: VTable {
release_vptr: usize::dyn_drop,
retain_vptr: usize::dyn_clone,
dyn_print: usize::dyn_print,
},
}; */
vptr.dyn_print(); // correctly prints `42`.
}
- Note that this is currently deemed too niche and is not featured by
safer-ffi
.