Updated book with changes to library error handling.

A new section covers error handling with a couple of recommendations. All examples and existing sections are updated with the changes.
Smithay · Feb 1, 2022 · 4d966b8 · 4d966b8
1 parent 4f9e01a
commit 4d966b8
Show file tree

Hide file tree

Showing 11 changed files with 256 additions and 75 deletions.
diff --git a/doc/src/SUMMARY.md b/doc/src/SUMMARY.md
@@ -9,6 +9,7 @@
   - [Ping](ch02-03-ping.md)
   - [Channels](ch02-04-channels.md)
   - [Unix Signals](ch02-05-signals.md)
+  - [Error handling](ch02-06-errors.md)
 - [I need async/await!](ch03-00-async-await.md)
   - [Run async code](ch03-01-run-async-code.md)
   - [Async IO types](ch03-02-async-io-types.md)

diff --git a/doc/src/adapt_io_example.rs b/doc/src/adapt_io_example.rs
@@ -2,7 +2,6 @@
 use calloop::EventLoop;
 
 // ANCHOR: use_futures_io_traits
-// futures = "0.3"
 use futures::io::{AsyncReadExt, AsyncWriteExt};
 // ANCHOR_END: use_futures_io_traits
 
@@ -15,10 +14,12 @@ fn main() -> std::io::Result<()> {
     let mut event_loop = EventLoop::try_new()?;
     let handle = event_loop.handle();
 
-    handle.insert_source(exec, |evt, _metadata, _shared| {
-        // Print the value of the async block ie. the return value.
-        println!("Async block ended with: {}", evt);
-    })?;
+    handle
+        .insert_source(exec, |evt, _metadata, _shared| {
+            // Print the value of the async block ie. the return value.
+            println!("Async block ended with: {}", evt);
+        })
+        .map_err(|e| e.error)?;
     // ANCHOR_END: decl_loop
 
     // ANCHOR: decl_io

diff --git a/doc/src/async_example.rs b/doc/src/async_example.rs
@@ -14,10 +14,12 @@ fn main() -> std::io::Result<()> {
     let mut event_loop = EventLoop::try_new()?;
     let handle = event_loop.handle();
 
-    handle.insert_source(exec, |evt, _metadata, _shared| {
-        // Print the value of the async block ie. the return value.
-        println!("Async block ended with: {}", evt);
-    })?;
+    handle
+        .insert_source(exec, |evt, _metadata, _shared| {
+            // Print the value of the async block ie. the return value.
+            println!("Async block ended with: {}", evt);
+        })
+        .map_err(|e| e.error)?;
     // ANCHOR_END: decl_loop
 
     // ANCHOR: decl_async

diff --git a/doc/src/ch02-06-errors.md b/doc/src/ch02-06-errors.md
@@ -0,0 +1,101 @@
+# Error handling in Calloop
+
+## Super quick advice
+
+Use [*Thiserror*](https://crates.io/crates/thiserror) to create structured errors for your library, and [*Anyhow*](https://crates.io/crates/anyhow) to propagate and pass them across API boundaries, like in [the ZeroMQ example](ch04-06-the-full-zeromq-event-source-code.md).
+
+## Overview
+
+Most error handling crates/guides/documentation for Rust focus on one of two situations:
+
+- Creating errors that an API can propagate out to a user of the API, or
+- Making your library deal nicely with the `Result`s from closure or trait methods that it might call
+
+Calloop has to do both of these things. It needs to provide a library user with errors that work well with `?` and common error-handling idioms in their own code, and it needs to handle errors from the callbacks you give to `process_events()` or `insert_source()`. It *also* needs to provide some flexibility in the `EventSource` trait, which is used both for internal event sources and by users of the library.
+
+Because of this, error handling in Calloop leans more towards having separate error types for different concerns. This may mean that there is some extra conversion code in places like returning results from `process_events()`, or in callbacks that use other libraries. However, we try to make it smoother to do these conversions, and to make sure information isn't lost in doing so.
+
+The place where this becomes the most complex is in the `process_events()` method on the `EventSource` trait.
+
+## The Error type on the EventSource trait
+
+The `EventSource` trait contains an associated type named `Error`, which forms part of the return type from `process_events()`. This type must implement `std::error::Error` and be `Sync + Send`.
+
+As a rule, if you implement `EventSource` you should try to split your errors into two different categories:
+
+- Errors that make sense as a kind of event. These should be a part of the `Event` associated type eg. as an enum or `Result`.
+- Errors that mean your event source simply cannot process more events. These should form the `Error` associated type.
+
+For an example, take Calloop's channel type, [`calloop::channel::Channel`](api/calloop/channel/struct.Channel.html). When the sending end is dropped, no more messages can be received after that point. But this is not returned as an error when calling `process_events()`, because you still want to (and can!) receive messages sent before that point that might still be in the queue. Hence the events received by the callback for this source can be `Msg(e)` or `Closed`.
+
+However, if the internal ping source produces an error, there is no way for the sending end of the channel to notify the receiver. It is impossible to process more events on this event source, and the caller needs to decide how to recover from this situation. Hence this is returned as a `ChannelError` from `process_events()`.
+
+Another example might be an event source that represents a running subprocess. If the subprocess exits with a non-zero status code, or the executable can't be found, those don't mean that events can no longer be processed. They can be provided to the caller through the callback. But if the lower level sources being used to run (eg. an asynchronous executor or subprocess file descriptor) fail to work as expected, `process_events()` should return an error.
+
+If your crate already has some form of structured error handling, Calloop's error types should pose no problem to integrate into this. All of Calloop's errors implement `std::error::Error` and can be manipulated the same as any other error types.
+
+If you want a more flexible or general approach, and you're not sure where to start, here are some suggestions that might help.
+
+> Please note that in what follows, the name `Error` can refer to one of two different things:
+> - the trait `std::error::Error` - this will be whenever it qualifies a trait object ie. `dyn Error` means `dyn std::error::Error`
+> - the associated type `Error` on the `EventSource` trait ie. as `type Error = ...`
+
+### Thiserror and Anyhow
+
+[*Thiserror*](https://crates.io/crates/thiserror) and [*Anyhow*](https://crates.io/crates/anyhow) are two excellent error handling crates crated by David Tolnay. Thiserror provides procedural macros for creating structured error types with minimal runtime cost. Anyhow provides some extremely flexible ways to combine errors from different sources and propagate them. This is the approach used in [the ZeroMQ example](ch04-06-the-full-zeromq-event-source-code.md).
+
+One wrinkle in this approach is that `anyhow::Error` does not, in fact, implement `std::error::Error`. This means it can't directly be used as the associated type `calloop::EventSource::Error`. That's where Thiserror comes in.
+
+The basic idea is that you use Thiserror to create an error type to use as the associated type on your event source. This could be a single element struct like this:
+
+```rust,noplayground
+#[derive(thiserror::Error, Debug)]
+#[error(transparent)]
+pub struct MyError(#[from] anyhow::Error);
+```
+
+This creates a minimal implementation for a struct that forwards the important `std::error::Error` trait methods to the encapsulated `anyhow::Error` value. (You could also use Thiserror to create an error with a specific variant for, and conversion from, `zmq::Error` if that's useful.)
+
+But how do we get from one of Calloop's errors (or a third party library's) to this "anyhow" value? One way is to use Anyhow's `context` trait method, which is implemented for any implementation of `std::error::Error`. This is doubly useful: it creates an `anyhow::Error` from the original error, and also adds a message that appears in the traceback. For example:
+
+```rust,noplayground
+self.socket
+    .send_multipart(parts, 0)
+    .context("Failed to send message")?;
+```
+
+Here, the result of `send_multipart()` might be a `zmq::Error`, a type that is completely unrelated to Calloop. Calling `context()` wraps it in an `anyhow::Error` with the message *"Failed to send message"*, which will appear in a traceback if the error (or one containing it) is printed with `{:?}`. The `?` operator then converts it our own `MyError` type if it needs to return early.
+
+### Arc-wrapped errors
+
+Since any error can be converted to a `Box<dyn Error>`, this suggests another simple approach for error handling. Indeed it's pretty common to find libraries returning `Result<..., Box<dyn Error>>`.
+
+Unfortunately you cannot simply set `type Error = Box<dyn Error + Sync + Send>` in your event source. This is for the same reason as with Anyhow: `Box<dyn Error>` does not actually implement the `Error` trait.
+
+There is a smart pointer type in `std` that *does* allow this though: setting `type Error = std::sync::Arc<dyn Error + Sync + Send>` works fine. You can do this with the `map_err()` method on a `Result`:
+
+```rust,noplayground
+type Error = Arc<dyn Error + Sync + Send>;
+
+fn process_events<F>(...) -> Result<calloop::PostAction, Self::Error> where ... {
+    self.nested_source
+        .process_events(readiness, token, |_, _| {})
+        .map_err(|e| Arc::new(e) as Arc<dyn Error + Sync + Send>)?;
+}
+```
+
+The `Arc::new(e) as ...` is known as an [unsized coercion](https://doc.rust-lang.org/reference/type-coercions.html#unsized-coercions). You can even just do:
+
+```rust,noplayground
+    self.nested_source
+        .process_events(readiness, token, |_, _| {})
+        .map_err(Box::from)?;
+```
+
+...since the `?` takes care of the second step of the conversion (`Box` to `Arc` in this case).
+
+### Which to choose
+
+Arc-wrapping errors only really has the advantage of fewer 3rd-party dependencies, and whether that really is an advantage depends on context. If it's a matter of policy, or simply not needing anything more, use this approach.
+
+Anyhow and Thiserror are both extremely lean in terms of code size, performance and their own dependencies. The extra `context()` call is exactly the same number of lines of code as `map_err()` but has the advantage of providing more information. Using Thiserror also lowers the effort for more structured error handling in the future. If those seem useful to you, use this approach.
diff --git a/doc/src/ch04-02-creating-our-source-part-1-our-types.md b/doc/src/ch04-02-creating-our-source-part-1-our-types.md
@@ -12,7 +12,7 @@ So at a minimum, our type needs to contain these:
 pub struct ZeroMQSource
 {
     // Calloop components.
-    socket_source: calloop::generic::Generic<calloop::generic::Fd>,
+    socket_source: calloop::generic::Generic<std::os::unix::io::RawFd>,
     mpsc_receiver: calloop::channel::Channel<?>,
     wake_ping_receiver: calloop::ping::PingSource,
 }
@@ -26,7 +26,7 @@ What else do we need? If the `PingSource` is there to wake up the loop manually,
 pub struct ZeroMQSource
 {
     // Calloop components.
-    socket_source: calloop::generic::Generic<calloop::generic::Fd>,
+    socket_source: calloop::generic::Generic<std::os::unix::io::RawFd>,
     mpsc_receiver: calloop::channel::Channel<?>,
     wake_ping_receiver: calloop::ping::PingSource,
 
@@ -59,14 +59,16 @@ where
 > Remember that it's not just `Vec<T>` and other sequence types that implement `IntoIterator` — `Option<T>` implements it too! There is also `std::iter::Once<T>`. So if a user of our API wants to enforce that all "multi"-part messages actually contain exactly one part, they can use this API with `T` being, say, `std::iter::Once<zmq::Message>` (or even just `[zmq::Message; 1]` in Rust 2021 edition).
 
 ## Associated types
-The `EventSource` trait has three associated types:
+The `EventSource` trait has four associated types:
 
 - `Event` - when an event is generated that our caller cares about (ie. not some internal thing), this is the data we provide to their callback. This will be another sequence of messages, but because we're constructing it we can be more opinionated about the type and use the return type of `zmq::Socket::recv_multipart()` which is `Vec<Vec<u8>>`.
 
 - `Metadata` - this is a more persistent kind of data, perhaps the underlying file descriptor or socket, or maybe some stateful object that the callback can manipulate. It is passed by exclusive reference to the `Metadata` type. In our case we don't use this, so it's `()`.
 
 - `Ret` - this is the return type of the callback that's called on an event. Usually this will be a `Result` of some sort; in our case it's `std::io::Result<()>` just to signal whether some underlying operation failed or not.
 
+- `Error` - this is the error type returned by `process_events()` (not the user callback!). Having this as an associated type allows you to have more control over error propagation in nested event sources. We will use [Thiserror](https://crates.io/crates/thiserror) to have a transparent wrapper around [Anyhow](https://crates.io/crates/anyhow), both very useful error libraries. The wrapper will be named `ZmqError`.
+
 So together these are:
 
 ```rust,noplayground
@@ -78,10 +80,11 @@ where
     type Event = Vec<Vec<u8>>;
     type Metadata = ();
     type Ret = io::Result<()>;
+    type Error = ZmqError;
     // ...
 }
 ```
 
 ----
 
-I have saved one surprise for later to emphasise some important principles, but for now, let's move on to defining some methods!
+I have saved one surprise for later to emphasise some important principles, but for now, let's move on to defining some methods!
diff --git a/doc/src/ch04-03-creating-our-source-part-2-setup-methods.md b/doc/src/ch04-03-creating-our-source-part-2-setup-methods.md
@@ -38,9 +38,9 @@ pub fn from_socket(socket: zmq::Socket) -> io::Result<(Self, calloop::channel::S
 
 Calloop's event sources have a kind of life cycle, starting with *registration*. When you add an event source to the event loop, under the hood the source will *register* itself with the loop. Under certain circumstances a source will need to re-register itself. And finally there is the *unregister* action when an event source is removed from the loop. These are expressed via the `calloop::EventSource` methods:
 
-- `fn register(&mut self, poll: &mut calloop::Poll, token_factory: &mut calloop::TokenFactory) -> std::io::Result<()>`
-- `fn reregister(&mut self, poll: &mut calloop::Poll, token_factory: &mut calloop::TokenFactory) -> std::io::Result<()>`
-- `fn unregister(&mut self, poll: &mut calloop::Poll) -> std::io::Result<()>`
+- `fn register(&mut self, poll: &mut calloop::Poll, token_factory: &mut calloop::TokenFactory) -> calloop::Result<()>`
+- `fn reregister(&mut self, poll: &mut calloop::Poll, token_factory: &mut calloop::TokenFactory) -> calloop::Result<()>`
+- `fn unregister(&mut self, poll: &mut calloop::Poll) -> calloop::Result<()>`
 
 The first two methods take a *token factory*, which is a way for Calloop to keep track of why your source was woken up. When we get to actually processing events, you'll see how this works. But for now, all you need to do is recursively pass the token factory into whatever sources your own event source is composed of. This includes other composed sources, which will pass the token factory into *their* sources, and so on.
 
@@ -51,7 +51,7 @@ fn register(
     &mut self,
     poll: &mut calloop::Poll,
     token_factory: &mut calloop::TokenFactory
-) -> io::Result<()>
+) -> calloop::Result<()>
 {
     self.socket_source.register(poll, token_factory)?;
     self.mpsc_receiver.register(poll, token_factory)?;
@@ -65,7 +65,7 @@ fn reregister(
     &mut self,
     poll: &mut calloop::Poll,
     token_factory: &mut calloop::TokenFactory
-) -> io::Result<()>
+) -> calloop::Result<()>
 {
     self.socket_source.reregister(poll, token_factory)?;
     self.mpsc_receiver.reregister(poll, token_factory)?;
@@ -77,7 +77,7 @@ fn reregister(
 }
 
 
-fn unregister(&mut self, poll: &mut calloop::Poll)-> std::io::Result<()> {
+fn unregister(&mut self, poll: &mut calloop::Poll)-> calloop::Result<()> {
     self.socket_source.unregister(poll)?;
     self.mpsc_receiver.unregister(poll)?;
     self.wake_ping_receiver.unregister(poll)?;