book: baby fuzzer chapter

andreafioraldi · andreafioraldi · commit 8bb061fa2252 · 2021-05-06T20:10:45.000+02:00
diff --git a/docs/src/baby_fuzzer.md b/docs/src/baby_fuzzer.md
@@ -1 +1,300 @@
 # Baby Fuzzer
+
+This chapter will teach you how to create a naive fuzzer using the LibAFL API, you will learn about basic entities such as `State`, `Observer`, and `Executor`.
+The following chapters will discuss in detail the components of LibAFL, while here we will just scratch the fundamentals.
+
+We are going to fuzz a simple Rust function that panics under a condition. The fuzzer will be single-threaded and will stop after the crash like libFuzzer does normally.
+
+You can find a complete version of this tutorial as an example fuzzer in [`fuzzers/baby_fuzzer`](https://github.com/AFLplusplus/LibAFL/tree/main/fuzzers/baby_fuzzer).
+
+## Creating a project
+
+We use cargo to create a new Rust project with LibAFL as a dependency. 
+
+```sh
+$ cargo new baby_fuzzer
+$ cd baby_fuzzer
+```
+
+The generated _Cargo.toml_ looks like the following:
+
+```toml
+[package]
+name = "baby_fuzzer"
+version = "0.1.0"
+authors = ["Your Name <you@example.com>"]
+edition = "2018"
+
+# See more keys and their definitions at https://doc.rust-lang.org/cargo/reference/manifest.html
+
+[dependencies]
+```
+
+In order to use LibAFl we must add it as dependency adding `libafl = { path = "path/to/libafl/" }` under `[dependencies]`.
+You can use the LibAFL version from crates.io if you want, in this case, you have to use `libafl = "*"` to get the latest version.
+
+As we are going to fuzz Rust code, we want that a panic does not simply cause the program exit, but an abort that can be caught by the fuzzer.
+To do that, we specify `panic = "abort"` in the [profiles](https://doc.rust-lang.org/cargo/reference/profiles.html).
+
+Alongside this setting, we add some optimization flags for the compile when building in release mode.
+
+The final _Cargo.toml_ should look similar to the following:
+
+
+```toml
+[package]
+name = "baby_fuzzer"
+version = "0.1.0"
+authors = ["Your Name <you@example.com>"]
+edition = "2018"
+
+# See more keys and their definitions at https://doc.rust-lang.org/cargo/reference/manifest.html
+
+[dependencies]
+libafl = { path = "path/to/libafl/" }
+
+[profile.dev]
+panic = "abort"
+
+[profile.release]
+panic = "abort"
+lto = true
+codegen-units = 1
+opt-level = 3
+debug = true
+```
+
+## The function under test
+
+Opening `src/main.rs` we have an empty main function.
+To start, we create the closure that we want to fuzz. It takes a buffer as input and panics if it starts with "abc".
+
+```rust
+let mut harness = |buf: &[u8]| {
+    if buf.len() > 0 && buf[0] == 'a' as u8 {
+        if buf.len() > 1 && buf[1] == 'b' as u8 {
+            if buf.len() > 2 && buf[2] == 'c' as u8 {
+                panic!("=)");
+            }
+        }
+    }
+};
+// To test the panic:
+// let input = "abc".as_bytes();
+// harness(&input);
+```
+
+## Generating and running some tests
+
+One of the main components that a LibAFL-based fuzzer uses is the State, a container of the data that is evolved during the fuzzing process, such as the Corpus of inputs.
+In our main so we create a basic State instance like the following:
+
+```rust
+// create a State from scratch
+let mut state = State::new(
+    // RNG
+    StdRand::with_seed(current_nanos()),
+    // Corpus that will be evolved, we keep it in memory for performance
+    InMemoryCorpus::new(),
+    (),
+    // Corpus in which we store solutions (crashes in this example),
+    // on disk so the user can get them after stopping the fuzzer
+    OnDiskCorpus::new(PathBuf::from("./crashes")).unwrap(),
+    (),
+);
+```
+
+It takes a random number generator, that is part of the fuzzer state, in this case, we use the default one `StdRand` but you can choose a different one. We seed it with the current nanoseconds.
+
+As the second parameter, it takes an instance of something implementing the Corpus trait, InMemoryCorpus in this case. The corpus is the container of the testcases evolved by the fuzzer, in this case, we keep it all in memory.
+
+We will discuss later the third and fifth parameters. The fourth is another corpus, in this case, to store the testcases that are considered as "solutions" for the fuzzer. For our purpose, the solution is the input that triggers the panic. In this case, we want to store it to disk under the `crashes` directory so we can inspect it.
+
+Another required component is the EventManager. It handles some events such as the addition of a testcase to the corpus during the fuzzing process. For our purpose, we use the simplest one that just displays the information about these events to the user using a Stats instance.
+
+```rust
+// The Stats trait define how the fuzzer stats are reported to the user
+let stats = SimpleStats::new(|s| println!("{}", s));
+
+// The event manager handle the various events generated during the fuzzing loop
+// such as the notification of the addition of a new item to the corpus
+let mut mgr = SimpleEventManager::new(stats);
+```
+
+Last but not least, we need an Executor that is the entity responsible to run our program under test. In this example, we want to run the harness function in process, and so we use the InProcessExecutor.
+
+```rust
+// Create the executor for an in-process function
+let mut executor =
+    InProcessExecutor::new(&mut harness, (), &mut state, &mut mgr)
+        .expect("Failed to create the Executor".into());
+```
+
+It takes a reference to the harness, the state, and the event manager. We will discuss the second parameter later.
+As the executor expects that the harness returns an ExitKind object, we add `ExitKind::Ok` to our harness function.
+
+Now we have the 3 major entities ready for running our tests, but we still cannot generate testcases.
+
+For this purpose, we use a Generator, RandPrintablesGenerator that generates a string of printable bytes.
+The State's method used to generate and run tests needs a scheduling policy for the corpus. We create it as QueueCorpusScheduler, a scheduler that serves testcases to the fuzzer in a FIFO fashion.
+
+```rust
+// A queue policy to get testcasess from the corpus
+let scheduler = QueueCorpusScheduler::new();
+
+// Generator of printable bytearrays of max size 32
+let mut generator = RandPrintablesGenerator::new(32);
+
+// Generate 8 initial inputs
+state
+    .generate_initial_inputs(&mut executor, &mut generator, &mut mgr, &scheduler, 8)
+    .expect("Failed to generate the initial corpus".into());
+```
+
+Now you can prepend the following `use` directives to your main.rs and compile it.
+
+```rust 
+use std::path::PathBuf;
+use libafl::{
+    corpus::{InMemoryCorpus, OnDiskCorpus, QueueCorpusScheduler},
+    events::SimpleEventManager,
+    executors::{inprocess::InProcessExecutor, ExitKind},
+    generators::RandPrintablesGenerator,
+    state::State,
+    stats::SimpleStats,
+    utils::{current_nanos, StdRand},
+};
+```
+
+When running, you should see something similar to:
+
+```sh
+$ cargo run
+    Finished dev [unoptimized + debuginfo] target(s) in 0.04s
+     Running `target/debug/baby_fuzzer`
+[LOG Debug]: Loaded 0 over 8 initial testcases
+```
+
+## Evolving the corpus with feedbacks
+
+Now you simply ran 8 randomly generated testcases but none of them has been stored in the corpus. If you are very lucky, maybe you triggered the panic by chance but you don't see any saved file in `crashes`.
+
+Now we want to turn our simple fuzzer into a feedback-based one and increase the chance to generate the right input to trigger the panic. We are going to implement a simple feedback based on the 3 conditions that are needed to reach the panic.
+
+To do that, we need a way to keep track of if a condition is satisfied. The component that feeds the fuzzer with information about properties of a fuzzing run, the satisfied conditions in our case, is the Observer. We use the StdMapObserver, the default observer that uses a map to keep track of covered elements. In our fuzzer, each condition is mapped to an entry of such map.
+
+We represent such map as a `static mut` variable:
+
+```rust
+// Coverage map with explicit assignments due to the lack of instrumentation
+static mut SIGNALS: [u8; 16] = [0; 16];
+
+fn signals_set(idx: usize) {
+    unsafe { SIGNALS[idx] = 1 };
+}
+```
+
+As we don't rely on any instrumentation engine, we have to manually track the satisfied conditions in a map modyfing our tested function:
+
+```rust
+// The closure that we want to fuzz
+let mut harness = |buf: &[u8]| {
+    signals_set(0);
+    if buf.len() > 0 && buf[0] == 'a' as u8 {
+        signals_set(1);
+        if buf.len() > 1 && buf[1] == 'b' as u8 {
+            signals_set(2);
+            if buf.len() > 2 && buf[2] == 'c' as u8 {
+                panic!("=)");
+            }
+        }
+    }
+    ExitKind::Ok
+};
+```
+
+The observer can be created directly from the `SIGNALS` map, in the following way:
+
+```rust
+// Create an observation channel using the signals map
+let observer = StdMapObserver::new("signals", unsafe { &mut SIGNALS });
+```
+
+The observers are usually kept in the corresponding executor as they keep track of information that is valid for just one run. We have then to modify our InProcessExecutor creation to include the observer as follows:
+
+```rust
+// Create the executor for an in-process function with just one observer
+let mut executor =
+    InProcessExecutor::new(&mut harness, tuple_list!(observer), &mut state, &mut mgr)
+        .expect("Failed to create the Executor".into());
+```
+
+Now that the fuzzer can observe which condition is satisfied, we need a way to rate an input as interesting (i.e. worth of addition to the corpus) based on this observation. Here comes the notion of Feedback. The Feedback is part of the State and provides a way to rate input and its corresponding execution as interesting looking for the information in the observers. Feedbacks can maintain a cumulative state of the information seen so far, in our case it maintains the set of conditions satisfied in the previous runs.
+
+We use MaxMapFeedback, a feedback that implements a novelty search over the map of the MapObserver. Basically, if there is a value in the observer's map that is greater than the maximum value registered so far for the same entry, it rates the input as interesting and updates its state.
+
+Feedbacks are used also to decide if an input is a "solution". The feedback that does that is called the Objective Feedback and when it rates an input as interested it is not saved to the corpus but to the solutions, written in the `crashes` folder in our case. We use the CrashFeedback to tell the fuzzer that if an input causes the program to crash it is a solution for us.
+
+We need to update our State creation including these feedbacks:
+
+```rust
+// create a State from scratch
+let mut state = State::new(
+    // RNG
+    StdRand::with_seed(current_nanos()),
+    // Corpus that will be evolved, we keep it in memory for performance
+    InMemoryCorpus::new(),
+    // Feedback to rate the interestingness of an input
+    MaxMapFeedback::new_with_observer(&observer),
+    // Corpus in which we store solutions (crashes in this example),
+    // on disk so the user can get them after stopping the fuzzer
+    OnDiskCorpus::new(PathBuf::from("./crashes")).unwrap(),
+    // Feedbacks to recognize an input as solution
+    CrashFeedback::new(),
+);
+```
+
+## The actual fuzzing
+
+Now, after including the correct `use`, we can run the program, but the outcome is not so different from the previous one as the random generator does not take into account what we save as interesting in the corpus. To do that, we need to plug a Mutator.
+
+Another central component of LibAFL is the Fuzzer, an entity that holds a set of Stages that are actions done on individual inputs taken from the corpus. The MutationalStage mutates the input and executes it several times for instance.
+
+As the last step, to have a proper fuzzer, we create a Fuzzer with a single MutationalStage that uses a mutator inspired by the havoc mutator of AFL.
+
+```rust
+// Setup a basic mutator with a mutational stage
+let mutator = StdScheduledMutator::new(havoc_mutations());
+let stage = StdMutationalStage::new(mutator);
+
+// A fuzzer with just one stage
+let mut fuzzer = StdFuzzer::new(tuple_list!(stage));
+
+fuzzer
+    .fuzz_loop(&mut state, &mut executor, &mut mgr, &scheduler)
+    .expect("Error in the fuzzing loop".into());
+```
+
+`fuzz_loop` will request a testcase for each iteration to the fuzzer using the scheduler and then it will invoke the stage.
+
+After adding this code, we have a proper fuzzer, that can run a find the input that panics the function in less than a second.
+
+```
+$ cargo run
+   Compiling baby_fuzzer v0.1.0 (/home/andrea/Desktop/baby_fuzzer)
+    Finished dev [unoptimized + debuginfo] target(s) in 1.56s
+     Running `target/debug/baby_fuzzer`
+[New Testcase] clients: 1, corpus: 2, objectives: 0, executions: 1, exec/sec: 0
+[LOG Debug]: Loaded 1 over 8 initial testcases
+[New Testcase] clients: 1, corpus: 3, objectives: 0, executions: 804, exec/sec: 0
+[New Testcase] clients: 1, corpus: 4, objectives: 0, executions: 1408, exec/sec: 0
+thread 'main' panicked at '=)', src/main.rs:35:21
+note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
+Crashed with SIGABRT
+Child crashed!
+[Objective] clients: 1, corpus: 4, objectives: 1, executions: 1408, exec/sec: 0
+Waiting for broker...
+Bye!
+```
+
+As you can see, after the panic message, the `objectives` count of the log increased by one and you will find the crashing input in `crashes/id_0`.
diff --git a/docs/src/getting_started/build.md b/docs/src/getting_started/build.md
@@ -22,4 +22,4 @@ Each of these example fuzzers uses particular features of LibAFL, sometimes comb
 
 You can use these crates as examples and as skeletons for custom fuzzers with similar feature sets.
 
-To build an example fuzzer you have to invoke cargo from its respective folder (`fuzzers/[FUZZER_NAME]).
+To build an example fuzzer you have to invoke cargo from its respective folder (`fuzzers/[FUZZER_NAME]`).
diff --git a/fuzzers/baby_fuzzer/src/main.rs b/fuzzers/baby_fuzzer/src/main.rs
@@ -39,13 +39,6 @@ pub fn main() {
         ExitKind::Ok
     };
 
-    // The Stats trait define how the fuzzer stats are reported to the user
-    let stats = SimpleStats::new(|s| println!("{}", s));
-
-    // The event manager handle the various events generated during the fuzzing loop
-    // such as the notification of the addition of a new item to the corpus
-    let mut mgr = SimpleEventManager::new(stats);
-
     // Create an observation channel using the signals map
     let observer = StdMapObserver::new("signals", unsafe { &mut SIGNALS });
 
@@ -64,12 +57,12 @@ pub fn main() {
         CrashFeedback::new(),
     );
 
-    // Setup a basic mutator with a mutational stage
-    let mutator = StdScheduledMutator::new(havoc_mutations());
-    let stage = StdMutationalStage::new(mutator);
+    // The Stats trait define how the fuzzer stats are reported to the user
+    let stats = SimpleStats::new(|s| println!("{}", s));
 
-    // A fuzzer with just one stage
-    let mut fuzzer = StdFuzzer::new(tuple_list!(stage));
+    // The event manager handle the various events generated during the fuzzing loop
+    // such as the notification of the addition of a new item to the corpus
+    let mut mgr = SimpleEventManager::new(stats);
 
     // A queue policy to get testcasess from the corpus
     let scheduler = QueueCorpusScheduler::new();
@@ -87,6 +80,13 @@ pub fn main() {
         .generate_initial_inputs(&mut executor, &mut generator, &mut mgr, &scheduler, 8)
         .expect("Failed to generate the initial corpus".into());
 
+    // Setup a basic mutator with a mutational stage
+    let mutator = StdScheduledMutator::new(havoc_mutations());
+    let stage = StdMutationalStage::new(mutator);
+
+    // A fuzzer with just one stage
+    let mut fuzzer = StdFuzzer::new(tuple_list!(stage));
+
     fuzzer
         .fuzz_loop(&mut state, &mut executor, &mut mgr, &scheduler)
         .expect("Error in the fuzzing loop".into());

Original file line number	Diff line number	Diff line change
`@@ -22,4 +22,4 @@ Each of these example fuzzers uses particular features of LibAFL, sometimes comb`
`22`	`22`
`23`	`23`	`You can use these crates as examples and as skeletons for custom fuzzers with similar feature sets.`
`24`	`24`
`25`		-To build an example fuzzer you have to invoke cargo from its respective folder (`fuzzers/[FUZZER_NAME]).
	`25`	+To build an example fuzzer you have to invoke cargo from its respective folder (`fuzzers/[FUZZER_NAME]`).