carterburn

New Series: Let’s Build a Simple Database…in Rust

2026-04-09T00:00:00+00:00

Hey, all! Back at it again trying to write posts. While I do believe the Roundup posts are great and will hopefully force me to stop coding and write about what I’m doing, I wanted to come up with a solid series of developing something from scratch. I’ve been intrigued by databases and their internals for awhile now and one of the more fascinating posts that I have read about database internals is from cstack on Github. That tutorial walks you through making a minimal sqlite clone in C. I have written a fair bit of C in my time (refer back to the about page on offensive security tools, they are predominantly in C) and even started working through the code but I eventually plateaued and stopped working through it. I thought a way to motivate me to get through it and really learn was to write that clone in Rust with all of the challenges of the borrow checker, ownership, and the like.

The plan is to try to follow the tutorial’s fifteen parts as closely as possible, but in Rust. I’m also going to attempt to avoid dependencies at all costs for two reasons:

SQLite itself prides itself on minimal dependencies. I’ve heard on a podcast with the creator that with no dependencies comes freedom to do what you want; it’s like backpacking in the wilderness, you’re all on your own.
Using something like the BTreeMap in the Rust standard library feels like cheating. We have to struggle through this!

Extensions

I plan to make a few extensions to the original at the end (if I’m up for it). One of them would be to add the Write Ahead Log (WAL) functionality of SQLite. This functionality is a key component of allowing SQLite to be distributed (many distributed SQLite extensions and components leverage the WAL). With that, I may be able to tie together my Raft implementation with this small sqlite clone.

I’m going to try to do my best and make it close to SQLite. I know I won’t get there but the tutorial seems to focus in on the key components of SQLite and I may want to add some of the other features.

Other extensions are in my mind as well (maybe some python bindings to allow the use of this clone within python) and will be explored as we make our way through.

Here is the table of contents for the entire series. I wanted this to be higher in the post. See below this section for Getting Started!

These are going to take the form of the original series by cstack mostly. More will be added as we work our way through.

Getting Started

To get started, I’m going to create a new Rust crate. I’ll start by creating this as a library with a binary for a command line tool (like the sqlite3 binary you can use to view SQLite databases from the command line). The purpose for this is extensibility with potential future extensions. It makes it a lot easier to start with a library and port that than try to undo a binary.

We’re calling this FoundryDB (foundry -> Rust get it?):

cargo new foundrydb --lib
# get the simple binary file created
touch src/main.rs

Then, update Cargo.toml to look like this:

[package]
name = "foundrydb"
version = "0.1.0"
edition = "2024"

[lib]
name = "foundrydb"
path = "src/lib.rs"

[[bin]]
name = "fdb"
path = "src/main.rs"

[dependencies]

I’ll leave the default lib.rs code of add and add some scaffolding to src/main.rs to make sure they’re playing nice.

In src/main.rs:

use foundrydb::add;

fn main() {
    println!("Inside fdb! {}", add(1, 2));
}

Then, we can verify we can test and run the library and binary respectively:

cargo test
cargo run

Of course, we now push to Github. You can find the repo here. We’ll add a README later on.

In the spirit of cstack’s original post, I’ll include the SQLite Architecture diagram as referenced at https://www.sqlite.org/arch.html.

Part 1 - SQLite Introduction and Setting up the REPL in fdb

2026-04-09T00:00:00+00:00

Welcome to Part 1! Here, we’ll quickly talk about the broad overview of SQLite’s internals (and the database we’ll make!) and then setup a basic Read-Eval-Print-Loop (REPL) for the binary in our project fdb.

The general “flow” for a SQLite query looks like this:

Paraphrasing part 1 of cstack’s series, the first step in executing some SQL against a SQLite database is to make the API call (in the SQLite library) with your SQL statement. That statement first hits the “frontend” of the engine which includes the tokenizer, parser, and code generator. Basically, this classic Computer Science compiler theory by taking the SQL string, turning it into a series of tokens, letting the parser make sense of those tokens (based on the semantics of the language), and then generate some bytecode to be ran in a virtual machine. Another option after the parser is to do what’s known as a tree-walking interpreter where you just walk the Abstract Syntax Tree that the parser creates. This is usually easier to conceptualize and code but it does typically affect performance since its really just a large recursive function that creates a massive stack frame along the way. We’ll follow the original design and generate bytecode.

Side note: I feel fairly comfortable with this step due to David Beasley’s excellent “Write a Compiler” course. Sadly, it’s no longer being offerred, but it was a great way to really dive into the depths of compiler design and theory. (Side note to the side note: I am not claiming I could write a tokenizer and parser for the entirety of the SQL grammer but the small subset we’ll do here, I think I can handle because of this course).

After we have bytecode generated, the next step is to hit the backend which consists of the virutal machine, B-tree, pager, and OS interface. The virtual machine will take the generated bytecode and perform operations on the B-tree data structures that make up the tables and indices of the database. When it comes down to it, it’s just a giant match (not switch in Rust!) statement that takes action based on the instruction.

Once the virtual machine executes those operations, it hits the B-tree. The B-tree is truly the defining aspect of SQLite. The B-tree has many nodes that are each a page in length and this layer can retrieve pages from disk or write to disk by issuing commands to the pager.

The pager receives commands from the B-tree to read or write pages of data to the database file. The main function here is to properly write these pages to the correct offset in the database file and keep a cache of recently accessed pages in memory to make access quick and decide when to write them back to disk.

The pager then interfaces with the OS interface for the actual reading and writing of files. SQLite also calls this the “Virtual File System” or VFS to provide a generic API for opening, reading, writing, and closing files (and more!). The OS interface then implements the actual mechanisms for the OS that the database is running on (like Windows or Unix). cstack’s original tutorial skipped over this portion to not have to support multiple platforms but luckily in this tutorial, if we just leverage the Rust standard library std::fs module, we get cross-platform support for free!

Making the REPL for fdb

cstack’s original post details the basic REPL which requires a decent more amount of work than in Rust. cstack clearly appreciates good design upfront and created a C struct for an input buffer to reuse an allocation. This is where C is a fantastic language but clearly dated and doesn’t have a lot of the niceties of a modern language like Rust. We’ll go on our own here to make the simple REPL which just accepts input from the user and only recognizes the ‘.exit’ command.

Taking from cstack’s tutorial, this is the basic usage of sqlite3 from the command line:

$ sqlite3
SQLite version 3.34.1 2021-01-20 14:10:07
Enter ".help" for usage hints.
Connected to a transient in-memory database.
Use ".open FILENAME" to reopen on a persistent database.
sqlite> create table users (id int, username varchar(255), email varchar(255));
sqlite> .tables
users
sqlite> .exit
$

We’ll get to the point where we have something similar at the end of this post (minus the create table line).

Of course there are great crates out there to build elegant REPL’s such as rustyline (which gives you tab completion and history!) but we’ll do it the hard way in this tutorial!

We want to read from stdin line-by-line, so we’ll use the standard library’s BufRead trait to expose read_line() for easy reading. Of note, you first have to get a handle to stdin by locking it from other threads:

use std::io::{self, stdin, BufRead, Write};

fn main() -> Result<(), io::Error> {
    let mut stdin = stdin().lock();
    let mut buffer = String::new();

    loop {
        // print out the prompt and flush the buffer to ensure it prints
        print!("fdb > ");
        io::stdout().flush()?;

        // clear the String 
        buffer.clear();
        // read a line from stdin
        let _ = stdin.read_line(&mut buffer);
        // print it back out
        println!("{}", buffer.trim());
    }

    Ok(())
}

This is a simple REPL that does nothing but print the input back to the user. I added comments in the blog post to explain some things in depth if you are newer to Rust. One note: I decided for main to return a Result<(), io::Error> which basically means we can propagate Error’s up to main and print the message to the terminal on an unrecoverable Error. Other than that, the code should be self-explanatory here.

Instead of just reading a line and printing it, now we want to take some action based on that. In SQLite, there are two main ways to interact. There are “.” commands which are non-SQL “meta” commands to get information from the database. Of these, .exit is probably the easiest, it just exits the process. The other commands are standard SQL commands that we’ll tackle later. The SQL statements will be the bulk of our focus with the tokenizer, parser, etc. For now, we’ll just support .exit to get the REPL started.

In Rust, the easiest approach will be to use the read_line method on the BufRead trait which has a signature of: fn read_line(&mut self, buf: &mut String) -> Result. We’ll have to handle the Result. Ok(0) means EOF (i.e. the input has been closed from a Ctrl-C or Ctrl-D). Ok(n) means n bytes were read into the buffer. This will include the newline at the end of the line, so we’ll use trim() to remove all leading and trailing whitespace. Any Err on the Result will be propagated through and exit the process.

Then, we’ll match on the input and select our action based on that. To keep main minimal (a personal goal of mine), we’ll write a helper function to perform the command matching.

We’ll start with the modified minimal main loop that will change after buffer.clear():

        // -- SNIP --
        match stdin.read_line(&mut buffer) {
            Ok(0) => {
                // EOF; just return (no error)
                return Ok(());
            }
            Ok(_n) => {
                // process the command
                process_command(buffer.trim());
            }
            Err(e) => {
                // print error and return it
                eprintln!("Error while reading input: {e:?}");
                return Err(e);
            }
        }

Then, with process_command, we will just convert the input to all lowercase (to make matching case insensitive) and only take action based on the command. (Here, we could have returned something like a Result and taken action in main but instead we’ll just handle everything here and treat process_command as almost like an inline function).

fn process_command(cmd: &str) {
    let cmd = cmd.to_lowercase();
    match cmd.as_str() {
        ".exit" => {
            // Exit with success
            eprintln!("Goodbye!");
            std::process::exit(0);
        }
        _ => {
            println!("Unrecognized command: {cmd}");
        }
    }
}

Pretty simple. We make another allocation (sadly) to make the command lowercase and then match on the &str of that lowercase string. If the buffer has .exit, we exit the application. Everything else we treat as an unrecognized command and print that to the user.

And that’s it! 41 lines and we have a basic REPL for fdb. Here’s the entire code at this moment:

use std::io::{self, stdin, BufRead, Write};

fn main() -> Result<(), io::Error> {
    let mut stdin = stdin().lock();
    let mut buffer = String::new();

    loop {
        print!("fdb > ");
        io::stdout().flush()?;

        buffer.clear();

        match stdin.read_line(&mut buffer) {
            Ok(0) => {
                // EOF; just return (no error)
                return Ok(());
            }
            Ok(_n) => {
                process_command(buffer.trim());
            }
            Err(e) => {
                eprintln!("Error while reading input: {e:?}");
                return Err(e);
            }
        }
    }
}

fn process_command(cmd: &str) {
    let cmd = cmd.to_lowercase();
    match cmd.as_str() {
        ".exit" => {
            // Exit with success
            eprintln!("Goodbye!");
            std::process::exit(0);
        }
        _ => {
            println!("Unrecognized command: {cmd}");
        }
    }
}

That’s it for now! Next time, we’ll work on the SQL compiler and virtual machine.

Series

This section contains the order of the series for easier navigation.

Previous Article	Next Article
Intro	Coming Soon!

Roundup - April 9th

2026-04-09T00:00:00+00:00

Hello! I am beginning one of my brain dump/minimal review/minimal edits posts with this one! I’m going to call this series “roundup” where I just give a quick few notes on where I’m at with any personal projects. The focus of this roundup is the Raft sans-i/o implementation.

Where were we?

With Raft, I had successfully gotten to the point where I could replicate log entries across the cluster (in a sans-i/o way). This worked great and really could have been a ‘stopping’ point to start moving to integrating a higher-level event loop and then an application like a key-value store on top of it. The one problem: unbounded log growth. I didn’t support log compaction and snapshotting in that implementation and that is fine for an academic project of learning the internals of Raft but for anything that wanted to even have the chance to be offerred up for production use needed to have log compaction. So, I started to go down that road.

What happened?

My initial thoughts were to put the work on the application and event loop. To be fair, I’m still doing that, but I had to change RaftCore (the type that represents a Raft node). The idea with this library is that RaftCore simply owns the Raft protocol itself. It accepts minimal input from the event loop (like propose a new command) and emits Action’s for the event loop to take such as SendMessage, ApplyToStateMachine, Persist*. The paper even calls out that the snapshot semantics for Raft is up to the application itself.

That makes sense to me because Raft can be used as the consensus layer for a number of applications like key value stores, databases, or even a simple counter you want replicated. Raft doesn’t need to know what the application cares about (which is also why Raft stores a command as a Vec; just a serialized blob of whatever the application wants to store).

This means the design will be on the application to tell RaftCore “hey, it’s time to snapshot”. The application can decide when that is the case (maybe a certain number of new entries in a key value store). I plan to also provide some helper functions for the event loop and application to peek at how large the RaftCore log is at any moment. So now most of the snapshotting will be on the application with this design.

Yeah, I totally am punting the problem down the road.

What RaftCore did need to change, though, was tracking whether or not a snapshot exists. The idea is that if a leader has a follower that is sufficiently behind, the leader may need to go all the way back to the snapshot for the follower to catch up.

This, also, suggests to me that more frequent snapshotting is better so that the log in RaftCore stays sufficiently bounded and the amount of time for “catchup” can be minimized in one fell swoop with a snapshot.

With that, now RaftCore needed to track the last index within the snapshot. That’s fine, but the nodes are still going to work on the monotonically increasing “index” of the log and not necessarily worry about the index within RaftCore.log. Now, we’re working on “virtual” or “logical” indices instead of “physical” indices in the RaftCore log.

Should be easy right? Just subtract the last index in the snapshot from the index provided and voila, you have the index in RaftCore.log. Well, it wasn’t that simple because I had chosen to add a sentinel or ‘dummy’ entry at the start of every node’s log that was on term 0. That made indexing really easy within RaftCore because I could just directly reference the index passed (i.e. index 1 in the logical Raft log was actually at index 1 in RaftCore.log). The issue I found was that once the node had a snapshot, all the index math was off. An example may help…

Say I have a Raft log that has 102 entries. And I snapshotted up to and including index 100 in the log. That means the last included index in the snapshot would be set to 100 and RaftCore’s log would be something like: [{index: 101, term: 3}, {index: 102, term: 3}] (note, the index isn’t actually stored in the entries of the log, but just for demonstrative purposes here). Say I wanted index 101 in the Raft log to provide a follower. The old math / simple math talked about would have been index - last_included_index, which in this case is: 101 - 100 = 1 but then I wouldn’t be able to go to RaftCore.log[1] because that’s index 102 in the Raft log! All the index math I was doing was fine until snapshots came up.

So, with great sadness, I added some helpers to do the math and removed the sentinel entry to make sure it was consistent math regardless if there was a snapshot or not. Then, I changed up any direct indexing into the log to use the helpers and ensured I wasn’t using RaftCore.log.len() as a way to compute the “last” index in the log and rather use the helper that took into account the snapshot’s log. This created a lot of problems at first (I actually had over half of the tests failing). Guess what the bulk of the problems were? You probably guessed it, off-by-one errors. I had a lot of places where a < should have been a <= or I missed appending -1 to a computation. I ended up adding in tests for the helpers too to make sure my sanity was correct (Claude made the suggestion after watching the struggle). Eventually, I cleared out those issues and hopefully tested all of the edge cases and have this ready to go.

The funny part is that this change that took a good chunk of time didn’t really do anything beyond just using the helpers. Snapshotting and log compaction is still NOT supporting in RaftCore but all of the scaffolding is there to make it ready!

Where next?

Next will be to actually do log compaction and snapshotting between the nodes. I’ll iron out the API between the application, event loop, and RaftCore and hopefully will write some solid tests that ensure the Raft protocol is still upheld when I added in the snapshots.

After that, I think the last piece I’ll tackle is cluster membership changes. I haven’t dove super deep into it in the paper but have learned and guessed that that is one of the harder problems within Raft. I may also pause on that for the moment and work on the higher levels (getting through to a key value store). However, I won’t consider this project ready for “production” until cluster membership changes can be supported. Raft is great until you can’t manipulate the cluster when you start to realize you need more replication to tolerate more failures.

I’m using the term “production” loosely because I have no idea if this would be considered for production use. I’ll use it maybe to test on real AWS nodes, but I doubt anyone will want to use this.

That’s all for now! The next roundup should have log compaction done!

Roundup Series

In this section of the Roundup’s, I’ll post the “series” so you can cycle through!

Previous Roundup	Next Roundup
NONE!	Coming Soon!

Hello World

2026-03-30T00:00:00+00:00

Hello world from Github Pages!

As seen in the about page, I set this blog up to start getting some written content out to the world. The greatest barrier to entry for me has been trying to capture a “perfect” post and spending so much time getting a post there that I end up getting distracted and never actually publish it. I intend to change that with this refreshed site. I’m planning to write about things I’m currently working on, design decisions, problems, etc. and then just post it. Minimal editing, minimal review, just brain dump a post and publish it.

I fully recognize the importance of technical writing and blog posts. It would take me ages to list the countless blog posts that have guided me through solutions and I hope that my brain dumps can be useful to someone down the line. If they’re not, then at least this is an attempt to force myself to write technical content and get it on the Internet.

Current State

Another one of my weaknesses with respect to personal projects is my inability to finish them. I am not going to make any promises that I will do that (or that this blog will help me finish them).

If we’re being honest, is a project ever “done”?!

Currently, I’m working on implementing the Raft consensus algorithm in Rust. I’m taking a sans-i/o approach to it and focusing on the algorithm correctness at the moment. I plan to finish that soon and then write the event loop (where the I/O will actually start to happen) along with a minimal key-value server on top of Raft (to give a replicated, fault-tolerant KV store).

I’ve also been messing around with Code Crafters, specifically the Redis implementation. I actually got up to the point where multiple servers would interact and that made me switch over to writing the Raft library. I’m not sure if I’ll combine the two (mostly because I like the idea of a raw KV store instead of adding the Redis-specific items), but maybe that’ll happen. Check out my progress so far here.

I’ve also been working on a tunneling tool (there’s those security tools again) that allows a user to tunnel into a network using a tun device on their host machine over a QUIC connection. That isn’t public yet because it only supports TCP tunneling at this point and it doesn’t have tests nor does it provide security with the self-signed certs. I’m uncertain if I’ll continue that at the moment, but it is still a fun project I may share in the future.

After I get through Raft, my plan is to try my hand at the Gossip Gloomers Distributed Systems problems found on fly.io. I had Claude generate some reading materials to help me through because while I did take a distributed systems course for my graduate degree, it appears there were some gaps in that course. I’ll definitely blog about those challenges here.

Previous Work

Two main things to point out as previous Rust work:

1) I wrote a simple crate for a socks5 proxy in Rust. It’s pretty bad, but it was honestly my first attempt at writing any sort of networking code in Rust using tokio. You can find it on crates.io and the source on Github. I have to say, greater than 3k downloads is still pretty cool, even if that’s a bunch of crates.io mirrors downloading the crate every once in awhile. 2) I wrote a userland exec binary in Rust a little bit ago. I called it santa because it only loads ELF binaries. I drew inspriation from a Python implementation of userland exec and a Rust one. This is another one of those security tools that allows someone to load up a binary inside an existing process’s memory (santa itself) without having the kernel do it. It can load binaries from files, stdin, or from a remote URL. While writing this blog, I forgot I had to add a few things to the README, so I sent Claude off to go do that. Check out the repo here.

Quick Word on AI (blog posts in 2026 must include)

In a lot of my projects, I do leverage Claude Code. My approach to using AI, though, is to use it like a teacher. For example, in my Raft implementation, I don’t have Claude write any code, I just have it give me “tasks” as if it were an assignment and then review my work and give me guidance on places I can improve. I think this has worked well for me because it allows me to ensure I understand what I’m writing and forces me to write idiomatic code. I think it’s making me better without relying too much on Claude to write the code for me. I definitely recognize the time and place to have Claude do that work and have considered a few projects to let Claude go do its thing, but I do fear if I do that too frequently, I miss out on the opportunity to truly learn and then won’t be able to effectively use AI coding assistants. The power of an AI coding assistant is when the human can effectively guide (and check) the assistant, which requires deep understanding of the topic at hand. I need to get to that point first, so I use Claude as my personal teacher (because I don’t really have one).

Hopefully this is a blog that will interest you! We’ll see how it goes. Until next time!

carterburn

New Series: Let’s Build a Simple Database…in Rust

Extensions

Table of Contents

Getting Started

Part 1 - SQLite Introduction and Setting up the REPL in fdb

Making the REPL for fdb

Series

Roundup - April 9th

Where were we?

What happened?

Where next?

Roundup Series

Hello World

Current State

Previous Work

Quick Word on AI (blog posts in 2026 must include)