Roadmap and Phases

AxiomDB is developed in phases, each of which adds a coherent vertical slice of functionality. The design is organized in three blocks:

Block 1 (Phases 1–7): Core engine — storage, indexing, WAL, transactions, SQL parsing, and concurrent MVCC.
Block 2 (Phases 8–14): SQL completeness — full query planner, optimizer, advanced SQL features, and MySQL wire protocol.
Block 3 (Phases 15–34): Production hardening — replication, backups, distributed execution, column store, and AI/ML integration.

Current Status

Last completed subphase: 40.1b CREATE INDEX on clustered tables — removed ensure_heap_runtime guard; CREATE INDEX / CREATE UNIQUE INDEX now work on clustered (PRIMARY KEY) tables using ClusteredSecondaryLayout-based index build with partial index, NULL-skipping, and uniqueness enforcement at build time.

Active development: Phase 40 — Clustered engine performance optimizations (40.1 ClusteredInsertBatch done; 40.1b CREATE INDEX on clustered tables done; statement plan cache, transaction write set, vectorized scan next)

Next milestone: 40.2 — Statement plan cache (per-session CachedPlanSource with OID-based invalidation)

Concurrency note: the current server already supports concurrent read-only queries, but mutating statements are still serialized through a database-wide Arc<RwLock<Database>> write guard. The next concurrency milestone is Phase 13.7 row-level locking, followed by deadlock detection and explicit locking clauses.

Phase Progress

Block 1 — Core Engine

Phase	Name	Status	Key deliverables
1.1	Workspace setup	✅	Cargo workspace, crate structure
1.2	Page format	✅	16 KB pages, header, CRC32c checksum
1.3	MmapStorage	✅	mmap-backed storage engine
1.4	MemoryStorage	✅	In-memory storage for tests
1.5	FreeList	✅	Bitmap page allocator
1.6	StorageEngine trait	✅	Unified interface + heap pages
2.1	B+ Tree insert/split	✅	CoW insert with recursive splits
2.2	B+ Tree delete	✅	Rebalance, redistribute, merge
2.3	B+ Tree range scan	✅	RangeIter with tree traversal
2.4	Prefix compression	✅	CompressedNode for internal keys
3.1	WAL entry format	✅	Binary format, CRC32c, backward scan
3.2	WAL writer	✅	WalWriter with file header
3.3	WAL reader	✅	Forward and backward iterators
3.4	TxnManager	✅	BEGIN/COMMIT/ROLLBACK, snapshot
3.5	Checkpoint	✅	5-step checkpoint protocol
3.6	Crash recovery	✅	CRASHED→RECOVERING→REPLAYING→VERIFYING→READY
3.7	Durability tests	✅	9 crash scenarios
3.8	Post-recovery checker	✅	Heap structural + MVCC invariants
3.9	Catalog bootstrap	✅	axiom_tables, axiom_columns, axiom_indexes
3.10	Catalog reader	✅	MVCC-aware schema lookup
3.17	WAL batch append	✅	`record_insert_batch()`: O(1) `write_all` for N entries via `reserve_lsns+write_batch`
3.18	WAL PageWrite	✅	`EntryType::PageWrite=9`: 1 WAL entry/page vs N/row; 238× fewer for 10K-row insert
3.19	WAL Group Commit	✅	`CommitCoordinator`: batches fsyncs across connections; up to 16× concurrent throughput
4.1	SQL AST	✅	All statement types
4.2	SQL lexer	✅	logos DFA, ~85 tokens, zero-copy
4.3	DDL parser	✅	CREATE/DROP/ALTER TABLE, CREATE/DROP INDEX
4.4	DML parser	✅	SELECT (all clauses), INSERT, UPDATE, DELETE
4.17	Expression evaluator	✅	Three-valued NULL logic, all operators
4.18	Semantic analyzer	✅	BindContext, col_idx resolution
4.18b	Type coercion matrix	✅	coerce(), coerce_for_op(), CoercionMode strict/permissive
4.23	QueryResult type	✅	Row, ColumnMeta, QueryResult (Rows/Affected/Empty)
4.5b	Table engine	✅	TableEngine scan/insert/delete/update over heap; later generalized by Phase 39 table-root metadata
4.5 + 4.5a	Basic executor	✅	SELECT/INSERT/UPDATE/DELETE, DDL, txn control, SELECT without FROM
4.25 + 4.7	Error handling framework	✅	Complete SQLSTATE mapping; ErrorResponse{sqlstate,message,detail,hint}
4.8	JOIN (nested loop)	✅	INNER/LEFT/RIGHT/CROSS; USING; multi-table; FULL→NotImplemented
4.9a+4.9c+4.9d	GROUP BY + Aggregates + HAVING	✅	COUNT/SUM/MIN/MAX/AVG; hash-based; HAVING; NULL grouping
4.10+4.10b+4.10c	ORDER BY + LIMIT/OFFSET	✅	Multi-column; NULLS FIRST/LAST; LIMIT/OFFSET pagination
4.12	DISTINCT	✅	HashSet dedup on output rows; NULL=NULL; pre-LIMIT
4.24	CASE WHEN	✅	Searched + simple form; NULL semantics; all contexts
4.6	INSERT … SELECT	✅	Reuses execute_select; MVCC prevents self-reads
6.1–6.3	Secondary indexes + planner	✅	CREATE INDEX, index maintenance, B-Tree point/range lookup
6.4	Bloom filter per index	✅	BloomRegistry; zero B-Tree reads for definite-absent keys (1% FPR)
6.5/6.6	Foreign key constraints	✅	REFERENCES, ALTER TABLE FK; INSERT/DELETE/CASCADE/SET NULL enforcement
6.7	Partial UNIQUE index	✅	CREATE INDEX … WHERE predicate; soft-delete uniqueness pattern
6.8	Fill factor	✅	WITH (fillfactor=N) on CREATE INDEX; B-Tree leaf split at ⌈FF×ORDER_LEAF/100⌉
6.9	FK + Index improvements	✅	PK B-Tree population; FK composite key index; composite index planner
6.10–6.12	Index statistics + ANALYZE	✅	Per-column NDV/row_count; planner cost gate (sel > 20% → Scan); ANALYZE command; staleness tracking
6.16	PK SELECT planner parity	✅	PRIMARY KEY equality/range now participate in single-table SELECT planning; PK equality bypasses the scan-biased cost gate
6.17	Indexed UPDATE candidate path	✅	UPDATE now discovers PK / indexed candidates through B-Tree access before entering the 5.20 write path
6.18	Indexed multi-row INSERT batch path	✅	Immediate multi-row VALUES statements now reuse grouped heap/index apply on indexed tables while preserving strict same-statement UNIQUE semantics
6.19	WAL fsync pipeline	🔄	Server commits now use an always-on leader-based fsync pipeline and the old timer-based `CommitCoordinator` path is gone, but the single-connection `insert_autocommit` benchmark still misses target throughput
6.20	UPDATE apply fast path	✅	PK-range UPDATE now batches candidate heap reads, skips no-op rows, batches `UpdateInPlace` WAL writes, and groups per-index delete+insert/root persistence
5	Executor (advanced)	⚠️ Planned	JOIN, GROUP BY, ORDER BY, index lookup, aggregate
6.8+	Index statistics, FK improvements	⚠️ Planned	Fill factor, composite FKs, ON UPDATE CASCADE, ANALYZE, index-only scans
7	Full MVCC	⚠️ Planned	SSI, write-write conflicts, epoch reclamation

Block 2 — SQL Completeness

Phase	Name	Status	Key deliverables
8	Advanced SQL	⚠️ Planned	Window functions, CTEs, recursive queries
9	VACUUM / GC	⚠️ Planned	Dead row cleanup, freelist compaction
10	MySQL wire protocol	⚠️ Planned	COM_QUERY, result set packets, handshake
11	TOAST	⚠️ Planned	Out-of-line storage for large values
12	Full-text search	⚠️ Planned	Inverted index, BM25 ranking
13	Foreign key checks	⚠️ Planned	Constraint validation on insert/delete
14	Vectorized execution	⚠️ Planned	SIMD scans, morsel-driven pipeline

Block 3 — Production Hardening

Phase	Name	Status
15	Connection pooling	⚠️ Planned
16	Replication (primary-replica)	⚠️ Planned
17	Point-in-time recovery (PITR)	⚠️ Planned
18	Online DDL	⚠️ Planned
19	Partitioning	⚠️ Planned
20	Column store (HTAP)	⚠️ Planned
21	VECTOR index (ANN)	⚠️ Planned
22–34	Distributed, cloud-native, AI/ML	⚠️ Future

Block 4 — Platform Surfaces and Storage Evolution

Phase	Name	Status	Key deliverables
35	Deployment and DevEx	⚠️ Planned	Docker, config tooling, release UX
36	AxiomQL Core	⚠️ Planned	Alternative read query language over the same AST/executor
37	AxiomQL Write + DDL + Control	⚠️ Planned	AxiomQL DML, DDL, control flow, maintenance
38	AxiomDB-Wasm	⚠️ Planned	Browser runtime, OPFS backend, sync, live queries
39	Clustered index storage engine	🔄 In progress	Inline PK rows, clustered internal/leaf pages, PK bookmarks in secondary indexes, logical clustered WAL/rollback, clustered crash recovery, clustered-aware CREATE TABLE

Completed Phases — Summary

Phase 1 — Storage Engine

A generic storage layer with two implementations: MmapStorage for production disk use and MemoryStorage for tests. Every higher-level component uses only the StorageEngine trait — storage is pluggable. Pages are 16 KB with a 64-byte header (magic, page type, CRC32c checksum, page_id, LSN, free pointers). Heap pages use a slotted format: slots grow from the start, tuples grow from the end toward the center.

Phase 2 — B+ Tree CoW

A persistent, Copy-on-Write B+ Tree over StorageEngine. Keys up to 64 bytes; ORDER_INTERNAL = 223, ORDER_LEAF = 217 (derived to fill exactly one 16 KB page). Root is an AtomicU64 — readers are lock-free by design. Supports insert (with recursive split), delete (with rebalance/redistribute/merge), and range scan via RangeIter. Prefix compression for internal nodes in memory.

Phase 3 — WAL and Transactions ✅ 100% complete

Append-only Write-Ahead Log with binary entries, CRC32c checksums, and forward/backward scan iterators. TxnManager coordinates BEGIN/COMMIT/ROLLBACK with snapshot assignment. Five-step checkpoint protocol. Crash recovery state machine (five states). Catalog bootstrap creates the three system tables on first open. CatalogReader provides MVCC-consistent schema reads. Nine crash scenario tests with a post-recovery integrity checker.

Phase 3 late additions (3.17–3.19):

3.17 WAL batch append — record_insert_batch() uses WalWriter::reserve_lsns(N) + write_batch() to write N Insert WAL entries in a single write_all call. Reduces BufWriter overhead from O(N rows) to O(1) for bulk inserts.
3.18 WAL PageWrite — EntryType::PageWrite = 9. One WAL entry per affected heap page instead of one per row. new_value = post-modification page bytes (16 KB) + embedded slot IDs for crash recovery undo. For a 10K-row bulk insert: 42 WAL entries instead of 10,000 — 238× fewer serializations and 30% smaller WAL file.
3.19 WAL Group Commit — CommitCoordinator batches DML commits from concurrent connections. DML commits write to the WAL BufWriter, register with the coordinator, and release the Database lock before awaiting fsync confirmation. A background Tokio task performs one flush+fsync per batch window (group_commit_interval_ms), then notifies all waiting connections. Enables near-linear concurrent write scaling.

Phase 4 — SQL Processing

SQL AST covering all DML (SELECT, INSERT, UPDATE, DELETE) and DDL (CREATE/DROP/ALTER TABLE, CREATE/DROP INDEX). logos-based lexer with ~85 tokens, case-insensitive keywords, zero-copy identifiers. Recursive descent parser with full expression precedence. Expression evaluator with three-valued NULL logic (AND, OR, NOT, IS NULL, BETWEEN, LIKE, IN). Semantic analyzer with BindContext, qualified/unqualified column resolution, ambiguity detection, and subquery support. Row codec with null bitmap, u24 string lengths, and O(n) encoded_len().

Near-Term Priorities

Phase 13 — Row-Level Writer Concurrency

The current implementation uses Arc<tokio::sync::RwLock<Database>>: reads can overlap, but mutating statements are still serialized at whole-database scope. Phase 13.7 removes that bottleneck with row-level locking. Phase 13.8 adds deadlock detection, and 13.8b adds SELECT ... FOR UPDATE, NOWAIT, and SKIP LOCKED.

Phase 5

Phase 5 is now complete. The last close was:

5.15 DSN parsing — AxiomDB-owned surfaces now accept typed DSNs: AXIOMDB_URL for server bootstrap plus Db::open_dsn, AsyncDb::open_dsn, and axiomdb_open_dsn for embedded mode. mysql:// and postgres:// are parse aliases only; the server still speaks MySQL wire only and embedded mode still accepts only local-path DSNs.

Phase 5 also closed the recent runtime/perf subphases:

5.11c Explicit connection state machine — the MySQL server now has an explicit CONNECTED → AUTH → IDLE → EXECUTING → CLOSING transport lifecycle with fixed auth timeout, wait_timeout vs interactive_timeout behavior, net_write_timeout for packet writes, and socket keepalive configured separately from SQL session state.
5.19a Executor decomposition — the SQL executor now lives in a responsibility-based executor/ module tree instead of one monolithic file, which lowers the cost of later DML and planner work.
5.19 B+Tree batch delete — DELETE WHERE and the old-key half of UPDATE now stage exact encoded keys per index and remove them with one ordered delete_many_in(...) pass per tree instead of one delete_in(...) traversal per row.
5.19b Eval decomposition — the expression evaluator now lives under a responsibility-based eval/ module tree with the same public API, which lowers the cost of future built-in and collation work without changing SQL behavior.
5.20 Stable-RID UPDATE fast path — UPDATE can now rewrite rows in the same heap slot when the new encoded row fits, preserve the RecordId, and skip unnecessary index maintenance for indexes whose logical key membership is unchanged.
5.21 Transactional INSERT staging — explicit transactions now buffer consecutive INSERT ... VALUES statements per table and flush them together on COMMIT or the next barrier statement, preserving savepoint semantics by flushing before the next statement savepoint whenever the batch cannot continue.

Phase 6 closing note — Integrity and recovery

Phase 6 now closes with startup index integrity verification:

every catalog-visible index is compared against heap-visible rows after WAL recovery
readable divergence is repaired automatically from heap contents
unreadable index trees fail open with IndexIntegrityFailure

SQL REINDEX remains deferred to the later diagnostics / administration phases.

Phase 6 closing note — Indexed multi-row INSERT on indexed tables

Phase 6 also closes the remaining immediate multi-row VALUES debt on indexed tables:

shared batch-apply helpers are now reused by both 5.21 staging flushes and the immediate INSERT ... VALUES (...), (... ) path
PRIMARY KEY and secondary indexes no longer force a per-row fallback for multi-row VALUES statements
same-statement UNIQUE detection remains strict because the immediate path does not reuse the staged committed_empty shortcut

Index range scan — range predicate via RangeIter.
Projection — evaluate SELECT expressions over rows from the scan.
Filter — apply WHERE expression using the evaluator from Phase 4.17.
Nested loop join — INNER JOIN, LEFT JOIN.
Sort — ORDER BY with NULLS FIRST/LAST.
Limit/Offset — LIMIT n OFFSET m.
Hash aggregate — GROUP BY with COUNT, SUM, AVG, MIN, MAX.
INSERT / UPDATE / DELETE — write path with WAL integration.

The executor will be a simple volcano-model interpreter in Phase 5. Vectorized execution (morsel-driven, SIMD) is planned for Phase 14.

AxiomQL — Alternative Query Language (Phases 36-37)

AxiomDB will support two query languages sharing one AST and executor:

SQL stays as the primary language with full wire protocol compatibility. Every ORM, client, and tool works without changes.

AxiomQL is an optional method-chain alternative designed to be learned in minutes by any developer who already uses .filter().sort().take() in JavaScript, Python, Rust, or C#:

users
  .filter(active, age > 18)
  .join(orders)
  .group(country, total: count())
  .sort(total.desc)
  .take(10)

Both languages compile to the same Stmt AST — zero executor overhead, every SQL feature automatically available in AxiomQL. Planned after Phase 8 (wire protocol).

Phase	Scope
36	AxiomQL parser: SELECT, filter, join, group, subqueries, let bindings
37	AxiomQL write + DDL: insert, update, delete, create, transaction, proc

Keyboard shortcuts

AxiomDB Documentation