Skip to content

[Pre-PR Contribution] Structured Rustonomicon Dataset (JSONL + SQLite) for Intelligent Tooling #490

@Superuser666-Sigil

Description

@Superuser666-Sigil

Hi team,

I wanted to share a side project that may be useful to the Rust ecosystem, especially for documentation tools, learning platforms, or AI-assisted programming environments.

I’ve built a structured, machine-usable version of the Rustonomicon, derived directly from the Markdown source:

What It Includes

nomicon_canon_nodes.jsonl: Line-delimited JSON with canonical IDs, titles, tags, trust levels (safe or hazardous), and SHA-256 content hashes.

nomicon.sqlite3: SQLite version of the same data for fast querying, tooling, and offline analysis.

Fully compliant with the original licensing (MIT OR Apache 2.0).

No modifications to the core content—just structure added to enable automated use.

Intended Use Cases

LSP plugins or rust-analyzer enhancements

Static/dynamic analysis tools (e.g., unsafe call tracing)

Educational tools and intelligent code tutors

Fuzzing and verification harnesses that require canonical unsafe examples

Live Repo

https://github.com/DaveTmire85/nomicon-codex

Right now the files are loose in /src, so I'm moving them to /src/codex for clarity.
If this is useful, I’d be happy to upstream the codex/ directory into this repo, or maintain it separately with your blessing.

Appreciate your work and the community’s time—happy to hear any feedback or suggestions.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions