High-Performance JSON library for Mojo🔥

Python-like API — loads, dumps, load, dump
GPU accelerated — 2-4x faster than cuJSON on large files
Cross-platform — NVIDIA, AMD, and Apple Silicon GPUs
Streaming & lazy parsing — Handle files larger than memory
JSONPath & Schema — Query and validate JSON documents
RFC compliant — JSON Patch, Merge Patch, JSON Pointer

Installation

Add mojson to your project's pixi.toml:

[workspace]
channels = ["https://conda.modular.com/max-nightly", "conda-forge", "https://prefix.dev/modular-community"]
preview = ["pixi-build"]

[dependencies]
mojson = { git = "https://github.com/ehsanmok/mojson.git" }

Then run:

pixi install

Note: mojo-compiler and simdjson are automatically installed as dependencies.

Quick Start

from mojson import loads, dumps, load, dump

# Parse & serialize strings
var data = loads('{"name": "Alice", "scores": [95, 87, 92]}')
print(data["name"].string_value())  # Alice
print(data["scores"][0].int_value())  # 95
print(dumps(data, indent="  "))  # Pretty print

# File I/O (auto-detects .ndjson)
var config = load("config.json")
var logs = load("events.ndjson")  # Returns array of values

# Explicit GPU parsing
var big = load[target="gpu"]("large.json")

Development Setup

To contribute or run tests:

git clone https://github.com/ehsanmok/mojson.git && cd mojson
pixi install
pixi run tests-cpu

Requirements

pixi package manager

GPU (optional): NVIDIA CUDA 7.0+, AMD ROCm 6+, or Apple Silicon. See GPU requirements.

Performance

GPU (804MB `twitter_large_record.json`)

Platform	Throughput	vs cuJSON
AMD MI355X	13 GB/s	3.6x faster
NVIDIA B200	8 GB/s	1.8x faster
Apple M3 Pro	3.9 GB/s	—

GPU only beneficial for files >100MB.

# Download large dataset first (required for meaningful GPU benchmarks)
pixi run download-twitter-large

# Run GPU benchmark (only use large files)
pixi run bench-gpu benchmark/datasets/twitter_large_record.json

API

Everything through 4 functions: loads, dumps, load, dump

# Parse strings (default: pure Mojo backend - fast, zero FFI)
loads(s)                              # JSON string -> Value
loads[target="cpu-simdjson"](s)       # Use simdjson FFI backend
loads[target="gpu"](s)                # GPU parsing
loads[format="ndjson"](s)             # NDJSON string -> List[Value]
loads[lazy=True](s)                   # Lazy parsing (CPU only)

# Serialize strings
dumps(v)                              # Value -> JSON string
dumps(v, indent="  ")                 # Pretty print
dumps[format="ndjson"](values)        # List[Value] -> NDJSON string

# File I/O (auto-detects .ndjson from extension)
load("data.json")                     # JSON file -> Value (CPU)
load("data.ndjson")                   # NDJSON file -> Value (array)
load[target="gpu"]("large.json")      # GPU parsing
load[streaming=True]("huge.ndjson")   # Stream (CPU, memory efficient)
dump(v, f)                            # Write to file

# Value access
value["key"], value[0]                # By key/index
value.at("/path")                     # JSON Pointer (RFC 6901)
value.set("key", val)                 # Mutation

# Advanced
jsonpath_query(doc, "$.users[*]")     # JSONPath queries
validate(doc, schema)                 # JSON Schema validation
apply_patch(doc, patch)               # JSON Patch (RFC 6902)

Feature Matrix

Feature	CPU	GPU	Notes
`loads(s)`	✅ default	✅ `target="gpu"`
`load(path)`	✅ default	✅ `target="gpu"`	Auto-detects .ndjson
`loads[format="ndjson"]`	✅ default	✅ `target="gpu"`
`loads[lazy=True]`	✅	—	CPU only
`load[streaming=True]`	✅	—	CPU only
`dumps` / `dump`	✅	—	CPU only

CPU Backends

Backend	Target	Speed	Dependencies
Mojo (native)	`loads()` (default)	1.31 GB/s	Zero FFI
simdjson (FFI)	`loads[target="cpu-simdjson"]()`	0.48 GB/s	libsimdjson

The pure Mojo backend is the default and is ~2.7x faster than the FFI approach with zero external dependencies.

Full API: docs/api.md

Examples

pixi run mojo -I . examples/01_basic_parsing.mojo

Example	Description
01_basic_parsing	Parse, serialize, type handling
02_file_operations	Read/write JSON files
03_value_types	Type checking, value extraction
04_gpu_parsing	GPU-accelerated parsing
05_error_handling	Error handling patterns
06_struct_serde	Struct serialization
07_ndjson	NDJSON parsing & streaming
08_lazy_parsing	On-demand lazy parsing
09_jsonpath	JSONPath queries
10_schema_validation	JSON Schema validation
11_json_patch	JSON Patch & Merge Patch

Documentation

API Reference — Complete function reference
Architecture — CPU/GPU backend design
Performance — Optimization deep dive
Benchmarks — Reproducible benchmarks

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
benchmark		benchmark
docs		docs
examples		examples
mojson		mojson
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pixi.toml		pixi.toml
recipe.yaml		recipe.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

High-Performance JSON library for Mojo🔥

Installation

Quick Start

Development Setup

Requirements

Performance

GPU (804MB `twitter_large_record.json`)

API

Feature Matrix

CPU Backends

Examples

Documentation

License

About

Uh oh!

Languages

License

ehsanmok/mojson

Folders and files

Latest commit

History

Repository files navigation

High-Performance JSON library for Mojo🔥

Installation

Quick Start

Development Setup

Requirements

Performance

GPU (804MB twitter_large_record.json)

API

Feature Matrix

CPU Backends

Examples

Documentation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages

GPU (804MB `twitter_large_record.json`)