mozjpeg-rs

Pure Rust JPEG encoder based on Mozilla's mozjpeg, featuring trellis quantization for optimal compression.

Encoder Only

mozjpeg-rs is a JPEG encoder only. It does not decode JPEG files.

For decoding, use one of these excellent crates:

Crate	Type	Notes
jpeg-decoder	Pure Rust	Widely used, reliable
zune-jpeg	Pure Rust	Fast, SIMD-optimized
mozjpeg-sys	C bindings	Full mozjpeg (encode + decode)

Why mozjpeg-rs?

	mozjpeg-rs	C mozjpeg	libjpeg-turbo
Language	Pure Rust	C	C/asm
Memory safety	Compile-time guaranteed	Manual	Manual
Trellis quantization	Yes	Yes	No
Build complexity	`cargo add`	cmake + nasm + C toolchain	cmake + nasm

Choose mozjpeg-rs when you want:

Memory-safe JPEG encoding without C dependencies
Smaller files than libjpeg-turbo (trellis quantization)
Simple integration via Cargo

Choose C mozjpeg when you need:

Smallest possible files at high quality (Q85+)
Maximum baseline encoding speed (SIMD-optimized entropy coding)
Established C ABI for FFI

Compression Results vs C mozjpeg

Tested on full Kodak corpus (24 images), trellis + Huffman opt, 4:2:0 subsampling.

Max Compression Mode (`Encoder::max_compression()`)

Progressive mode with optimize_scans=true - each AC scan gets its own optimal Huffman table.

Quality	Rust vs C	Notes
Q50	-0.39%	Rust produces smaller files
Q60	-0.26%	Rust smaller
Q70	-0.38%	Rust smaller
Q75	-0.14%	Rust smaller
Q80	+0.17%	Near-identical
Q85	+0.42%	Near-identical
Q90	+0.97%	Slight gap
Q95	+1.59%
Q97	+2.13%
Q100	+0.98%

All Modes Comparison

Quality	Baseline	Progressive	Max Compression
Q50	+0.15%	-1.23%	-0.39%
Q60	+0.47%	-0.70%	-0.26%
Q70	+0.54%	-0.35%	-0.38%
Q75	+0.87%	+0.22%	-0.14%
Q80	+1.34%	+0.90%	+0.17%
Q85	+1.75%	+1.44%	+0.42%
Q90	+2.73%	+2.63%	+0.97%
Q95	+3.87%	+3.64%	+1.59%
Q97	+5.36%	+4.90%	+2.13%
Q100	+3.53%	+2.59%	+0.98%

Summary:

Max Compression: Rust matches or beats C at Q50-Q80, within 2.2% at all quality levels
Progressive: Rust beats C at Q50-Q70, within 5% at all levels
Baseline: Larger gap due to trellis quantization differences at high quality

Visual quality (SSIMULACRA2, Butteraugli) is virtually identical at all quality levels.

Usage

use mozjpeg_rs::{Encoder, Subsampling};

// Default: trellis quantization + Huffman optimization
let jpeg = Encoder::new()
    .quality(85)
    .encode_rgb(&pixels, width, height)?;

// Maximum compression: progressive + trellis + deringing
let jpeg = Encoder::max_compression()
    .quality(85)
    .encode_rgb(&pixels, width, height)?;

// Fastest: no optimizations (libjpeg-turbo compatible output)
let jpeg = Encoder::fastest()
    .quality(85)
    .encode_rgb(&pixels, width, height)?;

// Custom configuration
let jpeg = Encoder::new()
    .quality(75)
    .progressive(true)
    .subsampling(Subsampling::S420)
    .optimize_huffman(true)
    .encode_rgb(&pixels, width, height)?;

Features

Trellis quantization - Rate-distortion optimized coefficient selection (AC + DC)
Progressive JPEG - Multi-scan encoding with spectral selection
Huffman optimization - 2-pass encoding for optimal entropy coding
Overshoot deringing - Reduces ringing artifacts at sharp edges
Chroma subsampling - 4:4:4, 4:2:2, 4:2:0 modes
Safe Rust - #![deny(unsafe_code)] with exceptions only for SIMD intrinsics

Encoder Settings Matrix

All combinations of settings are supported and tested:

Setting	Baseline	Progressive	Notes
Subsampling
├─ 4:4:4	✅	✅	No chroma subsampling
├─ 4:2:2	✅	✅	Horizontal subsampling
└─ 4:2:0	✅	✅	Full subsampling (default)
Trellis Quantization
├─ AC trellis	✅	✅	Rate-distortion optimized AC coefficients
└─ DC trellis	✅	✅	Cross-block DC optimization
Huffman
├─ Default tables	✅	✅	Fast, slightly larger files
└─ Optimized tables	✅	✅	2-pass, smaller files
Progressive-only
└─ optimize_scans	❌	✅	Per-scan Huffman tables
Other
├─ Deringing	✅	✅	Reduce overshoot artifacts
├─ Grayscale	✅	✅	Single-component encoding
├─ EOB optimization	✅	✅	Cross-block EOB runs (opt-in)
└─ Smoothing	✅	✅	Noise reduction filter (for dithered images)

Presets:

Encoder::new() - Trellis (AC+DC) + Huffman optimization + Deringing
Encoder::max_compression() - Above + Progressive + optimize_scans
Encoder::fastest() - No optimizations (libjpeg-turbo compatible)

Quantization Tables

Table	Description
`Robidoux`	Default. Nicolas Robidoux's psychovisual tables (used by ImageMagick)
`JpegAnnexK`	Standard JPEG tables (libjpeg default)
`Flat`	Uniform quantization
`MssimTuned`	MSSIM-optimized on Kodak corpus
`PsnrHvsM`	PSNR-HVS-M tuned
`Klein`	Klein, Silverstein, Carney (1992)
`Watson`	DCTune (Watson, Taylor, Borthwick 1997)
`Ahumada`	Ahumada, Watson, Peterson (1993)
`Peterson`	Peterson, Ahumada, Watson (1993)

use mozjpeg_rs::{Encoder, QuantTableIdx};

let jpeg = Encoder::new()
    .qtable(QuantTableIdx::Robidoux)  // or .quant_tables()
    .encode_rgb(&pixels, width, height)?;

Method Aliases

For CLI-style naming (compatible with rimage conventions):

Alias	Equivalent
`.baseline(true)`	`.progressive(false)`
`.optimize_coding(true)`	`.optimize_huffman(true)`
`.chroma_subsampling(mode)`	`.subsampling(mode)`
`.qtable(idx)`	`.quant_tables(idx)`

Performance

Benchmarked on 512x768 image, 20 iterations, release mode:

Configuration	Rust	C mozjpeg	Ratio
Baseline (huffman opt)	7.1 ms	26.8 ms	3.8x faster
Trellis (AC + DC)	19.7 ms	25.3 ms	1.3x faster
Progressive + trellis	20.0 ms	-	-

Note: C mozjpeg's baseline encoding is typically faster with its hand-optimized SIMD entropy coding. The benchmark numbers above reflect mozjpeg-sys from crates.io which may not have all optimizations enabled.

SIMD Support

mozjpeg-rs uses multiversion for automatic vectorization by default. Optional hand-written SIMD intrinsics are available:

[dependencies]
mozjpeg-rs = { version = "0.2", features = ["simd-intrinsics"] }

In benchmarks, the difference is minimal (~2%) as multiversion autovectorization works well for DCT and color conversion.

Differences from C mozjpeg

mozjpeg-rs aims for compatibility with C mozjpeg but has some differences:

Feature	mozjpeg-rs	C mozjpeg
Progressive scan script	Simple 4-scan (or optimize_scans)	9-scan with successive approximation
optimize_scans	Per-scan Huffman tables	Per-scan Huffman tables
Trellis EOB optimization	Available (opt-in)	Available (rarely used)
Smoothing filter	Available	Available
Multipass trellis	Not implemented (poor tradeoff)	Available
Arithmetic coding	Not implemented	Available (rarely used)
Grayscale progressive	Yes	Yes

Why multipass (`use_scans_in_trellis`) is not implemented

C mozjpeg's multipass option makes trellis quantization "scan-aware" for progressive encoding by optimizing low and high frequency AC coefficients separately. Benchmarks on the Kodak corpus (Q85, progressive) show this is a poor tradeoff:

Metric	Without Multipass	With Multipass	Difference
File size	1,760 KB	1,770 KB	+0.52% larger
Quality (butteraugli)	2.59	2.54	-0.05 (imperceptible)
Encoding time	~7ms	~8.5ms	~20% slower

Multipass produces larger files, is slower, and provides no perceptible quality improvement.

Why the file size gap at high quality?

At quality levels above Q85, there's a small gap (1-3%) due to differences in the progressive scan structure:

C mozjpeg uses a 9-scan successive approximation (SA) script that splits coefficient bits into coarse and fine layers
mozjpeg-rs uses a 4-scan script (DC + full AC for each component) with per-scan optimal Huffman tables

With optimize_scans=true (enabled in max_compression()), mozjpeg-rs matches or beats C mozjpeg at Q50-Q80.

Matching C mozjpeg output exactly

For exact byte-identical output to C mozjpeg, you would need to:

Use baseline (non-progressive) mode
Match all encoder settings exactly
Use the same quantization tables (Robidoux/ImageMagick tables)

The FFI comparison tests in tests/ffi_comparison.rs verify component-level parity.

Development

Running CI Locally

# Format check
cargo fmt --all -- --check

# Clippy lints
cargo clippy --workspace --all-targets -- -D warnings

# Build
cargo build --workspace

# Unit tests
cargo test --lib

# Codec comparison tests
cargo test --test codec_comparison

# FFI validation tests (requires mozjpeg-sys from crates.io)
cargo test --test ffi_validation

Reproduce Benchmarks

# Fetch test corpus (Kodak images, ~15MB)
./scripts/fetch-corpus.sh

# Run full corpus comparison
cargo run --release --example full_corpus_test

# Run pareto benchmark
cargo run --release --example pareto_benchmark

Test Coverage

# Install cargo-llvm-cov
cargo install cargo-llvm-cov

# Generate coverage report
cargo llvm-cov --lib --html

# Open report
open target/llvm-cov/html/index.html

License

BSD-3-Clause - Same license as the original mozjpeg.

Acknowledgments

Based on Mozilla's mozjpeg, which builds on libjpeg-turbo and the Independent JPEG Group's libjpeg.

AI-Generated Code Notice

This crate was developed with significant assistance from Claude (Anthropic). While the code has been tested against the C mozjpeg reference implementation and passes 248 tests including FFI validation, not all code has been manually reviewed or human-audited.

Before using in production:

Review critical code paths for your use case
Run your own validation against expected outputs
Consider the encoder's test suite coverage for your specific requirements

The FFI comparison tests in tests/ffi_comparison.rs and tests/ffi_validation.rs provide confidence in correctness by comparing outputs against C mozjpeg.

Name		Name	Last commit message	Last commit date
Latest commit History 187 Commits
.cargo		.cargo
.github/workflows		.github/workflows
benches		benches
benchmark		benchmark
crates/sys-local		crates/sys-local
docs		docs
examples		examples
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.toml		Cargo.toml
LICENSE		LICENSE
OPTIMIZE_SCANS_PARITY.md		OPTIMIZE_SCANS_PARITY.md
README.md		README.md
SIMD_TRACKING.md		SIMD_TRACKING.md
THIRD-PARTY-LICENSES		THIRD-PARTY-LICENSES
codecov.yml		codecov.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mozjpeg-rs

Encoder Only

Why mozjpeg-rs?

Compression Results vs C mozjpeg

Max Compression Mode (`Encoder::max_compression()`)

All Modes Comparison

Usage

Features

Encoder Settings Matrix

Quantization Tables

Method Aliases

Performance

SIMD Support

Differences from C mozjpeg

Why multipass (`use_scans_in_trellis`) is not implemented

Why the file size gap at high quality?

Matching C mozjpeg output exactly

Development

Running CI Locally

Reproduce Benchmarks

Test Coverage

License

Acknowledgments

AI-Generated Code Notice

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

imazen/mozjpeg-rs

Folders and files

Latest commit

History

Repository files navigation

mozjpeg-rs

Encoder Only

Why mozjpeg-rs?

Compression Results vs C mozjpeg

Max Compression Mode (Encoder::max_compression())

All Modes Comparison

Usage

Features

Encoder Settings Matrix

Quantization Tables

Method Aliases

Performance

SIMD Support

Differences from C mozjpeg

Why multipass (use_scans_in_trellis) is not implemented

Why the file size gap at high quality?

Matching C mozjpeg output exactly

Development

Running CI Locally

Reproduce Benchmarks

Test Coverage

License

Acknowledgments

AI-Generated Code Notice

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Max Compression Mode (`Encoder::max_compression()`)

Why multipass (`use_scans_in_trellis`) is not implemented

Packages