PDFium4j

Compact Java PDF engine on top of Java 25 FFM.

What it does

Open PDF from file path, byte array, or InputStream
Probe and diagnose unreadable, corrupt, or password protected files
Render pages to RGBA buffers at custom DPI
Render bounded previews and thumbnails with memory caps
Extract plain text and per character bounds
Read page size, page count, labels, rotation, permissions, encryption state
Read and write metadata from Info dictionary
Extract raw XMP and parse structured XMP metadata
Generate KOReader-compatible partial MD5 checksums for lightweight file identity
Read bookmarks and links
Read page annotations
Delete pages, insert blank pages, import pages, save documents
Save to path, byte array, or OutputStream

Reliability and safety

No JNI bridge
Direct FFM calls with explicit native resource ownership
Deterministic close semantics for document and page handles
Thread confinement for document and page operations
Strict and recover policy modes
Hard limits for document bytes, render pixels, and render worker count
Typed exceptions with error code mapping

Quick start

import java.nio.file.Path;
import java.io.InputStream;

import org.grimmory.pdfium4j.PdfDocument;
import org.grimmory.pdfium4j.PdfPage;
import org.grimmory.pdfium4j.model.MetadataTag;
import org.grimmory.pdfium4j.model.PdfProcessingPolicy;
import org.grimmory.pdfium4j.model.RenderResult;

PdfProcessingPolicy policy = PdfProcessingPolicy.defaultPolicy()
    .withMaxDocumentBytes(512L * 1024 * 1024)
    .withMaxRenderPixels(40_000_000L)
    .withMaxParallelRenderThreads(8);

// Open from Path, byte[], or InputStream
try (PdfDocument doc = PdfDocument.open(Path.of("book.pdf"), null, policy)) {
  try (PdfPage page = doc.page(0)) {
    RenderResult cover = page.renderThumbnail(512);

    // Zero-allocation text access
    page.withText((segment, length) -> {
        // process raw UTF-16LE text data
    });
  }

  // Zero-allocation metadata and XMP access via streams
  try (InputStream in = doc.metadataStream(MetadataTag.TITLE)) {
      // process raw UTF-16LE bytes from TITLE tag
  }
  try (InputStream xmp = doc.xmpMetadataStream()) {
      // stream large XMP metadata without heap allocation
  }

  var diagnostics = PdfDocument.diagnose(Path.of("book.pdf"));
  var checksum = PdfDocument.koReaderPartialMd5(Path.of("book.pdf"));
  var xmp = doc.xmpMetadataString();
  doc.save(Path.of("output.pdf"));
}

Installation

Note

Release Distribution: PDFium4j does not distribute binary releases or raw .jar files via GitHub Releases. Instead, releases are published exclusively as Maven packages directly to GitHub Packages and Maven Central.

Maven Central (Default)

No special repository configuration is needed if your project already uses mavenCentral().

Gradle (Kotlin DSL)

dependencies {
    implementation("org.grimmory:pdfium4j:1.2.0")
    runtimeOnly("org.grimmory:pdfium4j:1.2.0:natives-linux-x64")
}

Auto-detect platform

val osName = System.getProperty("os.name").lowercase()
val osArch = System.getProperty("os.arch")

val pdfiumNatives = when {
    "linux" in osName && osArch == "amd64"    -> "natives-linux-x64"
    "linux" in osName && osArch == "aarch64"  -> "natives-linux-arm64"
    "mac" in osName   && osArch == "aarch64"  -> "natives-darwin-arm64"
    "mac" in osName   && osArch == "x86_64"   -> "natives-darwin-x64"
    "windows" in osName && osArch == "amd64"  -> "natives-windows-x64"
    else -> error("Unsupported platform: $osName/$osArch")
}

dependencies {
    implementation("org.grimmory:pdfium4j:1.2.0")
    runtimeOnly("org.grimmory:pdfium4j:1.2.0:$pdfiumNatives")
}

Gradle (Groovy DSL)

dependencies {
    implementation 'org.grimmory:pdfium4j:1.2.0'
    runtimeOnly 'org.grimmory:pdfium4j:1.2.0:natives-linux-x64'
}

Maven

<dependency>
    <groupId>org.grimmory</groupId>
    <artifactId>pdfium4j</artifactId>
    <version>1.2.0</version>
</dependency>
<dependency>
    <groupId>org.grimmory</groupId>
    <artifactId>pdfium4j</artifactId>
    <version>1.2.0</version>
    <classifier>natives-linux-x64</classifier>
</dependency>

Available classifiers

Platform	Classifier
Linux x86_64	`natives-linux-x64`
Linux aarch64	`natives-linux-arm64`
macOS aarch64	`natives-darwin-arm64`
macOS x86_64	`natives-darwin-x64`
Windows x86_64	`natives-windows-x64`

Runtime requirements

Java 25+
JVM flags:

--enable-preview --enable-native-access=ALL-UNNAMED

Native libraries

PDFium4j ships prebuilt PDFium binaries as classified JAR artifacts. When the correct native classifier JAR is on the classpath, the library extracts the shared library to a temp directory at startup and loads it automatically.

If classpath loading is unavailable, PDFium4j falls back to System.loadLibrary("pdfium").

Supported platforms

Platform	Classifier
Linux x86_64	`natives-linux-x64`
Linux aarch64	`natives-linux-arm64`
macOS aarch64	`natives-darwin-arm64`
macOS x86_64	`natives-darwin-x64`
Windows x86_64	`natives-windows-x64`

Project structure

pdfium4j/                     Root (Java module)
  src/main/java/              FFM bindings and public API
  src/test/java/              Tests
  build/generated-natives/    Downloaded PDFium binaries (build-time)

Build

./gradlew build


## Native Build Requirements

To build the native shim from source, the following system dependencies are required:

- **CMake** 3.20+
- **C++23 Compiler** (Clang, GCC, or MSVC)
- **ZLIB** (System library)
- **JPEG** (System library, e.g., libjpeg-turbo)

### Installing dependencies:

- **macOS**: `brew install cmake zlib jpeg`
- **Linux (Debian/Ubuntu)**: `sudo apt-get install cmake zlib1g-dev libjpeg-dev`
- **Windows**: Use `vcpkg` or manual installation. For `vcpkg`: `vcpkg install zlib:x64-windows libjpeg-turbo:x64-windows`


## Quality Workflow

Use Java 25 and run the same verification path as CI:

```bash
./gradlew clean check --warning-mode all

This runs formatting and static analysis gates (Spotless, Checkstyle, PMD, SpotBugs), plus tests.

For local iteration:

./gradlew spotlessApply
./gradlew pmdMain pmdTest spotbugsMain spotbugsTest

License

Apache License 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
.github		.github
benchmarks		benchmarks
config		config
gradle/wrapper		gradle/wrapper
shim		shim
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
.sdkmanrc		.sdkmanrc
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDFium4j

What it does

Reliability and safety

Quick start

Installation

Maven Central (Default)

Gradle (Kotlin DSL)

Auto-detect platform

Gradle (Groovy DSL)

Maven

Available classifiers

Runtime requirements

Native libraries

Supported platforms

Project structure

Build

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDFium4j

What it does

Reliability and safety

Quick start

Installation

Maven Central (Default)

Gradle (Kotlin DSL)

Auto-detect platform

Gradle (Groovy DSL)

Maven

Available classifiers

Runtime requirements

Native libraries

Supported platforms

Project structure

Build

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages