PDF Java Toolkit ™

A native Java library for mission critical PDF workflows

  • Build applications that are interoperable with Adobe products
  • Unparalleled form support
  • Manage digital signatures
  • Redact and sanitize
  • Simplified document creation

Automate business workflow with PDF Java Toolkit

Datalogics PDF Java Toolkit is a native Java library that provides high-level PDF APIs for automating PDF workflows like processing PDF forms, verifying digital signatures, and extracting text. It also offers low-level PDF APIs for working directly with the structure of the PDF for those times you need it. While written with Java developers in mind, Datalogics PDF Java Toolkit can be used with any JVM language (Clojure, Scala, Groovy, etc.) and if you ever need help, our enterprise support team is only an email or phone call away!


Developers can use the PDF Java Toolkit to build applications with functionality, such as:

  • Generate barcode appearances
  • Create text based PDFs
  • Digitally sign a PDF
  • Merge PDFs
  • PDF portfolios
  • Reader enabling/extending PDFs
  • Redaction of PDFs
  • Use Javascript inside of a PDF
  • Process PDF forms
  • Overlay page content onto another page
  • Sanitize PDF documents
  • Create page spreads in a PDF

Reader Extensions and Forms Processing

Adobe Experience Manager Forms can help extend your document and form-based processes with streamlined data capture workflows that reduce errors, processing time, and cost for increased security, accuracy, and compliance. Enable core functions for your end users and customers to:

  • Easily collect and process forms data
  • Unlock forms and additional document features in Adobe Reader
  • Implement barcode creation and data capture workflows

Why Do You Need PDF Java Toolkit?

Redaction is the process of censoring or obscuring a portion of text in a document for legal or security purposes. A typical redaction process may include searching for and redacting specific words or phrases within a PDF document.

Here is an example of a redaction workflow:

1. Identify words or patterns
2. Locate the specific words or patterns
3. Add “blackout” rectangles to obfuscate the content
4. Completely remove the underlying text

A redaction process might also need to remove content contained in document metadata, file attachments, annotations and Javascript. This process is often referred to as sanitization.

It is also important to note that when done correctly, redaction is permanent and inaccessible. Redaction is often used in industries where sensitive personal information needs to be protected. Common redaction-heavy sectors include medical, legal, finance, and government agencies.

Learn More:


One of the advantages of PDF Java Toolkit is that it is entirely Java based and has no native code dependency allowing it to run on any platform that Java is supported on.

  • Windows 32-bit
  • Windows 64-bit
  • Mac 32-bit
  • Mac 64-bit
  • Linux 32-bit
  • Linux 64-bit
  • Solaris Sparc 32-bit
  • Solaris Sparc 64-bit
  • Solaris Intel 32-bit
  • Solaris Intel 64-bit
  • AIX 32-bit
  • AIX 64-bit
  • HP/UX PA-RISC 32-bit
  • HP/UX PA-RISC 64-bit
  • HP/UX Itanium 32-bit
  • HP/UX Itanium 64-bit

What's Inside?

Create and Edit PDFs

Programmatically create PDF documents that meet the strictest standards of the PDF specification. In addition to the basic PDF marking operators used to add text and images to pages, the PDF Java Toolkit includes the RichTextContentGenerator, a lightweight layout engine that accepts HTML formatted rich text as input, formats it, and draws it on the page handling word wrap and font styling automatically.

Regardless of the various versions of PDF you require, including PDF/A, the PDF Java Toolkit can be used to easily place text and images on the page, add marked content operators, embed fonts, and perform all of the other complex functions required to result in an optimized, well performing, highly portable PDF document.

The PDF Java Toolkit can help you quickly extract text, data and images from PDF documents for transfer to other documents or systems. The ReadingOrderTextExtractor performs a vertical and horizontal projection of words in a document.

It then creates a statistical analysis of the common word starting points in documents to find column breaks resulting in the ability to programmatically read the text in the proper order regardless of how the PDF file was generated and when no document structure is available.

For form data, the FDFServices and XFAService provide interfaces to manage and export field and annotation data required for processing by other systems. Images can be easily located, resampled and exported.

Form workflows is where the PDF Java Toolkit truly differentiates itself, it includes support for all three PDF form types: AcroForm, the original PDF interactive forms technology, Dynamic XFA, the XML Forms Architecture, and Static-XFA, an XML-based forms architecture.

Not only are there high level classes to make it easy to import and export data, but more importantly, when data is imported, you won’t need to replicate or even understand the business rules built into the form. The PDF Java Toolkit can behave as a proxy for Acrobat and run the JavaScript calculation and format scripts embedded in the document automatically.

You can even tell the PDF Java Toolkit what version of Acrobat to imitate. There’s also support to allow for plug-in proxies; if the JavaScript requires a particular plug-in before it will execute, “ppklite” for example, the PDF Java Toolkit can tell the document that it’s installed.

In addition, the PDF Java Toolkit can flatten AcroForm and Static XFA and add Reader Enablement for users of older versions of Adobe Reader. The PDF Java Toolkit offers the most comprehensive set of tools for working with PDF forms available.

Use your server to programmatically sign or certify PDF documents or validate signed PDF files. The PDF Java Toolkit supports the creation of signatures compliant with PDF Advanced Electronic Signatures (PAdES), an extension to the digital signatures functionality that allows for Long Term Validation (LTV).

The PDF Java Toolkit supports PDF Packages and PDF Portfolios, multiple files assembled into a single PDF envelope. For example, a PDF Portfolio can include text documents, e-mail messages, spreadsheets, CAD drawings, and PowerPoint presentations.

The original files retain their individual identities but are assembled into one PDF Portfolio file. The PDF Java Toolkit can programmatically create easy-to-use forms that can be submitted by users via web sites, extract data for use in other processes, and prepopulate form data.

The PDF Java Toolkit can read or add notes, stamps and other annotations as well as programmatically sign PDF documents. If you are looking for a Java SDK for your PDF document or forms application, adopting the tools developed by the inventors of PDF itself is the safer choice to future-proof your application and ensure interoperability with Adobe Reader.

The PDF Java Toolkit can add Reader Extensions to your PDF documents which allows users of the free Adobe Reader to better collaborate with your organization while reducing processing costs and errors.

Your users can easily interact with PDF forms and documents without purchasing Acrobat, while your organization improves security, archiving, and compliance. In addition to local save, Reader Extensions lets your document integrate with SOAP-based web services and update or decode 2D barcodes in your forms.


For all other PDF Java Toolkit Developer Resources, visit our Developer Resources area.


Our toolkits can be licensed to software developers who embed the technology into their applications (OEM), as well as for enterprise customers looking to build applications for internal use. Technical Support and regular updates are provided via our Support and Maintenance program – this enables you to keep your application current and compatible with the latest versions of Acrobat/Reader and the PDF Specification as they are released.


Our customers have integrated our toolkits into a vast array of applications with a wide variety of deployment configurations. Pricing for our products is highly situation-dependent but generally includes an initial license fee and per-platform annual maintenance and support fees. In addition, royalties are incurred for those applications intended for sale. Some companies may also qualify for special small business pricing.


All the information you need to make your decision to purchase PDF Java Toolkit is here. Click on the icons to learn more.


We have a tool that allows for automating and extending forms through Adobe Reader.