A native Java library for mission critical PDF workflows
Datalogics PDF Java Toolkit is a native Java library that provides high-level PDF APIs for automating PDF workflows like processing PDF forms, verifying digital signatures, and extracting text. It also offers low-level PDF APIs for working directly with the structure of the PDF for those times you need it. While written with Java developers in mind, Datalogics PDF Java Toolkit can be used with any JVM language (Clojure, Scala, Groovy, etc.) and if you ever need help, our enterprise support team is only an email or phone call away!
Developers can use the PDF Java Toolkit to build applications with functionality, such as:
Adobe Experience Manager Forms can help extend your document and form-based processes with streamlined data capture workflows that reduce errors, processing time, and cost for increased security, accuracy, and compliance. Enable core functions for your end users and customers to:
Redaction is the process of censoring or obscuring a portion of text in a document for legal or security purposes. A typical redaction process may include searching for and redacting specific words or phrases within a PDF document.
Here is an example of a redaction workflow:
1. Identify words or patterns
2. Locate the specific words or patterns
3. Add “blackout” rectangles to obfuscate the content
4. Completely remove the underlying text
It is also important to note that when done correctly, redaction is permanent and inaccessible. Redaction is often used in industries where sensitive personal information needs to be protected. Common redaction-heavy sectors include medical, legal, finance, and government agencies.
One of the advantages of PDF Java Toolkit is that it is entirely Java based and has no native code dependency allowing it to run on any platform that Java is supported on.
Programmatically create PDF documents that meet the strictest standards of the PDF specification. In addition to the basic PDF marking operators used to add text and images to pages, the PDF Java Toolkit includes the RichTextContentGenerator, a lightweight layout engine that accepts HTML formatted rich text as input, formats it, and draws it on the page handling word wrap and font styling automatically.
Regardless of the various versions of PDF you require, including PDF/A, the PDF Java Toolkit can be used to easily place text and images on the page, add marked content operators, embed fonts, and perform all of the other complex functions required to result in an optimized, well performing, highly portable PDF document.
The PDF Java Toolkit can help you quickly extract text, data and images from PDF documents for transfer to other documents or systems. The ReadingOrderTextExtractor performs a vertical and horizontal projection of words in a document.
It then creates a statistical analysis of the common word starting points in documents to find column breaks resulting in the ability to programmatically read the text in the proper order regardless of how the PDF file was generated and when no document structure is available.
For form data, the FDFServices and XFAService provide interfaces to manage and export field and annotation data required for processing by other systems. Images can be easily located, resampled and exported.
Form workflows is where the PDF Java Toolkit truly differentiates itself, it includes support for all three PDF form types: AcroForm, the original PDF interactive forms technology, Dynamic XFA, the XML Forms Architecture, and Static-XFA, an XML-based forms architecture.
In addition, the PDF Java Toolkit can flatten AcroForm and Static XFA and add Reader Enablement for users of older versions of Adobe Reader. The PDF Java Toolkit offers the most comprehensive set of tools for working with PDF forms available.
Use your server to programmatically sign or certify PDF documents or validate signed PDF files. The PDF Java Toolkit supports the creation of signatures compliant with PDF Advanced Electronic Signatures (PAdES), an extension to the digital signatures functionality that allows for Long Term Validation (LTV).
The PDF Java Toolkit supports PDF Packages and PDF Portfolios, multiple files assembled into a single PDF envelope. For example, a PDF Portfolio can include text documents, e-mail messages, spreadsheets, CAD drawings, and PowerPoint presentations.
The original files retain their individual identities but are assembled into one PDF Portfolio file. The PDF Java Toolkit can programmatically create easy-to-use forms that can be submitted by users via web sites, extract data for use in other processes, and prepopulate form data.
The PDF Java Toolkit can read or add notes, stamps and other annotations as well as programmatically sign PDF documents. If you are looking for a Java SDK for your PDF document or forms application, adopting the tools developed by the inventors of PDF itself is the safer choice to future-proof your application and ensure interoperability with Adobe Reader.
The PDF Java Toolkit can add Reader Extensions to your PDF documents which allows users of the free Adobe Reader to better collaborate with your organization while reducing processing costs and errors.
Your users can easily interact with PDF forms and documents without purchasing Acrobat, while your organization improves security, archiving, and compliance. In addition to local save, Reader Extensions lets your document integrate with SOAP-based web services and update or decode 2D barcodes in your forms.