Datalogics PDF Java Toolkit is a native Java library that provides high-level APIs for automating PDF workflows like processing PDF forms, verifying digital signatures, and extracting text. It also offers low-level APIs for working directly with the structure of the PDF for those times you need it. While written with Java developers in mind, Datalogics PDF Java Toolkit can be used with any JVM language (Clojure, Scala, Groovy, etc.) and if you ever need help, our enterprise support team is only an email or phone call away!
Developers can use the PDF Java Toolkit to build applications with functionality, such as:
One of the advantages of PDF Java Toolkit is that it is entirely Java based and has no native code dependency allowing it to run on any platform that Java is supported on.
Programmatically create PDF documents that meet the strictest standards of the PDF specification. In addition to the basic PDF marking operators used to add text and images to pages, the PDF Java Toolkit includes the RichTextContentGenerator, a lightweight layout engine that accepts HTML formatted rich text as input, formats it, and draws it on the page handling word wrap and font styling automatically. Regardless of the various versions of PDF you require, including PDF/A, the PDF Java Toolkit can be used to easily place text and images on the page, add marked content operators, embed fonts, and perform all of the other complex functions required to result in an optimized, well performing, highly portable PDF document.
Extract Text and Images
The PDF Java Toolkit can help you quickly extract text, data and images from PDF documents for transfer to other documents or systems. The ReadingOrderTextExtractor performs a vertical and horizontal projection of words in a document. It then creates a statistical analysis of the common word starting points in documents to find column breaks resulting in the ability to programmatically read the text in the proper order regardless of how the PDF file was generated and when no document structure is available. For form data, the FDFServices and XFAService provide interfaces to manage and export field and annotation data required for processing by other systems. Images can be easily located, resampled and exported.
PDF Forms Support
Digital Signatures, PDF Portfolios, and More
Use your server to programmatically sign or certify PDF documents or validate signed PDF files. The PDF Java Toolkit supports the creation of signatures compliant with PDF Advanced Electronic Signatures (PAdES), an extension to the digital signatures functionality that allows for Long Term Validation (LTV). The PDF Java Toolkit supports PDF Packages and PDF Portfolios, multiple files assembled into a single PDF envelope. For example, a PDF Portfolio can include text documents, e-mail messages, spreadsheets, CAD drawings, and PowerPoint presentations. The original files retain their individual identities but are assembled into one PDF Portfolio file. The PDF Java Toolkit can programmatically create easy-to-use forms that can be submitted by users via web sites, extract data for use in other processes, and prepopulate form data. The PDF Java Toolkit can read or add notes, stamps and other annotations as well as programmatically sign PDF documents. If you are looking for a Java SDK for your PDF document or forms application, adopting the tools developed by the inventors of PDF itself is the safer choice to future-proof your application and ensure interoperability with Adobe Reader.
The PDF Java Toolkit can add Reader Extensions to your PDF documents which allows users of the free Adobe Reader to better collaborate with your organization while reducing processing costs and errors. Your users can easily interact with PDF forms and documents without purchasing Acrobat, while your organization improves security, archiving, and compliance. In addition to local save, Reader Extensions lets your document integrate with SOAP-based web services and update or decode 2D barcodes in your forms.