JasperReports logo version 7.0.1 Home - Samples - Configuration - Functions - FAQ - API (Javadoc)

JasperReports - PDF/A Conformance Sample

Shows how to generate PDF/A compliant documents.

Main Features in This Sample

Generating PDF/A compliant documents

Generating PDF/A compliant documents

Documented by Sanda Zaharia

Description / Goal
How to use PDF/A conformance settings to generate various levels of PDF/A documents.

Since: 3.1.2

Other Samples
/demo/samples/accessible
/demo/samples/jasper
/demo/samples/table
/demo/samples/tabular

PDF/A conformance overview

In general, PDF/A refers to an ISO standardized set of guidance rules that a rendered PDF document must follow in order to ensure the archiving and long-term safeguarding of that document. The PDF/A standard was necessary in order to address the expanding need to ensure that documents intended for preservation over a long period of time could be retrieved and rendered in a consistent mode in the future. This requirement is essential for a large variety of documents produced in the government, industry and academic environments worldwide.

Some features are mandatory to met the PDF/A standard. For instance: specific color management guidelines, support for embedded fonts, support for reading embedded annotations. All features that are unsuitable for long-term archiving are prohibited. For instance, unembedded fonts, PDF encryption, audio and video content, etc.

The PDF/A standards attempts to maximize:

The constraints include:

PDF/A conformance levels

  1. PDF/A-1b (basic PDF/A-1 conformance level represents the minimal compliance required to ensure the preservability over long terms of the rendered visual appearance of a document). Level B is a lower level of conformance, including the general requirements of ISO 19005-1:2005 standard regarding the visual appearance of electronic documents, but not their internal structure or semantic properties. All constraints described in the section above must be applied to PDF/A-1b compliant documents.
    Also, PDF/A-1b documents may contain only scanned images without any representation of text. OCR text that supports indexing or extraction of text is permitted, but only if it is invisible. This conformance level requires the use of PDF 1.4
  2. PDF/A-1a (accessible PDF/A-1 conformance level includes all the PDF/A-1b rules, plus a set of rules regarding the document accessibility and reading order). PDF/A-1a requires a representation of the logical structure of the document as specified in section 6.8 of ISO 19005-1:2005. It is recommended to capture a document's logical structure hierarchy to the finest granularity possible. The standard also indicates that "PDF/A-1 writers should not add structural or semantic information that is not explicitly or implicitly present in the source material solely for the purpose of achieving conformance." The logical structure of a document will only include the minimal relevant structural tagging.
    In order to obtain persistent accessible documents one should:
    • make use of simple layouts
    • use alternate text for graphics
    • never use color or contrast alone to indicate meaning
    • never convey meaning through design in a way that can't be expressed with the document's text
    • avoid spanning table cells and complex table structures
    • avoid table tags without tabular data
    • avoid illustrations consisted of many small vector graphics
    • avoid elements overlapping
    • avoid background images
      PDF/A-1a requires the use of PDF 1.4.
  3. PDF/A-2b (basic PDF/A-2 conformance level) requires the use of PDF 1.7 (see also ISO 32000-1:2008). Conformance rules are described in ISO 19005-2:2011 and encapsulates all rules for PDF/A-1b. Additionally includes support for:
    • compressed Object and XRef streams (for smaller file sizes)
    • PDF/A-compliant file attachments, portable collections and PDF packages, archiving of sets of documents as individual documents in one file
    • transparency for graphical elements
    • JPEG 2000 compression
      PDF/A-2b imposes a few additional constraints on number of color channels, bit depth, and colorspaces for compatibility with versions of PDF/X and PDF/E current in 2010.
  4. PDF/A-2u (unicode PDF/A-2 conformance level) represents the PDF/A-2b conformance level with the additional requirement that all text in the document have Unicode equivalents.
  5. PDF/A-2a (accessible PDF/A-2 conformance level) defines rules for enhanced accessibility and includes the representation of the logical structure of the document, as specified in section 6.8 of ISO 19005-2:2011.
  6. PDF/A-3b (basic PDF/A-2 conformance level) requires the use of PDF 1.7 (see also ISO 32000-1:2008). Conformance rules are described in ISO 19005-2:2011 and encapsulates all rules for PDF/A-2b. PDF/A-3 is equivalent to PDF/A-2, except for allowing files in any format to be embedded. Each embedded file in a PDF/A-3 document must be identified in a file specification dictionary (as described in section 7.11.3 of ISO 32000-1:2008). The inclusion of a human-readable description of the file is recommended.
  7. PDF/A-3u (unicode PDF/A-3 conformance level) represents the PDF/A-34b conformance level with the additional requirement that all text in the document have Unicode equivalents.
  8. PDF/A-3a (accessible PDF/A-3 conformance level) rules are similar to PDF/A-2a.

For more information and guidance related to PDF/A conformance levels, also see the following:

The conformance level for the generated PDF report can be set in JasperReports using the following configuration properties:

Color Management Guidelines

In order to ensure the same appearance for the document on every device, we need to associate a color management profile to the report. An ICC profile is a set of data that characterizes a color input or output device, or a color space, according to standards promulgated by the International Color Consortium (ICC).
ICC profiles are available for both RGB and CMYK color spaces. They can also be embedded in the generated PDF/A compliant document.

In order to customize the color spaces we can use the following properties:

Additional Properties for PDF/A-1a

There are 2 more properties to be considered in order to ensure the accessibility requirements for PDF/A level "a" documents:

Section 508 conformance in Tagged PDF Reports

A PDF document can be internally structured in order to enable accessibility tools in PDF readers. Based on specific hidden tags in their content, documents can be read aloud to people with disabilities. Tagged PDF files comply with the requirements of the Section 508 of the U.S. Rehabilitation Act (see https://www.section508.gov). JasperReports supports Section 508 compliant PDF documents based on the PDF tags. According to a set of built-in configuration properties, PDF tags required for accessibility are incorporated into the structure of generated documents.

Tagged PDF Output

To enable accessibility in generated documents a set of export parameters are available in JasperReports along with their related exporter hint properties:

Accessibility Tags

The following property can be set per element in JRXML:

net.sf.jasperreports.export.accessibility.tag

This property that specifies the type of accessible content introduced by the element. Possible values are:

All the next consecutive properties can be set per element and share the same set of possible values (excepting the column span and row span properties, which accept numeric values only):

Available properties related to PDF accessibility tags are enumerated below:

The use of these individual tag properties is illustrated in the TabularPdfA1Report JRXML sample report.

Alternate Text for Images and Reading Order

Accessible documents must provide also an alternate text for images and an appropriate reading order. In JasperReports the alternate text for images is specified in the <hyperlinkTooltipExpression/> of the image element.

The reading order is processed using the Z-order of elements as present in the report template. No additional processing is necessary here.

To find out more about tagged PDF output see the "Section 508 conformance" chapter in the JasperReports Ultimate Guide.

Sample Reports

To illustrate various levels of PDF/A conformance, this sample contains the following reports: - PdfA1Report - a PDF/A-1a compliant report. It contains an embedded true type font (LiberationSans), a non transparent image of the JasperReports logo and a SVG image. All images provide alternate texts. It embeds a RGB type ICC profile - PdfA2Report - a PDF/A-2a compliant report. It contains an embedded true type font (LiberationSans), 2 transparent images of the JasperReports logo and some overlapped elements. All images provide alternate texts. It embeds a RGB type ICC profile - PdfA3Report - a PDF/A-3a compliant report, which is in fact similar to PdfA2Report, since there are no other document attachments. - TabularPdfA1Report - a PDF/A-1a compliant report. It contains a table-like structure, with elements tagged individually to generate an accessible PDF table - TaggedPdfA1Report - a PDF/A-1a compliant report. It contains several elements configured for accessibility, and an accessible table element that automatically generates table tags when it is exported to PDF

Running the Sample

Running the sample requires the Apache Maven library. Make sure that maven is already installed on your system (version 3.6 or later). In a command prompt/terminal window set the current folder to demo/samples/pdfa within the JasperReports source project and run the following command:

> mvn clean compile exec:exec@all

It will generate all supported document types containing the sample report in the demo/samples/pdfa/target/reports directory.




© 2001- Cloud Software Group, Inc. www.jaspersoft.com