This project is not covered by Drupal’s security advisory policy.

Drupal 8 can do document management, too :).

This module provides a media source plugin so PDF files can be uploaded as media entities.

Why? It can extract the raw text from a PDF and map it to a textfield, so you can index the PDFs with SearchAPI for fulltext search. And it comes with nice thumbnails you can present in your media collection.

Requirements: You need to install the Poppler utilities to extraxt the text and thumbnails from the PDF files.

The plugin uses Poppler's pdftotext command with the -layout switch. In this mode you get a text version of the original PDF layout, filled with whitespace to position the text.

This module comes with an additional field formatter you can use on the text field to render it in <pre> tags. This is very nice to extract data manually, e.g. from PDF invoices.

Project information

  • Module categories: Media
  • Created by shenzhuxi on , updated
  • shield alertThis project is not covered by the security advisory policy.
    Use at your own risk! It may have publicly disclosed vulnerabilities.

Releases