Cannot extract PDF metadata [#314582]

Hello,

File Framework is working great on my intranet. But it cannot extract any information from PDF files, like author, title, etc. I tried it with a PDF file which has this information.
It may be a problem related to the handler : in the handlers pages (admin/settings/file/handler), on the PDF line, I see no associated MIME type but I see it is handled by file_slideshow module. But on the MIME types configuration page, I can see the application/pdf MIME type associated to file_module.

Do you know how I could retrieve this information ?

Thank you.

Comments

Comment #1

miglius commented 1 October 2008 at 23:27

There should not be a MIME type next to the PDF line. So the handlers page is correct.

You have to install "pdfinfo" to your server and it should be in the PATH for the user the web server is running as. The module checks if it can find pdfinfo in the path and if it finds, it executes it and extracts the PDF information from the file.

Comment #2

Arto commented 18 February 2009 at 12:12

Issue tags:

+PDF

Comment #3

miglius commented 24 March 2009 at 20:25

Status:

Active

» Postponed (maintainer needs more info)

Have you installed the 'pdfinfo' to your server and do you still have this issue?

"xpdf is gone from CentOS 5. Install poppler:

yum install poppler poppler-utils

as root.

Poppler, a PDF rendering library, it's a fork of the xpdf PDF..."

Comment #7

johanneshahn commented 2 March 2012 at 17:54

Status:

Postponed (maintainer needs more info)

» Closed (cannot reproduce)

try latest stable

Cannot extract PDF metadata

Comments

Comment #1

Comment #2

Comment #3

Comment #4

Comment #5

Comment #6

Comment #7

News items

Our community

Documentation

Drupal code base

Governance of community