Package Details: pdfminer 20191125-1

Git Clone URL: https://aur.archlinux.org/pdfminer.git (read-only, click to copy)
Package Base: pdfminer
Description: python3 utils to extract, analyze text data of PDF files. Includes pdf2txt, dumppdf, and latin2ascii
Upstream URL: http://www.unixuser.org/~euske/python/pdfminer/
Licenses: MIT/X
Conflicts: pdfminer, python-pdfminer, python2-pdfminer
Provides: python-pdfminer
Submitter: severach
Maintainer: severach
Last Packager: severach
Votes: 26
Popularity: 0.006158
First Submitted: 2015-09-09 23:57 (UTC)
Last Updated: 2020-01-21 04:30 (UTC)

Dependencies (3)

Required by (20)

Sources (1)

Latest Comments

dreieck commented on 2023-12-05 11:29 (UTC)

Please remove python-pdfminer from the provides array. It only provides python2-pdfminer.

dreieck commented on 2023-12-05 11:28 (UTC) (edited on 2023-12-05 11:28 (UTC) by dreieck)

This one is no longer a replacement for python-pdfminer.

Having this package, instead of python-pdfminer, installed, makes ocrmypdf fail with:

Traceback (most recent call last):
  File "/usr/bin/ocrmypdf", line 5, in <module>
    from ocrmypdf.__main__ import run
  File "/usr/lib/python3.11/site-packages/ocrmypdf/__init__.py", line 10, in <module>
    from ocrmypdf import helpers, hocrtransform, pdfa, pdfinfo
  File "/usr/lib/python3.11/site-packages/ocrmypdf/pdfinfo/__init__.py", line 9, in <module>
    from ocrmypdf.pdfinfo.info import Colorspace, Encoding, PageInfo, PdfInfo
  File "/usr/lib/python3.11/site-packages/ocrmypdf/pdfinfo/info.py", line 44, in <module>
    from ocrmypdf.pdfinfo.layout import LTStateAwareChar, get_page_analysis, get_text_boxes
  File "/usr/lib/python3.11/site-packages/ocrmypdf/pdfinfo/layout.py", line 23, in <module>
    from pdfminer.pdfdevice import PDFTextSeq
ImportError: cannot import name 'PDFTextSeq' from 'pdfminer.pdfdevice' (/usr/lib/python3.11/site-packages/pdfminer/pdfdevice.py)

Regards!

igormp commented on 2019-11-07 14:17 (UTC)

FWIW, there has been some activity on the project. Looks like euske decided to work on it again.

oriba commented on 2019-09-20 22:19 (UTC)

Looks, like this package does not make sense anymore. Last update in 2017. Last comment from 2017 says, it's broken.

And there is https://aur.archlinux.org/packages/python-pdfminer.six/ which seems to be the newest pdfminer-stuff.

So, wouldn't it make sense to remove this package?

untitaker commented on 2017-11-19 19:17 (UTC)

Package is broken because HTTPS is enforced. Please fix.