Package Details: ocrmypdf 16.7.0-1

Git Clone URL: https://aur.archlinux.org/ocrmypdf.git (read-only, click to copy)
Package Base: ocrmypdf
Description: A tool to add an OCR text layer to scanned PDF files, allowing them to be searched
Upstream URL: https://github.com/ocrmypdf/OCRmyPDF
Licenses: MPL2
Submitter: dreuter
Maintainer: fbrennan (pigmonkey)
Last Packager: pigmonkey
Votes: 125
Popularity: 3.53
First Submitted: 2014-01-27 11:36 (UTC)
Last Updated: 2024-12-10 05:10 (UTC)

Pinned Comments

fbrennan commented on 2023-05-12 22:54 (UTC)

The flag was invalid and has been removed with no action taken as no new version was released. There's nothing to do for this package; no new release has been made. Rebuild, as @eclairevoyant has said.

Latest Comments

« First ‹ Previous 1 .. 13 14 15 16 17 18 19 20 21 22 Next › Last »

jbarlow commented on 2018-08-17 18:49 (UTC)

Aww, thanks everyone. :)

v7.0.3 is PyPI as of a few days. Normally Github and PyPI are nearly in lockstep, but Travis was having network problems last weekend and failed to deploy v7.0.3 to PyPI (which it does for me). PyPI releases are just distributions of the tagged releases on Github. It's a little better to use PyPI's sdist since it is smaller than a Github checkout.

bsdice commented on 2018-08-17 16:44 (UTC)

Check https://pypi.org/project/ocrmypdf/#files

fbrennan commented on 2018-08-17 16:36 (UTC)

Thank you everyone. Maintaining the package is the least I could do because I use ocrympdf a lot, and I found the developer extremely cordial and helpful when I had a problem with it while OCR'ing an Esperanto PDF.

Regarding version 7.0.3, someone flagged the package over this, that version is not yet on PyPI. As far as I know, what's on Github is development, while what's on PyPI is stable. So I'm assuming 7.0.3 is beta since it's not yet on PyPI. As soon as it is on PyPI I will update the PKGBUILD.

If my understanding of this is wrong, feel free to enlighten me.

sagittarius commented on 2018-08-17 10:24 (UTC) (edited on 2018-08-17 14:33 (UTC) by sagittarius)

Thanks to the maintainers and jbarlow for this utility is clearly ULTIMATE (necessary, indispensable, decisive for manipulating PDF files).

I've used the git version of img2pdf, rebuild some AUR packages (python-pikepdf, pybind11, pngquant...) and it works great :D

jbarlow commented on 2018-08-13 20:33 (UTC)

I'm the author of ocrmypdf and pikepdf - great to see the community here working away on the update. Several changes here are due deprecated features being removed in Python 3.7.

As of pikepdf 0.3.1, just released today, pybind11.patch will be unnecessary.

A few comments on dependencies compared to https://pastebin.com/84Tb6K6S: - jbig2enc should be added - leptonica should be added explicitly (>= 1.76.0, implied by tesseract) - qpdf should be added explicitly (>= 8.1.0, implied by pikepdf)

bsdice commented on 2018-08-13 02:55 (UTC) (edited on 2018-08-13 02:58 (UTC) by bsdice)

@fbrennan Thanks for adopting it! Glad I could help out the community.

Here are two fixes that escaped my attention:

(1) PKGBUILD of ocrmypdf is missing one depends=( ... 'python-reportlab>=3.3.0' ... )

(2) PKGBUILD of python-xmp-toolkit similarly is missing one depends=(... 'python-pytz')

Everything should be checked with namcap -i <pkgbuild|final .xz=""> anyhow.</pkgbuild|final>

May I also suggest to you to ask the pikepdf guy on Github why pybind11.patch is needed and also if that is the correct fix.

fbrennan commented on 2018-08-13 02:22 (UTC) (edited on 2018-08-13 07:20 (UTC) by fbrennan)

Thank you for the guide @bsdice ...

I adopted the package and will push a revised package for 7.0.2. (Unfortunately, have to wait for the python-ruffus package to either be disowned or updated. Will update as soon as that's done.)

mutantmonkey commented on 2018-08-12 19:34 (UTC)

Unfortunately, I haven't had much time to maintain this package as of late. I'm orphaning it so that someone with more time can take over.

bsdice commented on 2018-08-07 23:42 (UTC)

Finally, another new package called "python-xmp-toolkit" is needed, PKGBUILD: https://pastebin.com/xcngPwUq

I have based the PKGBUILD on the git-package: https://aur.archlinux.org/packages/python-xmp-toolkit-git/

In the end, the software will work again:

$ ocrmypdf --version

7.0.2

bsdice commented on 2018-08-07 23:36 (UTC) (edited on 2018-08-07 23:45 (UTC) by bsdice)

Next, update package "python-ruffus" https://aur.archlinux.org/pkgbase/python-ruffus/ and install the updated package python-ruffus-2.7.0-1-any.pkg.tar.xz (you can imho skip the python2 package).

PKGBUILD diff: https://pastebin.com/z9Zs1wZ7