@mutantmonkey. I'm the author of ocrmypdf. Thanks for maintaining this for Arch Linux and glad you find it useful.
When you have a chance please consider updating the dependencies to match setup.py. For v5.2 the requirements for several packages are higher. At the minimum versions listed it won't work anymore (Tesseract 3.04 is now required, for one thing).
I also suggest qpdf >= 7.0.0 because older versions have known security holes handling in malicious/malformed PDFs.
Search Criteria
Package Details: ocrmypdf 16.7.0-1
Package Actions
Git Clone URL: | https://aur.archlinux.org/ocrmypdf.git (read-only, click to copy) |
---|---|
Package Base: | ocrmypdf |
Description: | A tool to add an OCR text layer to scanned PDF files, allowing them to be searched |
Upstream URL: | https://github.com/ocrmypdf/OCRmyPDF |
Licenses: | MPL2 |
Submitter: | dreuter |
Maintainer: | fbrennan (pigmonkey) |
Last Packager: | pigmonkey |
Votes: | 125 |
Popularity: | 3.53 |
First Submitted: | 2014-01-27 11:36 (UTC) |
Last Updated: | 2024-12-10 05:10 (UTC) |
Dependencies (21)
- ghostscript
- img2pdf (img2pdf-gitAUR)
- pngquant
- python (python37AUR, python311AUR, python310AUR)
- python-deprecation
- python-importlib_resources
- python-packaging
- python-pdfminer (pdfminerAUR)
- python-pikepdf
- python-pillow
- python-pluggy
- python-reportlab
- python-rich
- python-tqdm
- tesseract (tesseract-gitAUR)
- unpaper (unpaper-gitAUR)
- python-build (make)
- python-hatch-vcs (make)
- python-installer (python-installer-gitAUR) (make)
- python-wheel (make)
- jbig2encAUR (jbig2encAUR, jbig2enc-gitAUR) (optional) – Better compression algorithm; results in smaller PDF files
Required by (6)
- docspell-joex (optional)
- dpsprep-git (optional)
- phoronix-test-suite-git (optional)
- python-ocrmypdf-papermerge
- riven-original-soundtrack (make)
- stirling-pdf-bin
Sources (1)
Latest Comments
« First ‹ Previous 1 .. 15 16 17 18 19 20 21 22 Next › Last »
jbarlow commented on 2017-11-09 08:15 (UTC)
mutantmonkey commented on 2017-08-12 18:20 (UTC)
The version of img2pdf in AUR is now up-to-date, so using img2pdf-git is no longer necessary.
Both python-ruffus and python-pypdf2 are already listed as dependencies and available in the AUR. If you are having trouble, you should try rebuilding them because you may already have older versions on your system built against an earlier version of Python, which will not work.
rabarrett commented on 2017-08-12 17:20 (UTC)
I got it to run, but only after installing python-pypdf2
-Should this be a dependency?
rabarrett commented on 2017-08-08 18:46 (UTC)
I also tried to clone the git repository for ocrmypdf and install it with makepkg (using the PKGBUILD from here).
In doing so, I had to install 2 packages manually:
[code]
$ makepkg -i
==> WARNING: A package has already been built, installing existing package...
==> Installing package ocrmypdf with pacman -U...
loading packages...
resolving dependencies...
warning: cannot resolve "python-ruffus>=2.6.3", a dependency of "ocrmypdf"
warning: cannot resolve "img2pdf>=0.2.1", a dependency of "ocrmypdf"
:: The following package cannot be upgraded due to unresolvable dependencies:
ocrmypdf
:: Do you want to skip the above package for this upgrade? [y/N]
[/code]
(I installed those two it warned about)
But I still got the same error:
[code]
$ ocrmypdf
Traceback (most recent call last):
File "/usr/bin/ocrmypdf", line 6, in <module>
from pkg_resources import load_entry_point
File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 3049, in <module>
@_call_aside
File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 3033, in _call_aside
f(*args, **kwargs)
File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 3062, in _initialize_master_working_set
working_set = WorkingSet._build_master()
File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 658, in _build_master
ws.require(__requires__)
File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 972, in require
needed = self.resolve(parse_requirements(requirements))
File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 858, in resolve
raise DistributionNotFound(req, requirers)
pkg_resources.DistributionNotFound: The 'PyPDF2>=1.26' distribution was not found and is required by ocrmypdf
[code]
rabarrett commented on 2017-08-08 18:29 (UTC)
It appeared to install correctly for me, but then when I just tried to run it to see the help info, it said there were files "not found." I followed sagittarius's suggestion to install img2pdf-git and it failed to build, complaining:
250 passed
4 xpassed
4 failed (all in cherry/test dir)
3 xfailed
4 skipped
sagittarius commented on 2017-07-26 09:46 (UTC)
With this latest version, I had to replace img2pdf by img2pdf-git to make it work.
mutantmonkey commented on 2017-07-24 02:39 (UTC)
I've updated the dependency for python-img2pdf to img2pdf, as python-img2pdf is a duplicate and img2pdf seems like a more logical name. However, the current version of img2pdf is out-of-date and older than what python-img2pdf provided, so if you run into any weird issues, it may be due to that.
sagittarius commented on 2017-01-23 19:11 (UTC) (edited on 2017-01-23 19:12 (UTC) by sagittarius)
For info, I had to recompile python-ruffus (aur) to make it work.
hason commented on 2016-10-21 08:27 (UTC) (edited on 2016-10-21 08:28 (UTC) by hason)
Please update to the new version 4.2.5.
mutantmonkey commented on 2016-02-23 05:02 (UTC)
I'm aware that this package is out of date, however after building the new package it does not appear to be working properly on a test PDF file. I'll leave it at the working version for now until I get this figured out.
Pinned Comments
fbrennan commented on 2023-05-12 22:54 (UTC)
The flag was invalid and has been removed with no action taken as no new version was released. There's nothing to do for this package; no new release has been made. Rebuild, as @eclairevoyant has said.