Package Details: paperless-ngx-venv 2.13.5-2

Git Clone URL: https://aur.archlinux.org/paperless-ngx-venv.git (read-only, click to copy)
Package Base: paperless-ngx-venv
Description: A supercharged version of paperless: scan, index and archive all your physical documents (version with bundled dependencies)
Upstream URL: https://docs.paperless-ngx.com/
Licenses: GPL-3.0-or-later
Conflicts: paperless, paperless-ng, paperless-ngx
Provides: paperless, paperless-ngx-venv
Replaces: paperless-ngx
Submitter: AlphaJack
Maintainer: AlphaJack (atomicfs, shtrophic)
Last Packager: shtrophic
Votes: 27
Popularity: 1.17
First Submitted: 2024-11-02 15:19 (UTC)
Last Updated: 2024-11-28 09:30 (UTC)

Required by (0)

Sources (12)

Latest Comments

1 2 3 4 5 6 .. 12 Next › Last »

shtrophic commented on 2024-11-26 20:17 (UTC)

Yeah, nltk-data is out of date. Even better: it is orphaned, while in extra! :(

pipep commented on 2024-11-26 17:52 (UTC) (edited on 2024-11-26 17:52 (UTC) by pipep)

After switching from paperless-ngx 2.11.6-1 to paperless-ngx-venv 2.13.5-1, I encountered the following error:

[2024-11-26 17:07:48,861] [WARNING] [paperless.tasks] Classifier error:
**********************************************************************
  Resource #[93mpunkt_tab#[0m not found.
  Please use the NLTK Downloader to obtain the resource:
  #[31m>>> import nltk
  >>> nltk.download('punkt_tab')
  #[0m
  For more information see: https://www.nltk.org/data.html
  Attempted to load #[93mtokenizers/punkt_tab/italian/#[0m
  Searched in:
PosixPath('/usr/share/nltk_data')
**********************************************************************

I resolved the issue by downloading the folder punkt_tab from https://www.nltk.org/nltk_data/ and saving it to /usr/share/nltk_data/tokenizers.

AlexBocken commented on 2024-11-15 15:51 (UTC)

When running this behind a proxy with Header authentikation it seems like i get a ModuleNotFoundError:

journalctl -xeu paperless-webserver results in:

Nov 15 16:41:03 seenas gunicorn[46190]:   File "<frozen importlib._bootstrap>", line 1360, in _find_and_load
Nov 15 16:41:03 seenas gunicorn[46190]:   File "<frozen importlib._bootstrap>", line 1331, in _find_and_load_unlocked
Nov 15 16:41:03 seenas gunicorn[46190]:   File "<frozen importlib._bootstrap>", line 935, in _load_unlocked
Nov 15 16:41:03 seenas gunicorn[46190]:   File "<frozen importlib._bootstrap_external>", line 995, in exec_module
Nov 15 16:41:03 seenas gunicorn[46190]:   File "<frozen importlib._bootstrap>", line 488, in _call_with_frames_removed
Nov 15 16:41:03 seenas gunicorn[46190]:   File "/usr/share/paperless/src/paperless/asgi.py", line 16, in <module>
Nov 15 16:41:03 seenas gunicorn[46190]:     from paperless.urls import websocket_urlpatterns  # noqa: E402
Nov 15 16:41:03 seenas gunicorn[46190]:     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Nov 15 16:41:03 seenas gunicorn[46190]:   File "/usr/share/paperless/src/paperless/urls.py", line 20, in <module>
Nov 15 16:41:03 seenas gunicorn[46190]:     from documents.views import AcknowledgeTasksView
Nov 15 16:41:03 seenas gunicorn[46190]:   File "/usr/share/paperless/src/documents/views.py", line 165, in <module>
Nov 15 16:41:03 seenas gunicorn[46190]:     from paperless_mail.oauth import PaperlessMailOAuth2Manager
Nov 15 16:41:03 seenas gunicorn[46190]:   File "/usr/share/paperless/src/paperless_mail/oauth.py", line 7, in <module>
Nov 15 16:41:03 seenas gunicorn[46190]:     from httpx_oauth.clients.google import GoogleOAuth2
Nov 15 16:41:03 seenas gunicorn[46190]: ModuleNotFoundError: No module named 'httpx_oauth'

A bit confusing to me since it seems like httpx-oauth is explicitly listed in the requirements.txt. Any idea what this could be? I'd love to help you out in triage if needed.

atomicfs commented on 2024-11-05 09:53 (UTC)

nevermind, my oversight

atomicfs commented on 2024-11-05 08:20 (UTC)

@shtrophic sorry, but what the heck? Did you just force-push stuff? Why?

shtrophic commented on 2024-11-03 18:41 (UTC) (edited on 2024-11-03 18:41 (UTC) by shtrophic)

I pushed some patches about the warnings caused by whoosh. OK?

atomicfs commented on 2024-11-02 15:52 (UTC)

I suspect it is issue of whoosh, last release was in 2016 and seems to be abandoned.

shtrophic commented on 2024-11-02 15:46 (UTC) (edited on 2024-11-02 15:46 (UTC) by shtrophic)

Not really... The only possibly breaking changes in the patched PKGBUILD are:

find "$pkgdir/usr/lib/paperless" -type f -exec sed -i "s|$srcdir/venv|/usr/lib/paperless|g" {} +
find "$pkgdir/usr/lib/paperless" -type d -name "__pycache__" | xargs rm -rf

in package().

atomicfs commented on 2024-11-02 15:27 (UTC)

I just pushed 2 fixes

AlphaJack commented on 2024-11-02 15:14 (UTC)

@shtropic I tried your patched version and I can launch paperless, but I get a lot of

usr/lib/paperless/lib/python3.12/site-packages/whoosh/analysis/filters.py:50: SyntaxWarning: invalid escape sequence '\w'
  url_pattern = rcompile("""
/usr/lib/paperless/lib/python3.12/site-packages/whoosh/analysis/filters.py:148: SyntaxWarning: invalid escape sequence '\S'
  """Interleaves the results of two or more filters (or filter chains).
/usr/lib/paperless/lib/python3.12/site-packages/whoosh/analysis/intraword.py:37: SyntaxWarning: invalid escape sequence '\S'
  """Given a set of words (or any object with a ``__contains__`` method),
/usr/lib/paperless/lib/python3.12/site-packages/whoosh/analysis/intraword.py:224: SyntaxWarning: invalid escape sequence '\S'
  """Splits words into subwords and performs optional transformations on
/usr/lib/paperless/lib/python3.12/site-packages/whoosh/analysis/intraword.py:285: SyntaxWarning: invalid escape sequence '\|'
  def __init__(self, delims=u("-_'\"()!@#$%^&*[]{}<>\|;:,./?`~=+"),

before I see paperless output. Do you know why?