Package Details: python-vllm-cuda 0.6.4.post1-3

Git Clone URL: https://aur.archlinux.org/python-vllm-cuda.git (read-only, click to copy)
Package Base: python-vllm-cuda
Description: faster implementation for TTS models, to be used in highly async environment - cpu version
Upstream URL: https://github.com/vllm-project/vllm
Licenses: Apache-2.0
Conflicts: python-vllm
Provides: python-vllm
Submitter: envolution
Maintainer: envolution
Last Packager: envolution
Votes: 0
Popularity: 0.000000
First Submitted: 2024-12-01 16:12 (UTC)
Last Updated: 2024-12-01 16:12 (UTC)

Pinned Comments

envolution commented on 2024-12-01 16:15 (UTC)

increase $_jobs to use more compilation threads - defaults to 3 which is as many as you'd want with ~16gb RAM

Latest Comments

envolution commented on 2024-12-01 16:15 (UTC)

increase $_jobs to use more compilation threads - defaults to 3 which is as many as you'd want with ~16gb RAM