@petronny Thanks for the suggestion. I Updated the PKGBUILD.
Search Criteria
Package Details: python-apex-git 22.03.r245.g59b80ee8-1
Package Actions
Git Clone URL: | https://aur.archlinux.org/python-apex-git.git (read-only, click to copy) |
---|---|
Package Base: | python-apex-git |
Description: | A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch |
Upstream URL: | https://github.com/NVIDIA/apex |
Keywords: | pytorch |
Licenses: | BSD |
Conflicts: | python-apex |
Provides: | python-apex |
Submitter: | leomao |
Maintainer: | petronny |
Last Packager: | petronny |
Votes: | 0 |
Popularity: | 0.000000 |
First Submitted: | 2018-12-14 06:07 (UTC) |
Last Updated: | 2024-08-02 07:00 (UTC) |
Dependencies (8)
- cuda (cuda11.1AUR, cuda-12.2AUR, cuda12.0AUR, cuda11.4AUR, cuda11.4-versionedAUR, cuda12.0-versionedAUR)
- python (python37AUR, python311AUR, python310AUR)
- python-pytorch-cuda (python-pytorch-mkl-cuda-gitAUR, python-pytorch-cxx11abi-opt-cudaAUR, python-pytorch-opt-cuda)
- python-sympy (python-sympy-gitAUR)
- git (git-gitAUR, git-glAUR) (make)
- numactl (numactl-gitAUR) (make)
- python-pip (make)
- python-setuptools (make)
Required by (4)
- fairseq (requires python-apex) (optional)
- fairseq-git (requires python-apex) (optional)
- python-pytorch-lightning (requires python-apex) (optional)
- python-pytorch-lightning-git (requires python-apex) (optional)
Sources (2)
Latest Comments
« First ‹ Previous 1 2
leomao commented on 2019-08-05 04:03 (UTC) (edited on 2019-08-05 04:04 (UTC) by leomao)
petronny commented on 2019-08-05 03:59 (UTC)
Also, git
should be in makedepends
.
petronny commented on 2019-08-04 05:48 (UTC)
It shouldn't be an any
package since it depends on cuda
.
Please set arch
to ('x86_64')
.
leomao commented on 2019-04-12 09:55 (UTC) (edited on 2019-04-12 09:57 (UTC) by leomao)
Please check https://github.com/NVIDIA/apex/issues/212. Currently, I don't have a solution with pytorch/pytorch-cuda in the community repo...
For now, I compile pytorch master myself...
drr21 commented on 2019-04-09 16:05 (UTC)
I get this warning when I use apex.amp:
'Warning: multi_tensor_applier fused unscale kernel is unavailable, possibly because apex was installed without --cuda_ext --cpp_ext. Using Python fallback. Original ImportError was: ImportError('/usr/lib/python3.7/site-packages/amp_C.cpython-37m-x86_64-linux-gnu.so: undefined symbol: _ZN3c105ErrorC1ENS_14SourceLocationERKSs')'
hottea commented on 2019-03-22 08:41 (UTC)
@leomao see syncbn for syncbn example. Actually, see this issue. It seems that pytorch appends -D_GLIBCXX_USE_CXX11_ABI=0
to compiler flags by default. I don't see a way to override it. And according to pytorch's PKGBUILD, there is no modify related to this flag. I believe that pytorch is build with -D_GLIBCXX_USE_CXX11_ABI=0
, which is the default behavior of pytorch official configuration. So it would be OK to build apex extension with the same flag, aka -D_GLIBCXX_USE_CXX11_ABI=0
. However, it's not. I try to build build apex with -D_GLIBCXX_USE_CXX11_ABI=1
by manually replace all -D_GLIBCXX_USE_CXX11_ABI=0
to -D_GLIBCXX_USE_CXX11_ABI=1
in /usr/lib/python3.7/site-packages/torch/utils/cpp_extension.py
, and it works as expected. However, one should not expect to modify this cpp_extension.py
during building apex with devtools, right?
leomao commented on 2019-02-25 03:10 (UTC)
Hi @hottea, thanks for reporting the issue. Could you provide a code snippet for testing? I just checked that the examples and tests ran without errors.
hottea commented on 2019-02-25 02:42 (UTC)
c++filt _ZN3c105ErrorC1ENS_14SourceLocationERKSs
gives me:
c10::Error::Error(c10::SourceLocation, std::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)
and then I found https://github.com/pytorch/pytorch/issues/13541, it seems we need to add -D_GLIBCXX_USE_CXX11_ABI=0
when compiling apex.
hottea commented on 2019-02-25 02:04 (UTC)
I got this warning:
Warning: using Python fallback for SyncBatchNorm, possibly because apex was ins
talled without --cuda_ext. The exception raised when attempting to import the c
uda backend was: /usr/lib/python3.7/site-packages/syncbn.cpython-37m-x86_64-lin
ux-gnu.so: undefined symbol: _ZN3c105ErrorC1ENS_14SourceLocationERKSs
It seems that it failed to build with --cuda_ext
? Or maybe there is something wrong with libs?
Pinned Comments
petronny commented on 2022-08-31 06:43 (UTC)
Prebuilt binaries of this package can be found in the arch4edu repository.