@brikler you have to understand the package. check ZZ-lto.sh.in makeflags.conf
after installation you have to check the makepkg.conf and you have the optimizations commented out. so you can replace your original file with the optimizations.
RTM
Git Clone URL: | https://aur.archlinux.org/makepkg-optimize.git (read-only, click to copy) |
---|---|
Package Base: | makepkg-optimize |
Description: | Supplemental build and packaging optimizations for makepkg |
Upstream URL: | https://wiki.archlinux.org/index.php/Makepkg-optimize |
Licenses: | GPL |
Submitter: | quequotion |
Maintainer: | quequotion |
Last Packager: | quequotion |
Votes: | 17 |
Popularity: | 0.000006 |
First Submitted: | 2016-03-20 15:08 (UTC) |
Last Updated: | 2022-01-09 15:05 (UTC) |
@brikler you have to understand the package. check ZZ-lto.sh.in makeflags.conf
after installation you have to check the makepkg.conf and you have the optimizations commented out. so you can replace your original file with the optimizations.
RTM
if found this options perhaps it is interesting for makepkg-optimize?
If you specify the optional n, the optimization and code generation done at link time is executed in parallel using n parallel jobs by utilizing an installed make program. The environment variable MAKE may be used to override the program used.
You can also specify -flto=jobserver to use GNU make’s job server mode to determine the number of parallel jobs. This is useful when the Makefile calling GCC is already executing in parallel. You must prepend a ‘+’ to the command recipe in the parent Makefile for this to work. This option likely only works if MAKE is GNU make. Even without the option value, GCC tries to automatically detect a running GNU make’s job server.
Use -flto=auto to use GNU make’s job server, if available, or otherwise fall back to autodetection of the number of CPU threads present in your system.
@quequotion: I saw this first in Alads commit that strips parallel
off of aur-srcver
particular this line showing wait -n
usage.
Fix the case statement in upx
and add multiprocessing
to upx
optipng
and svgo
git am < <(curl -s 'https://github.com/bartoszek/AUR-makepkg-optimize/commit/f1964eb728aee09524357d58585037b18ea50725.patch')
Inspired by aurutils
pacman=5.2
just landed in [core]
, you can update depends[]
@brikler
Finally had a chance to test this in the wild; checking $CC
and $CXX
only really works if they are set in makepkg.conf. Packages (see pcsx2-git) may set them in build()
, after the libmakepkg scripts have run; the only solution seems to be having distinct options for gcc and clang; ie lto-clang
and lto-thin-clang
...
EDIT: There are now separate options for clang: lto-clang, lto-thin-clang, polly, rice-clang. I'm having a heck of a time proving they work, but they should. polly
will not, because it requires the polly package for llvm, which is only available in llvm-git (aur).
@brikler
You still need to follow the instructions on the wiki to enable the clang compiler (set CC
and CXX
, etc), but ZZ-lto.sh.in will now set the appropriate flags depending on which compiler you have enabled.
I've also added a new build environment option lto-thin. This does -flto=thin
for clang or uses the default behavior for gcc (formerly -fwhopr
).
The build environment option lto will do monolithic link time optimization for both; this can result in greater optimization, but requires greater resources and takes more time.
I looked into the compatibility of the pgo build environment option's flags, and it seems like they should be fine as is; clang claims to support the gcc flags in addition to its own.
@quequotion i don't know how makepkg decide the compiler to use but it should be possible to catch this choice before the compiler works because some packages where compiled with clang by default, as example mesa
@brikler, I read it!
clang's -flto=thin
attempts to make lto more scalable by breaking up compilation (using less memory at the cost of somewhat less thorough optimization), similar to gcc's -fwhopr
. This is sometimes necessary when compiling very large programs with lto. This may also decrease compile time, as less effort is made to find optimization opportunities in distant parts of the code. EDIT: I have since learned that clang's lto uses a number of threads equal to the number of cores by default, and the flag to control this is --thinlto-jobs=N
.
gcc's -flto=n
parallelizes the optimization threads, which may also decrease compile time but requires a significantly greater amount of free memory as it searches through larger sections of code (and in multiple threads). EDIT: I have since learned that -fwhopr
was scrapped and incorporated as the default behavior of -flto=n
; the original behavior can be restored by additionally specifying -flto-partition=none
They are both lto methods, but their goals and results differ.
-flto=thin
is not supported by gcc, and -flto=n
is not supported by clang. In order to support both, I'd have to come up with some means (in ''lto.sh.in'') of distinguishing which compiler is intended to be used. Perhaps there is something in makepkg's code I could use.
There are actually a lot of other lto-related flags to consider as well. I might make a few other changes to the lto macro, but it's troublesome that the compilers are using mutually incompatible options (makes my job harder).
On a hunch, I went looking for pgo in clang, and found that they are using different flags on the command line and different terms in their documentation. Rather than "profile guided optimization", clang's devs are calling it "profiling with instrumentation".
At the moment makepkg-optimize's macros can only be said to support gcc. I'll look into adding support for clang, and making sure gcc-specific flags aren't getting sent to non-compliant compilers.
Edit: Looking at the archwiki page for clang, in the header it mentions having to edit makepkg.conf to set up the compiler. Doesn't seem like makepkg has any built-in mechanism to set the compiler. This could be an opportunity for libmakepkg--perhaps something like a supplemental buildenv script could be used to switch compilers and filter flags. Not yet sure if I'm talking about a few scripts (maybe another aur package, ie 'makepkg-compilers') or a massive change to makepkg itself (to be proposed, revised, argued, and fought over on the mailing list; then perhaps, someday, implemented).
Pinned Comments
quequotion commented on 2019-02-27 07:49 (UTC) (edited on 2022-01-09 13:20 (UTC) by quequotion)
makepkg-optimize is a collection of libmakepkg tidy, buildenv, and executable extensions, and a supplement to pacman. These enable various optimization routines for building and packaging such as upx compression, profile guided optimization, polyhedral model optimization, etc..
Note: Over-optimization is a thing, and it is not good.
See the wiki article for details.
Note to packagers: makepkg-optmize's macros may be enabled or disabled in options() as well!