From Fedora Project Wiki
(→‎Scope: more precise statistic over whole distribution)
Line 47: Line 47:
A small statistic on Fedora rawhide x86_64 (2009-01-22) to give an idea how many packages/files/bytes could be affected:
A small statistic on Fedora rawhide x86_64 (2009-01-22) to give an idea how many packages/files/bytes could be affected:


The files are split into 3 groups:
All files where put into one of the following categories:  
* bin32: 32 bit binaries including libraries(!) (as known to rpm, file color==1)
* bin64: 64 bin binaries including libraries (file color==2)
* lib32: other files in /lib or /usr/lib
* lib64: other files in /lib64 or /usr/lib64
* noarch: everything else


# binary: files rpm knows that they are arch dependent
Sizes are (uncompressed) bytes in files and though do not directly map to the size of packages nor to used disk space.
# libdir: files that are not binaries but reside in (/usr)/lib(64)
# noarch: everything else


Libdir files should be noarch in most cases. Sizes are (uncompressed) bytes in files and though do not directly map to the size of packages nor to used disk space.
<pre>
15560 packages (44 GB in 2.0 M files)
        11 k bin32 files (2.1 GB)
        31 k bin64 files (6.8 GB)
      142 k lib32 files (1.7 GB)
      161 k lib64 files (5.4 GB)
      1.7 M noarch files (28 GB)
 
8906 x86_64 packages (25 GB in 1.0 M files)
        31 k bin64 files (6.8 GB)
        21 k lib32 files (503 MB)
      161 k lib64 files (5.4 GB)
      828 k noarch files (13 GB)


<pre>
3489 noarch packages (14 GB in 763 k files)
15204 packages (44 GB in 2.0 M files, 100%)
        88  bin32 files (2.3 MB)
        41 k binary files (8.8 GB, ~20%)
        87 k lib32 files (648 MB)
      298 k libdir files (7.1 GB, ~16%)
       676 k noarch files (13 GB)
       1.7 M noarch files (28 GB, ~64%)


8762 x86_64 packages (25 GB in 1.0 M files, 100%)
3163 i386 packages (5.4 GB in 282 k files)
        31 k binary files (6.7 GB, ~27%)
         10 k bin32 files (2.1 GB)
      182 k libdir files (5.9 GB, ~24%)
         34 k lib32 files (571 MB)
      826 k noarch files (12 GB, ~48%)
       237 k noarch files (2.7 GB)
3132 i386 packages (5.3 GB in 280 k files, 100%)
         10 k binary files (2.0 GB, ~38%)
         32 k libdir files (551 MB, ~11%)
       237 k noarch files (2.7 GB, ~51%)
3308 noarch packages (13 GB in 755 k files, 100%)
        88  binary files (2.3 MB, ~0.2%)
        84 k libdir files (635 MB, ~5%)
      671 k noarch files (12 GB, 95%)
</pre>
</pre>
903 (sub) packages in 571 source packages could be directly switched to noarch (filtering out 32 bit packages): [[media:NoarchCandidates.txt]]. These are all x86_64 packages that do neither contain binary files (as known to rpm) nor files in (/usr)/lib64/.


== Test Plan ==
== Test Plan ==

Revision as of 11:49, 14 February 2009

Support Noarch Sub Packages in Fedora

Summary

Since some months RPM supports sub packages being noarch. Right now the Fedora infrastructure does not support this feature. This feature will provide the technical abilities to use noarch sub packages and also provide help to use them within packages all over the distribution.

Owner

Current status

  • Targeted release: Fedora 11
  • Last updated: --Ffesti 20:40, 12 February 2009 (UTC)--Ffesti 19:17, 29 January 2009 (UTC)
  • Percentage of completion: 33%

Detailed Description

There are several steps needed:

  • Support in rpm (100%)
  • Support in koji (75%)
    • see Ticket
    • Fedora infrastructure still needs to be updated
  • Support in other parts of the infrastructure (unknown)
  • Support in test/verification tools (unknown)
    • rpmlint (?)
    • ... (?)
  • Get a list of possible candidates (sub packages) (100%)
  • Write a mail to f-d-l and package owners (33%)
  • Write best practise documentation (0%)
  • Get packaging policy adjusted (see /PolicyChanges) (10%)
  • Get the /PackagesChanged

Benefit to Fedora

Noarch packages have several benefits over arch dependent packages:

  • They can be shared between different architectures and thus use up less disk space and bandwidth on both the Fedora infrastructure and our mirrors
  • They avoid double installation of data for multilib packages.
  • They tell the user that the content of the package is arch independent.

By increasing the use of noarch packages we also increase the effect of these benefits.

Additionally we can get rid of some hacks that are used to generate noarch sub packages for very few packages right now.

Scope

A small statistic on Fedora rawhide x86_64 (2009-01-22) to give an idea how many packages/files/bytes could be affected:

All files where put into one of the following categories:

  • bin32: 32 bit binaries including libraries(!) (as known to rpm, file color==1)
  • bin64: 64 bin binaries including libraries (file color==2)
  • lib32: other files in /lib or /usr/lib
  • lib64: other files in /lib64 or /usr/lib64
  • noarch: everything else

Sizes are (uncompressed) bytes in files and though do not directly map to the size of packages nor to used disk space.

15560 packages (44 GB in 2.0 M files)
        11 k bin32 files (2.1 GB)
        31 k bin64 files (6.8 GB)
       142 k lib32 files (1.7 GB)
       161 k lib64 files (5.4 GB)
       1.7 M noarch files (28 GB)

8906 x86_64 packages (25 GB in 1.0 M files)
        31 k bin64 files (6.8 GB)
        21 k lib32 files (503 MB)
       161 k lib64 files (5.4 GB)
       828 k noarch files (13 GB)

3489 noarch packages (14 GB in 763 k files)
         88  bin32 files (2.3 MB)
        87 k lib32 files (648 MB)
       676 k noarch files (13 GB)

3163 i386 packages (5.4 GB in 282 k files)
        10 k bin32 files (2.1 GB)
        34 k lib32 files (571 MB)
       237 k noarch files (2.7 GB)

Test Plan

  1. Create one noarch subpackage by adding BuildArch: noarch to the subpackage section
  2. Scratch build the package to see whether there are any problems with koji
  3. Build package for rawhide - check that it correctly shows up in the repository and is shown as noarch package in the metadata
  4. See if the package installs correctly via yum
  5. Check if updating from a arch dependent previous version to the new noarch package works

User Experience

  • Slightly improved mirrors due to less transfer size
  • Only packages containing binaries will be arch dependent

Dependencies

  • rpm >= 4.6.0 (is in Fedora since months when counting release candidates)
  • the steps listed in the #Detailed Description.

Contingency Plan

  • Move target to Fedora 12
  • As soon as the technical problems have been fixed moving more sub packages to noarch can be a continuing process.

Documentation

What's this all about?

With version 4.6.0 RPM supports subpackages being noarch by just adding "BuildArch: noarch" to their subpackage section in the spec file.

The noarch subpackages built on the different arches are going to be compared by koji with rpmdiff ignoring time stamp, size and md5 sums of files. If any other differences are found the build will be rejected. Even with those automatic checks in place it is the responsibility of the packager to make sure that the package is really arch independent - as for regular noarch packages, too.

Candidates for being switched to noarch

To get a list with good candidates all x86_64 packages that contain no binaries/libs (as known to rpm) and no files in /lib64 or /usr/lib64 were selected as a starting point. To further refine the selection and get an idea what can go wrong rpmdiff was run against the i386 sister packages - both with the relaxed koji and the strict -t settings. This showed a small number of false positives - mostly development packages that put files in different locations or undetected binary packages. Subpackages are marked by one surrounding '*' if they only fail the more strict rpmdiff -t check and by two if they also fail the rpmdiff check as used by koji. It is assumed that packages without '*' can be directly switched to noarch (assuming they don't do weird stuff on other arches). One '*' will require a more detailed look but should be OK in most cases and two '*'s is most likely a sign for a false positive. The diffs can be found below in a full and a hand shortened version.

Candidates for splitting off noarch subpackages

To search for more data that could be moved into noarch sub packages all files in the distributions where put into one of the following categories:

  • bin32: 32 bit binaries including libraries(!) (as known to rpm, file color==1)
  • bin64: 64 bin binaries including libraries (file color==2)
  • lib32: other files in /lib or /usr/lib
  • lib64: other files in /lib64 or /usr/lib64
  • noarch: everything else

To be able to detect arch independent files in (/usr)/lib x86_64 packages have been examined. It is assumed that lib32 and noarch files can be moved to noarch sub packes, bin64 and lib64 can't and bin32 should not be found. This is only a very rough estimate and must be checked for each packages and doesn't take other architectures into account. Nevertheless it gives a good idea of what packages should be considered and what results can be expected.

Note that it is probably not worth splitting all possible or even just the 1000 packages above. But the first few dozen have a very strong impact.

For some packages it might be better to just change the borders among the subpackages instead of blindly splitting them. Such situations are not reflected well in the above lists.

What about other packages?

A lot of other packages could also make use of this feature. When considering to split up your package please avoid too complicated spec files. We still have to develop packaging strategies to be applied throughout the distribution and it doesn't look like this is going to happen in the F11 time frame.

What can you do as a packager?

There are still fixes for koji that must hit the Fedora build system first. So noarch subpackages DO NOT WORK within Fedora yet. We hope that this can be solved soon.

If you are interested you can already play with noarch subpackages by building with mock and comparing the results on different arches with rpmdiff -t (Files differing in S and 5 are ok). There is going to be little time between support in koji and the feature freeze. So being prepared for this short time slot is a good thing.

Please add the packages you changed or plan to change to /PackagesChanged. Put the later in parenthesis. Thanks!

What if you don't want to change your packages?

That's perfectly fine. There is no plan to force packager to use noarch subpackages. I hope we can develop a more detailed plan on how to make use of this feature in future Fedora releases. You might be interested in taking part in this discussion.

What does that mean for the Packaging Policy?

The packaging policy will require a few additions. See /PolicyChanges. Any comments and help is welcome.

Release Notes

Not applicable as visibility for the users is low and developers need to know before the release.