From Fedora Project Wiki
(Early draft)
 
(→‎Release Notes: Bugzilla bug 1836108)
 
(27 intermediate revisions by 2 users not shown)
Line 1: Line 1:
{{admon/important | Comments and Explanations | The page source contains comments providing guidance to fill out each section. They are invisible when viewing this page. To read it, choose the "view source" link.<br/> '''Copy the source to a ''new page'' before making changes!  DO NOT EDIT THIS TEMPLATE FOR YOUR CHANGE PROPOSAL.'''}}
<!-- Self Contained or System Wide Change Proposal?
Use this guide to determine to which category your proposed change belongs to.
Self Contained Changes are:
* changes to isolated/leaf package without the impact on other packages/rest of the distribution
* limited scope changes without the impact on other packages/rest of the distribution
* coordinated effort within SIG with limited impact outside SIG functional area, accepted by the SIG
System Wide Changes are:
* changes that does not fit Self Contained Changes category touching
* changes that require coordination within the distribution (for example mass rebuilds, release engineering or other teams effort etc.)
* changing system defaults
For Self Contained Changes, sections marked as "REQUIRED FOR SYSTEM WIDE CHANGES" are OPTIONAL but FESCo/Wrangler can request more details (especially in case the change proposal category is improper or updated to System Wide category). For System Wide Changes all fields on this form are required for FESCo acceptance (when applies). 
We request that you maintain the same order of sections so that all of the change proposal pages are uniform.
-->
<!-- The actual name of your proposed change page should look something like: Changes/Your_Change_Proposal_Name.  This keeps all change proposals in the same namespace -->
= Sqlite RpmDB =
= Sqlite RpmDB =


Line 27: Line 5:


== Owner ==
== Owner ==
<!--
For change proposals to qualify as self-contained, owners of all affected packages need to be included here. Alternatively, a SIG can be listed as an owner if it owns all affected packages.
This should link to your home wiki page so we know who you are.
-->
* Name: [[User:pmatilai| Panu Matilainen]] [[User:ffesti|Florian Festi]]
* Name: [[User:pmatilai| Panu Matilainen]] [[User:ffesti|Florian Festi]]
* Email: pmatilai@redhat.com ffesti@redhat.com
* Email: pmatilai@redhat.com ffesti@redhat.com


== Current status ==
== Current status ==
[[Category:ChangePageIncomplete]]
[[Category:ChangeAcceptedF33]]
<!-- When your change proposal page is completed and ready for review and announcement -->
<!-- remove Category:ChangePageIncomplete and change it to Category:ChangeReadyForWrangler -->
<!-- The Wrangler announces the Change to the devel-announce list and changes the category to Category:ChangeAnnounced (no action required) -->
<!-- After review, the Wrangler will move your page to Category:ChangeReadyForFesco... if it still needs more work it will move back to Category:ChangePageIncomplete-->
 


[[Category:SystemWideChange]]
[[Category:SystemWideChange]]
Line 46: Line 15:
* Targeted release: [[Releases/33 | Fedora 33 ]]  
* Targeted release: [[Releases/33 | Fedora 33 ]]  
* Last updated: <!-- this is an automatic macro — you don't need to change this line -->  {{REVISIONYEAR}}-{{REVISIONMONTH}}-{{REVISIONDAY2}}  
* Last updated: <!-- this is an automatic macro — you don't need to change this line -->  {{REVISIONYEAR}}-{{REVISIONMONTH}}-{{REVISIONDAY2}}  
* FESCo issue: <will be assigned by the Wrangler>
* FESCo issue: [https://pagure.io/fesco/issue/2360 #2360]
* Tracker bug: <will be assigned by the Wrangler>
* Tracker bug: [https://bugzilla.redhat.com/show_bug.cgi?id=1818910 #1818910]
* Release notes tracker: <will be assigned by the Wrangler>
* Release notes tracker: [https://pagure.io/fedora-docs/release-notes/issue/462 #462]


== Detailed Description ==
== Detailed Description ==
Line 63: Line 32:
== Scope ==
== Scope ==
* Proposal owners:
* Proposal owners:
<!-- What work do the feature owners have to accomplish to complete the feature in time for release?  Is it a large change affecting many parts of the distribution or is it a very isolated change? What are those changes?-->
** Once [[Changes/RPM-4.16|RPM 4.16]] lands and passes initial shakedown, change the default rpmdb configuration to sqlite
** Arrange for automatic database conversion with opt-out possibility (one-shot service on next reboot or similar)
** Address any bugs and issues in the database backend found by wider testing base
** Help other developers to address Berkeley DB dependencies


* Other developers: N/A (not a System Wide Change) <!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
* Other developers:
<!-- What work do other developers have to accomplish to complete the feature in time for release?  Is it a large change affecting many parts of the distribution or is it a very isolated change? What are those changes?-->
** Test for hidden Berkeley DB dependencies in other software, address them as found and needed


* Release engineering: [https://pagure.io/releng/issues #Releng issue number] (a check of an impact with Release Engineering is needed) <!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
* Release engineering: [https://pagure.io/releng/issue/9308 #9308]  
<!-- Does this feature require coordination with release engineering (e.g. changes to installer image generation or update package delivery)?  Is a mass rebuild required?  include a link to the releng issue.
The issue is required to be filed prior to feature submission, to ensure that someone is on board to do any process development work and testing, and that all changes make it into the pipeline; a bullet point in a change is not sufficient communication -->


* Policies and guidelines: N/A (not a System Wide Change) <!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
* Policies and guidelines: Policies and guidelines are not affected
<!-- Do the packaging guidelines or other documents need to be updated for this feature?  If so, does it need to happen before or after the implementation is done?  If a FPC ticket exists, add a link here. -->


* Trademark approval: N/A (not needed for this Change)
* Trademark approval: N/A (not needed for this Change)
<!-- If your Change may require trademark approval (for example, if it is a new Spin), file a ticket ( https://fedorahosted.org/council/ ) requesting trademark approval from the Fedora Council. This approval will be done via the Council's consensus-based process. -->


== Upgrade/compatibility impact ==
== Upgrade/compatibility impact ==
<!-- What happens to systems that have had a previous versions of Fedora installed and are updated to the version containing this change? Will anything require manual configuration or data migration? Will any existing functionality be no longer supported? -->


<!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
=== Upgrading ===
N/A (not a System Wide Change)
* Ability to upgrade is not affected
* After update, systems will be converted to the sqlite format (on next reboot or similar) unless user overrides configuration to stay on BDB for now
 
=== Compatibility ===
* Container/chroot use-cases will be affected: older rpm versions will be unable to query/manipulate the rpmdb from outside the chroot
* Koji/COPR may need to override the database format (back to) BDB for the time being. Better option would be to use mock bootstrap container, which would solve a whole class of issues going forward.


== How To Test ==
== How To Test ==
<!-- This does not need to be a full-fledged document. Describe the dimensions of tests that this change implementation is expected to pass when it is done.  If it needs to be tested with different hardware or software configurations, indicate them.  The more specific you can be, the better the community testing can be.
* Rpmdb gets thoroughly exercised as a matter of normal system operation, performing installs, updates, package builds etc
 
* Of specific interest here is torture testing: forcibly killing rpm in various stages of execution - database should stay consistent and operational (other system state is out of scope)
Remember that you are writing this how to for interested testers to use to check out your change implementation - documenting what you do for testing is OK, but it's much better to document what *I* can do to test your change.
* Test database conversions from one backend to another (rpmdb --rebuilddb --define "_db_backend <backend>")
 
* Install/upgrade scenarios:
A good "how to test" should answer these four questions:
** A fresh install has sqlite enabled by default, no database warnings/errors emitted during rpm operation
 
** System upgraded from Fedora < 33 has been converted to sqlite rpmdb after the post-upgrade reboot without specifc user intervention, no database warnings/errors emitted during normal operation
0. What special hardware / data / etc. is needed (if any)?
1. How do I prepare my system to test this change? What packages
need to be installed, config files edited, etc.?
2. What specific actions do I perform to check that the change is
working like it's supposed to?
3. What are the expected results of those actions?
-->
 
<!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
N/A (not a System Wide Change)


== User Experience ==
== User Experience ==
<!-- If this change proposal is noticeable by users, how will their experiences change as a result?
* In normal operation, users should see little or no change
 
* Behavior in error situations is much more robust: forcibly killed transaction no longer causes database inconsistency or corruption
This section partially overlaps with the Benefit to Fedora section above. This section should be primarily about the User Experience, written in a way that does not assume deep technical knowledge. More detailed technical description should be left for the Benefit to Fedora section.
 
Describe what Users will see or notice, for example:
  - Packages are compressed more efficiently, making downloads and upgrades faster by 10%.
  - Kerberos tickets can be renewed automatically. Users will now have to authenticate less and become more productive. Credential management improvements mean a user can start their work day with a single sign on and not have to pause for reauthentication during their entire day.
- Libreoffice is one of the most commonly installed applications on Fedora and it is now available by default to help users "hit the ground running".
- Green has been scientifically proven to be the most relaxing color. The move to a default background color of green with green text will result in Fedora users being the most relaxed users of any operating system.
-->


== Dependencies ==
== Dependencies ==
<!-- What other packages (RPMs) depend on this package?  Are there changes outside the developers' control on which completion of this change depends?  In other words, completion of another change owned by someone else and might cause you to not be able to finish on time or that you would need to coordinate? Other upstream projects like the kernel (if this is not a kernel change)? -->
* This change depends on [[Changes/RPM-4.16|RPM 4.16]], support for sqlite rpmdb is not present in older versions
 
* RPM will grow a new dependency on sqlite-libs
<!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
* Technically the rpmdb format is an internal implementation detail of RPM and the data is only accessible through the librpm API, but some software is making assumptions both about the format and/or in particular, file naming. These are being tracked at https://bugzilla.redhat.com/show_bug.cgi?id=1766120
N/A (not a System Wide Change)
* A new systemd service (rpmdb-rebuild) is introduced, it can be used to flag the database for a maintenance rebuild at boot and is used to perform the automatic BDB -> Sqlite conversion on first boot after upgrade.


== Contingency Plan ==
== Contingency Plan ==


* Contingency mechanism:
* Contingency mechanism:
 
** Revert the default database back to Berkeley DB backend in the package. The actual conversion can use the same mechanism as in the other direction.
Revert the default database back to Berkeley DB backend in the package. Running 'rpmdb --rebuilddb' on hosts is currently required to actually convert the database, but means to automate conversion in specific conditions is being discussed upstream.
** The rpm-team does not expect problems with the database backend itself, but we are aware that postponing may be needed due to infrastructure or other tooling not being ready, primarily due to inability to access the database from older releases.


* Contingency deadline: Beta freeze
* Contingency deadline: Beta freeze
Line 130: Line 84:


== Documentation ==
== Documentation ==
<!-- Is there upstream documentation on this change, or notes you have written yourself?  Link to that material here so other interested developers can get involved. -->
* [https://rpm.org/wiki/Releases/4.16.0 | RPM 4.16 release notes]


<!-- REQUIRED FOR SYSTEM WIDE CHANGES -->
== Release Notes ==
N/A (not a System Wide Change)


== Release Notes ==
* RPM database default changes from Berkeley DB to Sqlite in this release. The conversion will take place automatically on the first boot after distribution upgrade via rpmdb-rebuild systemd service.
<!-- The Fedora Release Notes inform end-users about what is new in the release. Examples of past release notes are here: http://docs.fedoraproject.org/release-notes/ -->
* Users who have a particular need to stay on Berkeley DB backend can do still so in this release by overriding the configuration manually (eg. `echo %_db_backend bdb > /etc/rpm/macros.db`) before rebooting, or convert back at any later time. This is discouraged however, support for Berkeley DB will be reduced to read-only in the next release.
<!-- The release notes also help users know how to deal with platform changes such as ABIs/APIs, configuration or data file formats, or upgrade concerns. If there are any such changes involved in this change, indicate them here. A link to upstream documentation will often satisfy this need. This information forms the basis of the release notes edited by the documentation team and shipped with the release.  
* In some circumstances [*] users may see messages like "warning: Found bdb Packages database while attempting sqlite backend: using bdb backend." This is a harmless indication that rpm configuration and what's on disk disagree. It can be silenced either by running `rpmdb --rebuilddb` to convert the database to match configuration, or by overriding configuration to match what is on disk (see above).  
** Unfortunately, this warning appears after pretty normal upgrade, see https://bugzilla.redhat.com/show_bug.cgi?id=1836108


Release Notes are not required for initial draft of the Change Proposal but has to be completed by the Change Freeze.  
[*] At least non-bootstrap mock roots where inside and outside rpm database default differs, (rawhide) users who haven't rebooted since the default changed.
-->

Latest revision as of 10:01, 20 November 2020

Sqlite RpmDB

Summary

Change format of the RPM database from Berkeley DB to a new Sqlite format.

Owner

Current status

Detailed Description

The current rpm database implementation is based on Berkeley DB 5.x, a version which is unmaintained upstream for several years now. Berkeley DB 6.x is license incompatible so moving to that is not an option. In addition, the existing rpmdb implementation is notoriously unreliable as it's not transactional and has no other means to detect inconsistencies either.

Changing to a more sustainable database implementation is long overdue. We propose to change the default rpmdb format to the new sqlite based implementation. Support for current BDB format will be retained in Fedora 33, and phased out to read-only support in Fedora 34.

Benefit to Fedora

  • A far more robust rpm database implementation
  • Getting rid of Berkeley DB dependency in one of the core components

Scope

  • Proposal owners:
    • Once RPM 4.16 lands and passes initial shakedown, change the default rpmdb configuration to sqlite
    • Arrange for automatic database conversion with opt-out possibility (one-shot service on next reboot or similar)
    • Address any bugs and issues in the database backend found by wider testing base
    • Help other developers to address Berkeley DB dependencies
  • Other developers:
    • Test for hidden Berkeley DB dependencies in other software, address them as found and needed
  • Release engineering: #9308
  • Policies and guidelines: Policies and guidelines are not affected
  • Trademark approval: N/A (not needed for this Change)

Upgrade/compatibility impact

Upgrading

  • Ability to upgrade is not affected
  • After update, systems will be converted to the sqlite format (on next reboot or similar) unless user overrides configuration to stay on BDB for now

Compatibility

  • Container/chroot use-cases will be affected: older rpm versions will be unable to query/manipulate the rpmdb from outside the chroot
  • Koji/COPR may need to override the database format (back to) BDB for the time being. Better option would be to use mock bootstrap container, which would solve a whole class of issues going forward.

How To Test

  • Rpmdb gets thoroughly exercised as a matter of normal system operation, performing installs, updates, package builds etc
  • Of specific interest here is torture testing: forcibly killing rpm in various stages of execution - database should stay consistent and operational (other system state is out of scope)
  • Test database conversions from one backend to another (rpmdb --rebuilddb --define "_db_backend <backend>")
  • Install/upgrade scenarios:
    • A fresh install has sqlite enabled by default, no database warnings/errors emitted during rpm operation
    • System upgraded from Fedora < 33 has been converted to sqlite rpmdb after the post-upgrade reboot without specifc user intervention, no database warnings/errors emitted during normal operation

User Experience

  • In normal operation, users should see little or no change
  • Behavior in error situations is much more robust: forcibly killed transaction no longer causes database inconsistency or corruption

Dependencies

  • This change depends on RPM 4.16, support for sqlite rpmdb is not present in older versions
  • RPM will grow a new dependency on sqlite-libs
  • Technically the rpmdb format is an internal implementation detail of RPM and the data is only accessible through the librpm API, but some software is making assumptions both about the format and/or in particular, file naming. These are being tracked at https://bugzilla.redhat.com/show_bug.cgi?id=1766120
  • A new systemd service (rpmdb-rebuild) is introduced, it can be used to flag the database for a maintenance rebuild at boot and is used to perform the automatic BDB -> Sqlite conversion on first boot after upgrade.

Contingency Plan

  • Contingency mechanism:
    • Revert the default database back to Berkeley DB backend in the package. The actual conversion can use the same mechanism as in the other direction.
    • The rpm-team does not expect problems with the database backend itself, but we are aware that postponing may be needed due to infrastructure or other tooling not being ready, primarily due to inability to access the database from older releases.
  • Contingency deadline: Beta freeze
  • Blocks release? Yes

Documentation

Release Notes

  • RPM database default changes from Berkeley DB to Sqlite in this release. The conversion will take place automatically on the first boot after distribution upgrade via rpmdb-rebuild systemd service.
  • Users who have a particular need to stay on Berkeley DB backend can do still so in this release by overriding the configuration manually (eg. echo %_db_backend bdb > /etc/rpm/macros.db) before rebooting, or convert back at any later time. This is discouraged however, support for Berkeley DB will be reduced to read-only in the next release.
  • In some circumstances [*] users may see messages like "warning: Found bdb Packages database while attempting sqlite backend: using bdb backend." This is a harmless indication that rpm configuration and what's on disk disagree. It can be silenced either by running rpmdb --rebuilddb to convert the database to match configuration, or by overriding configuration to match what is on disk (see above).

[*] At least non-bootstrap mock roots where inside and outside rpm database default differs, (rawhide) users who haven't rebooted since the default changed.