Introduction and Purpose
There have been several frameworks proposed for use in Fedora QA. This is an attempt to document an evaluation of the various possible solutions for our use. Some of these would be a part of the taskbot proposal, others would pretty much replace the taskbot proposal.
Tools We're Potentially Interested In
Buildbot
Homepage: http://buildbot.net/
While buildbot was designed as a continuous integration system, it is flexible enough that it can be used as a more generic task scheduling and execution system. It is currently being used in taskbot's proof-of-concept implementation and has been functioning well in that capacity.
Advantages to using buildbot
- Written in python which is where most of our experience is and what most of our tools are already written in
- Configuration is 100% python code: not squirrelled away in the application and its database. everything can be stored in git and easily reviewed for changes if something breaks
- Very flexible: you can write new modules in the configuration file without dirty hacks by design
Disadvantages/Roadblocks to using buildbot
- It's not what the cool kids are using
- There are some complexities in buildbot that aren't hidden as well as they are in jenkins. Note that this isn't always a bad thing
Autotest
Homepage: http://autotest.github.io/
Fedora QA has been using autotest for some time now as part of AutoQA. While autotest was originally designed as a tool for kernel testing, it's capabilities are broader than that and is continuing to expand..
Advantages to using autotest
- codebase is a decent balance of simple and having features. it doesn't try to do everything but solves some difficult problems as well
- we're already using it in fedora QA
- upstream is friendly and interested in many of the directions that we want to go in (ie, good patches are very likely to be accepted upstream)
Disadvantages/Roadblocks to using autotest
- frontend: it's written in GWT which is not packagable in Fedora and has few methods for auth/authz. The upstream devs want to port it to django which should clear up both issues but it's something to keep in mind.
- scaling: we're already having issues with database performance on our production instances - this will likely be mitigated by recent changes to autotest and some work on tuning the database but it is a concern
- disposable clients: at the moment, there isn't much support for disposable clients in autotest. it's something they want to do but the code hasn't been written yet
- tests in git: with the current autotest structure, it expects all tests to be stored in a specific path on the master instead of in git repos or something else which is easier to use with distributed test maintenance. the maintainers are interested in changing this but again, the code hasn't been written
Beaker
- Homepage: http://beaker-project.org/
- Evaluation Details: User:Tflink/AutomationFrameworkEvaluation/beaker
Beaker is a full stack lab automation system developed by Red Hat for use in testing Red Hat Enterprise Linux. It has many listed capabilities (ovirt/openstack integration, bare metal and vm provisioning support etc.)
Advantages to using beaker
- Red Hat is using it internally, there is a possibility of using some of their tests without having to write everything on our own
- Has dedicated development resources
- Likely has fewer scaling issues than autotest
- Solves some complicated problems like system provisioning and multi-host tests
Disadvantages/Roadblocks to using beaker
- Execution model doesn't fit with where we are or where we want to go right now - this will get better with openstack integration but that's not done yet and doesn't address the limitations of recipes (written to test a single package or generic; nothing in between and very little ability to pass data or parameters into jobs)
- It's huge and has many moving parts. While beaker is very powerful, it is also rather complex
- Beah. Tasks as RPMs, bash-ish syntax. It works but isn't an ideal solution if you're not already using it. Autotest runner support could help with this but will have to wait and see the implementation details
Igor
Homepage: https://gitorious.org/ovirt/igord
Igor was developed as a method for running tests on oVirt node, it is based on a client/server architecture with communication via http. In contrast to both Autotest and Beaker, it is not a full-stack solution but instead focuses more on running tests and gathering status (execution result, logs)
Criteria
In an attempt to compare the various frameworks, I'm listing a set of criteria which I think are important to look at. The criteria are not going to be relevant to all solutions but that's not necessarily a bad thing.
Communication and Community
- how many projects are using it
- how old is it
- how many active devs from how many orgs
- quality of docs
- how much mailing list traffic is there?
- what is the bug tracker?
- what is the patch process?
- what is the RFE process?
High level stuff
- how tightly integrated are the components
- what license is the project released under
- how much is already packaged in fedora
API
- what mechanism does the api use (xmlrpc, json-rpc, restful-ish etc.)
- can you schedule jobs through the api
- what scheduling params are available through the api
Results
- how flexible is the schema for the built in results store
- what data is stored in the default result
- is there a difference between failed execution and status based on result analysis
- what kinds of analysis are supported
VM management
- does it work with any external systems (ovirt, openstack etc.)
- does it support rapid cloning
- how are vms configured post-spawn
- control over vm configuration (vnc/spice, storage type etc.)
- ephemeral client support?
Test harness
- base language
- how tightly integrated is it with the system as a whole
- are any non-primary harnesses supported
Test execution
- how are tests stored
- support for storing tests in vcs
- method for passing data into test for execution
- how are parameters stored for post-failure analysis
- support for replaying a test
- can tests be executed locally in a dev env with MINIMAL setup
- external log shipping?
- how tightly integrated is result reporting
- what kind of latency is there between tests?