Preliminary launch of ci.rocm.debian.net
Hi list,
I'm happy to announce that ci.rocm.debian.net [1] is up and running.
Currently, it has one worker, ckk01 which is my box with an RX 6800 XT
(gfx1030). I've explained how this works in my previous email [2]. You
can see an example result for rocrand here [3].
This is more of an alpha release. While performing end-to-end tests over
the last two days, I noticed the following issues that still need to be
addressed:
(1) There's no automatic scheduling yet. Jobs can only be submitted
manually. This will be addressed very soon.
(2) There seems to be a RabbitMQ issue in bookworm; both readers and
writers seem to block occasionally.
(3) I've underestimated the resources requirements. Last Sunday,
I naively scheduled jobs for unstable+testing for all packages
with autopkgtests. Well, some tests run for hours, they even
timed out (3h). I need to increase the limit.
(4) I've not looked into getting tests done in experimental, which
debci treats as unstable + an extra APT source.
(5) I've deliberately postponed stats (munin) and self-service (API)
to a later point in time.
I expect to address (1) to (4) sometime this week. This might lead to
occasional outages while I debug stuff. This is also the reason why I've
schedule so few jobs, as they block my host.
(3) is going to be interesting mid-term, I honestly did not expect that,
even though it's obvious in hindsight. An update of e.g. rocm-hipamd or
any other common dependency will trigger a few days' worth of tests. But
I'm confident we can optimize this.
Best,
Christian
[1] https://ci.rocm.debian.net
[2] https://lists.debian.org/debian-ai/2023/07/msg00162.html
[3] https://ci.rocm.debian.net/packages/r/rocrand/
Reply to: