19:59:19 #startmeeting 19:59:19 Meeting started Wed Dec 11 19:59:19 2024 UTC. The chair is ukleinek. Information about MeetBot at http://wiki.debian.org/MeetBot. 19:59:19 Useful Commands: #action #agreed #help #info #idea #link #topic. 19:59:26 #chair bwh 19:59:26 Current chairs: bwh ukleinek 20:00:50 bwh: anything before we start with the bug list? 20:01:05 No 20:01:15 #topic #1089521 linux-image-6.12.3-amd64 fails to boot due to Oops: invalid opcode: 0000 error 20:01:33 That's a new one, list corruption in drm/i915 20:02:23 Seems like it should be fixed by https://lore.kernel.org/all/20241211114905.368044-1-stanislaw.gruszka@linux.intel.com/ 20:02:26 ah right, someone pointed out this is a duplicate 20:02:44 but then they referred to the same bug number 20:03:22 oh, indeed, still this reminded me on some other bug 20:03:52 I guess Nico Orru wanted to say "me too" 20:04:35 should probably be forwarded upstream 20:04:55 I'll note the patch on the bug report. I don't think we need to forward unless someone says the patch doesn't work for them 20:05:29 ack 20:05:45 I'm very behind on tasks so I am not volunteering to walk reporters through testing the patch 20:06:20 #action ukleinek asks reporters of #1089521 to look at/test https://lore.kernel.org/all/20241211114905.368044-1-stanislaw.gruszka@linux.intel.com/ 20:06:42 #topic 1087807 linux-image-6.1.0-27-amd64: Unable to boot: i40e swiotlb buffer is ful 20:07:05 I still have an action to look at this 20:07:12 ack 20:07:25 #action bwh cares for #1087807 20:07:53 #topic #1076372 Corruption of Longsys/Lexar NM790 NVMe drive with Linux 6.5+ 20:08:09 That's one of the bugs we talk about each week. 20:08:39 :-/ 20:08:44 carnil asked reporter to forward one of the problems identified there 20:09:31 We should probably clone this bug to separate the 2 issues 20:10:18 I don't have an overview there, maybe carnil can better care there? I'll ask him. 20:10:32 OK 20:10:35 #action ukleinek talks to carnil about #1076372 20:10:56 # topic #1063754 fat-modules: SD corruption upon opening file on Linux desktop 20:11:00 #topic #1063754 fat-modules: SD corruption upon opening file on Linux desktop 20:11:23 That's an old one, IIRC we hope to close it for inactivity? 20:11:27 Right, I think I have an action to close this (letting the reporter know they can reopen if necessary) 20:11:49 There is no action since 2 month, so maybe now is the time. 20:12:10 #action ukleinek to close 1063754 nicely 20:12:24 Thank you 20:12:37 #topic #1087616 linux-image-6.11.7-amd64: Can't boot (boot freezes after \[or maybe during\] cryptsetup unlocks drive 20:13:17 The sysrq trick helped, however I'm unsure if this really helps. On a quick glance I think systemd waits in a poll 20:13:44 The backtrace shows systemd reading from a tty... 20:14:04 because the sysrq keycombo was pressed? 20:14:32 I don't think so. It's a regular read() system call 20:15:21 but this is already post exit_to_user_mode 20:15:37 hmm, not sure 20:15:57 The ? lines, are, well, questionable 20:16:10 probably left over from the previous system call 20:17:26 maybe ask in #d-systemd for help? 20:17:35 I think we should reassign this over to systemd so they can at least try to figure out why it's reading from the tty 20:17:46 ok 20:18:01 #action ukleinek reassigns #1087616 to systemd 20:18:31 #topic #1021245 linux-image-5.10.0-18-rt-amd64: can't access EFIVARS when using rt version of kernel 20:18:38 We have that 3x 20:18:53 Yes, and someone reminded me of this in Toulouse 20:19:45 it seems to be like that by design, waldi mentioned a cmdline to fix that and wondered if the Debian kernel should default to that behaviour. 20:20:09 Yes, for a proper real-time system the unpredictable latency of EFI run-time services is a problem 20:20:50 Maybe "don't do it then" is a more sensible approach than refusing to do it? 20:21:01 I don't believe the real-time kernels we provide are suitable for production real-time systems so I'm happy with changing the default 20:21:03 I tend to agree to waldi 20:21:10 yes 20:22:09 #action let efi=runtime be the default, ukleinek asks waldi to support 20:22:43 #topic #1085178 linux-signed-amd64: Some BPF fentry hooks silently fail 20:22:50 This is where my preparation stopped. 20:23:31 ah, seems to be tag + fixed-upstream 20:24:05 Well there's a patch but was it ever applied? 20:25:00 yup, discussion continued just a few days ago. I suggest waiting. 20:25:25 Oh, right, I didn't notice the dates 20:25:40 I'll mark it forwarded 20:25:55 #action keep an eye on upstream discussion of #1085178 20:26:39 #topic #1089158 Kernel Panic on Shutdown/Restart During RAID Resync with mdadm on Debian 12.7 20:28:06 This at least seems like it should be hardware-independent 20:29:19 and is presumably a regression from 12.6, since I would expect interrupting RAID syncs during installation to be pretty common 20:30:16 so maybe 6.1.115 was good? 20:30:27 the bug is about 6.1.119 20:30:46 Is it? The found version is 6.1.112-1 20:31:06 then I looked too quick 20:32:15 Hmm, Version: 6.1.0-28-amd64 is 6.1.119 20:32:45 if I did something wrong in the git history interpretation I still don't see it 20:33:05 also Debian 12.7 had linux 6.1.106-3, so it's a bit confusing 20:33:35 no version info in the photo of the oops 20:33:59 6.1.0-28 is from bookworm-security 20:34:17 OK, I can see 6.1.106-3 and 6.1.112-1 in photos attached to the other report 20:35:03 I've added that as a found version 20:35:19 so was ask the group of reporters to identify a good kernel version? 20:35:39 I think we can be a bit smarter than that 20:35:40 s/was/we/ 20:36:47 bookworm-backports is ok according to the initial mail in #1086175 20:37:17 There are only 4 commits to drivers/md between v6.1.94 (as released in Debian 12.6) and v6.1.106 20:38:34 so I can look to see if there are any later fixes referencing those 20:39:11 nothing for 305a517 and a8768a13 20:39:25 (I used git log a8768a13..next/master --grep a8768a13) 20:39:33 yeah, nothing 20:40:14 also nothing for the other two. 20:41:20 I guess this needs to be bisected then 20:41:49 #action ukleinek asks reporters of #1089158 and its merged bug for bisection 20:41:57 I can take an action to do that but probably won't get to it soon 20:42:28 I'll try to do it, if I fail we can still schedule to someone else 20:42:51 #topic #1089323 linux-image-6.1.0-28-arm64: systemd can login, sysvinit hangs fatal before login 20:44:11 I don't see any kernel bug here 20:44:31 only the subject points to a real problem 20:45:49 is lima a staging module? 20:46:24 No, they are marked with (C) 20:46:50 ** Tainted: C (1024) * staging driver was loaded 20:47:03 The error messages seems like the usual slightly buggy device tree 20:48:14 It's a bad bug report and I don't think it's worth spending time on 20:48:14 or missing drivers 20:48:49 so we reply and ask for a more detailed report? Or just nothing? 20:48:50 I'll take an action to close and suggest opening a separate bug report on sysvinit if that's actually something they care about 20:49:27 #action bwh closes #1089323 and asks to report to sysvinit (which seems to be the actual problem) 20:49:49 #956752 is one of the duplicates 20:50:34 #topic #1075770 linux-image-6.9.7-amd64: adlp_tc_phy_connect [i915] fills logs with drm_WARN_ON(tc->mode == TC_PORT_LEGACY) call traces 20:51:00 It seems like this is progresing upstream 20:51:54 bts forwarded 1075770 https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/12246 ? 20:52:11 Yes and there are responses there 20:52:21 It is already marked forwarded 20:52:23 ah, it is already forwarded. 20:52:40 (I expected a flag in the agenda for that) 20:52:59 so no action for that one, right? 20:53:05 right 20:53:26 #topic #1089045 linux-image-6.11.10-amd64: kernel task hang when connecting bluetooth](https://bugs.debian.org/1089045 20:54:29 the kernel log there seems to be only a symptom, not the root cause 20:55:09 maybe the hang is a wait for a mutex? 20:55:53 Well the one commit in net/bluetooth between v6.11.7 and v6.11.10 looks a bit sus 20:56:24 let them test 6.12? 20:56:47 That's an option yes 20:57:42 That would at least let us know if we still have a bug to fix at all 20:58:18 huh: $ git lg v6.11.. --grep=333b4fd11e89 20:58:18 55abbd148dfb Bluetooth: hci_core: Fix calling mgmt_device_connected 20:58:18 7967dc8f797f Bluetooth: hci_core: Fix calling mgmt_device_connected 20:58:55 why was it applied twice? 20:59:09 uh that's interesting 20:59:52 Probably went through 2 different branches (1 for next and 1 for fixes in current cycle) 21:00:35 yeah, one is in 6.12, the other is only reachable from 6.13-rc1 21:02:03 hmm, ok, so let's ask them to test 6.12 21:02:19 Right 21:02:27 #action ukleinek asks to reproduce #1089045 on 6.12 21:02:58 It's > 22:00 CET now, should we skip the rest of the bugs? 21:03:22 The same fix on 2 different branches usually merges together successfully, but can result in the patch being applied to stable twice (the second time in a different function). But that didn't happene here. 21:03:26 Yes 21:03:46 #1071562 would be good to solve, this affects Debian build servers IIRC 21:05:21 Probably should be severity important? 21:05:36 ack 21:06:51 #action ukleinek looks into #1071562, maybe increases severity. 21:07:08 * ukleinek has enough on his plate now and suggests to end the meeting. 21:07:20 Yes, please 21:07:25 #endmeeting