[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1036644: linux-image-6.1.0-9-amd64: System crashes. Netconsole reports CPUs not responding to MCE broadcast



Control: tags -1 + moreinfo

Hi Olivier,

On Tue, May 23, 2023 at 06:49:00PM +0200, Olivier Berger wrote:
> Package: src:linux
> Version: 6.1.27-1
> Severity: normal
> 
> Hi.
> 
> I'm experiencing crashes (computer reset or completely shutting down) without much details available on why. It used to work fine with 6.1.0-7 but has had problems with the 2 later updates of the testing kernel.
> 
> I've managed to get a log of the kernel panic with netconsole (otherwise wouldn't get any hints whatsoever in logs on disks after restarting), bellow.
> 
> I guess this is nasty as being close to the freeze. I've had the issue for a few days now, but only managed to test a netconsole remote log today.
> 
> It seems to me that the crash mainly happen when I'm away from the laptop for several minutes, so maybe related to some kind of energy saving stuff...
> 
> Hope this provides enough details to help.
> 
> [  394.735702] netpoll: netconsole: local port 6666
> [  394.735711] netpoll: netconsole: local IPv4 address 192.168.0.23
> [  394.735715] netpoll: netconsole: interface 'enp2s0'
> [  394.735717] netpoll: netconsole: remote port 6666
> [  394.735719] netpoll: netconsole: remote IPv4 address 192.168.0.47
> [  394.735722] netpoll: netconsole: remote ethernet address 38:2c:4a:b1:63:94
> [  394.735819] printk: console [netcon0] enabled
> [  394.735825] netconsole: network logging started
> [  463.655009] usb 3-6: new high-speed USB device number 8 using xhci_hcd
> [  463.659448] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [  463.943099] usb 3-6: New USB device found, idVendor=1307, idProduct=0190, bcdDevice= 1.00
> [  463.943133] usb 3-6: New USB device strings: Mfr=1, Product=2, SerialNumber=3
> [  463.943144] usb 3-6: Product: USB Mass Storage Device
> [  463.943153] usb 3-6: Manufacturer: USBest Technology
> [  463.943160] usb 3-6: SerialNumber: 0000000000027F
> [  463.974560] systemd-journald[428]: Successfully sent stream file descriptor to service manager.
> [  463.974717] systemd-journald[428]: Successfully sent stream file descriptor to service manager.
> [  463.987184] SCSI subsystem initialized
> [  463.990687] usb-storage 3-6:1.0: USB Mass Storage device detected
> [  463.990771] scsi host0: usb-storage 3-6:1.0
> [  463.990859] usbcore: registered new interface driver usb-storage
> [  463.992482] usbcore: registered new interface driver uas
> [  464.995952] scsi 0:0:0:0: Direct-Access     Ut190    USB2FlashStorage 0.00 PQ: 0 ANSI: 2
> [  464.996613] scsi 0:0:0:1: Direct-Access     Ut190    SD0StorageDevice 0.00 PQ: 0 ANSI: 2
> [  465.008300] scsi 0:0:0:0: Attached scsi generic sg0 type 0
> [  465.008343] scsi 0:0:0:1: Attached scsi generic sg1 type 0
> [  465.014353] sd 0:0:0:0: [sda] 7897088 512-byte logical blocks: (4.04 GB/3.77 GiB)
> [  465.014619] sd 0:0:0:1: [sdb] Media removed, stopped polling
> [  465.014756] sd 0:0:0:0: [sda] Write Protect is off
> [  465.014764] sd 0:0:0:0: [sda] Mode Sense: 00 00 00 00
> [  465.014804] sd 0:0:0:1: [sdb] Attached SCSI removable disk
> [  465.014951] sd 0:0:0:0: [sda] Asking for cache data failed
> [  465.014957] sd 0:0:0:0: [sda] Assuming drive cache: write through
> [  465.284600] GPT:Primary header thinks Alt. header is not at the end of the disk.
> [  465.284627] GPT:2590719 != 7897087
> [  465.284634] GPT:Alternate GPT header not at the end of the disk.
> [  465.284640] GPT:2590719 != 7897087
> [  465.284645] GPT: Use GNU Parted to correct GPT errors.
> [  465.284659]  sda: sda1
> [  465.285144] sd 0:0:0:0: [sda] Attached SCSI removable disk
> [  474.111368] systemd-journald[428]: Successfully sent stream file descriptor to service manager.
> [  497.264500] sda: detected capacity change from 7897088 to 0
> [  502.045711] usb 3-6: USB disconnect, device number 8
> [  519.695345] systemd-journald[428]: Successfully sent stream file descriptor to service manager.
> [  535.857315] EXT4-fs (dm-0): recovery complete
> [  535.858056] EXT4-fs (dm-0): mounted filesystem with ordered data mode. Quota mode: none.
> [  543.576681] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [  551.263395] systemd-journald[428]: Successfully sent stream file descriptor to service manager.
> [  634.375963] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [  725.578095] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [  845.577721] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [  871.117193] systemd-journald[428]: Successfully sent stream file descriptor to service manager.
> [  905.577391] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [  905.620289] systemd-journald[428]: Successfully sent stream file descriptor to service manager.
> [  905.623541] systemd-journald[428]: Successfully sent stream file descriptor to service manager.
> [  995.577111] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [ 1085.576193] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [ 1205.575316] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [ 1265.574866] systemd-journald[428]: Sent WATCHDOG=1 notification.
> [ 1305.267119] mce: CPUs not responding to MCE broadcast (may include false positives): 0-1,3-5,7
> [ 1305.267121] mce: CPUs not responding to MCE broadcast (may include false positives): 0-1,3-5,7
> [ 1305.267130] Kernel panic - not syncing: Timeout: Not all CPUs entered broadcast exception handler
> [ 1306.436494] Shutting down cpus with NMI
> [ 1306.448150] Kernel Offset: 0x1b600000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
> [ 1306.450068] ------------[ cut here ]------------
> [ 1306.450070] WARNING: CPU: 6 PID: 1146 at arch/x86/kernel/fpu/core.c:60 irq_fpu_usable+0x39/0x50
> [ 1306.450084] Modules linked in: dm_crypt sd_mod sg uas usb_storage scsi_mod scsi_common netconsole xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack_netlink nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xfrm_user xfrm_algo xt_addrtype nft_compat nf_tables nfnetlink br_netfilter bridge stp llc vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) ctr ccm rfcomm snd_seq_dummy snd_hrtimer snd_seq cmac algif_hash algif_skcipher af_alg squashfs qrtr overlay cpufreq_ondemand cpufreq_conservative cpufreq_powersave cpufreq_userspace bnep binfmt_misc nls_ascii nls_cp437 vfat fat snd_ctl_led snd_soc_skl_hda_dsp snd_soc_intel_hda_dsp_common snd_soc_hdac_hdmi snd_sof_probes hid_logitech_hidpp snd_hda_codec_hdmi iwlmvm snd_hda_codec_realtek btusb btrtl btbcm snd_hda_codec_generic snd_soc_dmic btintel ledtrig_audio mac80211 btmtk snd_sof_pci_intel_tgl libarc4 bluetooth snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation mei_hdcp iwlwifi soundwire_cadence x86_pkg_temp_thermal
> [ 1306.450173]  snd_sof_intel_hda intel_powerclamp snd_sof_pci snd_sof_xtensa_dsp joydev jitterentropy_rng pmt_telemetry coretemp intel_rapl_msr snd_sof pmt_class drbg cfg80211 snd_sof_utils ansi_cprng snd_soc_hdac_hda snd_hda_ext_core kvm_intel snd_soc_acpi_intel_match hp_wmi ecdh_generic snd_soc_acpi platform_profile kvm snd_soc_core uvcvideo snd_compress soundwire_bus snd_hda_intel snd_usb_audio snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec videobuf2_vmalloc irqbypass videobuf2_memops snd_usbmidi_lib snd_hda_core videobuf2_v4l2 rapl snd_rawmidi videobuf2_common snd_hwdep intel_cstate snd_seq_device intel_uncore snd_pcm videodev iTCO_wdt pcspkr snd_timer intel_pmc_bxt wmi_bmof mei_me iTCO_vendor_support snd watchdog ee1004 mei mc soundcore ecc rfkill processor_thermal_device_pci_legacy processor_thermal_device ucsi_acpi processor_thermal_rfim typec_ucsi processor_thermal_mbox roles processor_thermal_rapl intel_rapl_common int3403_thermal intel_vsec intel_soc_dts_iosf igen6_edac typec
> [ 1306.450244]  int340x_thermal_zone intel_hid int3400_thermal sparse_keymap acpi_thermal_rel intel_pmc_core acpi_pad ac hid_logitech_dj hid_multitouch serio_raw evdev nfsd auth_rpcgss nfs_acl lockd msr parport_pc grace ppdev lp parport sunrpc fuse loop dm_mod efi_pstore configfs ip_tables x_tables autofs4 usbhid ext4 crc16 mbcache jbd2 btrfs blake2b_generic zstd_compress efivarfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic raid1 raid0 multipath linear md_mod i915 drm_buddy nvme i2c_algo_bit nvme_core drm_display_helper t10_pi crc32_pclmul crc32c_intel crc64_rocksoft_generic cec hid_generic crc64_rocksoft rc_core crc_t10dif xhci_pci rtsx_pci_sdmmc ghash_clmulni_intel crct10dif_generic xhci_hcd r8169 ttm crct10dif_pclmul mmc_core usbcore drm_kms_helper aesni_intel crc64 realtek intel_lpss_pci i2c_i801 i2c_hid_acpi mdio_devres i2c_hid intel_lpss crypto_simd drm cryptd libphy rtsx_pci i2c_smbus crct10dif_common idma64 vmd
> [ 1306.450340]  usb_common battery hid video wmi button sha512_ssse3 sha512_generic
> [ 1306.450350] CPU: 6 PID: 1146 Comm: Xorg Tainted: G           OE      6.1.0-9-amd64 #1  Debian 6.1.27-1
> [ 1306.450356] Hardware name: HP HP ProBook 450 G8 Notebook PC/87E1, BIOS T70 Ver. 01.13.01 03/30/2023
> [ 1306.450358] RIP: 0010:irq_fpu_usable+0x39/0x50
> [ 1306.450364] Code: 00 f0 00 75 25 65 8a 0d bd 46 9e 63 31 c0 84 c9 75 13 b8 01 00 00 00 f7 c2 00 00 0f 00 74 06 80 e6 ff 0f 94 c0 c3 cc cc cc cc <0f> 0b 31 c0 c3 cc cc cc cc 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f
> [ 1306.450368] RSP: 0018:fffffe00001759f0 EFLAGS: 00010006
> [ 1306.450372] RAX: 0000000000110004 RBX: 0000000000000003 RCX: ffff9ed6401c4000
> [ 1306.450375] RDX: 0000000080110004 RSI: ffff9ed6401c4fe0 RDI: 0000000000000001
> [ 1306.450377] RBP: 0000000000000007 R08: ffff9ed652599000 R09: 0000000000000001
> [ 1306.450379] R10: abcc77118461cefd R11: fffffe0000175bbf R12: fffffe0000175a68
> [ 1306.450381] R13: fffffe0000175a70 R14: fffffe0000175a78 R15: 0000000000000000
> [ 1306.450383] FS:  00007fd13533ba80(0000) GS:ffff9eddcf980000(0000) knlGS:0000000000000000
> [ 1306.450386] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 1306.450388] CR2: 00007ff98cf90000 CR3: 00000001091c4003 CR4: 0000000000770ee0
> [ 1306.450391] PKRU: 55555554
> [ 1306.450392] Call Trace:
> [ 1306.450397]  <#MC>
> [ 1306.450398]  kernel_fpu_begin_mask+0x2b/0xe0
> [ 1306.450410]  virt_efi_query_variable_info_nonblocking+0x5e/0x130
> [ 1306.450419]  efi_query_variable_store+0x1a2/0x1e0
> [ 1306.450427]  efivar_set_variable_locked+0x9f/0xf0
> [ 1306.450438]  efi_pstore_write+0x152/0x1a0 [efi_pstore]
> [ 1306.450451]  ? pstore_dump+0x174/0x370
> [ 1306.450460]  pstore_dump+0x174/0x370
> [ 1306.450470]  kmsg_dump+0x43/0x60
> [ 1306.450476]  panic+0x186/0x2ed
> [ 1306.450484]  ? irq_work_queue+0x35/0x50
> [ 1306.450490]  mce_panic+0x113/0x1d0
> [ 1306.450498]  mce_timed_out+0x70/0xb0
> [ 1306.450502]  mce_start+0x98/0x130
> [ 1306.450506]  do_machine_check+0x68b/0x880
> [ 1306.450514]  ? fwtable_read32+0x79/0x220 [i915]
> [ 1306.450643]  exc_machine_check+0x70/0xb0
> [ 1306.450648]  asm_exc_machine_check+0x1a/0x40
> [ 1306.450656] RIP: 0010:fwtable_read32+0x79/0x220 [i915]
> [ 1306.450765] Code: 8b 43 08 8b 90 54 1a 00 00 85 d2 0f 85 67 01 00 00 41 8d 84 24 00 00 fc ff 3d ff ff 17 00 0f 87 92 00 00 00 48 03 2b 8b 6d 00 <48> 8b 43 08 8b 80 54 1a 00 00 85 c0 0f 85 66 01 00 00 4c 89 fe 4c
> [ 1306.450767] RSP: 0018:ffffafe9c431b920 EFLAGS: 00000082
> [ 1306.450770] RAX: 0000000000030008 RBX: ffff9ed682f01c20 RCX: 0000012f1323d3ce
> [ 1306.450772] RDX: 0000000000000000 RSI: 0000000000070008 RDI: 0000000000000000
> [ 1306.450774] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
> [ 1306.450776] R10: 0000000000000006 R11: 0000000000000001 R12: 0000000000070008
> [ 1306.450777] R13: 0000000000000000 R14: ffff9ed682f01c40 R15: 0000000000000202
> [ 1306.450785]  </#MC>
> [ 1306.450786]  <TASK>
> [ 1306.450789]  __intel_wait_for_register+0x1e1/0x220 [i915]
> [ 1306.450899]  intel_disable_transcoder+0x1b6/0x2c0 [i915]
> [ 1306.451062]  intel_ddi_post_disable+0x374/0x4d0 [i915]
> [ 1306.451206]  intel_encoders_post_disable+0x7b/0x90 [i915]
> [ 1306.451350]  intel_old_crtc_state_disables+0x38/0xa0 [i915]
> [ 1306.451487]  intel_atomic_commit_tail+0x392/0xe30 [i915]
> [ 1306.451625]  intel_atomic_commit+0x34f/0x390 [i915]
> [ 1306.451757]  drm_atomic_commit+0x93/0xc0 [drm]
> [ 1306.451827]  ? drm_plane_get_damage_clips.cold+0x1c/0x1c [drm]
> [ 1306.451881]  drm_atomic_connector_commit_dpms+0xcb/0xf0 [drm]
> [ 1306.451937]  drm_mode_obj_set_property_ioctl+0x193/0x3d0 [drm]
> [ 1306.451995]  ? drm_connector_set_obj_prop+0x90/0x90 [drm]
> [ 1306.452050]  drm_connector_property_set_ioctl+0x39/0x60 [drm]
> [ 1306.452101]  drm_ioctl_kernel+0xc6/0x170 [drm]
> [ 1306.452154]  drm_ioctl+0x22f/0x410 [drm]
> [ 1306.452202]  ? drm_connector_set_obj_prop+0x90/0x90 [drm]
> [ 1306.452254]  __x64_sys_ioctl+0x8d/0xd0
> [ 1306.452261]  do_syscall_64+0x58/0xc0
> [ 1306.452267]  ? fpregs_assert_state_consistent+0x22/0x50
> [ 1306.452271]  ? exit_to_user_mode_prepare+0x139/0x1d0
> [ 1306.452277]  ? syscall_exit_to_user_mode+0x17/0x40
> [ 1306.452282]  ? do_syscall_64+0x67/0xc0
> [ 1306.452284]  ? do_syscall_64+0x67/0xc0
> [ 1306.452287]  ? do_syscall_64+0x67/0xc0
> [ 1306.452289]  ? syscall_exit_to_user_mode+0x17/0x40
> [ 1306.452293]  ? do_syscall_64+0x67/0xc0
> [ 1306.452296]  entry_SYSCALL_64_after_hwframe+0x63/0xcd
> [ 1306.452303] RIP: 0033:0x7fd13591cafb
> [ 1306.452307] Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 1c 48 8b 44 24 18 64 48 2b 04 25 28 00 00
> [ 1306.452309] RSP: 002b:00007ffe51ce3ae0 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
> [ 1306.452313] RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 00007fd13591cafb
> [ 1306.452315] RDX: 00007ffe51ce3b70 RSI: 00000000c01064ab RDI: 0000000000000010
> [ 1306.452317] RBP: 00007ffe51ce3b70 R08: 0000000000000002 R09: 0000000000000001
> [ 1306.452319] R10: 0000000000000000 R11: 0000000000000246 R12: 00000000c01064ab
> [ 1306.452321] R13: 0000000000000010 R14: 000055cc63c6bbe0 R15: 000055cc65358fd0
> [ 1306.452325]  </TASK>
> [ 1306.452326] ---[ end trace 0000000000000000 ]---
> [ 1306.452329] ------------[ cut here ]------------
> [ 1306.452330] WARNING: CPU: 6 PID: 1146 at arch/x86/kernel/fpu/core.c:424 kernel_fpu_begin_mask+0xc4/0xe0
> [ 1306.452337] Modules linked in: dm_crypt sd_mod sg uas usb_storage scsi_mod scsi_common netconsole xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack_netlink nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xfrm_user xfrm_algo xt_addrtype nft_compat nf_tables nfnetlink br_netfilter bridge stp llc vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) ctr ccm rfcomm snd_seq_dummy snd_hrtimer snd_seq cmac algif_hash algif_skcipher af_alg squashfs qrtr overlay cpufreq_ondemand cpufreq_conservative cpufreq_powersave cpufreq_userspace bnep binfmt_misc nls_ascii nls_cp437 vfat fat snd_ctl_led snd_soc_skl_hda_dsp snd_soc_intel_hda_dsp_common snd_soc_hdac_hdmi snd_sof_probes hid_logitech_hidpp snd_hda_codec_hdmi iwlmvm snd_hda_codec_realtek btusb btrtl btbcm snd_hda_codec_generic snd_soc_dmic btintel ledtrig_audio mac80211 btmtk snd_sof_pci_intel_tgl libarc4 bluetooth snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation mei_hdcp iwlwifi soundwire_cadence x86_pkg_temp_thermal
> [ 1306.452396]  snd_sof_intel_hda intel_powerclamp snd_sof_pci snd_sof_xtensa_dsp joydev jitterentropy_rng pmt_telemetry coretemp intel_rapl_msr snd_sof pmt_class drbg cfg80211 snd_sof_utils ansi_cprng snd_soc_hdac_hda snd_hda_ext_core kvm_intel snd_soc_acpi_intel_match hp_wmi ecdh_generic snd_soc_acpi platform_profile kvm snd_soc_core uvcvideo snd_compress soundwire_bus snd_hda_intel snd_usb_audio snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec videobuf2_vmalloc irqbypass videobuf2_memops snd_usbmidi_lib snd_hda_core videobuf2_v4l2 rapl snd_rawmidi videobuf2_common snd_hwdep intel_cstate snd_seq_device intel_uncore snd_pcm videodev iTCO_wdt pcspkr snd_timer intel_pmc_bxt wmi_bmof mei_me iTCO_vendor_support snd watchdog ee1004 mei mc soundcore ecc rfkill processor_thermal_device_pci_legacy processor_thermal_device ucsi_acpi processor_thermal_rfim typec_ucsi processor_thermal_mbox roles processor_thermal_rapl intel_rapl_common int3403_thermal intel_vsec intel_soc_dts_iosf igen6_edac typec
> [ 1306.452453]  int340x_thermal_zone intel_hid int3400_thermal sparse_keymap acpi_thermal_rel intel_pmc_core acpi_pad ac hid_logitech_dj hid_multitouch serio_raw evdev nfsd auth_rpcgss nfs_acl lockd msr parport_pc grace ppdev lp parport sunrpc fuse loop dm_mod efi_pstore configfs ip_tables x_tables autofs4 usbhid ext4 crc16 mbcache jbd2 btrfs blake2b_generic zstd_compress efivarfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c crc32c_generic raid1 raid0 multipath linear md_mod i915 drm_buddy nvme i2c_algo_bit nvme_core drm_display_helper t10_pi crc32_pclmul crc32c_intel crc64_rocksoft_generic cec hid_generic crc64_rocksoft rc_core crc_t10dif xhci_pci rtsx_pci_sdmmc ghash_clmulni_intel crct10dif_generic xhci_hcd r8169 ttm crct10dif_pclmul mmc_core usbcore drm_kms_helper aesni_intel crc64 realtek intel_lpss_pci i2c_i801 i2c_hid_acpi mdio_devres i2c_hid intel_lpss crypto_simd drm cryptd libphy rtsx_pci i2c_smbus crct10dif_common idma64 vmd
> [ 1306.452524]  usb_common battery hid video wmi button sha512_ssse3 sha512_generic
> [ 1306.452530] CPU: 6 PID: 1146 Comm: Xorg Tainted: G        W  OE      6.1.0-9-amd64 #1  Debian 6.1.27-1
> [ 1306.452534] Hardware name: HP HP ProBook 450 G8 Notebook PC/87E1, BIOS T70 Ver. 01.13.01 03/30/2023
> [ 1306.452535] RIP: 0010:kernel_fpu_begin_mask+0xc4/0xe0
> [ 1306.452539] Code: 37 48 83 c4 10 5b c3 cc cc cc cc 48 8b 07 f6 c4 40 75 af f0 80 4f 01 40 48 81 c7 80 15 00 00 e8 b2 fe ff ff eb 9c db e3 eb c7 <0f> 0b e9 68 ff ff ff 0f 0b e9 70 ff ff ff e8 b9 53 9e 00 66 0f 1f
> [ 1306.452542] RSP: 0018:fffffe00001759f8 EFLAGS: 00010046
> [ 1306.452544] RAX: 0000000000000000 RBX: 0000000000000003 RCX: ffff9ed6401c4000
> [ 1306.452546] RDX: 0000000080110004 RSI: ffff9ed6401c4fe0 RDI: 0000000000000001
> [ 1306.452548] RBP: 0000000000000007 R08: ffff9ed652599000 R09: 0000000000000001
> [ 1306.452549] R10: abcc77118461cefd R11: fffffe0000175bbf R12: fffffe0000175a68
> [ 1306.452551] R13: fffffe0000175a70 R14: fffffe0000175a78 R15: 0000000000000000
> [ 1306.452553] FS:  00007fd13533ba80(0000) GS:ffff9eddcf980000(0000) knlGS:0000000000000000
> [ 1306.452556] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ 1306.452558] CR2: 00007ff98cf90000 CR3: 00000001091c4003 CR4: 0000000000770ee0
> [ 1306.452560] PKRU: 55555554
> [ 1306.452561] Call Trace:
> [ 1306.452562]  <#MC>
> [ 1306.452564]  virt_efi_query_variable_info_nonblocking+0x5e/0x130
> [ 1306.452569]  efi_query_variable_store+0x1a2/0x1e0
> [ 1306.452576]  efivar_set_variable_locked+0x9f/0xf0
> [ 1306.452584]  efi_pstore_write+0x152/0x1a0 [efi_pstore]
> [ 1306.452596]  ? pstore_dump+0x174/0x370
> [ 1306.452604]  pstore_dump+0x174/0x370
> [ 1306.452611]  kmsg_dump+0x43/0x60
> [ 1306.452614]  panic+0x186/0x2ed
> [ 1306.452620]  ? irq_work_queue+0x35/0x50
> [ 1306.452626]  mce_panic+0x113/0x1d0
> [ 1306.452630]  mce_timed_out+0x70/0xb0
> [ 1306.452634]  mce_start+0x98/0x130
> [ 1306.452638]  do_machine_check+0x68b/0x880
> [ 1306.452643]  ? fwtable_read32+0x79/0x220 [i915]
> [ 1306.452761]  exc_machine_check+0x70/0xb0
> [ 1306.452765]  asm_exc_machine_check+0x1a/0x40
> [ 1306.452771] RIP: 0010:fwtable_read32+0x79/0x220 [i915]
> [ 1306.452879] Code: 8b 43 08 8b 90 54 1a 00 00 85 d2 0f 85 67 01 00 00 41 8d 84 24 00 00 fc ff 3d ff ff 17 00 0f 87 92 00 00 00 48 03 2b 8b 6d 00 <48> 8b 43 08 8b 80 54 1a 00 00 85 c0 0f 85 66 01 00 00 4c 89 fe 4c
> [ 1306.452881] RSP: 0018:ffffafe9c431b920 EFLAGS: 00000082
> [ 1306.452883] RAX: 0000000000030008 RBX: ffff9ed682f01c20 RCX: 0000012f1323d3ce
> [ 1306.452885] RDX: 0000000000000000 RSI: 0000000000070008 RDI: 0000000000000000
> [ 1306.452887] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
> [ 1306.452888] R10: 0000000000000006 R11: 0000000000000001 R12: 0000000000070008
> [ 1306.452890] R13: 0000000000000000 R14: ffff9ed682f01c40 R15: 0000000000000202
> [ 1306.452894]  </#MC>
> [ 1306.452895]  <TASK>
> [ 1306.452896]  __intel_wait_for_register+0x1e1/0x220 [i915]
> [ 1306.453006]  intel_disable_transcoder+0x1b6/0x2c0 [i915]
> [ 1306.453162]  intel_ddi_post_disable+0x374/0x4d0 [i915]
> [ 1306.453302]  intel_encoders_post_disable+0x7b/0x90 [i915]
> [ 1306.453446]  intel_old_crtc_state_disables+0x38/0xa0 [i915]
> [ 1306.453585]  intel_atomic_commit_tail+0x392/0xe30 [i915]
> [ 1306.453725]  intel_atomic_commit+0x34f/0x390 [i915]
> [ 1306.453858]  drm_atomic_commit+0x93/0xc0 [drm]
> [ 1306.453917]  ? drm_plane_get_damage_clips.cold+0x1c/0x1c [drm]
> [ 1306.453970]  drm_atomic_connector_commit_dpms+0xcb/0xf0 [drm]
> [ 1306.454022]  drm_mode_obj_set_property_ioctl+0x193/0x3d0 [drm]
> [ 1306.454077]  ? drm_connector_set_obj_prop+0x90/0x90 [drm]
> [ 1306.454129]  drm_connector_property_set_ioctl+0x39/0x60 [drm]
> [ 1306.454180]  drm_ioctl_kernel+0xc6/0x170 [drm]
> [ 1306.454230]  drm_ioctl+0x22f/0x410 [drm]
> [ 1306.454279]  ? drm_connector_set_obj_prop+0x90/0x90 [drm]
> [ 1306.454330]  __x64_sys_ioctl+0x8d/0xd0
> [ 1306.454334]  do_syscall_64+0x58/0xc0
> [ 1306.454338]  ? fpregs_assert_state_consistent+0x22/0x50
> [ 1306.454342]  ? exit_to_user_mode_prepare+0x139/0x1d0
> [ 1306.454346]  ? syscall_exit_to_user_mode+0x17/0x40
> [ 1306.454351]  ? do_syscall_64+0x67/0xc0
> [ 1306.454353]  ? do_syscall_64+0x67/0xc0
> [ 1306.454355]  ? do_syscall_64+0x67/0xc0
> [ 1306.454358]  ? syscall_exit_to_user_mode+0x17/0x40
> [ 1306.454362]  ? do_syscall_64+0x67/0xc0
> [ 1306.454364]  entry_SYSCALL_64_after_hwframe+0x63/0xcd
> [ 1306.454371] RIP: 0033:0x7fd13591cafb
> [ 1306.454373] Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 1c 48 8b 44 24 18 64 48 2b 04 25 28 00 00
> [ 1306.454375] RSP: 002b:00007ffe51ce3ae0 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
> [ 1306.454378] RAX: ffffffffffffffda RBX: 0000000000000003 RCX: 00007fd13591cafb
> [ 1306.454380] RDX: 00007ffe51ce3b70 RSI: 00000000c01064ab RDI: 0000000000000010
> [ 1306.454382] RBP: 00007ffe51ce3b70 R08: 0000000000000002 R09: 0000000000000001
> [ 1306.454383] R10: 0000000000000000 R11: 0000000000000246 R12: 00000000c01064ab
> [ 1306.454385] R13: 0000000000000010 R14: 000055cc63c6bbe0 R15: 000055cc65358fd0
> [ 1306.454389]  </TASK>
> [ 1306.454390] ---[ end trace 0000000000000000 ]---

Would you be able to bisect the changes between 6.1.20 and 6.1.27 to
identify the culprit, though not instantntly triggerable? Maybe
focusing around the i915 changes, I stumpled over a2b6e99d8a62
("drm/i915: Disable DC states for all commits") which was backported
to 6.1.23.

Regards,
Salvatore


Reply to: