Bug#636797: Followup on debian bug #636797
Hi,
I got the following OOPS on 2.6.32-41 (Linux version 2.6.32-5-amd64
(Debian 2.6.32-41) (ben@decadent.org.uk) (gcc version 4.3.5 (Debian
4.3.5-4) ) #1 SMP Mon Jan 16 16:22:28 UTC 2012). The machine runs
Mysql, it's a dedicated database server with pretty high IO load.
The oops is hard to read because it was captured via netconsole, but
the crash is a divide by zero error in find_busiest_group, like the
original report. So this doesn't seem to be fixed in -41. Would
upgrading to -45 help?
Regards,
Ronald.
Sep 11 13:24:17 db03 [9141932.763480] divide error: 0000 [#1]
Sep 11 13:24:17 db03 SMP
Sep 11 13:24:17 db03
Sep 11 13:24:17 db03 [9141932.763584] last sysfs file:
/sys/devices/platform/host2/session1/target2:0:0/2:0:0:5/state
Sep 11 13:24:17 db03 [9141932.763673] CPU 4
Sep 11 13:24:17 db03
Sep 11 13:24:17 db03 [9141932.763701] Modules linked in:
Sep 11 13:24:17 netconsole
Sep 11 13:24:17 configfs
Sep 11 13:24:17 btrfs
Sep 11 13:24:17 zlib_deflate
Sep 11 13:24:17 libcrc32c
Sep 11 13:24:17 ufs
Sep 11 13:24:17 qnx4
Sep 11 13:24:17 hfsplus
Sep 11 13:24:17 hfs
Sep 11 13:24:17 minix
Sep 11 13:24:17 ntfs
Sep 11 13:24:17 vfat
Sep 11 13:24:17 msdos
Sep 11 13:24:17 fat
Sep 11 13:24:17 jfs
Sep 11 13:24:17 xfs
Sep 11 13:24:17 exportfs
Sep 11 13:24:17 reiserfs
Sep 11 13:24:17 ext2
Sep 11 13:24:17 ext4
Sep 11 13:24:17 jbd2
Sep 11 13:24:17 crc16
Sep 11 13:24:17 sd_mod
Sep 11 13:24:17 crc_t10dif
Sep 11 13:24:17 crc32c
Sep 11 13:24:17 ib_iser
Sep 11 13:24:17 rdma_cm
Sep 11 13:24:17 ib_cm
Sep 11 13:24:17 iw_cm
Sep 11 13:24:17 ib_sa
Sep 11 13:24:17 ib_mad
Sep 11 13:24:17 ib_core
Sep 11 13:24:17 ib_addr
Sep 11 13:24:17 iscsi_tcp
Sep 11 13:24:17 libiscsi_tcp
Sep 11 13:24:17 libiscsi
Sep 11 13:24:17 scsi_transport_iscsi
Sep 11 13:24:17 ipmi_devintf
Sep 11 13:24:17 acpi_cpufreq
Sep 11 13:24:17 loop
Sep 11 13:24:17 snd_pcm
Sep 11 13:24:17 snd_timer
Sep 11 13:24:17 ipmi_si
Sep 11 13:24:17 radeon
Sep 11 13:24:17 hpilo
Sep 11 13:24:17 ttm
Sep 11 13:24:17 drm_kms_helper
Sep 11 13:24:17 ipmi_msghandler
Sep 11 13:24:17 drm
Sep 11 13:24:17 i2c_algo_bit
Sep 11 13:24:17 i2c_core
Sep 11 13:24:17 snd
Sep 11 13:24:17 soundcore
Sep 11 13:24:17 snd_page_alloc
Sep 11 13:24:17 hpwdt
Sep 11 13:24:17 pcspkr
Sep 11 13:24:17 psmouse
Sep 11 13:24:17 serio_raw
Sep 11 13:24:17 container
Sep 11 13:24:17 evdev
Sep 11 13:24:17 power_meter
Sep 11 13:24:17 processor
Sep 11 13:24:17 button
Sep 11 13:24:17 ext3
Sep 11 13:24:17 jbd
Sep 11 13:24:17 mbcache
Sep 11 13:24:17 dm_mod
Sep 11 13:24:17 sg
Sep 11 13:24:17 usbhid
Sep 11 13:24:17 sr_mod
Sep 11 13:24:17 hid
Sep 11 13:24:17 cdrom
Sep 11 13:24:17 ata_generic
Sep 11 13:24:17 uhci_hcd
Sep 11 13:24:17 ata_piix
Sep 11 13:24:17 ehci_hcd
Sep 11 13:24:17 hpsa
Sep 11 13:24:17 libata
Sep 11 13:24:17 usbcore
Sep 11 13:24:17 nls_base
Sep 11 13:24:17 thermal
Sep 11 13:24:17 cciss
Sep 11 13:24:17 bnx2
Sep 11 13:24:17 e1000e
Sep 11 13:24:17 scsi_mod
Sep 11 13:24:17 thermal_sys
Sep 11 13:24:17 [last unloaded: scsi_wait_scan]
Sep 11 13:24:17 db03
Sep 11 13:24:17 db03 [9141932.764377] Pid: 16396, comm: mysqld
Tainted: G W 2.6.32-5-amd64 #1 ProLiant DL360 G6
Sep 11 13:24:17 db03 [9141932.764428] RIP: 0010:[<ffffffff810453e8>]
Sep 11 13:24:17 [<ffffffff810453e8>] find_busiest_group+0x97a/0xa4e
Sep 11 13:24:17 db03 [9141932.764492] RSP: 0018:ffff880073b29b88
EFLAGS: 00010006
Sep 11 13:24:17 db03 [9141932.764524] RAX: 0000000000100000 RBX:
0000000000100000 RCX: 0000000000000000
Sep 11 13:24:17 db03 [9141932.764592] RDX: 0000000000000000 RSI:
0000000000000400 RDI: 0000000000000000
Sep 11 13:24:17 db03 [9141932.764661] RBP: 0000000000000400 R08:
000000000000000a R09: ffffffff813c71cc
Sep 11 13:24:17 db03 [9141932.764711] R10: 00007f43e1e99200 R11:
ffff880073b29ec8 R12: ffff88000548fa20
Sep 11 13:24:17 db03 [9141932.764760] R13: ffff88000548fae0 R14:
0000000000015700 R15: 0000000000000000
Sep 11 13:24:17 db03 [9141932.764810] FS: 00007f43dd757700(0000)
GS:ffff880005480000(0000) knlGS:0000000000000000
Sep 11 13:24:17 db03 [9141932.764861] CS: 0010 DS: 0000 ES: 0000 CR0:
0000000080050033
Sep 11 13:24:17 db03 [9141932.764892] CR2: 00007f4426793000 CR3:
000000011c1ee000 CR4: 00000000000006e0
Sep 11 13:24:17 db03 [9141932.764942] DR0: 0000000000000000 DR1:
0000000000000000 DR2: 0000000000000000
Sep 11 13:24:17 db03 [9141932.764991] DR3: 0000000000000000 DR6:
00000000ffff0ff0 DR7: 0000000000000400
Sep 11 13:24:17 db03 [9141932.765041] Process mysqld (pid: 16396,
threadinfo ffff880073b28000, task ffff88011bd2dbd0)
Sep 11 13:24:17 db03 [9141932.765092] Stack:
Sep 11 13:24:17 db03 [9141932.765115] 0000000000015788
Sep 11 13:24:17 0000000000015780
Sep 11 13:24:17 0000000000000008
Sep 11 13:24:17 0000000000015780
Sep 11 13:24:17 db03
Sep 11 13:24:17 db03 [9141932.765162] <0>
Sep 11 13:24:17 0000000000015780
Sep 11 13:24:17 0000000000015780
Sep 11 13:24:17 0000000000000000
Sep 11 13:24:17 ffff88000544fbf0
Sep 11 13:24:17 db03
Sep 11 13:24:17 db03 [9141932.765230] <0>
Sep 11 13:24:17 0000000000000400
Sep 11 13:24:17 000000033b4196b3
Sep 11 13:24:17 0000000000000000
Sep 11 13:24:17 ffff88000548f9e0
Sep 11 13:24:17 db03
Sep 11 13:24:17 db03 [9141932.765319] Call Trace:
Sep 11 13:24:17 db03 [9141932.765349] [<ffffffff812fada6>] ?
schedule+0x2b3/0x7b4
Sep 11 13:24:17 db03 [9141932.765385] [<ffffffff8105ae6e>] ?
__mod_timer+0x141/0x153
Sep 11 13:24:17 db03 [9141932.765422] [<ffffffff8111b9ee>] ?
aio_read_evt+0x26/0xe5
Sep 11 13:24:17 db03 [9141932.765469] [<ffffffff8111cdd3>] ?
sys_io_getevents+0x2aa/0x37f
Sep 11 13:24:17 db03 [9141932.765508] [<ffffffff8104a4cc>] ?
default_wake_function+0x0/0x9
Sep 11 13:24:17 db03 [9141932.765545] [<ffffffff8111bcc4>] ?
timeout_func+0x0/0x10
Sep 11 13:24:17 db03 [9141932.765576] [<ffffffff81010b42>] ?
system_call_fastpath+0x16/0x1b
Sep 11 13:24:17 db03 [9141932.765608] Code:
Sep 11 13:24:17 db03 83
Sep 11 13:24:17 db03 bc
Sep 11 13:24:17 db03 24
Sep 11 13:24:17 db03 2c
Sep 11 13:24:17 db03 01
Sep 11 13:24:17 db03 00
Sep 11 13:24:17 db03 00 last message repeated 2 times
Sep 11 13:24:17 db03 75
Sep 11 13:24:17 db03 27
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03 8b
Sep 11 13:24:17 db03 94
Sep 11 13:24:17 db03 24
Sep 11 13:24:17 db03 b0
Sep 11 13:24:17 db03 00
Sep 11 13:24:17 db03 00 last message repeated 2 times
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03 8b
Sep 11 13:24:17 db03 84
Sep 11 13:24:17 db03 24
Sep 11 13:24:17 db03 10
Sep 11 13:24:17 db03 01
Sep 11 13:24:17 db03 00
Sep 11 13:24:17 db03 00
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03 2b
Sep 11 13:24:17 db03 84
Sep 11 13:24:17 db03 24
Sep 11 13:24:17 db03 18
Sep 11 13:24:17 db03 01
Sep 11 13:24:17 db03 00
Sep 11 13:24:17 db03 00
Sep 11 13:24:17 db03 8b
Sep 11 13:24:17 db03 7a
Sep 11 13:24:17 db03 08
Sep 11 13:24:17 db03 31
Sep 11 13:24:17 db03 d2
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03 c1
Sep 11 13:24:17 db03 e0
Sep 11 13:24:17 db03 14
Sep 11 13:24:17 db03
Sep 11 13:24:17 db03 f7
Sep 11 13:24:17 db03 f7
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03 89
Sep 11 13:24:17 db03 c7
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03 8b
Sep 11 13:24:17 db03 94
Sep 11 13:24:17 db03 24
Sep 11 13:24:17 db03 b0
Sep 11 13:24:17 db03 00
Sep 11 13:24:17 db03 00 last message repeated 2 times
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03 89
Sep 11 13:24:17 db03 f0
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03 29
Sep 11 13:24:17 db03 c8
Sep 11 13:24:17 db03 48
Sep 11 13:24:17 db03
Sep 11 13:24:17 db03 [9141932.765945] RIP
Sep 11 13:24:17 [<ffffffff810453e8>] find_busiest_group+0x97a/0xa4e
Sep 11 13:24:17 db03 [9141932.765986] RSP <ffff880073b29b88>
Sep 11 13:24:17 db03 [9141932.766272] ---[ end trace b9f3c525f1dad71e ]---
Reply to: