kernel BUG at mm/rmap.c:438 on fedora core 2
I got a server running fedora core 2 and last night it crashed and had to be rebooted. This is the last entry in the syslog:=======================================
Oct 23 02:44:00 server kernel: ------------[ cut here ]------------
Oct 23 02:44:00 server kernel: kernel BUG at mm/rmap.c:438!
Oct 23 02:44:00 server kernel: invalid operand: 0000 [#1]
Oct 23 02:44:00 server kernel: SMP
Oct 23 02:44:00 server kernel: Modules linked in: iptable_filter ip_tables md5 ipv6 parport_pc lp parport autofs4 sunrpc
r8169 floppy sg microcode dm_mod uhci_hcd ehci_hcd video button battery ac ext3 jbd ata_piix libata sd_mod scsi_mod
Oct 23 02:44:00 server kernel: CPU: 0
Oct 23 02:44:00 server kernel: EIP: 0060:[<c014a80a>] Not tainted VLI
Oct 23 02:44:00 server kernel: EFLAGS: 00010202 (2.6.10-1.771_FC2smp)
Oct 23 02:44:00 server kernel: EIP is at page_add_anon_rmap+0xe/0x66
Oct 23 02:44:00 server kernel: eax: 20000804 ebx: c11c7080 ecx: 0c75a000 edx: d6a9e284
Oct 23 02:44:00 server kernel: esi: f54b329c edi: ce205ad0 ebp: c11c7080 esp: dcdb4e48
Oct 23 02:44:00 server kernel: ds: 007b es: 007b ss: 0068
Oct 23 02:44:00 server kernel: Process httpd (pid: 24794, threadinfo=dcdb4000 task=f5669540)
Oct 23 02:44:00 server kernel: Stack: 0e384067 00000000 c01460ca ce384000 0e384025 00000000 d6a9e284 f54fc300
Oct 23 02:44:00 server kernel: 00000000 ce205ad0 00000000 0c75a000 c0146162 dad04318 00000001 0c75a000
Oct 23 02:44:00 server kernel: 0000009d 00000000 00000000 00000000 00000000 00000000 0c75a000 d6a9e284
Oct 23 02:44:00 server kernel: Call Trace:
Oct 23 02:44:00 server kernel: [<c01460ca>] do_anonymous_page+0x19c/0x1cb
Oct 23 02:44:00 server kernel: [<c0146162>] do_no_page+0x69/0x2d4
Oct 23 02:44:00 server kernel: [<c0146540>] handle_mm_fault+0xbe/0x176
Oct 23 02:44:00 server kernel: [<c0116f60>] do_page_fault+0x19d/0x593
Oct 23 02:44:00 server kernel: [<c012ecee>] autoremove_wake_function+0x0/0x2d
Oct 23 02:44:00 server kernel: [<c017bfdf>] dnotify_parent+0x1b/0x6c
Oct 23 02:44:00 server kernel: [<c0152bb3>] vfs_read+0xdc/0xe4
Oct 23 02:44:00 server kernel: [<c0152da4>] sys_read+0x3c/0x62
Oct 23 02:44:00 server kernel: [<c0116dc3>] do_page_fault+0x0/0x593
Oct 23 02:44:00 server kernel: [<c01047e7>] error_code+0x2b/0x30
Oct 23 02:44:00 server kernel: Code: 7b 10 00 74 0b 89 ca 89 d8 e8 ee fe ff ff 01 c7 89 d8 e8 32 e5 fe ff 5b 89 f8 5e 5f
c3 56 53 89 c3 8b 72 44 8b 00 f6 c4 08 74 08 <0f> 0b b6 01 80 6a 2d c0 85 f6 75 08 0f 0b b7 01 80 6a 2d c0 8b
Oct 23 02:44:00 server kernel: <3>Debug: sleeping function called from invalid context at include/linux/rwsem.h:43
Oct 23 02:44:00 server kernel: in_atomic():1, irqs_disabled():0
Oct 23 02:44:00 server kernel: [<c011bd64>] __might_sleep+0x80/0x8a
Oct 23 02:44:00 server kernel: [<c011e8a0>] profile_task_exit+0x18/0x43
Oct 23 02:44:00 server kernel: [<c0120170>] do_exit+0x17/0x3e9
Oct 23 02:44:00 server kernel: [<c0104e42>] do_trap+0x0/0xa2
Oct 23 02:44:00 server kernel: [<c0104eb7>] do_trap+0x75/0xa2
Oct 23 02:44:00 server kernel: [<c0105032>] do_invalid_op+0x0/0x8b
Oct 23 02:44:00 server kernel: [<c01050b1>] do_invalid_op+0x7f/0x8b
Oct 23 02:44:00 server kernel: [<c014a80a>] page_add_anon_rmap+0xe/0x66
Oct 23 02:44:00 server kernel: [<c026988a>] sock_def_readable+0xf/0x5e
Oct 23 02:44:00 server kernel: [<c013c0d5>] __rmqueue+0xc0/0x10f
Oct 23 02:44:00 server kernel: [<c013c189>] rmqueue_bulk+0x65/0x6f
Oct 23 02:44:00 server kernel: [<c013c534>] buffered_rmqueue+0x185/0x1ad
Oct 23 02:44:00 server kernel: [<c01047e7>] error_code+0x2b/0x30
Oct 23 02:44:00 server kernel: [<c014a80a>] page_add_anon_rmap+0xe/0x66
Oct 23 02:44:00 server kernel: [<c01460ca>] do_anonymous_page+0x19c/0x1cb
Oct 23 02:44:00 server kernel: [<c0146162>] do_no_page+0x69/0x2d4
Oct 23 02:44:00 server kernel: [<c0146540>] handle_mm_fault+0xbe/0x176
Oct 23 02:44:00 server kernel: [<c0116f60>] do_page_fault+0x19d/0x593
Oct 23 02:44:00 server kernel: [<c012ecee>] autoremove_wake_function+0x0/0x2d
Oct 23 02:44:00 server kernel: [<c017bfdf>] dnotify_parent+0x1b/0x6c
Oct 23 02:44:00 server kernel: [<c0152bb3>] vfs_read+0xdc/0xe4
Oct 23 02:44:00 server kernel: [<c0152da4>] sys_read+0x3c/0x62
Oct 23 02:44:00 server kernel: [<c0116dc3>] do_page_fault+0x0/0x593
Oct 23 02:44:00 server kernel: [<c01047e7>] error_code+0x2b/0x30
Oct 23 02:44:00 server kernel: note: httpd[24794] exited with preempt_count 1
=======================================
When checking the cacti stats for the server I could see that both the traffic and load on the server went up high just before the crash.
Any ideas on what the cause is? The hardware, software?
Thanks...