Post new topic   Reply to topic
View previous topic Printable version Log in to check your private messages View next topic
Author Message
LeTuXOffline
Post subject: Kernel Bug (?)  PostPosted: 23.12.2011, 20:07



Joined: 2011-05-15
Posts: 13
Location: Germany
Status: Offline
On my 64bit aptosid 3.1-5.slh.3-aptosid-686, I get black screens reading "KERNEL BUG" every day.
/var/log/messages is quoted below, it doesn't speak of a kernel bug any more.

This happens only when I run video-processing processes, namely a script that sequencially runs mplex, spumux, dvadauthor and HandBrakeCLI. The system hangs randomly in either process. This once also happened half a second after script-start, so I don't think this a temperature problem. The same happens when I start the PC with the previously installed kernel 3.0-6.slh.3-aptosid-686.

When I start with the older kernel 2.6.39-3.slh.1-aptosid-686, the script runs smoothly until I attempt an additional action like logging in from a remote pc via nfs and opening a directory. Then the system just freezes without giving the black screen, but /var/log/messages looks similar.

Below, I give excerpts from /var/log/messages showing different events with different kernels, which are stated in the messages.

I already did memtest without result.

Can anyone tell me what might be wrong?
LeTuX


-----------
      Code:
Dec 20 18:41:04 alexpc kernel: [ 6717.223367] ------------[ cut here ]------------
Dec 20 18:41:04 alexpc kernel: [ 6717.223380] WARNING: at /tmp/buildd/linux-aptosid-3.1/debian/build/source_i386_none/mm/truncate.c:286 truncate_inode_pages_range+0x234/0x27b()
Dec 20 18:41:04 alexpc kernel: [ 6717.223383] Hardware name: GA-MA74GM-S2H
Dec 20 18:41:04 alexpc kernel: [ 6717.223385] Modules linked in: powernow_k8 mperf cpufreq_stats cpufreq_powersave cpufreq_conservative bnep rfcomm ppdev bluetooth rfkill lp fuse nfsd nfs lockd fscache auth_rpcgss nfs_acl sunrpc ext3 jbd dm_crypt snd_hda_codec_hdmi snd_seq snd_hda_codec_realtek radeon ttm snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq_device snd_timer drm_kms_helper drm i2c_algo_bit shpchp snd soundcore snd_page_alloc parport_pc parport ati_agp sp5100_tco i2c_piix4 k8temp pci_hotplug evdev button pcspkr processor ext4 mbcache jbd2 crc16 dm_mod raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear md_mod sr_mod cdrom sd_mod crc_t10dif usbhid ata_generic pata_acpi hid pata_atiixp ahci libahci libata ohci_hcd ehci_hcd r8169 mii usbcore scsi_mod ssb mmc_core pcmcia pcmcia_core [last unloaded: scsi_wait_scan]
Dec 20 18:41:04 alexpc kernel: [ 6717.223450] Pid: 2463, comm: mv Not tainted 3.1-5.slh.3-aptosid-686 #1
Dec 20 18:41:04 alexpc kernel: [ 6717.223452] Call Trace:
Dec 20 18:41:04 alexpc kernel: [ 6717.223459]  [<c012cdb4>] ? warn_slowpath_common+0x7c/0x8f
Dec 20 18:41:04 alexpc kernel: [ 6717.223463]  [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 18:41:04 alexpc kernel: [ 6717.223466]  [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 18:41:04 alexpc kernel: [ 6717.223469]  [<c012cde2>] ? warn_slowpath_null+0x1b/0x1f
Dec 20 18:41:04 alexpc kernel: [ 6717.223472]  [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 18:41:04 alexpc kernel: [ 6717.223478]  [<c0183466>] ? truncate_inode_pages+0x17/0x1b
Dec 20 18:41:04 alexpc kernel: [ 6717.223490]  [<f86a2593>] ? ext4_evict_inode+0xd1/0x2a1 [ext4]
Dec 20 18:41:04 alexpc kernel: [ 6717.223494]  [<c01bc9ae>] ? d_delete+0xb6/0xd9
Dec 20 18:41:04 alexpc kernel: [ 6717.223498]  [<c01bf2d5>] ? evict+0x82/0x121
Dec 20 18:41:04 alexpc kernel: [ 6717.223502]  [<c01b867c>] ? do_unlinkat+0xca/0x107
Dec 20 18:41:04 alexpc kernel: [ 6717.223506]  [<c01d3f22>] ? fsnotify_find_inode_mark_locked+0xe/0x36
Dec 20 18:41:04 alexpc kernel: [ 6717.223509]  [<c01d4a4c>] ? dnotify_flush+0x27/0x9d
Dec 20 18:41:04 alexpc kernel: [ 6717.223514]  [<c01ad533>] ? filp_close+0x54/0x5b
Dec 20 18:41:04 alexpc kernel: [ 6717.223517]  [<c01ad59c>] ? sys_close+0x62/0x9b
Dec 20 18:41:04 alexpc kernel: [ 6717.223521]  [<c039d89f>] ? sysenter_do_call+0x12/0x28
Dec 20 18:41:04 alexpc kernel: [ 6717.223524] ---[ end trace 9735c6c19f55e03d ]---
Dec 20 19:01:09 alexpc kernel: [ 7922.640544] ------------[ cut here ]------------



      Code:
Dec 20 19:01:09 alexpc kernel: [ 7922.640544] ------------[ cut here ]------------
Dec 20 19:01:09 alexpc kernel: [ 7922.640556] WARNING: at /tmp/buildd/linux-aptosid-3.1/debian/build/source_i386_none/mm/truncate.c:286 truncate_inode_pages_range+0x234/0x27b()
Dec 20 19:01:09 alexpc kernel: [ 7922.640559] Hardware name: GA-MA74GM-S2H
Dec 20 19:01:09 alexpc kernel: [ 7922.640561] Modules linked in: powernow_k8 mperf cpufreq_stats cpufreq_powersave cpufreq_conservative bnep rfcomm ppdev bluetooth rfkill lp fuse nfsd nfs lockd fscache auth_rpcgss nfs_acl sunrpc ext3 jbd dm_crypt snd_hda_codec_hdmi snd_seq snd_hda_codec_realtek radeon ttm snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq_device snd_timer drm_kms_helper drm i2c_algo_bit shpchp snd soundcore snd_page_alloc parport_pc parport ati_agp sp5100_tco i2c_piix4 k8temp pci_hotplug evdev button pcspkr processor ext4 mbcache jbd2 crc16 dm_mod raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear md_mod sr_mod cdrom sd_mod crc_t10dif usbhid ata_generic pata_acpi hid pata_atiixp ahci libahci libata ohci_hcd ehci_hcd r8169 mii usbcore scsi_mod ssb mmc_core pcmcia pcmcia_core [last unloaded: scsi_wait_scan]
Dec 20 19:01:09 alexpc kernel: [ 7922.640628] Pid: 2463, comm: mv Tainted: G        W    3.1-5.slh.3-aptosid-686 #1
Dec 20 19:01:09 alexpc kernel: [ 7922.640630] Call Trace:
Dec 20 19:01:09 alexpc kernel: [ 7922.640636]  [<c012cdb4>] ? warn_slowpath_common+0x7c/0x8f
Dec 20 19:01:09 alexpc kernel: [ 7922.640640]  [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 19:01:09 alexpc kernel: [ 7922.640643]  [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 19:01:09 alexpc kernel: [ 7922.640646]  [<c012cde2>] ? warn_slowpath_null+0x1b/0x1f
Dec 20 19:01:09 alexpc kernel: [ 7922.640649]  [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 19:01:09 alexpc kernel: [ 7922.640655]  [<c0183466>] ? truncate_inode_pages+0x17/0x1b
Dec 20 19:01:09 alexpc kernel: [ 7922.640668]  [<f86a2593>] ? ext4_evict_inode+0xd1/0x2a1 [ext4]
Dec 20 19:01:09 alexpc kernel: [ 7922.640671]  [<c01bc9ae>] ? d_delete+0xb6/0xd9
Dec 20 19:01:09 alexpc kernel: [ 7922.640675]  [<c01bf2d5>] ? evict+0x82/0x121
Dec 20 19:01:09 alexpc kernel: [ 7922.640679]  [<c01b867c>] ? do_unlinkat+0xca/0x107
Dec 20 19:01:09 alexpc kernel: [ 7922.640683]  [<c01d3f22>] ? fsnotify_find_inode_mark_locked+0xe/0x36
Dec 20 19:01:09 alexpc kernel: [ 7922.640687]  [<c01d4a4c>] ? dnotify_flush+0x27/0x9d
Dec 20 19:01:09 alexpc kernel: [ 7922.640691]  [<c01ad533>] ? filp_close+0x54/0x5b
Dec 20 19:01:09 alexpc kernel: [ 7922.640693]  [<c01ad59c>] ? sys_close+0x62/0x9b
Dec 20 19:01:09 alexpc kernel: [ 7922.640698]  [<c039d89f>] ? sysenter_do_call+0x12/0x28
Dec 20 19:01:09 alexpc kernel: [ 7922.640700] ---[ end trace 9735c6c19f55e03e ]---
Dec 20 19:08:46 alexpc kernel: [ 8378.916915] ------------[ cut here ]------------





      Code:
Dec 21 12:33:35 alexpc kernel: [ 1583.838586] Pid: 3271, comm: java Not tainted 3.0-6.slh.3-aptosid-686 #1
Dec 21 12:33:35 alexpc kernel: [ 1583.838591] Call Trace:
Dec 21 12:33:35 alexpc kernel: [ 1583.838608]  [<c01a54bd>] ? bad_page+0x8d/0xe0
Dec 21 12:33:35 alexpc kernel: [ 1583.838616]  [<c01a5602>] ? free_pages_prepare+0xf2/0x100
Dec 21 12:33:35 alexpc kernel: [ 1583.838624]  [<c01a6d28>] ? free_hot_cold_page+0x28/0x140
Dec 21 12:33:35 alexpc kernel: [ 1583.838632]  [<c01a701f>] ? __pagevec_free+0x1f/0x30
Dec 21 12:33:35 alexpc kernel: [ 1583.838640]  [<c01a98db>] ? release_pages+0x13b/0x1f0
Dec 21 12:33:35 alexpc kernel: [ 1583.838650]  [<c01c71fb>] ? free_pages_and_swap_cache+0x7b/0x90
Dec 21 12:33:35 alexpc kernel: [ 1583.838660]  [<c01b8363>] ? tlb_flush_mmu+0x53/0x80
Dec 21 12:33:35 alexpc kernel: [ 1583.838667]  [<c01b8399>] ? tlb_finish_mmu+0x9/0x40
Dec 21 12:33:35 alexpc kernel: [ 1583.838673]  [<c01bdc1d>] ? unmap_region+0xcd/0xe0
Dec 21 12:33:35 alexpc kernel: [ 1583.838681]  [<c01beb33>] ? do_munmap+0x223/0x2a0
Dec 21 12:33:35 alexpc kernel: [ 1583.838687]  [<c01bef4f>] ? sys_brk+0xef/0x100
Dec 21 12:33:35 alexpc kernel: [ 1583.838695]  [<c044ba98>] ? sysenter_do_call+0x12/0x28
Dec 21 12:33:35 alexpc kernel: [ 1583.838700] Disabling lock debugging due to kernel taint





      Code:
Dec 23 13:10:54 alexpc kernel: [  330.236543] Pid: 331, comm: kswapd0 Not tainted 2.6.39-3.slh.1-aptosid-686 #1
Dec 23 13:10:54 alexpc kernel: [  330.236549] Call Trace:
Dec 23 13:10:54 alexpc kernel: [  330.236563]  [<c01a1f3b>] ? bad_page+0x8b/0xd0
Dec 23 13:10:54 alexpc kernel: [  330.236571]  [<c01a2072>] ? free_pages_prepare+0xf2/0x100
Dec 23 13:10:54 alexpc kernel: [  330.236579]  [<c01a3828>] ? free_hot_cold_page+0x28/0x140
Dec 23 13:10:54 alexpc kernel: [  330.236586]  [<c01a3b1f>] ? __pagevec_free+0x1f/0x30
Dec 23 13:10:54 alexpc kernel: [  330.236593]  [<c01a76c8>] ? free_page_list+0x68/0xa0
Dec 23 13:10:54 alexpc kernel: [  330.236602]  [<c01a8790>] ? shrink_page_list+0x120/0x710
Dec 23 13:10:54 alexpc kernel: [  330.236610]  [<c01a7985>] ? update_isolated_counts.isra.46+0x135/0x160
Dec 23 13:10:54 alexpc kernel: [  330.236618]  [<c01a90e8>] ? shrink_inactive_list+0x148/0x2d0
Dec 23 13:10:54 alexpc kernel: [  330.236626]  [<c01a9706>] ? shrink_zone+0x496/0x550
Dec 23 13:10:54 alexpc kernel: [  330.236639]  [<c01a9cdf>] ? kswapd+0x51f/0x720
Dec 23 13:10:54 alexpc kernel: [  330.236647]  [<c01a97c0>] ? shrink_zone+0x550/0x550
Dec 23 13:10:54 alexpc kernel: [  330.236655]  [<c0151dc9>] ? kthread+0x69/0x70
Dec 23 13:10:54 alexpc kernel: [  330.236661]  [<c0151d60>] ? kthread_worker_fn+0x150/0x150
Dec 23 13:10:54 alexpc kernel: [  330.236672]  [<c0437bb6>] ? kernel_thread_helper+0x6/0xd
Dec 23 13:10:54 alexpc kernel: [  330.236676] Disabling lock debugging due to kernel taint



      Code:
Dec 23 13:58:06 alexpc kernel: [  258.081944] Modules linked in: powernow_k8 mperf cpufreq_stats cpufreq_powersave cpufreq_conservative bnep rfcomm bluetooth rfkill ppdev lp fuse nfsd nfs lockd fscache auth_rpcgss nfs_acl sunrpc ext3 jbd dm_crypt snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq snd_timer snd_seq_device radeon sp5100_tco ttm i2c_piix4 snd drm_kms_helper drm i2c_algo_bit soundcore shpchp pci_hotplug ati_agp k8temp parport_pc evdev pcspkr parport snd_page_alloc button processor ext4 mbcache jbd2 crc16 dm_mod raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear md_mod sr_mod cdrom sd_mod crc_t10dif usbhid ata_generic hid pata_acpi ahci ohci_hcd libahci pata_atiixp libata ehci_hcd scsi_mod r8169 mii usbcore ssb mmc_core pcmcia pcmcia_core [last unloaded: scsi_wait_scan]
Dec 23 13:58:06 alexpc kernel: [  258.082575]
Dec 23 13:58:06 alexpc kernel: [  258.082575] Pid: 2456, comm: flush-8:64 Not tainted 3.1-5.slh.3-aptosid-686 #1 Gigabyte Technology Co., Ltd. GA-MA74GM-S2H/GA-MA74GM-S2H
Dec 23 13:58:06 alexpc kernel: [  258.082575] EIP: 0060:[<f86ff2b5>] EFLAGS: 00010246 CPU: 1
Dec 23 13:58:06 alexpc kernel: [  258.082575] EIP is at mpage_da_submit_io+0x188/0x3a4 [ext4]
Dec 23 13:58:06 alexpc kernel: [  258.082575] EAX: 5e00083c EBX: 00000000 ECX: 00000000 EDX: 00000000
Dec 23 13:58:06 alexpc kernel: [  258.082575] ESI: f618b3e0 EDI: f25a3ce0 EBP: f25a3e5c ESP: f25a3c34
Dec 23 13:58:06 alexpc kernel: [  258.082575]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
Dec 23 13:58:06 alexpc kernel: [  258.082575]  0000000e f73a2c00 e4d09558 00155734 00000000 00005f34 00005f34 00000000
Dec 23 13:58:06 alexpc kernel: [  258.082575]  f25a3d64 e4d09558 00001000 00000000 00000000 f8720530 00005f34 0000000e
Dec 23 13:58:06 alexpc kernel: [  258.082575]  00007358 00000000 e4d09620 00000000 00000000 0000000e 00000000 f618b3e0
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<f8700e18>] ? mpage_da_map_and_submit+0x396/0x3a8 [ext4]
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c0236879>] ? __lookup_tag+0x81/0xd9
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c0236e3c>] ? radix_tree_gang_lookup_tag_slot+0x79/0x93
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c017a6ef>] ? find_get_pages_tag+0x9e/0xb6
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<f8700ffc>] ? write_cache_pages_da+0x109/0x293 [ext4]
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<f87013ae>] ? ext4_da_writepages+0x228/0x33c [ext4]
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c0181a11>] ? do_writepages+0x12/0x1b
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c01c67f7>] ? writeback_single_inode+0xb9/0x1ea
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c01c6b9a>] ? writeback_sb_inodes+0x12f/0x1b8
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c01c6c76>] ? __writeback_inodes_wb+0x53/0x84
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c01c6d69>] ? wb_writeback+0xc2/0x137
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c01810a5>] ? determine_dirtyable_memory+0x31/0x43
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c01c70a7>] ? wb_do_writeback+0x120/0x131
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c01c70ff>] ? bdi_writeback_thread+0x47/0x101
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c01c70b8>] ? wb_do_writeback+0x131/0x131
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c013fb8f>] ? kthread+0x63/0x68
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c013fb2c>] ? kthread_worker_fn+0x10d/0x10d
Dec 23 13:58:06 alexpc kernel: [  258.082575]  [<c039de3e>] ? kernel_thread_helper+0x6/0xd
Dec 23 13:58:06 alexpc kernel: [  258.120141] ---[ end trace 909122826bd02045 ]---
 
 View user's profile Send private message  
Reply with quote Back to top
slhOffline
Post subject: RE: Kernel Bug (?)  PostPosted: 23.12.2011, 20:55



Joined: 2010-08-25
Posts: 724

Status: Offline
The second warning is of no use, given that it is just a follow up to a previous warning and doesn't allow to tell anything on its own (as the kernel has already gone south before).

That said, while all warning are roughly vfs related, they still point at multiple different code areas, which raises the suspicion of hardware problems. I would do a filesystem check of all involved filesystems first and then keep a close look on potential hardware problems - a RAM check (running memtest86+ over night must not produce any errors) is also recommended.
 
 View user's profile Send private message  
Reply with quote Back to top
DeepDayzeOffline
Post subject: RE: Kernel Bug (?)  PostPosted: 23.12.2011, 23:02



Joined: 2010-09-11
Posts: 616
Location: USA
Status: Offline
Don't vfs issues indicate possible disk or RAM problems, as those should indeed be the first places to check for hardware faults
 
 View user's profile Send private message  
Reply with quote Back to top
Display posts from previous:     
Jump to:  
All times are GMT - 12 Hours
Post new topic   Reply to topic
View previous topic Printable version Log in to check your private messages View next topic
Powered by Zafenio