View unanswered posts    View active topics

All times are UTC - 6 hours





Post new topic Reply to topic  [ 3 posts ] 
Print view Previous topic   Next topic  
Author Message
Search for:
PostPosted: Mon Mar 13, 2006 10:10 pm 
Offline
Joined: Tue Jan 18, 2005 2:07 am
Posts: 1532
Location: California
I recently upgraded to R5A30.2, moved to new hardware and added a hidef tuner (DCT-6200) to my myth setup and I am having some stability problems. Periodically I find that the entire system is hung. I can't even remote log in via "putty" and I am forced to hit the reset button. After a recent hang I looked in /var/log/messages I saw the following:
Quote:
Mar 12 11:50:49 mythhd -- MARK --
Mar 12 12:10:49 mythhd -- MARK --
Mar 12 12:30:49 mythhd -- MARK --
Mar 12 15:42:53 mythhd syslogd 1.4.1#16: restart.
Mar 12 15:42:53 mythhd kernel: klogd 1.4.1#16, log source = /proc/kmsg started.

From this I conclude that the system died somewhere around 12:30; I believe the entry at 15:42 was a result of my resetting the system. After doing some more grepping of various logs I have found the following error messages:

Quote:
kern.log:Mar 12 21:56:46 mythhd kernel: APIC error on CPU0: 40(40)
Mar 13 19:53:29 mythhd kernel: hdb: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Mar 13 19:53:29 mythhd kernel: hdb: drive_cmd: error=0x04 { DriveStatusError }


Note: I mis-copied the "drive_cmd" messages in my original posting -- I've updated them to reflect the two different messages I am getting.

The APIC error is repeated many times. The drive_cmd error messages each occur only once/boot.

Does anyone have an experience with these error messages? Do the APIC or "drive_cmd" error messages indicate a root cause for the hang? I ran a quick test of both hard drives (hda & hdb), but it came up clean.

Thanks for any help you can provided.

Marc


Top
 Profile  
 
 Post subject:
PostPosted: Tue Mar 14, 2006 12:43 am 
Offline
Joined: Tue Jan 18, 2005 2:07 am
Posts: 1532
Location: California
A brief update. I've made two changes to my configuration and the "drive_cmd" message no longer happen during boot:

1. I moved the "hdb" drive so so that it is now "hdd". So my configuration is "hda = boot drive; hdb = unused; hdc = DVD burner; hdd = second drive".

2. I changed my hdparm commands in bootmisc.sh from
Code:
/sbin/hdparm -c1 -d1 -u1 /dev/hda
/sbin/hdparm -c1 -d1 -u1 /dev/hdd

to
Code:
/sbin/hdparm -c1 -d1 -u0 /dev/hda
/sbin/hdparm -c1 -d1 -u0 /dev/hdd


ie: I now set the "interup umask" flag to 0. The drive_cmd error messages are no longer being produced on the boot. I will need to run for a few days to see if this has any impact on my stability.

Marc


Top
 Profile  
 
 Post subject:
PostPosted: Sat Mar 18, 2006 1:01 am 
Offline
Joined: Tue Jan 18, 2005 2:07 am
Posts: 1532
Location: California
Posting the solution in case anyone has the same problem and finds this thread on a search. I did the following:

1, Edited "/etc/lilo.conf" and added "noapic" to the "append" line,

2. Ran "lilo -v"

3. Rebooted.

The APIC errors are gone and hyper-threading still works...

Marc


Top
 Profile  
 

Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 3 posts ] 


All times are UTC - 6 hours




Who is online

Users browsing this forum: No registered users and 11 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Jump to:  
cron
Powered by phpBB® Forum Software © phpBB Group

Theme Created By ceyhansuyu