Over this past weekend I upgraded my server from a P200/RedHat Linux
7.2 to a new Athlon 850/RedHat Linux 7.3. The Athlon was my trusty
workstation for many a year. Everything's great, except for one problem:
the clock goes crazy!
It seems to be a once per second issue:
$ perl -e '$t=time;while(1){$t2=time; warn join("\n",scalar
localtime($t),scalar localtime($t2),"") if ($t2-$t>4); $t=$t2;}'
Wed Aug 21 17:17:28 2002
Wed Aug 21 18:29:03 2002
Wed Aug 21 17:17:29 2002
Wed Aug 21 18:29:04 2002
Wed Aug 21 17:17:30 2002
Wed Aug 21 18:29:05 2002
I use netsaint to monitor system health, and it keeps complaining about
the time jumps:
Aug 21 18:28:03 eclectic netsaint: Warning: A system time change of 4296
seconds (forwards in time) has been detected. Compensating...
Aug 21 17:16:31 eclectic netsaint: Warning: A system time change of 4293
seconds (backwards in time) has been detected. Compensating...
Aug 21 18:29:42 eclectic netsaint: Warning: A system time change of 4296
seconds (forwards in time) has been detected. Compensating...
Aug 21 17:18:10 eclectic netsaint: Warning: A system time change of 4293
seconds (backwards in time) has been detected. Compensating...
Of course, random checks start failing as well since they occasionally
run during the time jump, and if they're checking for time since something
occured (log entry, cron run, query response time, etc) ...
I've tried "hwclock --systohc" to force the hardware clock to the same
time as the system, didn't help. I'm running NTP and thought maybe
it was having problems, so I tried turning that off, didn't help.
I went through BIOS and tweaked memory/IRQ settings -- I didn't think
it would really solve anything, but the setting changes and reboot kept
the behavior away for ~14-15 hours.
So I'm completely stumped as to what causes this problem. Has anyone
seen anything like this before? Any suggestions for things to look at?
I'm at the point where I'm planning to just buy a new bit of hardware,
migrate the drives and such again, and go from there. With my luck the
problem would follow me of course... :(
Thanks in advance. :)
BTW: If anyone is wondering, I'm running SGI's XFS patched 2.4.18 kernel:
2.4.18-SGI_XFS_1.1. I'm thinking about dropping back to a 2.4.9 kernel
w/ XFS support and seeing if that works any better. :( I was running
2.4.18 on the P200 for quite a while though without problem.
--
Randomly Generated Tagline:
I don't want to look like a weirdo. I'll just go with a muumuu.
-- Homer Simpson
King-Size Homer