mcelog problems

pppplus

Verified User
Joined
Dec 19, 2008
Messages
528
Hi

I have this in /var/log/mcelog :

Code:
MCE 28
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
Sat Sep 18 15:47:49 2010
CPU 1 THERMAL EVENT TSC 2341816662c0 (null)
Processor 1 below trip temperature. Throttling disabled
STATUS 88010282 MCGSTATUS 0
CPUID Vendor Intel Family 6 Model 26
MCE 29
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
Sat Sep 18 15:47:49 2010
CPU 5 THERMAL EVENT TSC 234181666084 (null)
Processor 5 below trip temperature. Throttling disabled
STATUS 88010282 MCGSTATUS 0
CPUID Vendor Intel Family 6 Model 26
MCE 30
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
Sat Sep 18 15:52:48 2010
CPU 0 THERMAL EVENT TSC 23fb1765819c (null)
Processor 0 below trip temperature. Throttling disabled
STATUS 88010282 MCGSTATUS 0
CPUID Vendor Intel Family 6 Model 26
MCE 31
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
Sat Sep 18 15:52:48 2010
CPU 4 THERMAL EVENT TSC 23fb17657f24 (null)
Processor 4 below trip temperature. Throttling disabled
STATUS 88010282 MCGSTATUS 0
CPUID Vendor Intel Family 6 Model 26

When I run #sensors, I have :
Code:
coretemp-isa-0000
Adapter: ISA adapter
Core 0:      +99°C  (high =  +100°C)

coretemp-isa-0001
Adapter: ISA adapter
Core 1:      +99°C  (high =  +100°C)

coretemp-isa-0002
Adapter: ISA adapter
Core 2:      +95°C  (high =  +100°C)

coretemp-isa-0003
Adapter: ISA adapter
Core 3:      +97°C  (high =  +100°C)

coretemp-isa-0004
Adapter: ISA adapter
Core 4:     +100°C  (high =  +100°C)

coretemp-isa-0005
Adapter: ISA adapter
Core 5:      +99°C  (high =  +100°C)

coretemp-isa-0006
Adapter: ISA adapter
Core 6:      +95°C  (high =  +100°C)

coretemp-isa-0007
Adapter: ISA adapter
Core 7:      +97°C  (high =  +100°C)

w83627dhg-isa-0ca0
Adapter: ISA adapter
VCore:     +0.83 V  (min =  +0.92 V, max =  +1.48 V) ALARM
in1:       +5.39 V  (min = +10.72 V, max = +13.20 V) ALARM
AVCC:      +3.28 V  (min =  +2.96 V, max =  +3.63 V)
3VCC:      +3.28 V  (min =  +2.05 V, max =  +2.62 V) ALARM
in4:       +1.12 V  (min =  +1.35 V, max =  +1.65 V) ALARM
in5:       +0.82 V  (min =  +1.13 V, max =  +1.38 V) ALARM
in6:       +2.61 V  (min =  +4.53 V, max =  +4.86 V) ALARM
VSB:       +3.34 V  (min =  +2.96 V, max =  +3.63 V)
VBAT:      +3.28 V  (min =  +2.96 V, max =  +3.63 V)
Case Fan:    0 RPM  (min =  715 RPM, div = 16) ALARM
CPU Fan:     0 RPM  (min =  715 RPM, div = 16) ALARM
Aux Fan:     0 RPM  (min =  715 RPM, div = 16) ALARM
fan4:        0 RPM  (min =  715 RPM, div = 16) ALARM
fan5:        0 RPM  (min =  715 RPM, div = 16) ALARM
Sys Temp:    +40°C  (high =   +75°C, hyst =   +70°C)  [thermistor]
CPU Temp:   +0.0°C  (high = +85.0°C, hyst = +75.0°C)  [CPU diode ]
AUX Temp:   +0.0°C  (high = +80.0°C, hyst = +75.0°C)  [CPU diode ]
vid:      +3.500 V
Alarm are similar with another server (same configuration) but the second one has no problem in mcelog.

So have you some explanations to understand better mcelog ?
Thanks
 
I'm not sure why you're calling it a CentOS problem when the warning is specifically that it's not.

If you google mcelog you'll find a lot of information on it.

Jeff
 
Back
Top