All;
I'm seeing this notification multiple times a day (from 5 - 30 minutes apart):
Unexpected system error The system has encountered a problem and recovered.
Name: kernel-core
Version: 4.4.2.301.fc23.x86_64
Detected: 3 minutes ago
Reported: cannot be reported
The kernel log indicates that hardware errors were detected. This is most likely not a software problem.
I'm running Fedora 23, KDE Plasma Spin with KDE and Cinnamon installed. On a Lenove Thinkpad X1 Carbon (3rd Gen)
I wonder if I should be doing something to trap & report?
Thoughts?
Thanks in advance
On 03/01/2016 02:06 PM, CS DBA wrote:
All;
I'm seeing this notification multiple times a day (from 5 - 30 minutes apart):
Unexpected system error The system has encountered a problem and recovered.
Name: kernel-core
Version: 4.4.2.301.fc23.x86_64
Detected: 3 minutes ago
Reported: cannot be reported
The kernel log indicates that hardware errors were detected. This is most likely not a software problem.
I'm running Fedora 23, KDE Plasma Spin with KDE and Cinnamon installed. On a Lenove Thinkpad X1 Carbon (3rd Gen)
I wonder if I should be doing something to trap & report?
Have a look at the output of dmesg. It may be apparent there what the issue is. Most likely it's some bad memory or possibly some errant firmware--either in the BIOS or a device (e.g. a wifi card or something).
If it's not that, it could be your power supply starting to have issues. Wonky power supplies do lots of odd things. ---------------------------------------------------------------------- - Rick Stevens, Systems Engineer, AllDigital ricks@alldigital.com - - AIM/Skype: therps2 ICQ: 226437340 Yahoo: origrps2 - - - - "If you can't fix it...duct tape it!" -- Tim Allen - ----------------------------------------------------------------------
On 03/01/2016 03:12 PM, Rick Stevens wrote:
On 03/01/2016 02:06 PM, CS DBA wrote:
All;
I'm seeing this notification multiple times a day (from 5 - 30 minutes apart):
Unexpected system error The system has encountered a problem and recovered.
Name: kernel-core
Version: 4.4.2.301.fc23.x86_64
Detected: 3 minutes ago
Reported: cannot be reported
The kernel log indicates that hardware errors were detected. This is most likely not a software problem.
I'm running Fedora 23, KDE Plasma Spin with KDE and Cinnamon installed. On a Lenove Thinkpad X1 Carbon (3rd Gen)
I wonder if I should be doing something to trap & report?
Have a look at the output of dmesg. It may be apparent there what the issue is. Most likely it's some bad memory or possibly some errant firmware--either in the BIOS or a device (e.g. a wifi card or something).
If it's not that, it could be your power supply starting to have issues. Wonky power supplies do lots of odd things.
- Rick Stevens, Systems Engineer, AllDigital ricks@alldigital.com -
- AIM/Skype: therps2 ICQ: 226437340 Yahoo: origrps2 -
"If you can't fix it...duct tape it!" -- Tim Allen -
I see lots of these:
[ 8521.141831] mce: [Hardware Error]: Machine check events logged [ 8521.141834] CPU0: Package temperature above threshold, cpu clock throttled (total events = 280) [ 8521.141837] mce: [Hardware Error]: Machine check events logged [ 8521.142829] CPU1: Core temperature/speed normal [ 8521.142830] CPU2: Package temperature/speed normal [ 8521.142831] CPU3: Package temperature/speed normal [ 8521.142834] CPU0: Core temperature/speed normal [ 8521.142836] CPU0: Package temperature/speed normal [ 8521.142838] CPU1: Package temperature/speed normal
Should I be concerned? Anything I can do?
On 03/01/2016 02:14 PM, CS DBA wrote:
On 03/01/2016 03:12 PM, Rick Stevens wrote:
On 03/01/2016 02:06 PM, CS DBA wrote:
All;
I'm seeing this notification multiple times a day (from 5 - 30 minutes apart):
Unexpected system error The system has encountered a problem and recovered.
Name: kernel-core
Version: 4.4.2.301.fc23.x86_64
Detected: 3 minutes ago
Reported: cannot be reported
The kernel log indicates that hardware errors were detected. This is most likely not a software problem.
I'm running Fedora 23, KDE Plasma Spin with KDE and Cinnamon installed. On a Lenove Thinkpad X1 Carbon (3rd Gen)
I wonder if I should be doing something to trap & report?
Have a look at the output of dmesg. It may be apparent there what the issue is. Most likely it's some bad memory or possibly some errant firmware--either in the BIOS or a device (e.g. a wifi card or something).
If it's not that, it could be your power supply starting to have issues. Wonky power supplies do lots of odd things.
- Rick Stevens, Systems Engineer, AllDigital ricks@alldigital.com -
- AIM/Skype: therps2 ICQ: 226437340 Yahoo: origrps2 -
"If you can't fix it...duct tape it!" -- Tim Allen -
I see lots of these:
[ 8521.141831] mce: [Hardware Error]: Machine check events logged [ 8521.141834] CPU0: Package temperature above threshold, cpu clock throttled (total events = 280) [ 8521.141837] mce: [Hardware Error]: Machine check events logged [ 8521.142829] CPU1: Core temperature/speed normal [ 8521.142830] CPU2: Package temperature/speed normal [ 8521.142831] CPU3: Package temperature/speed normal [ 8521.142834] CPU0: Core temperature/speed normal [ 8521.142836] CPU0: Package temperature/speed normal [ 8521.142838] CPU1: Package temperature/speed normal
Should I be concerned? Anything I can do?
Well, yeah, it appears you have a CPU fan that's dying, or the heatsink is very dirty. I'd recommend opening up the box and looking at the fans (especially the one on the CPU) and verify they're turning at the correct speed and not binding up. Fans are cheap and they're generally built cheaply as well. I've had to replace dozens of them over the years.
I'd also see if you can get a vacuum to clean the interior and a "can o' air" to blow the cruft out of all the heatsinks. When you do that, do it from the SIDE of the heatsinks parallel to the fins, not down on top of them and do it GENTLY or you may force dirt into the sockets--a "bad thing".
I generally open my boxes and give them a real good cleaning at least once a year. I replace the fans every two years. Cheap insurance--I had a CPU cook itself once. ---------------------------------------------------------------------- - Rick Stevens, Systems Engineer, AllDigital ricks@alldigital.com - - AIM/Skype: therps2 ICQ: 226437340 Yahoo: origrps2 - - - - The world is coming to an end ... SAVE YOUR FILES!!! - ----------------------------------------------------------------------
George N. White III On 2016-03-01 6:24 PM, "Rick Stevens" ricks@alldigital.com wrote:
On 03/01/2016 02:14 PM, CS DBA wrote:
On 03/01/2016 03:12 PM, Rick Stevens wrote:
On 03/01/2016 02:06 PM, CS DBA wrote:
All;
I'm seeing this notification multiple times a day (from 5 - 30 minutes apart):
Unexpected system error The system has encountered a problem and recovered.
Name: kernel-core
Version: 4.4.2.301.fc23.x86_64
Detected: 3 minutes ago
Reported: cannot be reported
The kernel log indicates that hardware errors were detected. This is most likely not a software problem.
I'm running Fedora 23, KDE Plasma Spin with KDE and Cinnamon installed. On a Lenove Thinkpad X1 Carbon (3rd Gen)
I wonder if I should be doing something to trap & report?
Have a look at the output of dmesg. It may be apparent there what the issue is. Most likely it's some bad memory or possibly some errant firmware--either in the BIOS or a device (e.g. a wifi card or something).
Or cooling issues -- see below.
If it's not that, it could be your power supply starting to have issues. Wonky power supplies do lots of odd things.
- Rick Stevens, Systems Engineer, AllDigital ricks@alldigital.com -
- AIM/Skype: therps2 ICQ: 226437340 Yahoo: origrps2 -
"If you can't fix it...duct tape it!" -- Tim Allen -
I see lots of these:
[ 8521.141831] mce: [Hardware Error]: Machine check events logged [ 8521.141834] CPU0: Package temperature above threshold, cpu clock
A few years ago we had several ThinkPads that became unstable due to accumulated dust in the CPU cooling ducts, fins, and fans. Opening the up and liberal application of canned air solved the problem until dust built up again. These systems would slow down noticeably before they became unstable as the clock was slowed in an attempt to reduce CPU temps.
throttled (total events = 280) [ 8521.141837] mce: [Hardware Error]: Machine check events logged [ 8521.142829] CPU1: Core temperature/speed normal [ 8521.142830] CPU2: Package temperature/speed normal [ 8521.142831] CPU3: Package temperature/speed normal [ 8521.142834] CPU0: Core temperature/speed normal [ 8521.142836] CPU0: Package temperature/speed normal [ 8521.142838] CPU1: Package temperature/speed normal
Should I be concerned? Anything I can do?
Well, yeah, it appears you have a CPU fan that's dying, or the heatsink is very dirty. I'd recommend opening up the box and looking at the fans (especially the one on the CPU) and verify they're turning at the correct speed and not binding up. Fans are cheap and they're generally built cheaply as well. I've had to replace dozens of them over the years.
I'd also see if you can get a vacuum to clean the interior and a "can o' air" to blow the cruft out of all the heatsinks. When you do that, do it from the SIDE of the heatsinks parallel to the fins, not down on top of them and do it GENTLY or you may force dirt into the sockets--a "bad thing".
I generally open my boxes and give them a real good cleaning at least once a year. I replace the fans every two years. Cheap insurance--I had a CPU cook itself once.
- Rick Stevens, Systems Engineer, AllDigital ricks@alldigital.com -
- AIM/Skype: therps2 ICQ: 226437340 Yahoo: origrps2 -
-
The world is coming to an end ... SAVE YOUR FILES!!! -
-- users mailing list users@lists.fedoraproject.org To unsubscribe or change subscription options: https://admin.fedoraproject.org/mailman/listinfo/users Fedora Code of Conduct: http://fedoraproject.org/code-of-conduct Guidelines: http://fedoraproject.org/wiki/Mailing_list_guidelines Have a question? Ask away: http://ask.fedoraproject.org