Questions tagged [ecc]
Error Correcting Code (ECC memory) is used in most computers where data corruption cannot be tolerated under any circumstances.
82
questions
39
votes
5
answers
47k
views
How do I get notified of ECC errors in Linux?
How do I get notified, when a Linux machine equipped with ECC memory recognizes a memory failure? I'm interested in both correctable and uncorrectable errors.
if a message is written to dmesg/the ...
31
votes
4
answers
28k
views
What is ECC ram and why is it better?
I've seen a dicussion about ECC ram use on servers. Why is it better?
27
votes
10
answers
41k
views
Would you use ECC RAM in a workstation?
Is ECC RAM recommended for use in workstations, or is it something that only gets used in servers? If non-ECC RAM works in PCs, why would we need ECC RAM at all?
21
votes
3
answers
15k
views
Non-ECC memory with ZFS: a stupid idea?
I have a new server and am planning to upgrade the paltry 2 GB of memory to the maximum of 16 GB. (Theoretically 8 GB is the limit, but empirically 16 GB has been shown to work.) Some guides advise ...
20
votes
5
answers
41k
views
How to check if RAM is running in ECC mode?
I updated this post since I replaced the processor, but the core of my question (and unfortunately the results as well) are the same.
I built my first FreeNAS box and wanted to use ECC RAM since I ...
20
votes
2
answers
2k
views
What is the Rowhammer DRAM bug and how should I treat it?
DRAM chips are very tightly packed. Research has shown that neighboring bits can be flipped at random.
What is the probability of the bug triggering at random in a server-grade DRAM chip with ECC (...
19
votes
2
answers
5k
views
What RAM options do I need to know before buying Server RAM?
This is a proposed Canonical Question about Server Memory.
I have to buy a Dell R420 server and there are various combinations (1600 and 1333 MHz RDIMMS and UDIMMS) and Performance Optimized vs. ...
16
votes
1
answer
42k
views
Should I use bios "Advanced ECC" in Dell PowerEdge R710 Bios with ECC DIMMs?
I have a Dell PowerEdge R710 with dual Intel Xeon E5503 CPUs. It has 96GB(12x8GB) of ECC DIMMs.
In its BIOS, memory is configured for "Advanced ECC".
My question is if my DIMMs are already ECC, does ...
11
votes
5
answers
4k
views
The importance of ECC memory
Are ECC memory modules important to have on a non-critical server?
I was thinking about getting myself a toy dedicated server for lots of random, non-critical stuff. Sporadic reboots are no big deal....
10
votes
2
answers
6k
views
What does ECC RAM failure look like
For Non-ECC memory I have a decent idea of what a failure looks like; certain random things start going wrong (e.g. PNG checksums fail validation once and then not the next time), that sort of thing. ...
10
votes
1
answer
1k
views
How to force ECC error [closed]
I'm looking for a way to force an ECC error in a DRAM DIMM to test some code associated with recovering from these errors. I believe Intel makes a test jig for several thousand dollars, but I'm ...
9
votes
1
answer
58k
views
How seriously should I take ECC correctable error warnings?
I have a pile of Sun X2200-M2 servers. These servers have ECC memory.
In some of these servers, I am getting warnings in the eLOM about "correctable ECC errors detected", eg:
# ssh regress11 ...
8
votes
2
answers
32k
views
ECC chipkill errors: which DIMM?
We often get DIMMs in our servers going bad with the following errors in syslog:
May 7 09:15:31 nolcgi303 kernel: EDAC k8 MC0: general bus error: participating processor(local node response), time-...
7
votes
3
answers
13k
views
ECC errors in L3 cache - critical or not?
On a linux server (8x Quad-Core AMD 8378), I'm getting the following errors:
[Hardware Error]: MC4_STATUS[-|CE|MiscV|-|AddrV|CECC]: 0x9c294c00001d018b
[Hardware Error]: Northbridge Error (node 4): ...
6
votes
1
answer
3k
views
Which browsers and OSes supports ECC based SSL certificates?
We are evaluating whether to buy a RSA based certificate or a ECC based certificate.
RSA is older and is supported by all browsers.
ECC is newer, they state it is faster due requiring smaller key ...
5
votes
3
answers
519
views
Is there any such error logged by CentOS somewhere that can conclusively reveal "it is now time to pay for ECC"
I have a 32GB non-ECC RAM dedicated server with CentOS.
Once for day it randomly crashes without any error in /var/log/kern.log, /var/log/messages, mysql, apache.
CPU/RAM/IO are not particularly ...
5
votes
3
answers
7k
views
Evaluating uncorrectable ECC errors and fallback methods
I run a server which has just experienced an error I've not encountered before. It emitted a few beeps, rebooted, and got stuck at the startup screen (the part where the bios shows its logo and begins ...
5
votes
5
answers
2k
views
md5sum of large files gives different results sometimes
I have an AMD quad core, 8 gb RAM, 1 SSD EXT2 (2 months old), 2 HDD EXT4, approximately 1 year old.
I'm using Ubuntu 10.04 x86-64 and when I compute the md5sum of large files (9 GB) sometimes I get ...
4
votes
2
answers
23k
views
What does "single-bit ECC errors were detected on the RAID controller" mean?
I have a Dell T7600 with a Perc H710P RAID controller and 4 attached 3TB drives. Over the past few months the RAID controller has been intermittently reporting errors on boot: "no boot device found", ...
4
votes
4
answers
3k
views
Is an ECC ram enabled GPU necessary for a server, or will a normal gpu work fine in a server?
Is it a requirement for a server to use ECC ram on a GPU while the normal CPU ram is ECC? Im thinking that instead of using a Quadro k6000 or AMD Firepro, we could use a GTX 980 or AMD r9 290...if ...
3
votes
2
answers
392
views
SAS/RAID controller non-ecc ram
I have Adaptec 51245 controller (I know it is old but I got it for free) that I use in my server.
As far as I know it is highly recommended to use ECC RAM as system memory, but what about RAID ...
3
votes
3
answers
5k
views
What to do in response to repeat DRAM ECC error notifications for the same memory location?
I woke up this morning to what's a first for me; one of my systems had logged DRAM ECC error notifications. Three of them, in fact, for as far as I can tell the exact same memory location (obviously, ...
3
votes
1
answer
1k
views
Where are the ECC memory error counters stored?
Where are the ECC memory error counters stored: on the DIMM itself, the motherboard, or the host's disk?
I'm using memtest86+, but it seems that it doesn't recognize ECC on my system, so if ...
3
votes
0
answers
294
views
Correct EDAC driver for Supermicro X10SLL-F
Prompted by Debian upgrade (stretch to buster), I've replaced mcelog with rasdaemon. I'm not sure whether things work as expected though. In dmesg I can find a message like this:
[ 1.871662] EDAC ...
3
votes
0
answers
1k
views
What are the risks of running a database on a server without ECC RAM? [duplicate]
A lot of branded servers come with ECC RAM, but it is expensive.
For a database server or other critical servers, what would be the impact of not using ECC RAM?
Data corruption? (I suppose the ...
2
votes
3
answers
5k
views
ECC vs Non-ECC memory
I am looking for memory for SUPERMICRO MBD-X8DAH+-F-O Dual LGA 1366 (http://www.supermicro.com/products/motherboard/qpi/5500/x8dah_-f.cfm). Basically, I am looking for the difference between ECC and ...
2
votes
2
answers
2k
views
Alternative file system/volume manager for ZFS w/ non-ECC RAM?
It's not recommended to use ZFS for a computer without ECC RAM. So, what's a good alternative then? Or is the risk the same, so it doesn't matter what manager I use, it'll be the same problem if a bit ...
2
votes
2
answers
4k
views
Can I mix ECC RDIMMs with different rank?
I have an Intel S5520SC motherboard with two Intel Xeon E5620 CPUs installed. It currently has six KVR13R9D4/8I DIMMS - I want to add another six DIMMs (48GB of RAM) to upgrade this workstation to ...
2
votes
2
answers
4k
views
ECC CE (Correctable Error) occuring every 5 minutes exactly
On one of our computing nodes I am getting ECC CE (correctable errors). What is a little bit peculiar about is is that errors are not massive, just a single occurrence exactly every 5 minutes.
...
2
votes
1
answer
1k
views
FBDIMM Thermal/TDP Issue
I've got a 2U dual Xeon server with 8x 2GB DDR2 FBdim/ECC ram, on an intel s5000PSL board. It's stable, the ram memtests clean and both CPUs are running cool (35C). Half of the sticks run ~60-65C, ...
2
votes
1
answer
2k
views
Do Kaby Lake Pentiums support ECC?
Looking at setting up a SOHO server with a C236 Chipset and ECC RAM and wondering about the CPU to use.
Skylake Pentiums (e.g., the G4400) support ECC, but prior to Kaby Lake's release, news outlets ...
2
votes
3
answers
18k
views
ECC memory errors causing random server reboots
I'm running ubuntu server 14.04 on Supermicro X10SLM-F / Xeon E3-1271 v3
Memory: SuperTalent 32GB DDR3 1600 ECC
About every 4 days, the logs on Ubuntu will show this:
{1}[Hardware Error]: Hardware ...
2
votes
0
answers
294
views
i7 edac: ecc error - which module?
I'm running a Xeon X3450 on a Supermicro X8SIE-F mainboard. Currently there are 4 reg. ECC DIMMs installed (each 4GB in size; installed as DIMM A-Channel1, DIMM A-Channel2, DIMM B-Channel1, DIMM B-...
2
votes
1
answer
2k
views
What happens to a random bit error in the cache on an Intel CPU?
I have a system with ECC RAM and a Xeon E3 CPU.
My understanding is that ECC circuits on the RAM will detect corruption from random bit errors in the RAM chips.
But what happens to random bit errors ...
1
vote
4
answers
1k
views
Is it safe to use not ECC RAM for cold backup server?
I need home computer for simple backup task (just cronjob on Linux, it will run once per day):
Download file from my production server (in datacenter, it's good server with Xeons & ECC RAM etc.) ...
1
vote
3
answers
33k
views
What and how to check when determining if a memory stick will be compatible with a particular server?
We recently needed to add more RAM to our vCenter Server (Dell PowerEdge 860 server). We checked from the Kingston memory search what kind of memory the server accepted.
Then we found a listing of 4x ...
1
vote
1
answer
1k
views
Why won't this RAM work in my HP Proliant DL380 G4 server?
I tried installing some Crucial 1024MB PC5400 DDR2 667MHz ECC Memory (CT12872AA667) and the system would not start - just post beeps.
The original memory is PC2-3200R ECC and the place I purchased ...
1
vote
3
answers
5k
views
See ECC correction count
I'm curious as to whether or not there's some performance counter that will log the number of ECC corrections required, that could perhaps be tracked as an early indicator of memory failure. I imagine ...
1
vote
1
answer
3k
views
Non-ECC RAM for virtualization?
I'm on a quest of building a virtualization server. However, I was asking myself a question: should I stick with non-ECC RAM for this server or not?
This because I found a Xeon CPU that falls in the ...
1
vote
4
answers
1k
views
Advice on DL380 G5 RAM, and why the DIMMS I have aren't supported
I have 3x DL380 G5 servers that were second hand retired ones. One of the servers, I populated with 4x 4GB DIMMs, and 4x 1GB DIMMs.
Everything was working away merrily, and I decided to purchase some ...
1
vote
1
answer
863
views
can ecc and registered ram be used together
do you know if it is possible to use both ECC and Registered DDR2 SDRAM in a server?
We have a mix of both, but when they are all installed, the server fails to boot.
1
vote
1
answer
4k
views
How to test RAM if memtest86 freezes?
I suspect some faulty RAM and wanted to test it with Memtest86. I'm using a bootable Ubuntu 20.04 USB stick and choose the "Memtest" option from the boot menu.
Unfortunately the test freezes ...
1
vote
1
answer
277
views
Is ECC memory recommended for memcache? [duplicate]
I'm planning to buy new server for memcache/couchbase.
Would you recommend using ECC memory for a memcache server, why/why not?
1
vote
2
answers
3k
views
HP Proliant DL320 G5 Memory Registered or Unregistered?
I have an old HP Proliant DL320 G5 with 2x 1GB of RAM. I would like to swap out the old ram and install 4x 2GB of RAM to bring it to the max capacity. The manual calls for PC2-5300 unbuffered modules.
...
1
vote
2
answers
6k
views
Dell PowerEdge - Advanced ECC vs Optimized for 8GB RAM
We are buying new Dell R410 servers, and I'm trying to figure out the best RAM performance we can get.
Dell offers the following choices:
8GB Memory (8x1GB), 1333MHz Single Ranked UDIMMs for 2 ...
1
vote
1
answer
1k
views
Does fully buffered memory have a different notch position?
I'm working on a server and ordered replacement ECC memory and found that the fully buffered DDR2 to have the notch in a different positions.
I wasn't aware what buffered vs unbuffered memory has a ...
1
vote
1
answer
201
views
Sync ECC archive with non-ECC backup/archive server
For archiving and backup, I created the following strategy:
Server 1 (Local Network in Office) running 24/7, Linux Ubuntu NAS on DIY Odroid XU4 with Cloudshell 2 and Raid-1 (2x 8TB)
this is my ...
1
vote
1
answer
3k
views
Fujitsu server memory modules - registered ECC, but still won't POST
I have a Fujitsu TX150 S7, for which I want to upgrade the memory. I thought that would be simple enough, but apparently that is not so.
The manual for that server states the following:
Memory slots:...
1
vote
2
answers
2k
views
Can ECC Chipkill be used in non-IBM servers?
Can IBM specific ram, such as 41Y2770 with ECC Chipkill, be used in non-IBM servers?
1
vote
1
answer
201
views
CL latency of unbuffered and registered RAM
So AFAIK, registered ram generally has a bit higher latency as going through the registers adds (usually one?) cycle.
If I have two dimms of ecc ram, one registered and one unbuffered, and both say ...