Intel Atom C2000 chips are bricking products
There a serious problem with the Atom C2000 series processors from Intel. After certain amount of time the internal clock generator of the CPU will degrade substantially causing defects in the products they are used in.
That would be NAS servers, small-servers, but also network related equipment and small form factor PCs. The news was uncovered by The Register who noticed in a Q4 2016 earnings call earlier this monththat chief financial officer Robert Swan stated that a product issue limited profitability during the quarter, forcingthem to reserve cash to deal with the problem.
Many technology vendors will be effected as simply many are using the Intel Atom C2000 processors. This list includes Dell and Synology (NAS servers).
Folks with Synology DS1815+ NAS boxes have been reporting complete hardware failures; the DS1815+ is powered by an Intel Atom C2538. Other vendors using Atom C2000 chips include Asrock, Aaeon, HP, Infortrend, Lanner, NEC, Newisys, Netgate, Netgear, Quanta, Supermicro, and ZNYX Networks. The chipset is aimed at networking devices, storage systems, and microserver workloads.
I'll do a paste from the register as, they uncovered this story really intensive:
We were observing a product quality issue in the fourth quarter with slightly higher expected failure rates under certain use and time constraints, and we established a reserve to deal with that," he said. "We think we have it relatively well-bounded with a minor design fix that we're working with our clients to resolve."
Coincidentally, Cisco last week issued an advisory warning that several of its routing, optical networking, security and switch products sold prior to November 16, 2016 contain a faulty clock component that is likely to fail at an accelerated rate after 18 months of operation.
Cisco at the time declined to name the supplier of that component. When asked on Monday whether Intel supplied the faulty electronics, a Cisco spokesperson told that The Register that the networking giant does not intend to publicly name the supplier.
Intel indicated in a January 2017 revision of its Atom C2000 family documentation that the chip line contains a clock flaw. Errata note AVR.54, titled "System May Experience Inability to Boot or May Cease Operation," explains that the Atom C2000 Low Pin Count bus clock outputs (LPC_CLKOUT0 and LPC_CLKOUT1) may stop functioning. Permanently.
An Intel spokesperson in an email to The Register characterized the issue as "a degradation of a circuit element under high use conditions at a rate higher than Intel’s quality goals after multiple years of service."
"If the LPC clock(s) stop functioning the system will no longer be able to boot," Intel's documentation explains.
This consequence is precisely what Cisco says may happen to its devices given enough time. "Once the component has failed, the system will stop functioning, will not boot, and is not recoverable," Cisco's advisory states.
The Register asked Intel whether it could confirm that Cisco's advisory could be attributed to an Intel component. Intel said it could not confirm or deny whether its chip issue is the one affecting Cisco gear, citing a policy of not commenting on customers. We note that the affected Cisco ASA 55xx products use Intel's Atom C2000 system-on-chips at least.
Radio silence
We asked Intel to provide specific details about when it began and stopped shipping Intel Atom C2000 processors with faulty clock outputs. Intel declined to comment. The official errata says the B0 stepping of C2xxx Atoms are vulnerable to failure, and these parts began shipping in 2013. The specific SKUs are:
C2308, C2338, C2350, C2358, C2508, C2518, C2530, C2538, C2550, C2558, C2718, C2730, C2738, C2750, and C2758.We asked Intel how many affected Atom C2000 chips have been shipped and how much fixing the issue will cost the company. Intel declined to comment.
Intel did, however, provide some insight on how the Atom C2000 flaw might be addressed. "A board level workaround exists for the existing production stepping of the product which resolves the issue," a company spokesperson said in an email. "Additionally, Intel will implement and validate a minor silicon fix in a new product stepping that resolves this issue."
Many other technology vendors make products with Intel Atom C2000 processors, including Dell and Synology. The Register pinged Dell via email, and it was not immediately available for comment.
People with Synology DS1815+ storage boxes have been reporting complete hardware failures; the DS1815+ is powered by an Intel Atom C2538.
Other vendors using Atom C2000 chips include Asrock, Aaeon, HP, Infortrend, Lanner, NEC, Newisys, Netgate, Netgear, Quanta, Supermicro, and ZNYX Networks. The chipset is aimed at networking devices, storage systems, and microserver workloads.
According to this Intel data-sheet [PDF], LPC_CLKOUT0 and LPC_CLKOUT1 are driven by the processor to provide to hardware on the board, including the boot ROM. If these signals stop ticking, the rest of the electronics stops, too. ®
Updated to add at 14:29 UTC, February 7
Synology told The Reg it was "investigating this issue together with Intel," saying it would "provide more detailed information in the week as it becomes available."
It added: "At this time Synology has not seen any indication that this issue has caused an increase in failure rates for DiskStation or RackStation models equipped with Intel Atom C2000 series processors. However once additional information is available, Synology will post an advisory on this topic.
"In the meantime it is still safe to continue to use your device, however should you encounter any issues these will be handled via our support teams and all the devices are still covered under the standard warranty provided with all our units. Technical Support can be reached via www.synology.com/ticket."
Updated to add at 15:36 UTC, February 7
Synology subsequently asked to have its statement above revised to omit any mention of Intel.
Intel Atom Apollo Lake Processor E3900 series Launches - 10/26/2016 12:44 PM
The Intel Atom processor E3900 Series delivers performance and a unique set of features for the edge in segments such as industrial, automotive, video, manufacturing, retail, and more. Intel is also o...
Specs Microsoft HoloLens Devkit Appears Online - Runs on Intel Atom - 05/03/2016 06:00 PM
The devkit specs of the Microsoft HoloLens have surfaced on-line. The headset uses a 32-bit version of Windows 10 with an Intel Atom processor from the Braswell generation and 2 GB of ram....
Intel Atom x3 x5 and x7 Series - 03/02/2015 07:24 PM
We already reported about this last week, but Intel announced a series of mobile platforms including the company's new low-cost system-on-chip (SoC) for phones, phablets and tablets, a global LTE so...
Intel Atom naming scheme gets x3, x5 and x7 suffixes - 02/27/2015 10:09 AM
And that's of course to simplify things a bit. You will notice the Atom x3 line for the lower-end market, then the Atom x5 for the top-of-the-line models with the best features and performance are p...
Point of View launches Intel Atom Based Windows 8.1 tablets - 06/24/2014 03:01 PM
Point of View launches a new tablet line based on the latest Intel Atom platform and Microsoft Windows 8.1 Operating system. In the new Mobii Windows tablet line there are three models available: The ...
Senior Member
Posts: 813
Joined: 2009-11-30
in article
it said the chip released on 2013
and it failing after 18months (so its 1.5years)
its said the chip mostly used in NAS / network equipment which should be online most of time if not 24/7
why it detected/reported 3years later?
such this should detected sooner
because most device is start to failing after 1.5years of usage
Senior Member
Posts: 7255
Joined: 2012-11-10
If you want many low-power cores, ARM is definitely the way to go, assuming your software and other hardware is compatible. If size isn't an issue, there are some relatively cheap 6c/12t Xeons that are low-power.
Senior Member
Posts: 7255
Joined: 2012-11-10
in article
it said the chip released on 2013
and it failing after 18months (so its 1.5years)
its said the chip mostly used in NAS / network equipment which should be online most of time if not 24/7
why it detected/reported 3years later?
such this should detected sooner
because most device is start to failing after 1.5years of usage
Most (if not all) Atom CPUs are soldered to their motherboards. Usually when this is the case, manufacturers don't tend to use the latest and greatest. So even though this is a 2013 CPU, there are likely products released in 2015-2016 that are using these. Generally speaking, industrial class hardware is a bit outdated. Ironically, because the outdated stuff is supposed to be considered proven stable.
Senior Member
Posts: 3376
Joined: 2013-03-10
It probably also takes time to pinpoint the exact reason the hardware failed, and what this article is telling about is extremely exact. Intel's own investigation must have taken months as well (and they are still refusing to comment directly, by the looks of it).
Moderator
Posts: 15142
Joined: 2006-07-04
Man I looked at a small c2000 integrated system with 8 cores and now I'm glad I went against it. Those suckers are not cheap.