NVMe High-Speed Data Capture and Recorder
ARCHITECTURAL CONCEPTS NVMe High-Speed Data Capture and Recorder The Data Capture and Recording Challenge There are many streaming data sources customers want to capture or
BittWare’s Data Recorder reference design captures up to 200 Gb/s high-speed sensor data to NVMe solid-state drives (SSDs). This gives users an open architecture to build any data recorder system they require—but a common question we hear is “How many drives are required for a given sustained data rate?”
We want to address this question as it’s often assumed the calculation is a simple one: look at an SSD vendor’s drive specification for write speed, and divide up the required maximum sustained bandwidth for the application, add a bit of margin, and you have your number of RAID0 (striped) drives required. The challenge is that write figure on the SSD spec sheet is never sustainable for the long record times of our high-performance customers.
Let’s look at some real-world performance figures using common SSD drive types and dig into the reasons why sustained write isn’t as simple as read speeds. We’ll then conclude with some recommendations on drive arrays for given speeds using our Data Recorder reference design. The good news is our design can easily scale as required using our TeraBox servers and 250-SoC FPGA card.
Solid-state drives with NVMe interfaces are known for being very fast. Indeed, they are faster than traditional hard disk drives (HDDs) for a certain workload. Specifically, SSDs are optimized for many more reads than writes and random access. Of course, for a data recorder, the focus is the opposite: sequential writes.
Another challenge is that NVMe SSDs often carry specifications on maximum “streaming write” bandwidth that is limited to a small portion of the drive’s capacity when doing sustained writes. The majority of a long sustained write will significantly underperform this figure, and as drives get larger it gets worse. To see why, let’s briefly look at how bits are stored on SSDs.
Increasing SSD density means moving to newer technologies that happen to also make sustained writes slower. These SSD technology generations are called SLC, MLC, TLC, and finally QLC, in the order they appeared in the market, which is also in order from lowest density to highest density. The SSD designers are very aware they are making streaming writes slower over time. Thus they often reserve a portion of even the very newest, densest SSDs to operate in the old SLC mode. This allows write to stream at the old, faster speeds until that section of drive is filled up. After that, write streaming slows down to the speed supported by the rest of the drive.
The first three thresholds are examined further in our Benchmarks section.
There’s yet another reduction that occurs when a recorder begins to overwrite old data with newer data. We’ll cover this in the next section.
Rather than benchmark specific drives, our goal was to choose examples from consumer and enterprise markets:
This is a typical TLC consumer-level drive with an SLC cache.
RAID 0 configuration would require at least 8 drives
This is an MLC drive targeting the consumer market without an SLC cache.
RAID 0 configuration would require at least 6 drives
This is an MLC drive targeting the consumer market without an SLC cache.
RAID 0 configuration would require at least 10 drives
Optane drives use 3D XPoint memory technology and have significantly better endurance than traditional SSD drives. However, they are not available in the same high capacities as traditional NVMe drive technologies.
RAID 0 configuration would require at least 6 drives
Drives using PCIe Gen4 have substantially improved performance. To take advantage of the optimized data-moving engines we used large buffer sizes of 256KB or more.
RAID 0 configuration would require at least 5 drives
The use of multi-cell architectures to get larger drive sizes comes at a cost of sustained writes for long periods. The biggest impact of this comes on consumer SSDs. The best real-world streaming write speed that matches the claimed maximum figure comes from Intel Optane drives; however these are also the most limited in drive size.
For those specifying systems for sustained writes, the key is to obtain or perform real-world benchmarks on the target drives. Our recommendations above for sustained writes at 100 Gb/s are based on such real-world test data.
Even so, there are further considerations in writing to SSDs that should be considered, which we will cover next.
A final consideration in write performance is lifetime of the drives. Writing to an SSD wears it out—and specifically data recorder applications can potentially stress the lifetime of the drives.
An “enterprise” SSD includes extra flash cells (over provisioning) to allow for longer lifetimes (larger DWPD).
However, this example is illustrative, not realistic.
While SSDs bring impressive performance over traditional hard-disk drives, the improvements are application-specific and, in some cases, performance is reduced.
For applications with sustained writes, such as those targeted by our Data Capture and Recorder projects, there are a number of factors to consider beyond the maximum sustained write specification.
Get in touch with BittWare to learn more about our Storage acceleration and Sensor Processing products and solutions.
ARCHITECTURAL CONCEPTS NVMe High-Speed Data Capture and Recorder The Data Capture and Recording Challenge There are many streaming data sources customers want to capture or
Go Back to IP & Solutions Dynamic Neural Accelerator ML Framework EdgeCortix Dynamic Neural Accelerator (DNA), is a flexible IP core for deep learning inference
BittWare On-Demand Webinar High Performance Computing with Next-Generation Intel® Agilex™ FPGAs
PCIe FPGA Card XUP-P3R AMD UltraScale+ 3/4-Length PCIe Board 4x 100GbE and up to 512GB DDR4 Need a Price Quote? Jump to Pricing Form Ready