Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Is deduplication performance determined by the number of disk drives used?

April 26, 2017deduplication disk drives PERFORMANCE Used

0

Posted

Is deduplication performance determined by the number of disk drives used?

1 Answer

0

Posted

In any storage system, the disk drives are the slowest component. In order to get greater performance it is a common practice to stripe data across a large number of drives so they work in parallel to handle I/O. If the system uses this method to reach performance requirements you need to ask what the right balance between performance and capacity is. This is important since the point of data deduplication is to reduce the number of disk drives. In Data Domain’s SISL implementation, an inline, CPU-centric approach, very few disk drives are needed, so its deduplication delivers on the expectation of a smaller storage system.