When
deduplication was launched for storage, it seemed a difficult technology to
handle. Like any other technological aspects, deduplication also needs
processing power and memory. So deduplicating everything while storing on the
primary storage would not be very effective. The base premise it started off
with was reducing the disk storage investment by reducing the content to be
stored. In real sense, it did not help much. While it would reduce the number
of disk spindles required to store data, lesser spindles mean lesser IOPS so a
compromise on performance.
SSD based
storages require huge investment. Deduplication there can help reduce the disk
requirement. Being SSD and capable of large no. of IOPS per disk, there is no
compromise on IOPS while deduplicating. While working on one such customer
requirement recently, I realized that this does not end here. Scale out storage
further provides more processing power
& memory every time you upgrade and help you with consistent performance. Deduplication also happens inline i.e. you write only what is unique unlike other technologies that do deduplication at rest i.e. you write everything and then run a deduplication process to mark the duplicate content followed by a cleanup process to remove the duplicate content.
& memory every time you upgrade and help you with consistent performance. Deduplication also happens inline i.e. you write only what is unique unlike other technologies that do deduplication at rest i.e. you write everything and then run a deduplication process to mark the duplicate content followed by a cleanup process to remove the duplicate content.
Choose
deduplication on Storage with a caution, it may not be as fascinating as it
looks like.
No comments:
Post a Comment