Data deduplication
@GoodBytes knows lots more about Windows than I do, maybe he can be of help, or
catch any errors I've made in my assumptions about how Windows does volumes etc. ![]()
To me it sounds like the website is saying this:
If I have an identical file/block located on 2 different volumes, then dedupe will delete one of those, and create a pointer that points to the location on the other volume. This works because something somewhere is keeping track of where these pointers are going
Without being absolutely sure, I don't think dedupe would work across different volumes.
I would expect it to work across different disks within the same volume, but not across
volume borders. I have however not yet been able to find a reliable source to confirm
this.
EDIT:
It seems that this is correct:
source: http://www.techrepublic.com/blog/data-center/windows-server-2012-deduplication-how-and-where-to-tweak/Deduplication with Windows works within a single volume (though multiple volumes can have it enabled)
So deduplication would not work across volume boundaries according to that site. Only
within the same volume.
/EDIT
A volume that is under deduplication control is an atomic unit.
Note: volume, singular. Not volumes.
Hence why a volume is an atomic unit, but a disk not necessarily so, unless you have a
volume which consists only of a single disk.
It says you can move a volume to another server, not a single disk:
You can back up the volume and restore it to another server. You can rip it out of one Windows 2012 server and move it to another
At least that's what I make of it.
If you take out one disk from a multi-disk volume, yes, the data should not be available
on the new machine, because obviously it's not on that disk. But if you move an entire
volume over to another machine (consisting of multiple disks), then everything should
be somewhere on those drives, distributed.
Now, if you have volumes which consist of single disks and enable dedupe, then I would
expect it to keep one copy of the data on that one disk, because it wouldn't deduplicate
over volume boundaries (or at least I would not expect it to).
This is just based on some googling and what I know about deduplication in general, if
Windows does something distinctly different, I hope somebody catches any errors I've made.
EDIT2:
This source also says that it's per-volume, not across volumes:
https://redmondmag.com/articles/2014/03/13/data-deduplication-in-windows-server.aspxDeduplication is performed on a per-volume basis.
So you need to make a clear distinction between volumes and disks. Individual disks
can only be moved between systems without problems if they contain an entire volume,
as soon as a volume spans multiple disks, it seems you need to move all associated
drives to the new system. Which makes sense in my book.

Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now