In my brief performance tests of latest beta, I don't see any difference between 4088 byte and 9016 byte jumbo frames. With either, I was able to get to 550MB/sec reads from cache over 10GbE, and one test when I had two clients testing the same target simultaneously got to 700+ MB/sec.
So, if your clients are using the default windows allocation unit of 4096 bytes (which is more than a 4088 byte frame!) and you use 9k frames, and on the starwind side your disk is also formatted as 4096 byte allocation units, is the ethernet 9016 byte packet padded with zeroes, or will it have two allocation units worth of data in it? Or is the allocation unit size irrelevant because iSCSI groups IOs into larger chunks, or whatever?
Basically I'm looking for a reason for there not being much of a difference between 4088 byte and 9016 byte frames, and wondering what the optimal setting is really going to be given that most IOs are going to be for multiples of 4096 bytes. In the end I actually care more about IOPs than transfer rates, but it's that 4096 byte windows default that I want to optimise for.



