Sounds like he's using "reverse crossfeed" -- in other words, using crossfeed to cancel the sound from one speaker to the other ear, rather than what it usually does.
And yes, that video does a good job of explaining the exact answer to my question. Although he skipped "the third thing", which I suspect is frequency response.