That's sounds like a great idea until you actually try to design and run a few tests...
What do you use as an input signal? How are differences measured and weighted? What meaning can be reliably gleaned from the resulting numbers?
If you actually try to do a few, and do it well with the intention of having a meaningful result, you will quickly realize the resulting numbers can vary tremendously with your choices, and can be what you want it to be if you are willing to be manipulative.
Results from someone else's null tests are meaningless, and doing your own is not much better.