Leo Gibson is using LUFS to measure capturing accuracy and not perceived results
Yep. This is the wrong approach if you want objective accuracy measurements.
It simply needs to be done at the spectral level. There is no other way. Basic "level meter" analysis (whatever the weighting is!) is just not fine enough detail to give any kind of realistic impression of how accurate something is.
You need to:
- Record the amp with a source signal. Sine tones or pink noise work best.
- Record the captured profile with a source signal. Sine tones or pink noise work best.
- Run an STFT over the source.
- Run an STFT over the capture.
- Account for any FFT smearing artifacts due to windowing or overlapping.
- Perform a diff across both data sets.
- The diff shows you what and where the two audio signals are different.
- At this stage, if you wanted some kind of weighting... you could do it all again, but apply a log conversion, or a weighting against the Bark/Mel scales to account for human perception. Both data sets are valid and can tell you things.
If anyone is interested in doing this, grab Python, Librosa, Matplotlib, and get to work!!