Archimago's Musings: MUSINGS: Digital Filters Test discussion, and a 192kHz down-sampling setting suggestion...

Thursday, 23 July 2015

MUSINGS: Digital Filters Test discussion, and a 192kHz down-sampling setting suggestion...

Well, I guess I am flattered that the results of my recent Digital Filters Test got attention on AudioStream (Stereophile affiliate). The other day, Michael Lavorgna posted an entry entitled "The Trouble With Audio Tests" including a few quotes originating from my INVITATION to the test as well as my ANALYSIS posts.

I'm going to start today's entry addressing some of his thoughts on the matter and how I view tests like this.

First, let's just start with the quote from Ernest Rutherford (1871-1937) brought up by Mr. Lavorgna: "If your experiment needs statistics, you ought to have done a better experiment."

Well, it is a romantic notion isn't it that it could be so simple... That we don't need statistics to understand complex phenomena because somehow results end up being "black or white" or the conclusion just jumps out and declares itself. Perhaps in high school classrooms. Now I don't know enough about nuclear physics (to which Rutherford is known for) to provide any direct links, but I'd be surprised if some form of statistical analysis were not used in his famous Geiger-Marsden apparatus experiments to demonstrate alpha particle scattering, making sense of the empirical data, probabilities of deflection angles, and the margin of acceptable experimental error. Or whether modern particle physicists at CERN bother with statistics. But I do know that statistical analysis is essential in the biological and social sciences to make inferences out of observations of natural and social phenomena. It is acknowledgement that the natural world is "noisy" and that there is a "normal curve" to continuous variables out there in the "real" world. In research, we're trying to find answers based on whether a signal can be isolated from the noise. And this of course would include explorations into audibility and evaluation for the presence of "golden ears".

According to the post, Mr. Lavorgna was given the audio test samples in May as he states. He then did "one run-through" and "easily" picked out his preference for 2/3 samples. It now turns out he preferred 2 of 3 minimum phase samples and 1 of 3 for the linear filter.

I must say I am however disappointed! Why didn't you enter the results in my survey if you listened to these samples in May, Mr. Lavorgna? Remember, the test ended in late June (maybe you did but declined to indicate you were an audio reviewer?). Blind tests and claims of audibility are only good if one takes on the test and commits before the results are shown! You could have even added your subjective description as to what you heard so "easily", indicating strong confidence, and this 2/3 preference in the minimum phase filter could have pushed my results further towards an overall minimum phase preference for some of these samples. Alas, the opportunity is missed and what we end up with is a retrospective report - post hoc testimony.

He even goes so far as claim that he didn't even need to listen to the linear phase version before picking out the minimum phase one in the "second sample". ("So much so that I picked out the minimum phase filter in the second sample without having to hear the linear phase filter version.") If by "second sample" he meant the "GrandPiano" piece, recall that most respondents actually preferred the linear phase filter setting even to the point of a statistically significant p < 0.05 with headphone listeners! Impressive I guess that he was able to hear the difference so strongly. But also consider what this means if an audio reviewer's preference seems to be different from what most listeners seem to prefer...

I do agree with him that experimental results of group data may not apply to any one individual. It's not supposed to. But it does give us clues of trends and tendencies, what is important and what isn't. Sure, some unique individuals in a population will have remarkable abilities exceeding standard deviations of the "norm". These individuals have a "gift". Does Lavorgna have such a gift of having "golden ears"? I don't know nor does it necessarily matter. He only has to be true to himself just as I have to be true to myself and try my best to avoid my own preconceptions as best I can. This is just basic insight into human limitations and imperfections of the ear/mind mechanism. [Note that in this case, since the samples are all 24/176.4 and level matched, an honest ABX test with the foobar plugin will show just how "easily" the samples can be differentiated.]

What I can say is that out of 45 people who spent their time evaluating the test samples blinded (whom I wholeheartedly appreciate), I was not able to find a cohort who consistently selected preference of one filter type over the other (signal) beyond what was expected by chance (noise). The only significant result I could find was a preference for linear phase filtering with headphone users for one of the test samples, and that perhaps there is value in using the minimum phase setting with speaker listeners for 2 of the 3 samples based on subjective preference. This is all in the context of filter settings that exaggerate the amount of ringing due to an extremely steep "brick wall"! [That is, the vast majority of digital filters in use do not have close to this amount of ringing and I would expect audibility to be even less than this for most DAC filters.]

I certainly do not think the "answer" is at all "wishy washy" as Lavorgna claims. This is I believe a demonstration of reality, and a limited glimpse of the "forest" rather than individual trees to perhaps generate other hypotheses and further exploration. This is what you find when subjects are put to the test, when audibility between samples exist at the threshold of acuity. Not the remarkable "slam-dunk" subjective descriptions of "obvious", "clear" or "easily heard" comments made by almost every reviewer doing sighted evaluation as if "golden ears" were ubiquitous in the world.

One final comment before I end this segment. I do appreciate Mr. Lavorgna for admitting that he picked the linear filter in 1/3 samples. It's a demonstration that even if one believes there is an obvious difference, there could still be preference for the other filter setting in certain situations. Hence based on this empirical test, I would support DACs having switchable filters with both linear and minimum phase settings for audiophiles to choose their preference and do their own experimentation. Note that I'm not saying this should be a necessity since I personally feel the differences are at best subtle for the majority of listeners using typical filters with less ringing than the ones tested. And I think it would be presumptuous for any manufacturer to claim there is such a thing as an ideal filter setting to listen with. "Wishy washy" or realistic respect and appreciation for the subjectivity of human experience?

Mr. Lavorgna gave me a quote to start off discussions. Let me close with another I had mentioned previously courtesy of Karl Popper (1902-1994):

And who shows greater reverence for mystery, the scientist who devotes himself to discovering it step by step, always ready to submit to facts, and always aware that even his boldest achievements will never be more than a stepping-stone for those who come after him, or the mystic who is free to maintain anything because he need not fear any test?... All mystics, as F. Kafka, the mystic poet wrote in despair, "set out to say... that the incomprehensible is incomprehensible, and that we knew before."

(from The Open Society and Its Enemies, 1945, chapter 24 - Oracular Philosophy and the Revolt Against Reason)

---------------

I do not want to end this post with just philosophical debate. Rather, let's be practical and discuss a digital filter setting that I have been using over the last year for all my audio down-conversions.

I suspect many of you like myself may be a bit disappointed by all the hype around 192kHz samplerate material. Sadly, the majority of the 192kHz albums I have bought clearly have no "desirable" content at all in the ultrasonic frequency range (ie. most of the time the last octave is just low-level noise if anything). And they clearly don't sound any better than a lower samplerate version. Since I don't like to waste storage space, I routinely will downsample these to 96kHz if it looks like there is significant natural-looking material above 25kHz, down to 48kHz if it looks like it may have originated as that, and 44kHz if it looks literally like something upsampled from CD resolution. Even though disk space may be cheap, I'm still against wastage.

For a moment, let's not argue if we need anything more than 44/48kHz :-).

Suppose then, I want to down-convert 192kHz music to 44kHz, what filter setting would be good?

I've been using the excellent iZotope RX 4 software package for the last while. It's an excellent toolbox for experimentation and tweaking. I have been using the following setting for the lowpass resampling filter:

Pre-ringing at 1.0 means this is a linear phase setting - no phase shift, which I believe is preferable for the audio file itself (especially since it could be played back with a minimum phase filter DAC that would further distort phase). Filter steepness of 30 is moderate and while there is a little bit of aliasing, it's down at -28dB or so and above 20kHz. Finally the "cutoff shift" is down to 96%, lowering the passband slightly but reducing the amount of aliasing.

I find this setting works really well for me intellectually and it sounds just great :-). The reduced steepness correlates with less severe ringing / energy smearing at the Nyquist frequency. Pushing the cutoff shift down reduces frequency response only above 20kHz and inaudible for adults (and likely for everyone).

Impulse response for such a setting (using a 192kHz impulse downsampled to 44kHz):

Impulse Response

Just a tiny bit of low-amplitude symmetrical ringing (only about 4 cycles of any significance) with attenuated impulse at 22.05kHz.

And the spectral frequency display:

Spectral frequency display - Audition 3, Blackman-Harris windowing, 128 bands, 100% window width, 132dB range.

Nice linear response without group delays. No significant high-frequency smearing.

A similar setting if you're using the free SoX software would look like this:

sox.exe "Input_File" "Output_File" rate -v -a -b 91 44100

The -v flag is for "very high" quality accuracy, -a allows a small amount of aliasing, and -b 91 sets the passband to 91% or about 20.1kHz. For 48kHz, this would be pushed up to over 21.8kHz. Anyhow, a suggestion that works well for me, YMMV :-).

Frequency response of the SoX parameters downsampling 24/192 white noise to 44kHz - frequency response only rolls off over 20kHz.

---------------

The other day, Arny K. on the Squeezebox forum suggested having a look at the documentary An Honest Liar (2014) about James Randi. Even without the connection with audiophile cables, it's a fascinating look into an interesting fellow and his battles with charlatans and mystics over the years. Regarding the cable challenge that never happened almost 8 years ago, here's Mike Fremer's recollection of the story, and here's Randi's (long) side of the story [Oct 12, 2007], Oct 20, 2007 followup, and Oct 23, 2007. Not sure if much was said after that...

I guess we'll see what happened to the Ars Technica / Randi ethernet cable test soon... (AudioQuest Vodka teardown now posted.)

Time for summer holidays and a few weeks off with the family. Until next time... Enjoy the music!

28 comments:

Jour23 July 2015 at 01:23
:-/ Mr. Michael Lavorgna wrote this "lovely" thing: http://www.audiostream.com/content/blind-testing-golden-ears-and-envy-oh-my#gEa97M7Gj0D0EiP3.97 for me that was the end of a otherwise reputable website.
I'm sure he's a really nice person but he needs to level up if he wants to show "the finger" to objectivists, that's why I enjoy M. Fremer so much, he also shows "the finger" to vinyl haters but with a lot of class, I respect that. XD
Anyway Archimago thank you for your blog, for me its a reference just like NwAvGuy blog was. You are up there with Tyll Hertsens in terms of quality, Tyll is the perfect reviewer, THE benchmark for all others...XD...
I'm being subjective now..OH MY! Soon I'm gonna hear differences in CAT cables and use magic stones...XD
Ok now I'm just trolling...but you got an amazing blog Archi it's a true pleasure to read.

Rafael Lino Aka Journeyman
ReplyDelete
Replies
Anonymous23 July 2015 at 16:00
"Why didn't you enter the results in my survey if you listened to these samples in May, Mr. Lavorgna?"

Because I saw no value in doing so. Someone I respect asked me to listen to the files so I did. I reported my findings to him on May 28.

"But also consider what this means if an audio reviewer's preference seems to be different from what most listeners seem to prefer..."

This is but one easy example of why I do not take you seriously.
ReplyDelete
Replies
Anonymous25 July 2015 at 13:46
Cool.

If you ever wonder "Why the heck did he say that?" feel free to send me an email (mlavorgna@enthusiastnetwork.com) or comment directly on AS (unless this becomes an overwhelming task ;-) Some day a phone call might go a long way to clear things up.

Enjoy your summer.
ReplyDelete
Replies
Jim Ambras29 July 2015 at 10:15
With the rise of computer-based audio systems it's unfortunate that The Enthusiast Network family of online audio publications has not put more effort into creating high quality content for their AudioStream site. Publishing products from Synergistic Research as "Greatest Bits" - seriously? It does provide a great advertising platform for Audioquest and other high end audio manufacturers though, often doing little more than republishing content from the manufacturer.

Fortunately there are much better sources of information on computer audio such as Computer Audiophile - and this blog!
ReplyDelete
Replies
Unknown29 July 2015 at 19:59
I have been reading through your blog, and I would like to see a comparative test of listeners and their reaction to overly audio compressed tracks. Many of our favorite albums have been audio remastered. Often times to the detriment of sound quality, as they have had their loudness levels elevated. I'd be willing to bet that in a test similar to those you have run on digital compression, that the listeners would very clearly be able to tell the difference. There are many versions of tracks that have been remastered over the years, and if you could select a variety of those through their various remasters, it would be interesting.
ReplyDelete
Replies
MitchEE30 July 2015 at 17:39
1) I am new to reading your blog, and found it in the last week because I was also interested in the Tascam UH-7000 you measured in January, since I would also like to do similar types of tests at home. So, forgive me in advance if I say things that you may have blogged previously, until I have time to read more of your earlier posts.
2) By way of background, I have enjoyed reading many of your musings, and have enjoyed listening to a number of well-recorded, standard CDs (16 bit, 44.1 kHz sampling) as well as higher resolution recordings.
3) Your test attempts to discover if one or another upsampling method significantly affects the playback quality of initial selections at standard CD quality. I am not surprised by the mixed results that seem to indicate that the upsampling method is not that significant.
4) I think a more interesting test would be to start with well-recorded samples of specifically selected music at 24 bits, 176.4 or 192 kHz, and to test what effective bandwidth, bit depth, noise shaping, etc. significantly degrades the quality of the sound for well-trained listeners in a double blind experiment.
5) I would love to have concrete results from such an experiment. My hypothesis would be that some “simple” music would not have significant degradation from high resolution to standard CD quality, if care is taken. On the other hand, I would expect high dynamic range music like a symphony orchestra with brass and percussion for transients and simultaneous lower level signals that may be masked by poorer effective bandwidth, bit depth and transient response would be easily distinguished from the original recording by symphony conductors and others familiar with the sounds of the instruments.

MitchEE
ReplyDelete
Replies
solderdude31 July 2015 at 00:33
As expected ... no discernable difference in ethernet cables:

http://arstechnica.com/gadgets/2015/07/even-vegas-strangers-agree-340-audiophile-cables-make-no-difference/

Of course the test + used 'components' and unknown testers (only 7) will never convince the ones claiming differences are 'really' there.
Any audiophile will tell you the used test equipment 2x Grado RS2e directly ? out of a DELL laptop using Win Media player won' t 'resolve' the, to them, so obvious differences on high end gear.

ReplyDelete
Replies
tnargs10 August 2015 at 19:29
The same Rutherford once said "All science is either physics or stamp collecting", then went on to win the Nobel Prize in Stamp Collecting (known in less esteemed quarters as chemistry). Anyway, his statistics quote was made as a pre-quantum physicist, and he was speaking to all scientists, i.e. physicists only (if you get my drift).

Point being, he was a heavyweight as a scientist and a lightweight as a philosopher, perhaps even a joker.

Mr Lavorgna seems to have been on a crusade lately to discredit the scientific method, scientific testing, and, by association, science itself. He has put himself out there, not only in his own forum, but in the forums of some key people who like to discuss audio (and audibility) rationally and sensibly. Unfortunately, he is doing it as a missionary, not as a co-learner. A teacher and a teller, not a listener and a learner. While presenting as suave and sophisticated, one need not look far to see what lies underneath: disrespect. He let the truth out in his first comment on this article, "I do not take you seriously", and promptly tried to paper it over for reputation's sake, but it's everywhere and easy to find, starting with his blind testing article: the sneer.

Mr Lavorgna's fundamental missionary position on blind testing can be found in his article on that topic: "....listening tests of hi-fi gear at best tell you about the listening capabilities of the people taking the test under those specific testing conditions." Whereas the alternative, sighted tests, what do they tell us? Apparently they tell us all about the hi-fi gear itself! And now we see the sad hypocrisy: perform a cold rational critique of the method you wish to discredit (and that's fair), while maintaining an uncritical golden faith in the method you wish to endorse. Dear Sir: LOL! No sale!
ReplyDelete
Replies
Jim Ambras11 August 2015 at 10:37
This comment has been removed by the author.
ReplyDelete
Replies
Jim Ambras11 August 2015 at 10:39
I think it's important to always try to look for the motivation behind someone's behavior. In this case, it's pretty easy to guess. Just look at the main advertisers on his site and you'll see familiar names like AudioQuest. Lavorgna writes glowing reviews on $1000+ AudioQuest ethernet cables. Coincidence? I don't think so.

Adhering to the scientific method of testing (both using equipment and proper listening ABX tests) would cause him to have to expose the fact that much of the equipment advertised on his site is snake oil. And Audiostream would be out of business. So he is therefore on a mission to discredit any process that would expose the bogus claims of his advertisers who are paying his salary.
ReplyDelete
Replies
Ulisses Macedo22 June 2025 at 09:06
Hi dude. I've found your blog randomly on Reddit while looking at SoX configs. Do you still use those settings for downsampling? In my case I want to convert 24/96 or 24/192 files to AAC (yes, lossy, for phone usage) through foobar2000, and by reading around here a bit the ideal seems to be:

intermediate (45) + 91 passband?

Is aliasing still necessary?
ReplyDelete
Replies

Add comment