Although I believe some of the contents in the article above are debatable, in the years since 1998, high-resolution, high samplerate audio has of course become common-day reality for audiophiles. As I expressed years ago, I do like the 2xCD samplerates like 88 and 96kHz. But as a result of realizing that 176.4 and 192kHz songs were not being streamed properly with my Logitech Media Server with BrutefirDRC set-up described a few months back, I started asking myself, what is it we would be missing if these albums were downsampled to 88.2 and 96kHz?
Put another way, we could ask "Is there something musical about the highest octave in these 4x samplerate files?" This highest octave for 176.4kHz files would be audio containing 44kHz to 88kHz, and in 192kHz files from 48kHz to 96kHz.
Now ideally, if a file contains no audio information in the highest frequencies, we should see a very clean - low and flat - noise floor... Something like this:
|Spectral Frequency Display|
Another example of a relatively clean noise floor would be something like this:
|Spectral Frequency Display|
More often than not however, you see files which are contaminated with noise peaks probably picked up in the analogue recording chain. Something like this:
|Spectral Frequency Display|
That was taken from the Linn 24/192 "Studio Master" release of Carol Kidd's album Tell Me Once Again (the track was "Moon River"). It sounds fine on playback of course but you can see that the ultrasonic spectrum is contaminated with peaks up to -68dB or so especially around the 60-70kHz regions; easily seen as a band across the Spectral Frequency Display. If you consider however that there does not appear to be anything but noise above 30kHz, one certainly should start wondering if it might not be best to just filter out all that extra crud since there is potential that intermodulation distortion could affect the audible frequencies in playback.
I started to look at just how many of these "very high-resolution" albums (that is, 176.4/192kHz albums) I had in my Logitech Media Server library. As expected, relatively few - only 5 24/176 albums and another 35 24/192 albums out of about 8,000 total albums (0.5%) excluding singles and vinyl rips. Many of these I collected back in 2008 to 2012 as high-resolution downloads off HDTracks or my own DVD-A rips purchased in the early 2000's. Remember that it was during that time frame when high-resolution audio started becoming available online and DVD-A Explorer came out for DVD-A ripping (~2008). Well, I loaded the first 14 albums into my audio editor and made notes about whether I thought there was musical content in the highest octave afforded by the higher sampling rate.
- Neil Young - American Stars 'N' Bars - 24/176 DVD-A rip was just an upsample, happily resampled to 88kHz
- Neil Young - On The Beach - 24/176 DVD-A rip was also just an upsample, brought down to 88kHz
- Neil Young - Archives Volume 1 - Blu-Ray 24/192 rips. I didn't go through every song but the ones I looked at all looked like 96kHz upsamples at best.
- Bill Evans Trio - Waltz For Debby - 2011 HDTracks 24/192. Easily resample down to 48kHz and dither to 16-bits without fear of losing anything. High noise floor on old recordings do not benefit from 24-bits.
- Bob Marley - Legend: The Best of Bob Marley & The Wailers - 2012 HDTracks 24/192 download, no content worth keeping so downsampled to 24/96.
- Fleetwood Mac - Tango In The Night - 2011 HDTracks 24/192 download - clearly not worth more than 16/48 due to high noise floor and no high frequency content.
- Fleetwood Mac - Tusk - 2011 HDTracks 24/192 download - like above, no point keeping >16/48.
- Neil Young - Harvest - 2002 DVD-A 24/192 rip - upsampled from 96kHz.
- Carly Simon - No Secrets - DVD-A 24/192 rip - awful DR9 remaster, just get the DR13 first press CD or maybe Audio Fidelity remaster I see is available.
- The Eagles - Hotel California - 2001 DVD-A rip - feel free to go down to 24/96.
- Cat Stevens - Tea For The Tillerman - 2012 HDTracks 24/192, happily downsample to 96kHz.
- Steely Dan - Everything Must Go - DVD-A 24/192 rip. Basically a 96kHz upsample.
- Muddy Waters - Folk Singer - 1999 Classic Records HDAD 24/192 - lovely recording but just noise >25kHz. In fact 16-bits would also be enough for a recording of this vintage with high noise floor.
- Antonio Forcione & Sabina Sciubba - Meet Me In London - 2011 Naim 24/192. Sounds great but even an "audiophile" release like this has a high noise floor and nothing but noise beyond 48kHz as well as some moderately strong high frequency tape bias signals.
... and so it goes; that was just the first 14. Even classical and acoustic music (from 2L and Linn) as suggested above in the graphs looking like genuine high-resolution captures at 192kHz could easily be downsampled to 96kHz and I don't think an audiophile should have concern about content loss. I know that we often joke on forum posts that maybe these very high resolution audio files would be preferred by cats and dogs, but honestly, I suspect one's dog and cat would prefer a downsample taking out all that noise and high-pitched continuous tones in the highest octave found in many of these albums (like the Carol Kidd tune demonstrated above)!
As I went through what I had, the only time I considered maybe 192kHz could capture more musical information was with Trondheimsolistene's In Folk Style 24/192 Blu-Ray rip where the track "Grieg Two Nordic Melodies Op. 63: II. Kulokk and Stabbelaten (Cow Call and Peasant Dance)" encroached up to 48kHz and maybe just barely surpasses. Obviously, any information going above 48kHz is of very low levels and unlikely audible given human hearing limitations, so it's all rather academic and perfectionistic.
Years ago, Monty at Xiph.org already laid out the technical case against 24/192. Some excellent points there, but even just looking at the realtime spectrum analyser while playing the music, one gets the sense that there's just nothing there to even bother with.
Finally, if you look at how we make recordings, it gets down to the basic fact that there are not many microphones with the capability for extended frequency response. For example, the recent Sennheiser MKH8000-series models can go up to 60kHz, but typically to around 50kHz. And as noted by Demian Martin in the comments of a previous post, we need to be careful about drop-off when recording off-axis (he gives the example of the B&K 4133 and off axis drop-off based on the data sheet page 7). Even for Earthworks' own demo material to show off their microphones' high frequency response, the wav audio download is presented as 16/44 files with obviously no content above 22kHz. Are they suggesting that higher microphone frequency response might improve tonality in the audible range?! If we look at the specs for a typical list of "recommended" studio microphones, the vast majority are rated to 15-20kHz; bottom line is that there's not much high frequency material being captured even if one were to argue that some instruments are capable of strong ultrasonic harmonics. Sure, there could be "life above 20kHz" based on a CalTech paper (you can still find it on Google cache), but how many studios actually capture this and of what fidelity? What I have seen suggests that there's little being retained whether by choice or from the limitations of the microphones used.
[On a side note, of course we do have microphone technologies that can record ultrasonics quite well. For example, in the biosciences, a number of research papers on animal vocalizations such as this paper on rat communication uses the Avisoft-UltraSoundGate CM16 which is capable of up to 200kHz within the limitations of the polar diagram. Great for bird chirps, rats, echolocating bats, and understanding porpoises; not quite necessary for music recordings as far as I can tell :-).]
If you're wondering what settings I use to downsample from 192kHz to 96kHz, the obvious answer is that it really doesn't matter with modern sampling rate converters since accuracy would be excellent. My favourite program for this is iZotope RX using a relatively gentle linear filter (compared to the typically steep 44kHz settings where "Filter steepness" would be around 30 to keep frequencies intact to 20kHz) beginning just beyond 40kHz and essentially free of aliasing:
Sounds great and works for me... Of course I could use an even gentler filter but I had this romantic notion that it would be nice to keep frequencies as flat as possible to 40kHz (two times the typical ideal top frequency response for human hearing at 20kHz) and to use linear phase setting ("Pre-ringing" at 1.00) to prevent phase shifting since we don't know if the DAC will further oversample with its own minimum phase algorithm at playback. Not that this likely matters being ultrasonic and all...
In summary: Yes, there is "life above 20kHz". But I see little if any sign of life above 44/48kHz (or 88/96kHz samplerate) in recordings out there. IMO, this is further justification for the music industry, if they choose to be serious about providing truly high resolution quality offerings to consider "keeping it simple" and standardize on 24/96 masterings.
In the comments, let me know what albums you've found to contain recorded harmonic frequencies benefiting from 176 and 192kHz samplerates... I'd love to have a list for demo and assessment purposes!
Before closing off, remember that in this post I'm only reanalyzing my 176.4/192kHz files. In truth, remember that there are many 88.2/96kHz files out there that probably can just as well be downsampled to 44.1/48kHz simply due to lack of actual content. I've many times come across 96kHz files for example that just look like they may have been put through some kind of analogue mixer but the underlying musical content originated at 44.1/48kHz. Apart from noise picked up in the ultrasonic range, and unless on wants to argue that said ultrasonic noise has beneficial and audible effects, I'd be inclined to downsample these as well. This is of course the natural outcome when the music industry cannot standardize on a technically quality controlled product for what constitutes as "high resolution". Also, I believe many audiophiles are tempted to buy the "high resolution" file simply because of the "bigger is better" mindset in the absence of objective standards and analysis (is it any surprise that websites like HDTracks don't allow user review comments for folks to discuss the value of these downloads?).
Finally, remember that I'm not at all saying anything negative about the value of higher technical capabilities like 24-bits or >88/96kHz sample rates. There is of course a time and place where 24-bits resolution is essential (eg. in the studio for dynamic range overhead during production) and high sample rates used to optimize the mastering quality and archiving. It is important to however remember that the needs of the studio does not necessarily apply to the consumer home playback of the final product. Sure, there might be some psychological satisfaction in owning a "studio master" better than CD-resolution despite the technical standard of the content. I look around my collection and see albums like Miles Davis' Kind Of Blue in 24/88 and 24/96, or a copy of 24/96 Harry Belafonte's Belafonte At Carnegie Hall; both from 1959 and clearly neither of which have a low noise floor requiring 16-bits nor frequency extension beyond at the very most 48kHz samplerate. I also see Bob Dylan's Highway 61 Revisited (2015 MFSL SACD) in 24/88 from 1965 - again 16/48 would have been enough. But due to the "classic" status of albums like these, I'm OK with keeping them in a bigger bit "container"... Remember though that it's one thing to accept oneself as human and have idiosyncratic "values" but I would certainly not argue that these albums would sound any worse downsampled to 16/44! This is an example of what it means for me to be "more objective"; maintaining a foundation of decision-making and understanding based on the science, but at the same time allowing oneself a license to take joy in one's own subjective psychological idiosyncrasies.
Have a great week ahead. And of course I hope you're all enjoying the music.
Addendum (06-10-2016): Based on the discussion below with sk16 about Steve Wilson's production, a reader sent me some information about the recent album Hand. Cannot. Erase. (HDTracks 24/96, 2015).
DR11 is quite good for a modern album of course. The lower graphs are for the HDTracks 24/96 download of the first song "First Regret". As you can see there's nothing beyond 29kHz demonstrated on the FFT and Spectral Frequency Display. Based on what I see on playback, I believe resampling and dithering down to 16/48 would have been just fine while saving a good amount of storage space.