Saturday 11 April 2015

ANALYSIS: DSD-to-PCM 2015 - foobar SACD Plug-In, AuI ConverteR, noise & impulse response...

Noise characteristics of PCM vs. DSD - image found here.
In my post last week looking at the various DSD-to-PCM converters, Solderdude Frans made a good suggestion... Let's have a look at the newer SACD plugin which has superseded the DSDIFF plug-in as the converter of choice these days for foobar. Also, it was suggested by Yuri Korzunov, the author of AuI ConverteR 48x44 to have a look at his converter package as well.

I. DSD-to-PCM - foobar SACD plug-in & AuI ConverteR

So, I downloaded the newest SACD plug-in currently at version 0.7.7 dated 2015/03/16. I deleted the DSDIFF plug-in from my computer so there are no interactions, and installed the new files.

Notice that the SACD plugin has a configuration panel for settings:

Because this plug-in does not directly output to 24/96, I figure let's try with the highest output (352.8kHz) and I will use the best samplerate converter (SRC) I have (the excellent iZotope RX 4) to bring it down to 24/96 for analysis as before. Here are the settings I used in iZotope RX 4:
Sharpest "max" filter for 24/96 in iZotope RX 4, linear phase without suppression of pre-ringing - nothing fancy...

The other parameter we can play with in the SACD plug-in is the DSD2PCM mathematics setting. By default, it's the standard fixed-point integer mode. Let us also analyze the result from the highest precision mode "Multistage (Double Precision)".

As well, I downloaded the AuI ConverteR 48x44 software. The current demo is version 4.1.20. Other than setting the output for WAV 24/96, I left the rest of the settings to default.

Using the exact same procedure as last week, here's a summary of what I got:

Interesting... It looks like the SACD plug-in is actually about the same as the old DSDIFF 1.4 (<1dB difference) in terms of noise level and dynamic range. Notice just like last week's results from XLD, that going from fixed-point to double floating-point calculations made no difference here.

AuI ConverteR resulted in some impressive numbers! Let's see what it's doing in detail...

As you can see, AuI ConverteR is using a sharp "brick-wall" filter right at ~20kHz to remove essentially everything after 20kHz. As such, have a look at what it does with the noise profile:

Wow. That is an impressively sharp, precise filter at 20kHz! I can approximate that effect with iZotope RX 4's EQ plug-in with a low-pass at 20kHz, high Q of 25 or so (not shown) but AuI ConverteR looks even cleaner with less noise floor irregularity.

That little bit of high-frequency "rippling" with DSDIFF is probably a result of the resampling algorithm. Otherwise, foobar DSDIFF and the newer SACD plug-ins appear very similar.

Basically, this is what we can say at this point...

1. foobar SACD plug-in works about the same as the old DSDIFF plug-in. I would not be surprised if the algorithm (DSD2PCM) is essentially the same if we look "under the hood".

2. The AuI ConverteR software puts up some impressive numbers. This is done with a very strong low-pass filter. If you feel there is no need to retain frequencies >20kHz, then this will clearly get the job done.

II. All that noise!

But wait, there's more! SACD Plug-in also has a 30kHz lowpass mode - "Direct - (Double Precision, 30kHz LF)". Hmmm, I wonder how that looks?

Engaging the 30kHz lowpass mode really resulted in a step down in calculated accuracy. Here's a look at the graphs:

Indeed, the 30kHz low-pass filter is doing the job (yellow).

We can see the effect of that 30kHz filter on the noise floor... Certainly not the prettiest filter out there! Realize that although the differences are there in these graphs with a synthetic test signal, we're talking about noise down below -150dB (below 20kHz). It's just not an issue in terms of audibility.

Now, let us see if we can do it better by using iZotope RX 4 to do the 30kHz low-pass filtering instead of the algorithm used by SACD plug-in. Here's a simple setting:

Low-pass filter at 30kHz as seen, Q = 5.0 (not too steep), linear-phase FIR with FFT size of 32k.


This is what a good low-pass filter can do for the results. As you can see, when we use the 30kHz iZotope low-pass filtering, the calculated noise level drops substantially on the RightMark analysis... This also tells us that the reason Saracon and JRiver measured so well is because they implement very good quality noise filtering algorithms beyond just the DSD-to-PCM calculations.

The SACD Plug-in's DSD2PCM algorithm is excellent, and when we pair it up with iZotope's SRC and 30kHz low-pass filtering, we get some fantastic results easily on par to what Saracon does:

III. Impulse Response and DSD-to-PCM Converters

Despite the inherent noise in DSD, we can drop overall noise levels substantially with a good low-pass filter. In fact, since a picture is worth a thousand words, this is what a 15kHz (-12dBFS) sine wave looks like comparing the unfiltered foobar SACD plug-in output at 352kHz with the 30kHz iZotope RX 4 low-pass filtering (again, this is with Saracon as the encoding software for PCM-to-DSD):

This is what all that extra high-frequency noise looks like in DSD when you don't filter it out at all. Notice that DSD128 is significantly less noisy. The question is, just how much noise reduction should we actually do? (You can also see the noise through an analogue oscilloscope - as shown here.)

As noted by Juergen in the comments to the previous post, there is this matter about time-domain behaviour as well which can be skewed as we apply various filters.

Let's see what a 24/96 impulse looks like after going through the DSD encoder [Saracon] and most of the decoders I looked at (DSD-to-PCM converter output set to 24/352 for each, AudioGate's max was 192kHz):
(Click to enlarge.)
In the top left panel, this is what a 0dBFS 24/96 "impulse" would look like with a typical linear-phase oversampling interpolation showing symmetrical pre- and post-ringing. Even though Adobe Audition renders the interpolation, the actual PCM data itself is a simple, single "pulse" (see Addendum below for screenshots using Audacity).

When I convert this waveform to DSD64 and DSD128 with Saracon and then back to PCM with the foobar SACD plug-in to 24/352 unfiltered (retaining all that ultrasonic noise), you get the 2nd and 3rd left images. Notice again the amount of noise in the signal and again, we see the superiority of DSD128. From a time domain perspective, the SACD conversion process is excellent. The shape and timing of the impulse would be completely retained since the 2.8224 MHz sampling rate of DSD64 provides ~29 samples within each 96kHz time period.

When we use iZotope RX 4 with 30kHz low-pass filtering (4th left image from the top), the "impulse" amplitude is significantly reduced and we see the corresponding ringing pattern as the high frequency noise is removed and no longer obstructing the picture.

AudioGate and Saracon both look very similar. Both use linear phase filters with characteristic symmetrical pre- and post-ringing. Whereas AudioGate allows high frequencies through (and thus well formed impulse), we see the effect of Saracon's filter (pre- and post-ringing ~30kHz).  JRiver looks like it uses an intermediate phase filter (with 24kHz or 30kHz low-pass) which minimizes but does not remove pre-ringing. Comparatively, we see that DSD Master is using a form of minimum phase filter that removes the pre-ringing but the post-ringing is augmented.

AuI ConverteR is an interesting case. As we saw above with the RightMark tests, it implements a very sharp ~20kHz low-pass filter. This impulse response looks to be linear phase with accentuated pre- and post-ringing due to the sharpness of the filter; the "price" to pay I suppose.

I'll leave you to decide how you feel about this information and whether you think the relative time domain effects resulting from implementation of the filters are audible. Back in 2013 I had a listen to some filter settings off the TEAC DAC and had difficulty noticing much of a difference; again here, I listen and fail to convince myself that I have any clear preferences among the converters including using ABX Comparator. So far I'm using headphones (Sennheiser HD800 + TEAC UD501 DAC, ASIO driver playing DSD64 converted to 24/352kHz) so perhaps I need to try again with the speaker system. You guys up for an internet "blind" test to see if there's a preference towards linear phase vs. minimum phase upsampling???

IV. Conclusion

I hope we can appreciate the compromises we face with DSD to PCM conversion. How much noise can we tolerate from the 1-bit quantization when we move the signal to PCM? What's the best frequency to set a low-pass filter assuming one believes it's necessary? What parameters should we use to filter (minimum / linear / intermediate phase, sharp vs. gradual roll-off...)? What's the best sampling rate to spit out the PCM data (eg. do we need to produce >96kHz files if we roll-off before 48kHz)?

As I noted last week, I really am not convinced that these differences are audible beyond volume level changes and whether the ultrasonic noise causes problems for one's audio system (eg. intermodulation distortions, interaction with tweeter ultrasonic peaks, and other non-linearities). This is why I don't think there's any point in "crowning" any software package as being superior. Although it's interesting to demonstrate and experiment with, I suspect this is all rather obsessive academic results of interest to audio geeks :-).

If I had to choose, I remain partial to Saracon and JRiver because of the excellent results from the low-pass filtering used by default with those programs; one-step easy conversion using very reasonable parameters. As you can see, I can get similar results with the foobar SACD plug-in creating 24/352 output, and running that through iZotope RX 4 with high-precision samplerate conversion and low-pass filtering indicating that the underlying free DSD-to-PCM algorithm works well. AuI ConverteR is interesting in that the default setting I looked at resulted in a very clean output so long as one does not feel there is any need for >20kHz signals to be retained nor concerned about the effect on the impulse tracing.


You can perhaps imagine, after "penning" these last 2 posts, I'm pretty well done with talking about DSD for awhile. The most interesting question for me currently as suggested by the discussions with the previous post is this whole notion of just how much significance we should place on resolution in the time-domain irrespective of audible frequencies.

If it is significant (I hesitant to use the word "important" since that should be obvious by now if it is the case), then how much is enough? Should we take research like this paper by Kunchur (2008) seriously? Or is it possible that for practical purposes, it doesn't really matter that much when we're listening to real music as opposed to test signals? In any case, I have a strong suspicion that we will be revisiting this in the days ahead since this seems like an area that will be brought out when Meridian's MQA becomes available as I suspect they will emphasize time parameters, digital filter types, and samplerate given their apparent satisfaction with 16-bit resolution.


Finally, it has come to my attention that there was much unhappiness regarding a recent blog post on the importance of noise (here also) in digital audio reproduction to the point of using speculation to support an underlying belief that expensive ethernet cables could somehow impart beneficial effects (as you know, I found no evidence of significance in my testing with various types of ethernet cables). As usual, no empirical data or real-life examples were provided and support came from more testimony from the like-minded and some links that are at best tangential to high-fidelity audio. It looks like bans from commenting were issued for what seems like rather fair statements calling out the obvious lack of substance. I guess that's how people not felt to be "true believers in the audiophile experience" are dealt with. IMO, this is unfortunate behaviour for a site reporting on mature audio computing technology.

There is much that can be said, argued and refuted in that article, but I think for most reasonable audiophiles it's rather obvious and many excellent points can be found in the comments... What is of relevance to this blog entry is that if one believes that expensive ethernet cabling can reduce noise in the "system" (in a way that appears difficult for these people to produce empirical evidence for), why would any audiophile who subscribes to this theory even want to listen to DSD64 (where the noise is obviously demonstrable and a potential cause of distortion)? Or even consider DSD64 superior to 24/192 at times? Would it not be just as likely that some folks actually like the ultrasonic noise and what it actually is doing through the system? Perhaps similar to how some tube-lovers talk about certain types of distortion being unobjectionable? In fact, back in late 2013, I posted on my impressions with realtime PCM-to-DSD transcoding with JRiver 19 and felt that DSD64 did impart a subtle change to the sound. I wouldn't say that I felt the sonic difference compelled me to convert all my PCM files to DSD for listening, but it was an interesting effect. Maybe that's why some people would prefer a DAC that purposely converts PCM to DSD like the PS Audio DirectStream DAC (the signal is purposely downsampled to DSD128, and then only noise filtered by 80kHz according to this review).

I hope you enjoyed this exploration into the world of DSD (again)... I got a few projects piled up to work on so might not be around as much for the next couple weeks. I'll also be in Boston in the next little while so if anyone has a recommendation on music store I should check out near downtown, let me know!

I'm also thoroughly enjoying David Byrne's book How Music Works (2012, with 2013 update) - check it out for entertaining reading!

Enjoy the music folks :-).

Note that Adobe Audition renders the PCM data with a linear phase interpolation filter. Here are renderings of some impulse waveforms using Audacity which does not do the fancy interpolation for reference:


  1. A comment regarding Low pass filters , the Sony scarlet book specify some sort of low pass filter for SACD players ( DSD64 i assume ) ,so the intention has always been that some of this noise should be removed .

    What are we doing when converting to PCM ? If we miss to use a filter or don't filter enough we can end up with playback that includes more ultrasonic noise than the creators of DSD ever intended ?

    Is it ok to stick with Sony's proposed filter for hardware players when doing DSD to PCM ? Or can we do better with current knowledge . I liked your idea about the isotope filter at 30k personally I would used 25k .

    I also notice that it can be a bit silly , none or some of this migth not yield any audible differences , but sticking to specifications can be good thing it gives predictable results .

    1. Hi Mnyb...

      Back in the "old" days when I had my Sony SCD-1, the default hardware filter was set at 50kHz I believe. I'm not sure if the SACD specs officially recommends this as the roll-off filter frequency though... Clearly there's significant amount of noise generally seen from 20kHz onward already.

    2. Hi !
      Super Audio CD Signal Properties specifications by Philips and Sony, version 1.3, annex E, advises a low pass filter with a maximum cut off frequency of 50 kHz and a minimum slope of 30 dB/octave. This specifications are described as "informative", not "normative. So it's up to the designer to implement different filter.

  2. Hi
    Are you tested Foobar SACD plug-in in mode "Direct 30 kHz filter" or in mode "Installable 30 kHz filter" ?
    In my mesurements "Installable 30 kHz filter" works much more precise, than "Direct".
    Results is independent from type of calculations (integer, float, double).

  3. (my results)

    small comparison of converters.

    1) generate test files, RMAA 6.4
    24bit 88.2 kHz

    2) convert to dsd64 and dsd128 :

    software : Saracon
    gain: 0db
    8th order

    3) conversion to pcm 24bit 88.2 :
    (all tests in 24 / 88 mode)
    3.1) Saracon
    gain 0db
    tpdf dither

    foobar old version: 1.1.8 (with foobar new: 1.3.5 - some problems)
    sacd plugin: 0.7.3 + filters

    4) analyze in RMAA

    overall results:

    Testing chain: Saracon+foobar dsd64 (88 kHz 24-bit)
    Sampling mode: 24-bit, 88 kHz

    Test Saracon+Saracon dsd64 (88 kHz 24-bit)-p2d-d2p Saracon+foobar-old multistage dsd64 (88 kHz 24-bit)-p2d Saracon+foobar-old direct 30khz dsd64 (88 kHz 24-bit)-p2d Saracon+foobar-old I FIR 30khz dsd64 (88 kHz 24-bit)-p2d Saracon+foobar-old I FIR 40khz dsd64 (88 kHz 24-bit)-p2d
    Frequency response (from 40 Hz to 15 kHz), dB: -1000.00, +1000.00 -1000.00, +1000.00 -1000.00, +1000.00 -1000.00, +1000.00 -1000.00, +1000.00
    Noise level, dB (A): -141.9 -140.9 -119.0 -144.9 -144.0
    Dynamic range, dB (A): 133.1 132.9 119.7 133.3 133.3
    THD, %: 0.0000 0.0000 0.0001 0.0000 0.0000
    IMD + Noise, %: 0.0002 0.0002 0.0004 0.0002 0.0002
    Stereo crosstalk, dB: -140.9 -140.0 -117.9 -145.1 -144.3

    Testing chain: Saracon + foobar-old comparison dsd128 (88-24)
    Sampling mode: 24-bit, 88 kHz


    Test Saracon+Saracon dsd128 (88 kHz 24-bit) Saracon+foobar-old multistage dsd128 (88 kHz 24-bit) Saracon+foobar-old direct 30khz dsd128 (88 kHz 24-bit) Saracon+foobar-old I FIR 30khz dsd128 (88 kHz 24-bit) Saracon+foobar-old I FIR 40khz dsd128 (88 kHz 24-bit)
    Frequency response (from 40 Hz to 15 kHz), dB: -1000.00, +1000.00 -1000.00, +1000.00 -1000.00, +1000.00 -1000.00, +1000.00 -1000.00, +1000.00
    Noise level, dB (A): -142.0 -143.8 -101.2 -95.6 -69.2
    Dynamic range, dB (A): 133.1 133.2 98.1 92.5 66.7
    THD, %: 0.0000 0.0000 0.0010 0.0016 0.017
    IMD + Noise, %: 0.0002 0.0002 0.0075 0.012 0.146
    Stereo crosstalk, dB: -142.2 -142.0 -114.0 -99.4 -64.8

    full data with graph in archive :!AtAiQBJZ!hAO63e...85QU6r_vepFIAU

    1. Conclusion :

      Saracon is all-good, "reference tool" for DSD conversion, but it's usability isn't fine for me.
      Excellent HF suppression filter in both modes, DSD64 and DSD128.

      Tool from one forum for "all-in-one" conversion with Saracon, unfortunately, isn't fine for me.

      Foobar SACD plugin : (

      foobar + convert mode multistage : all good in DSD64 and 128, without distortions,
      but, in case with DSD64, very huge ultrasonic (noise shaping) noise is still present,
      and at DAC/AMP stage this noise will produce additional distortions - it's level depends from DAC class, but it will.
      In case of DSD128 noise shaping still present, but noise levels is much lower, and it starts ~~ at 40 khz.
      You may filter it or not - ultrasonic noise has very low energy.
      I not tested DSD256 files (can't produce test file, my Saracon only can 64/128), but I looked my dsd256-converted files in SoundForge and don't find any sufficient ultrasonic noise from noise shaping.

      foobar + convert mode : direct filter 30khz .
      Yes, ultrasonic noise at 30-44khz successfully suppressed (not as good, as with Saracon).
      But this filter add some noise and distortions, add for full range from bass to treble.
      Why author of SACD plugin use this mode ? Dont' know.

      foobar + convert mode : installable filter 30khz
      DSD64 : ultrasonic noise filtered worse, than with Saracon, slightly worse, than with direct filter, but without any visible noise and distortions. (But I think, that this level of filtration is sufficient for most DACs...)

      foobar + convert mode installable filter 40khz : same result,
      but with 40 khz ноdue to a higher cutoff frequency filter practically filter is practically useless for DSD64 discs / files.

      foobar + convert mode: installable filter 30+40 khz , DSD128:
      Heh, we arrived. What is it ? 8=-)
      With direct filter 30khz we see some noise and distortions, but with installable filter 30/40 khz we see much more noise and distortions.
      may be, convertor + filters writed, optimized and tested only for dsd64 files ?

      Final conclusion : SAracon is excellent for DSD64 / 128, but with low usability.

      for DSD64 conversion you may use Foobar + installable filter 30khz ,
      and may use Foobar multistage, but sound will depend from your DAC, how it can play this ultrasonic noise.

      for DSD128 conversion : use Saracon or Foobar + multistage

      for DSD256 conversion : Foobar multistage. (may be, new Saracon can work with DSD256? )

    2. Hi evgeniy,
      Thanks for the feedback. Yes I was using the "Direct" setting with SACD Plugin hence the lower quality 30kHz filtering results.

      BTW. Also, I've noticed some differences between different versions of the RightMark tool; including differences between the free and PRO versions. There was clearly a change in the test signal between 6.3 and 6.4. We're looking at a few dB here and there which IMO is rather insignificant when looking at noise levels down at -130dB or so...

      Thanks for the tips!

  4. Cambridge, Just off the Red Line, both near Harvard/MIT.

    1. Cool Bill!

      Looks like QAudio is by appointment only so unlikely to visit... But The Audio Lab looks possible when I'm near Harvard.

  5. I've been following your great analysis! Excellent work!

    I request that, should you return to DSD measurements, that you use your 24/96 signal and convert to DSD using Korg's AudioGate 3.XX and burn a DSD Disc (DVD). Place the DSD Disc into a PS3 and measure the HDMI and Analog outputs. The PS3 plays DSD Disc however, converts to 24/192 PCM on the fly.

    This would be excellent in archiving LP should the performance be acceptable.

    1. Interesting idea. It would in essence be measurement of the audio quality output from the PS3.

      Unfortunately my older model PS3 died last year (which sadly also means I'm unable to rip new SACDs for the time being) so I can't run the test at this time.

      Also, I wonder with the different hardware revisions whether there has been significant change in the DAC for the analog outputs? My brother actually has a PS3 hanging around so I'll see if I can compel him to let me borrow it when he's next in town :-)

    2. Right, essentially the sound of the PS3. There is a Wikipedia PS3 article, however I like this one.

      There's also an aftermarket analog cable for the PS3, however HDMI does both stereo and multichannel in the digital domain.

      Used PS3s are very common ~$90USD.

      I'll be watching for the review.

  6. Hi Archimago,

    Thank you for testing my audio converter AuI ConverteR 48x44.

    I want add some details:

    1. Used "brick-wall" filter with cut under 20 kHz allow pass by any possible ultrasound troubles for some hardware and/or playback software (in single system with hardware).

    I suppose any hardware has the best (for itself) features in range 0 … 20 kHz.
    But upper frequencies (ultrasound) may be shifted and mirrored to audible range.

    Thus AuI ConverteR 48x44 do "half work" of hardware. It can considered as optimizing audio stream to apparatus.
    And allow avoid existing of extra distortion by ultrasound of original (non-converted) stream.
    This technology used also for PCM/PCM conversion.

    In general, as I suppose, it must impact even to cheap apparatus in direction sound improving.

    I already not once get suggestions by AuI’s users for extending cut border up to 25 kHz and/or use less steep filter (where it possible).
    I planned it for future experiments.

  7. 2. About high frequency noise for some high resolution records
    As I assume, and how wrote in the article
    DXD records is not pure PCM. DXD has high frequency noise what, as I assume, inheritable from DSD.

    I got feedback from AuI’s users about noise removing by converting with AuI ConverteR without resampling.

    As I assume AuI ConverteR 48x44’s cutting all above 20 kHz allow playback such files at any software and hardware.

    Demonstation of noise removing for hi-res file in video

    After cutting we don’t lost transparency, but got sound without significant noise and open hidden in noise weak details.

    3. Why AuI ConverteR 48x44 has minimum settings

    In AuI used manually tuned resampling filters for each combination of input/output sample rates.
    Via measurements all contradictory features (steepness-ringing-performance, etc.) adjusted to optimal combination of values.
    Regard to resampling, user available only: turn DSD ON/OFF for unchanged sample rates, switch linear/min. phase filter.

  8. 4. AuI ConverteR 48x44’s minimal phase filters

    AuI default use linear phase filters. In Settings window /General tab possible turn to minimal phase filter mode.
    Phase response of AuI’s minimal phase filter very close to linear form.

    Need consider that pre-ringing artifacts for minimal phase filter moved behind to front of output (converted) signal.
    Thus we get 2 times more energy post-ringing than linear phase filter. Same things we can see in pictures of ringing comparing in the article.
    Here all according to theory of conservation of energy :)

    AuI’s minimal phase filter can be applied for any conversion as PCM as DSD.

    During several years since begin applying minimal phase filter, I don’t got any unambigous feedback that anybody prefer minimal phase filters.
    Here I said not only about AuI’s minimal phase filters.
    I got not once feedback about preferring of linear phase filters. Not only AuI’s.

    5. AuI ConverteR 48x44’s and DSD128 (5.6 MHz), D256 (11.2 MHz), D512 (22.5 MHz)

    Indeed D64 has level noise/distortions better CD's -120 … 130 dB (vary by DSD coder/decoder combination).
    I was compared D128 vs. PCM24. D128 he’s significantly lower level of noise -146 vs. -177 dB
    PCM 32 bit float (what I always use for measurements during testing software) superrior D128: -201 vs. -177 dB.
    Details here:

    D256 and D512 has so low noise that (as I assume) it restricted 32 bit floating point precision.
    While for me unavailable enough precise tool for correct estimating D256 and D512, for checking this hypothesis.

    About level of ringing

    In general, I suppose, ringing artefacts, phase and other distortions of final master's offline conversion is not compared by energy with same distortions during mixing and postproduction. Including different effects, infinite impulse response filtrations, real-time algorithm simplifications, …

    However we have possibility get audio stream more optimized for playback at available apparatus, even cheap.

    Best regards,
    Yuri Korzunov

    1. Thanks for the detailed feedback Yuri. Great to hear from developers who get their hands dirty with the actual coding/math/technical details.

      I must say that what you have done is bold :-). Strong filtering at 20kHz (for DSD audio despite all those who claimed in the old days that DSD64 was capable of up to 100kHz!), steep linear phase filtering to boot (forget the audiophile guys and those "apodizing" minimal phase filters!). Of course, you have maintained the high-resolution bit depth nicely within the 20-20kHz range.

      What I find most interesting is that you did not get "any unambiguous feedback that anybody prefer minimal phase filters". Now that I think says something about the importance (or lack of importance) in terms of audibility of these filter types and the fancy impulse response graphs...

    2. Dear Archimago

      Please try the different filter settings of your UD501 again, with minimum miked recordings in real rooms and listen over loudspeakers (instead of headphones).

      In the other blog you mentioned, that you haven't heard any difference of the different DSD to PCM converters, while listening to your HD800. I mentioned, that the audible difference of the different filters is audible as changing of the sound stage (when listening over speakers). Listening with headphones is binaural and that is different to stereo, when it comes to reproduces real room impressions. Sure, wich headphones you do hear also the tonal characteristic of the room, but not how the musicians where actually sitting in that space, during recording.

      Attached I have added some time measurements of your UD501 for your reference (I know you have made similar measurements). I have had the UD501 two years ago for a one week trial, when I was looking for a good headphone amp (but I send it back (but this doesn't matter in this case)). What you will see in my measurements, is not the Dirac impulse, it is a single cycle of a 20 kHz sine wave.

      Please have a second try with my mentioned track from 2L "Quiet Winter Night". Or if you like, I can offer you a link of a excerpt of one of my recordings, done in pure stereo, so only two mics equivalent stereo in real space with real musicians on real instruments. Please have a second try. Thank you.


    3. Hi Juergen,
      I'll have a listen again in the next couple nights when I have time to 'Quiet Winter Nights' on the speakers with different filter settings on the TEAC UD501 as suggested...

      So that I'm clear, you are suggesting that I listen to a 44.1kHz version, right? I'm assuming that with higher sampling rates, the effect from the antialiasing filter would be even higher up and harder to detect... Ringing would be way up above 40kHz. So if I download the 24/96 2L sample, I really should do a downsample to 44/48kHz, then listen with the various filter settings, should I not?

      For those unaware, here are the 2L high-res samples:

    4. Hi Archimago

      Yes, with 44k1 material, the difference between different oversampling filters is more obvious, as with higher sample rates.

      I would give you also two examples from my recordings. Both are made in pure stereo, so only 2 cardioid condenser microphones in equivalent arrangement (similar ORTF, OSS, NOS, ...) without compressors, without limiters, without reverb or anything artificial.

      One track is a mandolin concerto, recorded live in a church. So lots of room information and a bit long decay in the bass section. But fine and nice details.

      The second one is a concerto orchestra playing Vivaldi in a medium sized hall, with all wood ceiling, so a bit less vibrant in the treble compared to the other.

      I have cut both into 1 Minute excerpts. I hope you enjoy it.

      Good luck, and give me / us your feedback. Thanks.


      PS: Both recordings are made in 88k2 24 Bit and for this purpose, rendered down to 16 Bit 44k1 with noise shaped dither and minimum phase filter.

    5. Fantastic Juergen! I'll have a listen on the Teac. I had a quick listen at work and they clearly sound amazingly recorded.

      Here's an idea.

      Would you be OK if I utilized the 2 samples to run a little blind test for the readers here given the quality of the work? Here's the gist:
      Each sample: strings / mandolin will be upsampled with SoX to 24/192.

      One will be with standard linear phase (VHQ Linear), the other with VHQ Minimum Phase.

      As usual, I'll collect data anonymously, asking respondents:
      1. Which of the two samples they "prefer".
      2. If listeners can grade (maybe out of 4) whether they hear a difference... (No difference, small, moderate, clear difference).
      3. As usual, have testers describe the gear including DAC, speakers, headphones...
      4. Ask for subjective description of what difference they heard.

      Of course, the testers need equipment capable of 192kHz. I'd certainly be curious if there is a statistical significance (or even tendency) towards one or the other upsampling filters as preference by listeners.

      What do you think?

    6. Hi Archimago

      Yes you can take my two excerpts in sense of "fair use".

      So with the Upsampling (I would prefer 176k4 instead of 192k to have a more clearer process) you want to get rid of the main effect of the hardware upsampling of the DAC chip and leave mainly the effect of the software oversampling filter. I understand.

      Do you have the impulse response and the frequency response of the above mentioned filters that you want to / will use (as this can be seen at the website, or in the stereophile reviews). This would be nice to see.

      But please: Listening should be done over loudspeakers and not over headphones.

      Thank you

    7. Thank you!

      Absolutely, let's do 176kHz. I will use the newest SoX 14.4.2 and use the settings for VHQ Linear and VHQ Minimum phase as can be seen on

      In fact, here's the command line I plan to use:
      VHQ Linear phase:
      sox xxx.wav -b 24 xxx_out.wav rate -v -s 176400

      VHQ Minimum phase:
      sox xxx.wav -b 24 xxx_out.wav rate -v -s -M 176400

      I'll of course show the impulse response I create myself to verify that the program output was correct.

  9. Hey, Archimago!

    Which Precision setting did you use in foo_sacd on your final experiment (the one where you added the Izotope RX 30kHz filter)?

  10. I have now used EZ CD Music Converter because its Ulta High precision DSD to PCM gives me the best audio quality. Have you tested it ? It Is free download at

    Thanks for the great professional audio blog.