Archimago's Musings: Home Audio Fidelity's (HAF) X-talk Shaper DSP. And is crosstalk correction/cancellation (XTC) just an "effect"?

Saturday, 2 March 2024

Home Audio Fidelity's (HAF) X-talk Shaper DSP. And is crosstalk correction/cancellation (XTC) just an "effect"?

See video and plug-in info at Home Audio Fidelity site.

Let's spend some time talking about X-talk Shaper in this post, a new DSP plug-in that will allow speaker system listeners enjoy crosstalk cancellation.

For those who have not read much about this, perhaps review the post from last year written with STC on crosstalk cancellation (XTC) and Ambiophonics. There's also the reposted article written by Ralph Glasgal that discusses some of the rationale for Ambiophonics you might find interesting and I hope provides good background for the 'hows' and 'whys' of this technique.

My understanding is that for awhile now, Home Audio Fidelity has offered room correction with and without incorporating the crosstalk reduction option into the convolution filtering for their customers. This new plug-in then will only utilize the crosstalk reduction portion itself with the ability to customize the sound based on your speaker and head geometry as per the settings:

As discussed recently, I've been using a fanless Intel i3-N305 MiniPC in the audio room and as part of this, have incorporated crosstalk cancellation DSP playback as one of the outputs available to Roon. I've been doing this through the DSP Studio facility in JRiver Media Center and above we see a screenshot from that set-up.

The interface is simple, we have the "Listening Angle" knob which corresponds to the angle between speakers and the listening position. The "Interaural dist" reflects the width of your head between ears - 15cm is the default although like hat sizes, double check for yourself. The "Effect" knob determines how strong the cancellation effect is. "Volume" is self-explanatory, just whether the DSP applies some attenuation to the signal.

In JRiver, turn on "Process independently of internal volume".
Allow DSP to have access to full signal level for processing.

Obviously, have a listen, you'll be able to hear the significant difference crosstalk cancellation makes. Just take "Bypass" on and off. You don't need objective measurements to know if you need or want this.😉

Having said this, since this blog is about looking a little further under the hood at how things work, we can objectively capture/demonstrate the effect... Despite the beliefs of some subjective-only audiophiles, with the resolution and capabilities of modern computers and ADCs, we can always demonstrate a difference if in fact there is something to be heard! It's a matter of knowing what we're testing for. Whether you personally like/enjoy the demonstrable differences is of course the truly subjective part. The only things in the audiophile hobby that cannot be measured or demonstrated as having an objective effect are snake oil products which play on expectation bias and the purely subjective testimonies typically primed by purveyors of such products and echoed by the faithful (for example this laughable recent Stereophile cable hype article).

As demonstrated with the uBACCH DSP, we can examine the digital filter's effect on the impulse response as it is modified going through the processing. Here's what it looks with X-talk Shaper in "Bypass" (no modification) mode, then the default linear phase setting, and finally minimum phase selected:

NOTE: Impulse responses shown to examine morphology.
Linear phase processing will add latency, not shown.

Notice that I'm using a linear phase low-pass upsampling filter hence the symmetrical pre-/post-ringing in DSP Bypass. The intent of this diagram is obviously just to show the morphological change when we apply the crosstalk cancellation. When we use the Minimum Phase setting, like with minimum phase low-pass filtering, there is less pre-impulse effect. The upshot might be as sense of better "immediacy" with transients, at the expense of potentially less effective crosstalk cancellation plus the usual high frequency phase shift found with non-linear filtering (likely not audible).

Obviously changing the speaker angle will have a significant impact on the impulse response as well - here's an example of what happens when the speaker angle is set 30° vs. 60°:

We can surmise the effect is noticeably less strong when speaker angle higher.
(Default linear phase setting used for both.)

The "Effect" knob changes how strongly the crosstalk cancellation signal is sent to the opposite channel. We can easily see this with a 5kHz left-channel-only sine wave sent through the DSP and examine the amount of cancellation content showing up in the right channel:

The "Effect" knob clearly changes the strength of the cancellation signal sent to the other speaker. Notice the phase/time shift between the left and right channels calculated based on the geometry of the speaker angle + interaural distance (space between the ears). While performed differently, this is the same principle as active noise-cancellation headphones sending out a signal that attenuates the noise from the outside reaching your ears; in this case, the signal is intended to reduce the sound heard by the other ear just inches away. Another analogy might be that of a "reverse crossfeed"; crossfeed being the intentional channel mixing that some headphone listeners will use to simulate what happens when listening with speakers. The big difference between this and simulations with basic crossfeed however is that when listening to speakers with XTC, the sounds are still emanating from external space and we perceive it as such using our personal HRTF (head shape, pinnae), unlike headphones and the typical "inside the head" lateral soundstage.

This is all calculated by the DSP and applied realtime during playback. With today's modern CPUs it's no problem at all. Here's a look at my CPU load while playing a 24/96 stream from Roon (server computer) → Roon Bridge on the MiniPC → JRiver/X-talk Shaper:

This is the i3-N305 MiniPC with power-limited BIOS setting, processed through 32-bit X-talk Shaper DSP (64-bit version would likely be even more efficient). The computer was able to do this at 10% load running at 1.22GHz - not even the typical base 1.8GHz speed, much less activating turbo to 3.8GHz. As audiophiles, know that audio processing for playback typically isn't a big load for modern CPUs unless you're running a bunch of DSP processes concurrently and possibly in multichannel. More likely to be taxing the computer on the studio production side than consumer playback side.

I hope you found this dive into HAF's X-talk Shaper software and its effects interesting! It's great to see another DSP option available for audiophiles to try out. Here's the link to the X-talk Shaper DSP software which will run in demo mode but intrude every minute with a dip in the volume for a couple of seconds (IMO, a great way to implement software demos). It's available for Windows (32/64-bit versions, VST, VST3, standalone) and Mac (Audio Unit, standalone). Registration is linked to your specific computer hardware and OS based on key ID, and costs €129.

[IMO, this is potentially a much better deal than the unreasonable high cost of uBACCH which also implements crosstalk cancellation through convolution DSP. I'll leave you to subjectively listen and decide for yourself which sounds better in your system.]

The fun of music is in the listening of course, so give it a try yourself with your speaker set-up. Play with the settings; feel free to deviate from your speaker "Listening Angle" in order to achieve most pleasurable effect - I do. If it works well, you should hear neutral tonality yet the soundstage widens with improved depth (better sense of "immersion") as crosstalk distortion is reduced and your ears/brain experience possibly a more accurate interaural time and level differences (ITD + ILD) embedded in the recordings we perhaps never knew were in there!

Tip: to test that you're not hearing weird distortions or tonality, try using mono music with distinctive vocals. For example, the gravelly, detailed vocals of Louis Armstrong like the track "A Kiss to Build A Dream On" on Satchmo Serenades should sound well centered, without artificial glare when played through your DSP of choice.

--------------------

But is crosstalk cancellation just an "effect"?

I've heard variants of this question asked online and in videos over the last few months. I think the underlying belief is that perhaps there is something pejorative about adding "effects" to the sound we hear. As if this could damage or even destroy some notion of an idealistic "absolute sound" that we're after.

Well, of course crosstalk cancellation has a very significant effect on the sound, so yes, it is and has an "effect"! Putting a mattress or wall between the speakers as a hardware solution to separate the sounds reaching the contralateral ear will also have a noticeable effect. 😁 [Furthermore, in my mind, choosing to introduce vacuum tubes into the playback chain and selecting vinyl playback are also no less "effects".]

However, the question is really whether that effect is good/beneficial... Whether it serves the purpose of allowing the listener to hear what's actually in the recording - even hearing what's truly in the mix that the artist(s) and studio engineer(s) did not know were there! This idea should not be foreign to us because I think we already know that most musicians and pro audio folks are not obsessive audiophiles.

Crosstalk cancellation (and general Ambiophonics playback) has the potential to remove at least some of the crosstalk distortions from speaker systems which we have all lived with, and experienced as "normal" since the beginning of home 2-channel playback even though we've also all heard the benefits of crosstalk reduction when we wear headphones. Even if you're sitting nearfield listening to excellent speakers (less room effects), notice how much easier it is to use headphones for A/B listening tests to detect subtle changes when we don't hear the distortion from signals intended for the other ear. But other than those who have tried physical partitions between the ears, or DSP like AmbiophonicDSP [note: apparently not being sold anymore], (u)BACCH, X-talk Shaper, etc., probably few music lovers, audiophiles, or even studio production engineers would have heard the expanded immersive soundstage through just 2 speakers on many recordings.

For example audiophiles, despite how often you might have heard a track like "Keith Don't Go" used in audio show demos, I would argue that you have not really heard the best from Nils Lofgren's Acoustic Live (1997, DR11) until you've heard it though good crosstalk cancellation dialed in for your 2-channel speaker system.

Since XTC is not used routinely in music production studios, albums would not have been "checked" for how they sound which means what is heard might not be "as the artist intended".

For example Bob Marley's "Jamming" (off Legend: The Best of Bob Marley and the Wailers, 1990 CD first pressing) has percussion parts that sound like they're sitting right against the ears as if one is virtually wearing headphones. This is pretty weird but an interesting experience I suppose; not something I imagine Bob would have heard in the studio. Would he have wanted this effect? (Perhaps not.)

There are some other older recordings that really get a boost from XTC, for example Jon and Vangelis' The Friends of Mr. Cairo (1981, DR15), check out the title track! Car whizzing by, car crashes, gunshots in the distance, voices at different depths, damsel in distress, alien-type ray-gun effects, synths, distorted vocals, etc. A cute ode to gangster movies and the golden age of Hollywood. Did Jon Anderson and Vangelis intend for the soundstage to be this wide, and "objects" this well delineated? Have they ever heard this recording with XTC on? Who knows unless we ask them. More importantly, should we even care!? This is clearly an artificial studio production which can obviously sound highly immersive but does not reflect "realism" in that this recording was never meant to replicate the sound from any actual performance one could ever attend.

Modern pop recordings already incorporate a ton of DSP and I find that they tend to sound fine with XTC despite many inherent issues like poor dynamic range. Christina Aguilera's AGUILERA (2022, DR6 - yuck!) sounds pretty good - have a listen to "Pa Mis Muchachas" and the more traditional "La Reina". And Dua Lipa's latest single "Training Season" also sounds expansive already, and a bit more so with X-talk Shaper. Her track "New Rules" from the Live Acoustic EP (2017) also sounds great with XTC. I could go on...

Whereas crosstalk cancellation is still stereo playback (assuming not further processed with surround speakers for ambiance extraction) and there are significant limitations for those listening outside the sweet-spot, true multichannel does provide better imaging for those sitting off the ideal position, is able to anchor the center image better with good center speaker, and can provide sounds originating from the rear (or overhead as in Atmos).

The first track on the Hans Zimmer Dune: Part Two (2024, DR11 multichannel) soundtrack, "Beginnings Are Such Delicate Times" (well known quote from the Frank Herbert book) with its deep bass and massive impression of spatial volume right from the start, mixed with an intimate central melody when heard in a multichannel/Atmos system is an example of impressive immersion that simply can't quite be captured with the 2-channel version even with XTC. This massive spatial sonic gestalt returns in the track "Eclipse", and the cacophony of sounds/noises in "Water of Life" sound phenomenal in multichannel surround.

Consider incorporating crosstalk cancellation in your arsenal of audiophile experiences. Have fun folks as you explore the joys of Fidelity, Immersion, and Realism in your own home and over the lifetime of audio playback!

As always, I hope you're enjoying the music...

BTW: I'm in Houston next week. Probably won't have much time, but let me know if there's an awesome music/audio store to check out in case I have some down-time.

30 comments:

Mike59592 March 2024 at 11:33
Hej Arch,
So much to digest! Love your lengthy forays into subjects I need to know more about. I have a multichannel setup and I encourage all 2 channel dinosaurs to at least give this format a proper listen. It is after all the natural evolution of sound reproduction if our audiophile goal is to get as close as possible to a live concert experience. In my naivety I assumed this format would be lauded and embraced by music lovers. Of course I was wrong. I stumbled across a post in one of Roons forums where someone urged people to try the Bacch4macs 3D software as he was completely floored by the experience. He writes, In 38 years of being an Audiophile I’ve never have experienced such an incredible increase in enjoyment in musical realism and added purity through increased dimensional detail and resolution by using distance based or custom (using binaural in-ear microphones) crosstalk cancellation filters. It’s not a frugal endeavor but I do make it a point to mention to audiophiles my experience with it has been revelatory. The response?... I am content with my 2-channel stereo setup, thank you very much. I am here for the music, not for the latest sound-processing gimmick.
History teaches us that many groundbreaking inventions were ridiculed when they first appeared. The light bulb was not immediately accepted, The British claimed” good enough for our Transatlantic friends... but unworthy of the attention of practical or scientific men." And then we have vaccines, personal computers, internet and so on. All met with skepticism and distrust.
This inherent negativity and suspiciousness for anything new and revolutionary is sad and sometimes even harmful. Music will be enjoyed regardless of how it is reproduced. But if there is a way to immerse myself even more, then I am all for it!
Cheers!
Mike
ReplyDelete
Replies
GillesP2 March 2024 at 12:27
Hi Arch. I looked again at Dune 2 on Amazon and now there is no Atmos track at all! I guess like DSOTM it’s Apple paying for some exclusive « spatial audio » version of a popular release for a time. I could stream Apple Music on my Sonos system but I really don’t like their attitude…I’ll wait.

Regarding XTC, I found out that simply putting both your hands on your nose to make a short 10-finger wall, and already there is more channel separation. Take your hands away and the two channels get mixed up again!

Brings back memories of the old three stooges eye poke joke. ;-)
ReplyDelete
Replies
Mikhail3 March 2024 at 06:05
I see you are getting hooked on XTC, I think you can go a long way trying to make the experience perfect by trying to adjust the cross-talk cancellation to your personal head and torso properties. Regarding the question whether it's an "effect" or not—we should start from the fact that stereo recording is already a somewhat artificial attempt to capture or re-create the reality using a very limited representation, much like like a photography maybe. And then we use various ways to extract that reality using our playback system. This is similar to the fact that you can print the same photograph on a different medium, process it using different color profiles, etc—and it will be perceived differently. It's always half engineering, half art.
ReplyDelete
Replies
ST4 March 2024 at 22:07
Hi Archimago, what is the duration of the impulse response? Thanks.
ReplyDelete
Replies
fgk8 March 2024 at 21:34
When I listen to music with headphones, I use 112dB Redline Monitor VST crossfeed plugin. So, if I want to try this X-talk Shaper VST plugin (for headphone listening), does it make sense to use it before the 112dB Redline Monitor?
ReplyDelete
Replies
fgk23 March 2024 at 05:23
Hi Archi,
Do you know Anaglyph (high-definition binaural spatialization engine)?
I would be interested to hear your impressions about it.
http://anaglyph.dalembert.upmc.fr/index.html
https://www.aes.org/tmpFiles/elib/20240323/19544.pdf
ReplyDelete
Replies
Tell7 April 2024 at 14:19
I just tried this out and I can't hear ANY difference whatsoever having this on or bypassed. The volume slider inside the VST seem to be working at least so I guess the rest should be working as well, except that nothing really happens when working the sliders. I think I have a quite good system and my hearing shouldn't be too bad either, so why am I not hearing any difference?
ReplyDelete
Replies
Kirk Boone30 July 2024 at 09:55
For many years I've used a modified (Gundry circuit, new caps and connectors) Carver C-9 for modifying the crosstalk. Now, I'm using the software only version of BACCH on my mac and I must say it is a great improvement. Whether you use a Carver, BACCH or the X-talk (which I have not tried), it's hard to think about doing without once you use it for awhile.
ReplyDelete
Replies
fgk14 March 2025 at 05:48
Foobar now has this component:
LCC (Localization Cue Correction) is a solution for spatialized audio through stereo speakers.
https://www.foobar2000.org/components/view/foo_dsp_lcc
ReplyDelete
Replies