Wednesday, 2 July 2014

24-Bit vs. 16-Bit Audio Test - Part IIa: The 20 Correct Respondents...

As suggested by one of the comments in Part II, I have put together a summary of the respondents who got the 3 sample audio tracks correct (answered B-A-A). Let us see if there are any demographic variables that stand out. Remember, out of 140 respondents, 20 were able to identify all 3 24-bit samples. This is of course not significant (p 0.30) - by chance alone, one would expect approximately 1/8 to be correct (17.5 out of 140).

Here are the graphs direct from the survey site. Feel free to compare / contrast with those in Part II in the "Demographics" section.

95% males with 1 woman in the group. Not unexpected.

Average age calculated from the median age in each category: 45 years. Essentially the same as the 44 years old average for all 140 respondents.

24% musicians as compared to 22.1% for all respondents.

30% does audio "engineering" as compared to 24.3% for all respondents.

None of these who got all the choices correct identified themselves as an audio hardware reviewer. Since this was an optional item, 1 individual did not answer the question.

11/19 (58%) were Windows users (almost exactly the same as the full group at 60%). Again, we see external USB/Firewire DAC being used mainly (13/20 - 65%). The only thing that seemed quite different was a higher proportion of headphone users here at 12/19 (63%) versus about 50% in the total group. Note that there was 1 individual who got all 3 24-bit selections correct who did not go into detail with his reporting, hence the total of 19 since he did not click either the speaker options or headphone option.

Average cost of the systems (based on median price in each price category) used by those who got all 24-bit samples correct: ~$6600. This is lower than the $8160 for all respondents. It's a smaller sample so I wouldn't put too much into this... No evidence in any case that these folks on average had more expensive systems. Like in the main group, the $1000-3000 segment was most common.

The person who used the "$100-$250" system was running what looked like a basic Windows PC with some Sennheiser HD433 (~$25) plugged in. (Remember, I dissuaded folks from listening directly off the computer since often the on-board DACs are poor - worked out well in this case!)

The person using the "$50,000-$100,000" system included the use of a Squeezebox Touch --> Berkeley Alpha DAC, Cary CAD 120S tube amp, Revel Ultima Studio 2s. Not sure about the preamp function. Looks like a nice setup...

Other gear used in this subgroup as I browse through them: balanced cable Sennheiser HD800 headphone, Grado SR80 + Bose QC15 headphones, Parasound amp & preamp, PSB speakers, Senn HD800 connected to Woo Audio WA7 amp, Senn HD600 connected to AudioQuest Dragonfly 1.0, NuForce DDA-100 to KEF Coda 70 bookshelf speakers, Arcam rDAC to Arcam DiVA AVR280 receiver, KEF iQ30 bookshelves, Benchmark DAC2 HGC with Grado RS1i headphones, Wyred4Sound DAC-2 to Senn HD800. Certainly some excellent gear there but not really exotic.

20% (4/20) used a listening tool like the ABX test. Exactly same as total group average.

So... How "confident" were they of the answers picked?
45-50% graded their confidence low - either "guessing" or "2 stars = More than a guess" for each track. Again, overall about the same as the total group. There seemed to be a bit more confidence in the Bozza track however (looked like everyone selected "4" rather than spread out between "3" and "4") and the confidence level dropped off by the time Goldberg was evaluated similar to the previous report. With such small numbers, it's difficult to put strong weight into specific results such as these.

In summary, this demographic doesn't look very different from the general group of all respondents - perhaps proportionally more headphone listeners. Remember however that I was not able to find a significant improvement in general accuracy of identifying 24-bit audio comparing all the headphone-using respondents in the overall group.

Of interest also was the fact that the average cost of the hardware used by these respondents was no higher than that calculated for the total group.

Although this subgroup was accurate in their choice of which sample was the 24-bit audio, confidence overall remained relatively low with 20-25% admitted to being "guesses".

Part III: SUBJECTIVE COMMENTS & FINAL THOUGHTS

Friday, 27 June 2014

24-Bit vs. 16-Bit Audio Test - Part II: RESULTS & CONCLUSIONS

See Part I: PROCEDURE for details around the test samples used and how this study was conducted.

In this installment, let's have a look at the results from the 24-bit vs. 16-bit listening test among respondents.

First I need to remind everyone that the test procedure was not easy. As demonstrated in Part I, the sonic difference between the original 24-bit track and the 16-bit dithered version is down below -90dB. This makes the test much more difficult than the previous high bit-rate MP3 test from last year... Whether you were able to detect the 24-bit version or not, I applaud your efforts and input.

As I noted previously, there were 140 total respondents and looking at the transfer statistics from my FTP server, I know the test was downloaded at least 350 times. Response rate just based on my FTP server transfer was therefore about 40% of all who downloaded. The actual response rate would likely be significantly lower since there were other download sites.

Results

I. Demographics:


First let us consider the characteristics of the respondents taking this blind test. Being that this is an internet test, involves downloading 200MB worth of high-resolution audio data in FLAC, and given the target audiophile forums where the test was advertised, it is reasonable to conclude that many if not most are tech savvy audiophiles rather than the "average" music listener.

Not surprisingly, the vast majority (98%) were men which is expected (just have a look around audio clubs, audio shows, etc.) - thanks to the 2 ladies that responded!:
The age distribution likewise isn't a surprise. Audiophiles tend to be a bit older overall, and the average age if we estimate using the median age in each range comes out to about 44 years old. The distribution looks like this:
Nice to see some teenagers and early 20 year olds with the majority in the 41-50 age category. If one were a computer audio manufacturer, the 40-50 age group would be the one to target for maximal effect in 2014.

The survey also asked if some of the respondents belonged to specific categories such as musicians and those with audio engineering experience. This could be useful  in the sub-analysis to see if there were more "golden ears" in these groups:
 
By self report, there were >20% musicians and audio "engineers". Of course these 2 groups were not exclusive and 17/31 musicians also identified themselves as doing audio recording/mixing/editing.

As for the hardware utilized by the respondents, here is the general layout of the type of gear being used to evaluate:
In terms of operating systems, of the 3 main OSs - Windows, Mac, Linux - it's clear that Windows predominated. 129 respondents used one of these 3 OS's and Windows was 60% of that followed by Mac at 23% and Linux 17%. Among streaming devices the Squeezebox was tops. Most respondents used an external USB/Firewire DAC to conduct the evaluation; not surprising that in the computer audio world, SPDIF interfaces are no longer as common and a few used the HDMI interface (surround receiver devices).

There was an even split in respondents using speakers (bookshelf + tower) of 74 and headphones 72 (a few used both).

Here's how the audio system "cost structure" looked (US$):
Weighted average using the median price in each category yields a system price of around $8160 on account of the number of expensive 5-figure systems reported (22% had systems >$10,000). The median audio system price is in the $1000-$3000 range. This is very reasonable and again speaks to the demographic who would download and try a test like this. Objective >16-bit resolution is easily achieved in a $1000-3000 system as demonstrated with even relatively inexpensive DACs measured here over the last year and by having a look at the Stereophile objective results.

Many respondents went into detail describing their systems in the survey.  The first 25 responses included full Meridian active speakers, Sennheiser HD800 headphones with upgraded cabling, custom amplifiers, tube amplifiers, custom ESS9023 DAC, NAD amp, Lyngdorf TDAI 2170 digital amps into Intonation Terzian speakers, Overdrive SE USB DAC, Parasound Halo JC-1 monoblocks, custom ribbon speakers, Cambridge Azure 840E, Focal 1028BE speakers, Sonus Faber Cremona Auditor M speakers, Sony MDR-7509HD headphones, Grado SR325 headphones, Audiolab M-DAC, Chord Hugo DAC, AKG Q701 headphones, Squeezebox Transporter, PS Audio 4.6 preamp, Pass Aleph 5 amplifier, Devialet 170 integrated DAC/amp, Martin Logan Montis speakers, Geek Out 720. Clearly, many respondents used very high quality equipment for this test.

As a reflection of the technological savvy of the respondents, many utilized ABX testing such as the Foobar ABX tool:
20% utilized listening tools to evaluate (ignore that 3rd bar above since it's just a reflection of how many left a description, 29/140 used an ABX tool). Other than Foobar ABX, Mac ABXTester was common, and others described their own script.

II. Were the 24-bit audio files distinguishable from the same files dithered down to 16-bits (and fed into the DAC in the 24-bit container) by the respondents as a whole?

In total, the final result looked like this:




As you can see, in aggregate there is no evidence to show that the 140 respondents were able to identify the 24-bit sample. In fact it was an exact 50/50 for the Vivaldi and Goldberg! As for the Bozza sample, more respondents actually thought the dithered 16-bit version was the "better" sounding 24-bit file (statistically non-significant however, p-value 0.28).

Looking at the individual responses, there were a total of 20 respondents who correctly identified the B-A-A selection of 24-bit samples, and 21 selected the opposite A-B-B. This too is in line with expectations that 17.5 would pick each of these patterns based on chance alone.

III. How certain were the respondents that they answered correctly (ie. able to identify the 24-bit sample)?

24-32% of respondents felt they were unable to hear a difference (1 star = "Guessing"). If we consider that those who chose "2 Stars = more than a guess" also represent a very low level of certainty, then we can see that 45-52% of respondents really had quite low confidence that they were able to tell the difference.

Fewer respondents were "certain" about the solo piano piece (Goldberg), and in general more seemed confident about the Bozza piece. This could be listening fatigue if one were to progress through Bozza-Vivaldi-Goldberg in sequence to account for this result.

IV. Were the respondents who felt more certain about their answer more likely able to identify the 24-bit audio?

Let us have a look at the results reported by those who rated their confidence level as 4 or 5 ("very confident" to "certain" - 25-30% of all the responses):

"Correct" responses being the ones who were successful in identifying the 24-bit sample. As can be seen, there is no evidence to suggest that even in those respondents with a strong sense of confidence were able to identify the 24-bit sample (as sounding better). In fact, for the Goldberg sample, only 44% of those who were quite "certain" selected the 24-bit version correctly.

V. Were the subgroups (musicians, sound engineers, hardware reviewers) able to identify the 24-bit audio better?

Due to the fact that respondents admitting to "guessing" tended to answer with A-A-A and this would severely impact a small sample size, I decided to not count the "guesses" in these smaller subgroups and see if there was any pattern of higher accuracy compared to all respondents.

Musicians:


As a subgroup (total of 31 respondents), the self identified respondents with a "good amount" of musical background did not do well. In fact, this group of respondents consistently scored worse than the combined result. Curiously, the musician group seemed to select the 16-bit dithered Vivaldi as the "better" sounding version (p-value 0.047).

Sound "Engineers" (those with experience recording, mixing, editing):


As a group the "engineers" faired better than the musicians in terms of accurately identifying the 24-bit tracks. This subgroup surpassed the accuracy of the combined respondents marginally. Again, the number of individuals was small (34). There was an overlap between the "musician" and "engineer" group with 17 individuals identifying themselves as both.

Hardware Reviewers:

This was an optional survey item that could be interesting to look at since audiophiles who provide hardware review opinions can have significant influence on sentiment and purchasing decisions.


With only 8 respondents, it would be difficult to draw any firm conclusion other than there is no evidence to suggest this subgroup was any more able to identify the 24-bit from dithered 16-bit audio.

VI. Were those with more expensive hardware able to identify the 24-bit audio better?

In total, there were 44 (31.4%) respondents using $6000+ equipment to perform this test, let us see if they were more accurate than the group average in identifying the 24-bit sample:


As you can see, the ~30% of respondents utilizing equipment costing >$6000 were not able to accurately identify the 24-bit audio track any better than the group average. The Vivaldi track was exactly at 50% accuracy.

VII. Did Headphone Use Improve Accuracy?

72 respondents used headphones in their evaluation. Since headphones can be potentially more accurate (no room acoustics, better noise isolation) at a lower overall cost, it would be interesting to see if accuracy in determining which was the 24-bit sample was any better.

As you can see, headphone use did not result in any appreciable improvement.

VIII. Did age have any effect on the accuracy?

There were 44 respondents 51+ in age. As a group, this is how they did compared to the overall result:


No evidence again of any significant change in accuracy in identifying the 24-bit audio.

Conclusions:


In a naturalistic survey of 140 respondents using high quality musical samples sourced from high-resolution 24/96 digital audio collected over 2 months, there was no evidence that 24-bit audio could be appreciably differentiated from the same music dithered down to 16-bits using a basic algorithm (Adobe Audition 3, flat triangular dither, 0.5 bits).

This survey was targeted to audiophile enthusiasts who in general reported using equipment beyond typical consumer electronics. The majority (77%) were using audio systems reported in excess of US$1,000 and 22% were listening with systems in excess of $10,000. Furthermore, 20% used an ABX utility in the evaluation process suggesting good effort in trying to discern sonic differences. There were no surprises in terms of demographics with the vast majority being males, with an age distribution centred around 41-50 years old.

Subgroup analysis of "musicians" and those who work with the technical aspects of recording, editing and mixing ("engineers") did not demonstrate evidence of special abilities at discerning the 24-bit audio. The "engineers" group did perform slightly better overall. The small group of individuals who identified themselves as writing hardware reviews did not show an increase in accuracy.

About 50% of respondents admitted that they had low confidence in their ability to discern differences. Conversely, 25-30% (depending on which musical sample) of respondents reported a strong sense of "certainty" that they were correct in identifying the 24-bit sample. Nonetheless, analysis was not able to demonstrate improved accuracy despite claims of increased subjective confidence by the respondents.

Furthermore, analysis of those utilizing more expensive audio systems ($6,000+) did not show any evidence of the respondents being able to identify the 24-bit audio. Those using headphones likewise did not show any stronger preference for the higher bit-depth sample. No difference was noted in the "older" (51+ years) age group data (not surprising if there is no discernible difference even with potential age-related hearing acuity changes).

Limitations of the study includes the fact that this was an open test distributed via the Internet in an uncontrolled fashion. This allowed the opportunity for test subjects to analyze the audio files objectively rather than through pure listening. However, this is also the mechanism of delivery for high-resolution downloads and the test participants would likely be using the same equipment to listen. The benefit of course is that the results may reflect realistic feedback from potential consumers (if not the target audience) of high-resolution audio. Respondents were able to listen in their own home using their own equipment rather than an artificially controlled environment. The fact that there was no time limit (other than a 2 month window to gather survey submissions) should have been a less stressful experience for the testers.

140 participants is not a particularly large number of data points but it was adequate to demonstrate an even 50/50 split in preference across the 3 musical samples; a level of consistency which adds to the idea that listeners were unable to differentiate 24-bit audio from the dithered 16-bit counterpart. Replication of the results is of course advised.

As expressed previously in "High-Resolution Expectations" (See "Good Enough Room?" section), there is no good rationale for a dynamic range of greater than 16-bit digital audio in the home environment. The results of this survey appear to support the notion that high bit-depth music (24-bits) does not provide audible benefits despite the fact that objectively measurable DACs capable of >16-bit resolution are readily available at very reasonable cost these days.

If 24-bit audio imparts no audible benefit when listening to music compared to the same data dithered down to 16-bits, how certain can the audiophile consumer be that higher sampling rates (eg. 88/96/176/192kHz) would make much of any audible difference? This perhaps should be the target for another blind test. Methodologically, it would be extremely difficult to maintain the blind testing condition over the internet since it would be trivial to run the audio files through a spectrum analyser with no easy mechanism to conceal the bandwidth limitation of lower sampling rates (eg. 22kHz frequency headroom for 44kHz sampling). The reader is encouraged therefore to explore the effect of higher sample rates for him/herself.

One final comment in closing. Notice that the Goldberg track was soft and had a peak amplitude of -10.35dB as demonstrated by the DR Meter (see PROCEDURE post). This means that the full potential dynamic range was not being utilize and for the 16-bit dithered sample, the dynamic range can be encapsulated in <15-bits. Even with this limitation, there was no evidence that respondents were significantly able to identify a difference in aggregate or within subgroups.
 

-------------

As usual, I encourage others to do their own testing. Feel free to drop a link especially if there are other controlled, preferably blind tests showing a significant audible difference between 24-bit and 16-bit audio.

I will put up a Part III over the next week as well documenting the subjective comments made by respondents and final observations... Stay tuned.


Saturday, 21 June 2014

24-Bit vs. 16-Bit Audio Test - Part I: PROCEDURE

Disclosure: Just in case anyone is wondering, I want to make it clear that I have no affiliation with any audio company. I do not derive any financial benefit of significance from conducting this survey (a few dollars from the ad revenue I suppose). I enjoy the audio hobby and wanted to do some "reality testing".

Over the course of 2 months (April 19 to June 20, 2014), an invitation was extended from this blog (archimago.blogspot.ca) to various "audiophile" forums on the Internet for participants to submit responses to an anonymous survey to see if they can identify which sample of music was the original 24-bit source versus the same piece of music (exact same mastering) dithered down to 16-bits.

Although the following may seem pedantic, I want to lay out the procedure used transparently and in detail so as to be clear of the nature of this test and what was done to collect the data.

The musical samples were taken from freely available sources on the internet; 2 classical pieces from the Norwegian studio 2L recorded in high resolution digital and 1 from the Open Goldberg Variations. For the purposes of this test, the "high resolution" 24/96 file samples were utilized directly from those sources (ie. I did not want to do any manipulation of the data like resample to 48kHz).

Musical samples from 2L (available here):
1. Eugène Bozza - la Voie Triomphale (performed by The Staff Band of the Norwegian Armed Forces): A well recorded orchestral track originally recorded in DXD (32/352.8).

2. Vivaldi - Recitative and Aria from Cantata RV 679, "Che giova il sospirar, povero core" (performed by Tone Wik & Barokkanerne) - String orchestra with female vocals. Also DXD-recorded originally based on the description from the website.

The third sample is taken from the excellent recent recording off the Open Goldberg Variations. Again, I am using the 24/96 high-resolution download as a starting point:

3. Bach: GoldbergVariations BWV 988 - Aria (performed by Kimiko Ishizaka). The recording was done at Teldex Studio in Berlin using the Bösendorfer 290 Imperial CEUS concert grand piano. It has been said by some audiophiles that the piano is an extremely difficult instrument to reproduce well. It's also a much slower piece which provides an opportunity to listen to the note decay quality. Low-level spatial room acoustics are also easily heard on this recording.

Due to the size of high-resolution downloads, each sample was limited to 1.5-2 minutes (the 2L samples were 2 minutes long, 1.5 minutes for the Bach). Some of the more interesting or dynamic portions of the musical samples were selected. Only fade in and fade outs were added to the beginning and/or end of the tracks of <2 seconds so as not to be too abrupt. FLAC compression was used to decrease file size.

The dithering process was basic. Using an older version of Adobe Audition (version 3.0.1), a flat triangular dither of 0.5 bits was utilized with settings as shown:
The sample rate was kept at 96kHz. These are very conservative settings and no advanced settings like noise shaping was utilized as featured in some of the "better" dithering algorithms like iZotope's MBIT+ or Weiss' POWr, etc. Adobe Audition again was used to convert the dithered 16-bit data back to a 24-bits container.

The 24-bit and (effective) 16-bit versions were randomly assigned as Sample A or B and files were enumerated 1 to 6 in the final package downloaded by the respondents.

Due to the fact that this is an "open" test released on the Internet (rather than a listening test in a lab situation where variables could be easily controlled), some measures were implemented to prevent easy differentiation of 24 vs. 16 bit-depth by other means than just listening. (Thanks to Wombat for giving me some ideas.)

1. Files 2, 4 and 6 (Sample B of each track) had 1 ms cut off from the start and files 1, 3, and 5 (Sample A) had 1 ms truncated from the end. This maintains the exact duration of Sample A and B but shifted them temporally. Doing this confounded simple null tests that did not take into consideration the slight timing offset.

2. A very low level -140dB (average RMS power) white noise was mixed into the 16-bit dithered samples (remember, they were placed in 24-bit containers) to affect the LSB so that a simple program that just checked the bit-depth (by looking for "0" in the least significant bits) will think that this is an actual 24-bit resolution file. This small amount of white noise would be inaudible and well below the dithered 16-bit audio noise floor (and below the objective noise floor of actual DACs).

3. FLAC was consistently LESS EFFICIENT at compressing the dithered (effective 16-bit) files resulting in larger file sizes. As a result, one of the 24-bit files was purposely compressed at FLAC level 2 (versus level 8) to make the file size slightly larger than the respective dithered version.

[Of note: the beta-testers wanted me to implement even more than the above to hide the identity of the 16-bit dithered files! I suppose I had more faith in human nature.]

Knowing the above, if one were to align the files, cut off 2 seconds from the front and end (to account for any slight variation in the fades), we could run the files through a null test and obtain the following amplitude results:
Bozza - La Voie Triomphale
Vivaldi - Recitative & Aria
Bach - Goldberg Aria
As you can see, the null test demonstrates peak amplitude difference down in the -90dB level (and average RMS difference down at -98dB) as a result of dithering from 24 to 16-bits. Also, for those who had a peek, you can see the higher noise floor during quiet portions such as this fade-in portion in the Bach Goldberg (0.501 seconds in):
24-bit
Dithered to 16-bits
The resulting samples were also run through the DR Meter (version 1.1.1) in foobar to ensure that the volume levels were equivalent:

DR         Peak            RMS           Duration Track
--------------------------------------------------------------------------------
DR12     -10.35 dB   -26.90 dB      1:30 05-Sample A - Goldberg Aria
DR12     -10.35 dB   -26.90 dB      1:30 06-Sample B - Goldberg Aria
DR13      -0.17 dB   -17.36 dB      2:00 01-Sample A - Bozza: La Voie Triomphale
DR13      -0.17 dB   -17.36 dB      2:00 02-Sample B - Bozza: La Voie Triomphale
DR14      -4.13 dB   -21.41 dB      2:00 03-Sample A - Vivaldi: Recitative & Aria
DR14      -4.13 dB   -21.41 dB      2:00 04-Sample B - Vivaldi: Recitative & Aria

This also demonstrates that the samples were of good dynamic range - DR12 to 14. No major dynamic range compression, clipping or peak limiting in any of the source material as shown below:
Bozza - La Voie Triomphale
Vivaldi - Recitative & Aria
Bach - Goldberg Aria
These "audiophile" samples should therefore provide a good chance to experience dynamic nuances between 16-bit and 24-bit audio. (Much better than the typical compressed, limited audio of modern rock/pop recordings sold as "high resolution" routinely with <DR10.)

The samples were ZIPped together and distributed in a single file (~200MB in size). My FTP server was the primary download source with secondary download sites at privatebits.net (thanks again Ingemar), Uploaded.net, and FilePost.com.

Here then is the randomization used:

01 - Sample A - Bozza - La Voie Triomphale --- 16-bit
02 - Sample B - Bozza - La Voie Triomphale --- 24-bit
03 - Sample A - Vivaldi - Recitative & Aria --- 24-bit
04 - Sample B - Vivaldi - Recitative & Aria --- 16-bit
05 - Sample A - Goldberg --- 24-bit
06 - Sample B - Goldberg --- 16-bit

The 24-bit original audio files for the test samples are therefore B-A-A.

"Advertising" for this test was done through forum invitations extended to:
A few other smaller forums had invitations advertised as well. Invitations included a request for participants to NOT share their findings so as to affect others, and a warning that this is a 24-bit test, so the participant should try to ensure that the equipment (at least the DAC) is capable of >16-bit resolution. In general, participants were dissuaded from just using a direct computer motherboard/laptop output. I visited the advertisement threads on occasion and also reminded of the closure date on June 20, 2014. "Golden eared" audiophiles and those with high-end audio equipment were encouraged to participate. Due to the 2-month window, participants were asked not to rush the listening evaluation.

Participant results were collected through an active, paid account on: http://freeonlinesurveys.com/. Cookies were used to prevent double entries from the same computer. Participants were asked to:
1. Identify what they believe to be the 24-bit sample. (Presumably the "better sounding" track.)
2. Identify their level of certainty for each test track. Asked to grade on a 5 point scale (1 = "guess", 5 = "certain").
3. Tell me whether an ABX tool or other instantaneous comparison tool was utilized.
4. Provide demographics: gender, age, "musician" background, audio engineering/editing background, audio hardware reviewer status.
5. Describe evaluation hardware: components, cost of equipment.
6. Provide their subjective input: details on the hardware, any surprises in terms of difficulty, and a description of the audible difference (if any).

As suggested by the nature of this test and the data collected, I wished to answer the following questions (as expressed on April 30th on this thread in the Squeezebox forum):

Primary objectives:
1. How "easy" was it for people to detect (or report) a difference?
2. How accurate were the respondents in detecting the 24-bit sample?

It'll be interesting also to have a look at:
1. Which musical piece was it easier to hear a difference in.
2. Whether more expensive gear resulted in more accurate detection.
3. Whether age was a factor (might be hard to generalize unless I can normalize the gear quality).
4. Whether those who felt confident that they got it right actually did. Perhaps a measure of human ability to self-evaluate.
5. Whether there were more successful results from headphones vs. speakers.

Thank you to all the "beta testers" involved before the survey went public! Also, thank you again to all the participants who took the time.


24-bit vs. 16-bit Blind Listening Test Closed...

The day has arrived...



The survey for the blind test ended today! Thank you for everyone with the patience in taking the time to listen to the 3 samples and submitting your results. A few people admitted to only listening "a few times" but it certainly looks like the majority took the time to seriously listen and I certainly appreciate the detailed responses provided.

In total, I received 140 responses over the 2 months. Here's the map of the countries with submissions:



As you can see, not unexpectedly we have 3 main "clusters" of input from audiophiles - N. America, Europe, and the Pacific region (Asia + Australia + New Zealand). Then there's the single South African submission :-). The breakdown looks like this:

North America: 36 USA + 12 Canada = 48

Europe: 14 UK + 1 Spain + 6 France +2 Belgium + 8 Netherlands + 12 Germany + 1 Denmark + 4 Sweden + 4 Norway +1 Finland + 1 Estonia + 2 Austria + 3 Italy + 7 Croatia + 4 Hungary + 1 Bulgaria + 1 Turkey + 1 Cyprus +1 Israel = 74

Asia & Oceania: 1 India + 1 China + 1 Taiwan + 2 Malaysia + 3 Australia + 1 New Zealand = 9

Africa: 1 South Africa

Unknown: (for some reason IP could not be traced to country, I've seen this with Russian IP addresses) 8

I didn't work out the per-capita numbers but 7 from Croatia caught my eyes! Nice.

As in the MP3 Blind Test, I'm going to be posting the results over the next week or two in parts. Coming up in the next 24 hours will be a description of the procedure. This will include the "answers" as to which samples were the 24-bit audio. I'll speak about how the files were created as well as the dithering algorithm used. Following this will be the results and then a discussion of the implications of the findings...

Stay tuned!

Part I: PROCEDURE

Thursday, 12 June 2014

REMINDER: 1 Week Left (24-bit vs. 16-bit blind test)

¡Hola amigos!
Greetings from here:
Swimming with the turtles and stingrays off the coast of the Mayan Riviera...
Thought I'd just put up a little reminder that I'll be closing the blind test on June 20th - approximately 1 week from now. At this point, we're up to 120 responses on the survey (muchas gracias).

Although there are always limits to test methodology, and I certainly do not pretend that all variables have been controlled for, (indeed, it is impossible in cases like this where it's being done "remotely" over the internet!) I do believe this is a valuable test for the audiophile community. It's an opportunity to expose one's expectations (yes - 24-bits provide 16 million "levels" vs. the paltry 65536 "levels" of 16-bits) to reality testing in the comfort of one's home; away from Industry biases, suggestions from audio gurus, and group expectations that may be set-up when one goes to a show room or trade shows. This is about what audio lovers around the world actually hear in the real world...

Please put in your own response and suggest it to audiophile friends who may want to give this a try before the closing date. Feel free to also put it up on audio(phile) forums you may frequent. Just remember - you better have a system that has >16-bit capability.

Golden ears and those with 5+ figure audio systems - I would really love to have your continued survey response! I would also love to get musicians, sound engineers, and reviewers of audio hardware involved.

As usual, test details including procedure and files can be found here:
http://archimago.blogspot.com/2014/04/internet-test-24-bit-vs-16-bit-audio.html

Talk to you all later - likely after the test end date...

Friday, 30 May 2014

MUSINGS: It's Vinyl Time!

Hey there everyone... The 24-bit vs. 16-bit audio survey has now surpassed the 100 responses "milestone" which I unofficially set as the minimum I wanted to achieve for this test. Thanks to all who have already submitted to the test results and please keep them coming - the more results the better the statistical power... You have until June 20th. Still looking for some Russian input to fill out the international effort :-).

---------------------------------

Over the last few weeks, I finally bothered my father enough to drag out the old turntable for a spin. It's a "vintage" Sony PS-T15 from the early 1980's - he bought it with an old Sony receiver, tape deck and Sansui floorstander speakers in 1981. Just a couple years before the CD juggernaut started rolling. I essentially grew up with CD as the primary music collection medium although did play some LP in those early years (along with a collection of cassettes). It's a "middle-of-the-road" model from Sony that's supposedly a step up from the "L" series but below the Sony "audiophile grade" "X" series. Although old, the condition is still VG+ with the only blemishes being the top transparent plastic cover slightly scratched from the moves between homes and cities over the years. The direct drive motor still spins smoothly and quietly, and the tonearm looks and functions decently. Pitch control is functioning accurately. I suspect this works better than many current-model inexpensive consumer turntables.



As you can see, there's a long discontinued Audio-Technica AT70 MM cartridge in place. The stylus still looks good under a loupe.

I reacquainted myself with the turntable setup routine (here's a good AXPONA 2014 video from Michael Fremer - funny interchange around 20 minutes vs. CD). I've still got lots to learn about this but I did reseat the cartridge and cleaned the contacts, reapplied the counterweight and antiskating settings, realigned with a protractor (check out some free ones on Vinyl Engine), leveled the unit, etc. Good enough for now... Grabbed an inexpensive Spin Clean Starter Kit MKII, and went to work getting the gunk off some very old records in boxes which have not seen the light of day in decades!

I must admit, after having not handled LPs in at least 10 years and more than 20 years since using a turntable with any regularity, this has been fun... Without doubt, psychologically the act of setting up even something as simple as this "non-audiophile" gear taps into a different kind of mental "reward center". It's a sense of achievement brought about by the ritual. To hold in hand the 12" vinyl disk, and examine the full-sized LP cover brings with it a different kind of "joy" which for me isn't really about the sound quality, but rather caters to the "collecting" (hopefully not "hoarding"!) habit. Furthermore, the discipline of keeping the gear and vinyl clean and getting the albums organized intensifies a collector's obsessive-compulsive tendencies. This indeed all translates to a different experience which again can be pleasurable and memorable irrespective of the sonic quality.

My kids checking out the spinner - for them it's the first time seeing one of these in real life :-).
Listening to an old Teresa Teng pressing from 1980.
You might be wondering... What about the sound? Well, it's pretty good. Obviously this isn't high-end vinyl gear by any means, and I don't have many audiophile grade LPs on hand, but the sound is definitely enjoyable. Over the years, I have heard some excellent vinyl setups and can attest to fantastic imaging and clarity. However, as alluded to above, for me, vinyl isn't about audio quality being "accurate" or "high-resolution"; there are just too many variables and little objective frame of reference to accurately verify what is "high-fidelity" from a vinyl setup.

Despite the audiophile buzz, turntable & vinyl ownership is still very rare among my friends (late 30's to early 40's). I think many of us also recognize the "coolness" factor associated with being retro. This is heightened further by the positive sentiment perpetuated by folks like Mr. Fremer that somehow LP's in general sound "better" than digital (like this video where he also talks about CDs being "analog" at 43:30). Seriously, if it's only sound quality I'm after, I still think a well produced non-dynamically-compressed digital copy is superior.

In any event, there is fun to be had and that's good.

Now that I have the sound room and space, nothing wrong with putting together a little vinyl collection of cherished recordings. Besides, the stack of LPs could help with room acoustics... :-)

KoB 50th Anniversary box set - fitting 1st LP purchase in >20 years... And I swear this is the last time I spend another dollar on this album! Of course I said this >10 years ago when I bought the SACD... :-)
I was chatting with a friend the other day about maybe doing some "controlled" vinyl rips using my basic gear versus his high-end multi-kilobucks setup to demonstrate what $$$ buy in terms of sonic output. That could be fun and maybe another blind test to try out. I'll keep you guys posted :-). Let's get through the 24-bit vs. 16-bit survey first...

There just happens to be a couple of good used LP stores within 10 minutes of home, so I might check them out between driving the kids to events this weekend. Until next time, enjoy the tunes everyone...

Addendum (June 1, 2014):
I ended up returning that Kind Of Blue (50th Anniversary) Box I showed above. I noticed that Side B had quite a number of light surface abrasions and 2 strange "bumps" about 1cm in diameter. I've not seen these bumps with the black LPs I have... Disappointing since it's quite a nice package and essay book... Maybe I'll just grab the standard black vinyl release later.

Thursday, 15 May 2014

MUSINGS: A few words about Pono... And what is "the finest digital copy"?

Remember everyone, we're not done with the 24-bit vs. 16-bit blind test yet! We're about 1/2 way through before I close off submissions to the survey site. Again, I thank everyone who has tried it thus far and all the awesome responses with detailed descriptions of the gear used and demographic information... Many audiophiles from around the globe have already voiced their "vote" and it's getting close to 100 submissions. I'm somewhat surprised that so far there hasn't been a submission from India, Russia or S. America!

In any case, one more month to go everyone!

----------------------


Up to this point, much has already been written about Pono from both supporters and detractors. As you know, the media show around Pono started anew at SXSW with the announcement in mid-March and concomitant Kickstarter project. They did quite well financially with the Kickstarter campaign so it's an encouraging start for opportunities ahead. A few details were clarified such as the designer of the PonoPlayer being done by Ayre (rather than Meridian as previously thought). But at the end of the day, it's really hard to pin down just what is going to be all that different in the Pono "ecosystem" or why a music lover would choose this over others (eg. HDtracks, Qobuz). Maybe it is as Neil Young said here; "There's nothing, there's nothing new. There is no new thing. It's just available. It's available to everybody". So high-resolution, lossless audio will be available to everybody - at a price... Indeed, that's not new.

So, after all the star-powered hoopla and idealistic catch phrases like "rescuing an art form" (seriously, what's that supposed to mean?), what we know is that Pono is primarily a music service that sells "high-resolution" audio tracks in FLAC much like HDTracks has been doing for years. They are selling the PonoPlayer designed by Ayre to get the ball rolling as a physical device to brand the service with the 'iconic Toblerone' form factor but there's no clear commitment to ongoing hardware development. There will be no DRM which is of course good for compatibility. Since many DACs and devices these days can already do high-resolution decoding and playback, the flexibility is very much appreciated. It's amazing that even the recent May 7 Pono-listening-gathering did not use a prototype Ayre player for demonstration; again, stressing the music software as the prime objective rather than the player hardware. As you know, a number of folks have signed up for the limited-edition Kickstarter autographed PonoPlayers which are nice for collectors and as novelty items.

In the last month, I see that the Pono CEO John Hamm has taken the lead on presentations at shows like AXPONA and demonstrations like the May 7th one above. This seems like a good move since he's clearer and obviously more in tune with the technology (see the AXPONA video) than NY. There's of course still a major sales pitch going on in the video which is understandable if not annoying at this point. For example, those video clips of musicians/producers with testimonies of being "blown away" by Neil's car stereo!? You have to wonder just what kind of music these millionaires have been listening to in the last few years. Their own CDs remastered loud and volume compressed down to DR5 is my guess; using MP3 as the scapegoat of course. Furthermore, the discussion still gets stuck around the technical "container" (24-bit, high sample rate, MP3 as "5% of the data",  etc...) rather than the elephant in the room; the actual quality of the source recording, mixing and mastering process. If you're using a massive shipping container, you better have lots of good stuff in there worth taking up space for!

Encouragingly, it's great to see questions being raised at the end of the presentation about dynamic range compression and some answers being provided; John Hamm addresses this around 25:30 in the video above. He states that there will be no special remastering or remixing although it sounds like they could exert some influence in selection of which mastering is sold... But he does say something intriguing: "The promise of Pono is that we will sell you and play on our PonoPlayers the finest digital copy of that song or album available in the world" (38:35). Elsewhere, we are told that if you buy a copy of an album and a "better" remastering comes up, you get an automatic upgrade... (Right... I'll believe it when I see it.)

Interesting... Consider the question then: How does one know which is the "finest digital copy"? First, it almost goes without saying that to answer that question for any album with remasters over the years requires subjective evaluation; much more complex than just throwing out a number like "192kHz!". Second, I suspect the process whereby that answer is arrived at could allow a fundamental shift in consumer expectations if actually accepted by the public, and implemented professionally. It could make a substantial dent to calling out that proverbial elephant in the room and setting us free from ridiculous over simplifications like focusing on just bitrate / bitdepth / samplerate or simply MP3 vs. lossless. Educating the general public on how to listen for a quality mastering job that encompasses the full dynamic range, natural sound (eg. judicious EQ), with fully detailed resolution would in itself be the most beneficial outcome of this whole "high resolution audio" endeavor even though it may have started with "objective" parameters like pushing 24-bit and 96+kHz... I trust that already many of us have experienced 24/96+ music which is either no better than an equivalent CD (have a look at most of the new pop releases on HDTracks like Beck's Morning Phase), or in some cases the "high-resolution remaster" sounds worse.

There are suggestions in these presentations that Pono somehow represents a "legacy" (1) that Neil Young wants to leave beyond his copious body of work already. There is certainly an opportunity. But I believe it's not going to come from geeky talk about sample rates and meaningless stats like "30x more data than MP3" with a graph to accompany the rhetoric as if this directly correlates to sound quality. It's going to come from choices made and promoted about not just the provenance (ie. where the audio recording came from including true hi-res recording practices - see Mark Waldrep's site), but the technical and artistic bit around adjudicating what actually sounds "best" and which of the versions of many recordings is "best" irrespective of the file type. Does the artist choose which version is best? Is it down to vital stats like dynamic range (eg. something like DR values)? Is it up to the record labels thinking that the most recent "anniversary" remaster is best? In Neil Young's ears we trust? Or a mindless adherence to 24-bits and 192kHz sampling rate?

No my friends, on the first day that the Pono Music site is up, I'm not really going to care whether the PonoPlayer sounds great (it better for essentially a no-frills unconnected music player at the >$300 price point), or if the website is user friendly (2), or how many albums are available to choose from... I mainly want to know whether they're selling the first digital release of Nirvana's Nevermind (16/44), the 1996 MFSL remaster (16/44), the HDTracks 2011 remaster at 24/96, or the Blu-Ray 2013 24/96. Nevermind is of course just one of many albums to keep an eye on for what Pono does... Pono's offering will speak volumes about their commitment and whether this "movement" has any steam - are they walking the talk about "good sound", stuck with fancy specification numbers, or just "nothing new"?

------------------
(1) See around 7:00 into this Esquire interview.

(2) There's time to optimize and streamline the website. A nice website with full provenance data and ability for consumers to leave comments & reviews would be awesome of course.

Saturday, 19 April 2014

INTERNET TEST: 24-bit vs. 16-bit Audio - Can you hear the difference?

24-bits vs. 16-bits Audio - A Visual Analogy?
Those that have been reading this blog for awhile will recall that these pages started with an MP3 vs. Lossless test that was posted here back in 2012. That was kinda fun :-).

As you know, the "High-Resolution Audio" movement is on. A major cornerstone of this is the belief that in the PCM world, 24-bit audio resolution imparts clear audible benefits (as opposed to the standard 16-bits for CD resolution). Indeed, most decent DACs these days are capable of measuring >16-bit dynamic range. Clearly, a push is being made for the subjective "virtues" of 24-bit audio for your headphones, into your home and into your car. The 8-bits of difference between 16-bits and 24-bits of PCM data provides 48dB of extra dynamic range or 256x the number of values to represent each PCM sample! Fantastic advertising buzz. The 24-bit "container" is certainly capable of significantly higher resolution.

Here's the question... Can you hear the difference? Evidently many people believe the difference is audibly significant.

So, you've tried "crowd funding" (and you're probably at this point still waiting for a product), it's time for another round of "crowd testing", folks! Here's what you do to participate:

1. Download the "24-bit Audio Test.zip" file:

WARNING: The test file is big (a taste of the storage demands for those who have not downloaded high-resolution audio). Approximately 200MB for a total of 6 musical samples at 24/96 lasting less than 12 minutes with fully tagged FLAC lossless compression.

Get the file from my FTP server:
ftp://24bit-test.dyndns.org
Login = 24bit
Pass = test

Please have patience if the server load is high... Anyone able to help out with this, please drop me a note below!

Alternate download sites:

Thanks Ingemar:
http://www.privatebits.net/24-bit_Audio_Test_(Hi-Res_24-96-FLAC_2014).zip 

From Uploaded.net: http://ul.to/fqziwlfs
From FilePost.com: https://filepost.com/files/m5b3em11/24-bit_Audio_Test_(Hi-Res_24-96,_FLAC,_2014).zip/

2. Extract that ZIP to wherever you want for playback (computer folder, music archive, server, etc.)

Located within are 3 musical pieces in 24/96 FLAC, each piece with Sample A or Sample B versions. One of them (A or B) is the 24-bit original and the other contains a dithered 16-bit version which has been converted back to 24/96 so your DAC will basically be playing back the same bit/samplerate when you switch between tracks.

The samples are all classical pieces but with variation in instrumentation, vocals, and dynamics. Realize that it's not easy to find good high-resolution audio where the music is recorded and mixed to the highest standards with known provenance. Classical music as a genre is where some of the best recordings can be found.

Here are the tracks:

1. Eugène Bozza - la Voie Triomphale (performed by The Staff Band of the Norwegian Armed Forces): A well recorded orchestral track originally in DXD (32/352.8) freely downloadable as a sample from 2L here. In the interest of download size, I extracted 2 minutes from the track. It features good dynamic range with a DR13.

2. Vivaldi - Recitative and Aria from Cantata RV 679, "Che giova il sospirar, povero core" (performed by Tone Wik & Barokkanerne) - String orchestra with female vocals. Also DXD-recorded and available as a free download from 2L here. Again, I only extracted 2 minutes from the track. DR14 for this track.

3. Goldberg Variations BWV 988 - Aria (performed by Kimiko Ishizaka) as taken from the freely available Open Goldberg Variations 24/96 release. The recording was done at Teldex Studio in Berlin using the Bösendorfer 290 Imperial CEUS concert grand piano. Simple instrumentation for those who love and appreciate the sound of the piano. It's also a much slower piece which provides an opportunity to listen to the decay quality. Low-level spatial room acoustics also easily heard on this recording. Measured dynamic range is a reasonable DR12.

I've included the DR printout from foobar to demonstrate that peak and average volumes are identical for Samples A & B for each piece.

** Note that I'm using the samples under the principle of "fair use" for the purpose of education, research and commentary. I have no ties with Industry and have no financial gain by organizing this. As noted above, these tracks were selected based on excellent quality of the recording and I would highly recommend visiting the source links to sample more high-quality audio tracks in the high-resolution delivery formats (the 2L download page also includes DXD, multichannel and DSD samples).

3. Listen and compare Sample A with Sample B.
Can you hear a difference? Can you tell which one was the original 24-bit audio and which was dithered down to 16-bits?

Needless to say, you need to make sure your DAC / computer / streamer / etc. is capable of >16-bit performance!

I encourage the use of tools like foobar's ABX comparator to switch quickly between A & B. Make note of which Sample for each piece of music you believe is the original 24-bit track. Presumably, since the 24-bit version is higher resolution than the 16-bit one, there could/should be improved transparency, ambiance, definition, smoother decay, etc... For example, if you think the 24-bit sample is A for Bossa, B for Vivaldi, and A for Goldberg; keep track of that and make note of how confident you feel about your selection.

4. Make your voice heard on my survey form!


I have 14 questions (15th optional) - it should not take long to fill out as most are multiple-choice tick boxes. As expected, I want to know whether you think A or B is the 24-bit track. I also want to know if you're guessing or confident - if you spent time listening, make it count by visiting the survey even if you don't think you can tell a difference. A 'negative' report is just as important as a 'positive' one. I also want to get a sense of your age, gender, tiny bit on musical and technical experience. Also, equipment used, and approximate cost of the audio gear. All data is anonymous and I have no access to your IP address (the web site will keep track of what country you're submitting the survey from - let's make this an international effort!).

I'm going to be busy with work and other responsibilities for the next while so this is the perfect time to gather some data. As I did with the MP3 test, I'll summarize the information and demonstrate the conclusions of this survey once it closes. I think about 2 months is adequate time for everyone who wants to get involved to have a good listen. Therefore, I will close the survey around June 20, 2014 - fill in the survey before that time! Note that IP filtering is ON so only 1 response from each IP please.

Finally, remember to relax, take your time, and have fun with this... Enjoy the music and see if one version "speaks" to you more than the other in terms of sonic quality. Feel free to share this test with friends / family / music lovers / audio reviewers / audio forums / enemies. Also, please try not to whip out the audio editor before doing the listening and completing the survey! Honesty is extremely important for an open "naturalistic" survey like this one. If you know which is which, please do not share it with others so as not to bias the results. Feel free to leave a comment down below if you run into any problems or need help. Thanks...

A few guys got to "beta-test" this little project and I want to thank 'Wombat' and 'Mnyb' especially for providing technical suggestions and detailed feedback on the Squeezebox Audiophile forum before this went "live"!

--------------------

I've been enjoying some Ladysmith Black Mambazo over the last few evenings at home. As usual, with some of the more recent CDs I wish the dynamic range were higher especially for multilayered vocal music like this. I'll be sure to update when I get stuff like the pre-ordered Geek Out in my hands over the next while or if I've got some burning thoughts to share :-).

Enjoy the music... Happy Easter.

-------------------------
UPDATE (April 24, 2014): ~1 week into the test...

Great to see the responses so far! Just with the FTP site, >120 uploads of the file already. I of course do not expect that many responses to the survey yet since there's plenty of time to listen, but I am impressed by the quality of responses so far and the variety of gear people are using. Clearly folks are not using "junk" gear to assess and the majority are spending time listing out for me what they use and subjective impressions. All will be revealed in time of course...

Keep the responses coming!

--------------------------
UPDATE (April 30, 2014):

>50 responses now. Need more for better statistical power; especially in analyzing each test piece. Expanded the advertisement to the Audio Circle "HiRez Music Circle".

May 2, 2014:
Formal invitation extended to WiredState and Steve Hoffman forum.
May 10, 2014:
Invitation to Pink Fish Media forum.
May 17, 2014:
Invitation to Head-Fi forum.