Ordered vector space: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>MikeRumex
m added link to lattice
en>Monkbot
 
Line 1: Line 1:
An '''ABX test''' is a method of comparing two choices of sensory stimuli to identify detectable differences between them. A subject is presented with two known samples (sample '''A''', the first reference, and sample '''B''', the second reference) followed by one unknown sample '''X''' that is randomly selected from either A or B. If X cannot be identified reliably with a low [[p-value]] in a predetermined number of trials, then the [[null hypothesis]] cannot be rejected and it cannot be proven that there is a perceptible difference between A and B.
The person who wrote the article is called Jayson Hirano and he totally digs that name. He works as a bookkeeper. Alaska is the only place I've been residing in but now I'm considering other options. I am truly fond of handwriting but I can't make it my profession really.<br><br>Also visit my website ... [http://kard.dk/?p=24252 tarot readings]
 
ABX tests can easily be performed as [[double-blind trial]]s, eliminating any possible unconscious influence from the researcher or the test supervisor. Because samples A and B are provided just prior to sample X, the difference does not have to be discerned from assumption based on long-term memory or past experience. Thus, the ABX test answers whether or not, under ideal circumstances, a perceptual difference can be found.
 
ABX tests are commonly used in evaluations of digital [[audio data compression]] methods; sample A is typically an uncompressed sample, and sample B is a compressed version of A. Audible [[compression artifact]]s that indicate a shortcoming in the compression algorithm can be identified with subsequent testing. ABX tests can also be used to compare the different degrees of fidelity loss between two different audio formats at a given [[bitrate]].
 
ABX tests can be used to audition input, processing, and output components as well as cabling: virtually any audio product or prototype design.
 
==Hardware tests==
[[File:QSC ABX Comparators.jpg|thumb|right|Two [[QSC Audio Products|QSC]] ''ABX Comparators'' in a traveling rack]]
ABX test equipment utilizing relays to switch between two different hardware paths can help determine if there are perceptual differences in cables and components. Video, audio and digital transmission paths can be compared. If the switching is microprocessor controlled, double-blind tests are possible.
 
Loudspeaker level and line level audio comparisons could be performed on an ABX test device offered for sale as the ''ABX Comparator'' by [[QSC Audio Products]] from 1998 to 2004. Other hardware solutions have been fabricated privately by individuals or organizations for internal testing.
 
== Confidence ==
If only one ABX trial were performed, random guessing would incur a 50% chance of choosing the correct answer, the same as flipping a coin. In order to make a statement having some degree of [[Confidence interval|confidence]], many trials must be performed. By increasing the number of trials, the likelihood of statistically asserting a person's ability to distinguish A and B is enhanced for a given confidence level. A 95% confidence level is commonly considered [[Statistical significance|statistically significant]].<ref>{{cite journal |url= http://www.aes.org/e-lib/browse.cfm?elib=3839 |work=AES Journal |volume=30 |issue=5 |year=1982 |author=David Clark |title=High-Resolution Subjective Testing Using a Double-Blind Comparator}}</ref> The company QSC, in the ABX Comparator user manual, recommended a minimum of ten listening trials in each round of tests.<ref name=QSCABX>QSC ABX Comparator user manual. (1998) p. 10</ref>
 
'''Results required for a 95% confidence level:'''<ref name="ABX Web Page">{{cite web |url=http://home.provide.net/~djcarlst/abx_bino.htm |author=David Carlstrom |title=Probability of Experimental Result Being the Same as Random Guesses |publisher=[http://home.provide.net/~djcarlst/abx.htm ABX Web Page] |accessdate=2011-12-14}}] at</ref><ref name="P-value">[[P-value]]</ref>
{| class="wikitable"
|-
! Number of trials
! 10
! 11
! 12
! 13
! 14
! 15
! 16
! 17
! 18
! 19
! 20
! 21
! 22
! 23
! 24
! 25
|-
| '''Minimum number correct'''
| 9
| 9
| 10
| 10
| 11
| 12
| 12
| 13
| 13
| 14
| 15
| 15
| 16
| 16
| 17
| 18
|-
|}
 
QSC recommended that no more than 25 trials be performed, as listener fatigue can set in, making the test less sensitive (less likely to reveal one's actual ability to discern the difference between A and B).<ref name=QSCABX/> However a more sensitive test can be obtained by [[Pooled variance|pooling]] the results from a number of such tests using separate individuals or tests from the same listener conducted in between rest breaks. For a large number of total trials N, a significant result (one with 95% confidence) can be claimed if the number of correct responses exceeds <math>N/2+\sqrt{N}</math>. Important decisions are normally based on a higher level of confidence, since an erroneous "significant result" would be claimed in one of 20 such tests simply by chance.
 
==Software tests==
The [[foobar2000]] and the [[Amarok (audio)|Amarok]] audio players support software-based ABX testing, the latter using a third-party script. [http://sourceforge.net/p/avex/home/Home/ aveX] is an open-source software mainly developed for [[Linux]] which also provides test-monitoring from a remote computer. [http://www.surrey.ac.uk/msr/people/chris_hummersone/index.htm#additional ABX patcher] is an ABX implementation for [[Max/MSP]]. More ABX software can be found at the archived [http://web.archive.org/web/20070813001013/http://www.pcabx.com/ PCABX website].
 
==Potential flaws==
The ABX test is able to assert if A is identifiably different from B, however it must be performed correctly in order to produce a meaningful result. For example, all test results must be counted in order for the result to be valid. This include previous failed tests, which might not be made public , while the successful ones are, or repeated tests. All tests performed should be summed, and the p value calculated from the sum, not an individual test.{{citation needed|date=April 2012}} Other problems might arise from the abx equipment itself, such as a [[Tell (poker)|tell]] from the equipment or poor volume matching in the case of audio tests.<ref>{{cite web |title=ABX Testing |publisher=Boston Audio Society |year=1990 |url=http://bostonaudiosociety.org/bas_speaker/abx_testing.htm |accessdate=2012-06-12 |quote=The large relays in this box make a soft clunk that is different for the two sources and is audible in a quiet room; Meyer has identified X 10 out of 10 times without any signal! While the sound is quiet enough to be masked when any music is playing, testing hygiene dictates that the relay box be enclosed or otherwise muffled. Meyer handed out a sheet photocopied from the ABX manual which showed typical level-matching required for reliable detection of differences between sources with 1/3 octave frequency-response aberrations.}}</ref>
 
==Alternatives==
 
===Algorithmic Audio Compression Evaluation===
Since ABX testing requires human beings for evaluation of lossy audio codecs, it is time-consuming and costly. Therefore, cheaper approaches have been developed, e.g. [[PEAQ]], which is an implementation of the [[Objective Difference Grade|ODG]].
 
===MUSHRA===
In [[MUSHRA]], the listener is presented with the reference (labeled as such), a certain number of test samples, a hidden version of the reference and one or more anchors. A 0-100 RATING scale makes it possible to rate very small differences.
 
===Discrimination testing===
Alternative general methods are used in [[discrimination testing]], such as paired comparison, duo–trio, and triangle testing. Of these, duo–trio and triangle testing are particularly close to ABX testing. Schematically:
;Duo–trio: AXY – one known, two unknown (one equals A, other equals B), test is which unknown is the known: X = A (and Y = B), or Y = A (and X = B).
;Triangle: XXY – three unknowns (two are A and one is B or one is A and two are B), test which is the odd one out: Y = 1, Y = 2, or Y = 3.
 
In this context, ABX testing is also known as "duo–trio" in "balanced reference" mode – both knowns are presented as references, rather than one alone.<ref>{{cite book|last=Meilgaard |first=Morten |coauthors=Gail Vance Civille, B. Thomas Carr |title=Sensory evaluation techniques |publisher=CRC Press |year=1999 |edition=3 |pages=68–70 |isbn=0-8493-0276-5 |url=http://books.google.com/books?id=XX9xwk9G0EUC&pg=RA1-PA68 }}</ref>
 
==See also==
* [[Transparency (data compression)]]
* [[Psychophysics]]
* [[Psychoacoustics]]
 
==References==
{{Reflist}}
 
{{DEFAULTSORT:Abx Test}}
[[Category:Digital audio]]
[[Category:Statistical tests]]
[[Category:Psychophysics]]

Latest revision as of 22:10, 19 July 2014

The person who wrote the article is called Jayson Hirano and he totally digs that name. He works as a bookkeeper. Alaska is the only place I've been residing in but now I'm considering other options. I am truly fond of handwriting but I can't make it my profession really.

Also visit my website ... tarot readings