Approximate entropy: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>Kku
wikified
en>BG19bot
m WP:CHECKWIKI error fix for #61. Punctuation goes before References. Do general fixes if a problem exists. - using AWB (9916)
Line 1: Line 1:
{{Multiple issues|primarysources = May 2012|
I'm a 41 years old, married and work at the high school (Asian Studies).<br>In my spare time I try to teach myself [http://www.taskrb.com/%E5%93%81%E8%B3%AA%E3%81%AF%E9%9D%9E%E5%B8%B8%E3%81%AB%E8%89%AF%E3%81%84-%E3%83%9B%E3%83%83%E3%82%B1%E3%83%BC-%E6%96%B0%E7%9D%80/%E6%9C%89%E5%90%8D%E3%83%96%E3%83%A9%E3%83%B3%E3%83%88-%E3%82%B7%E3%83%A7%E3%83%AB%E3%83%80%E3%83%BC-%E5%8E%B3%E9%81%B8%E3%81%97%E5%88%B6%E4%BD%9C%E3%81%97%E3%81%9F/%E6%96%B0%E3%81%97%E3%81%8F%E7%9D%80%E3%81%8D-easton-%E3%82%A4%E3%83%BC%E3%82%B9%E3%83%88%E3%83%B3-%E6%96%B0%E4%BD%9C/%E7%89%B9%E3%81%AB%E5%A3%B2%E3%82%8C%E3%81%A6%E3%81%84%E3%82%8B-jr-%E3%82%B8%E3%83%A5%E3%83%8B%E3%82%A2-130%E3%80%9C160cm-%E8%B2%A9%E5%A3%B2 ホッケー jr ジュニア 130〜160cm] Korean. I've  [http://www.taskrb.com/%E9%AB%98%E5%93%81%E8%B3%AA-%E6%AD%A6%E9%81%93-%E5%B0%82%E5%A3%B2%E5%BA%97/%E5%BD%93%E5%BA%97%E3%81%AE%E8%AA%A0%E5%AE%9F-%E8%A9%A6%E5%90%88%E3%83%BB%E6%BC%94%E8%88%9E%E7%94%A8%E5%93%81-%E3%82%A2%E3%82%A6%E3%83%88%E3%83%AC%E3%83%83%E3%83%88 武道 試合 演舞用品] been twicethere and look forward to returning sometime in [http://www.taskrb.com/%E6%9C%89%E5%90%8D%E3%83%96%E3%83%A9%E3%83%B3%E3%83%88-%E3%83%90%E3%82%B9%E3%82%B1%E3%83%83%E3%83%88%E3%83%9C%E3%83%BC%E3%83%AB-%E6%96%B0%E5%93%81/%E5%A4%A7%E4%BA%BA%E6%B0%97-%E3%83%9C%E3%83%BC%E3%83%AB-%E5%B0%82%E5%A3%B2%E5%BA%97/%E8%B6%85%E5%AE%89%E3%81%84-%EF%BC%97%E5%8F%B7%E3%83%9C%E3%83%BC%E3%83%AB-%E6%96%B0%E5%88%B0%E7%9D%80/%E5%A4%A7%E6%B3%A8%E7%9B%AE-%EF%BC%B6%EF%BC%A9%EF%BC%B3%EF%BC%A9%EF%BC%AF%EF%BC%AE%EF%BC%B1%EF%BC%B5%EF%BC%A5%EF%BC%B3%EF%BC%B4-%E6%96%B0%E5%88%B0%E7%9D%80 バスケットボール VISIONQUEST] the future. I love to read, preferably on my beloved Kindle. I like to watch Grey's Anatomy and The Vampire Diaries as well as docus about nature. I like Fossil hunting.<br><br>Look into my blog :: [http://www.taskrb.com/%E5%93%81%E8%B3%AA%E8%87%B3%E4%B8%8A-%E3%82%B8%E3%83%A7%E3%82%AE%E3%83%B3%E3%82%B0%E3%83%BB%E3%83%9E%E3%83%A9%E3%82%BD%E3%83%B3-%E6%BF%80%E5%AE%89%E5%B8%82%E5%A0%B4/%E6%B4%97%E7%B7%B4%E3%81%95%E3%82%8C%E3%81%9F-%E3%83%A6%E3%83%8B%E3%82%BB%E3%83%83%E3%82%AF%E3%82%B9-%E3%82%92%E6%AC%B2%E3%81%97%E3%81%84 ジョギング マラソン ユニセックス]
{{expert-subject|date=January 2012}}
}}
 
In [[statistics]], the '''maximal information coefficient (MIC)''' is a measure of the strength of the linear or non-linear association between two variables ''X'' and&nbsp;''Y''.
 
The MIC belongs to the maximal information-based nonparametric exploration (MINE) class of statistics.<ref>{{Cite doi|10.1126/science.1205438|noedit}}</refIn a simulation study, MIC outperformed some selected functions,<ref name=MIC>Reshef et al. 2011</ref> however concerns have been raised regarding reduced [[statistical power]] in detecting some associations in settings with low sample size.<ref>[http://www-stat.stanford.edu/~tibs/reshef/comment.pdf Comment on “Detecting Novel Associations in Large Data Sets” by Reshef et al., Science Dec. 16, 2011]</ref> It is claimed<ref name=MIC/> that MIC approximately satisfies a property called ''equitability'' which is illustrated by selected simulation studies.<ref name=MIC/> It was later proved that no non-trivial coefficient can exactly satisfy the ''equitability'' property as defined by Reshef et al.<ref name=MIC/><ref>[http://arxiv.org/abs/1301.7745v1 Equitability, mutual information, and the maximal information coefficient by Justin B. Kinney, Gurinder S. Atwal, arXiv Jan. 31, 2013]</ref> Some criticims of MIC are addressed by Reshef et al. in further studies published on arXiv.<ref>[http://arxiv.org/abs/1301.6314v1 Equitability Analysis of the Maximal Information Coefficient, with Comparisons by David Reshef, Yakir Reshef, Michael Mitzenmacher, Pardis Sabeti, arXiv Jan. 27, 2013]</ref>
 
== Overview ==
 
The maximal information coefficient uses [[Data binning|binning]] as a means to apply  [[Mutual Information|mutual information]] on continuous random variables. Binning has been used for some time as a way of applying mutual information to continuous distributions; what MIC contributes in addition is a methodology for selecting the number of bins and picking a maximum over many possible grids.
 
The rationale is that the bins for both variables should be chosen in such a way that the mutual information between the variables be maximal. That is achieved whenever <math>\mathrm{H}\left(X_b\right)=\mathrm{H}\left(Y_b\right)=\mathrm{H}\left(X_b,Y_b\right)</math>.<ref>The "b" subscripts have been used to emphasize that the mutual information is calculated using the bins</ref> Thus, when the mutual information is maximal over a binning of the data, we should expect that the following two properties hold, as much as made possible by the own nature of the data. First, the bins would have roughly the same size, because the entropies <math>\mathrm{H}(X_b)</math> and <math>\mathrm{H}(Y_b)</math> are maximized by equal-sized binning. And second, each bin of ''X'' will roughly correspond to a bin in ''Y''.
 
Because the variables X and Y are reals, it is almost always possible to create exactly one bin for each (''x'',''y'') datapoint, and that would yield a very high value of the MI. To avoid forming this kind of trivial partitioning, the authors of the paper propose taking a number of bins <math>n_x</math> for ''X'' and <math>n_y</math> whose product is relatively small compared with the size N of the data sample. Concretely, they propose:
 
<math>n_x\times n_y \leq \mathrm{N}^{0.6} </math>
 
In some cases it is possible to achieve a good correspondence between <math>X_b</math> and <math>Y_b</math> with numbers as low as <math>n_x=2</math> and <math>n_y=2</math>, while in other cases the number of bins required may be higher. The maximum for <math>\mathrm{I}(X_b;Y_b)</math> is determined by H(X), which is in turn determined by the number of bins in each axis, therefore, the mutual information value will be dependent on the number of bins selected for each variable. In order to  compare mutual information values obtained with partitions of different sizes, the mutual information value is normalized by dividing by the maximum achieveable value for the given partition size.
Entropy is maximized by uniform probability distributions, or in this case, bins with the same number of elements. Also, joint entropy is minimized by having a one-to-one correspondence between bins. If we substitute such values in the formula
<math>I(X;Y)=H(X)+H(Y)-H(X,Y)</math>, we can see that the maximum value achieveable by the MI for a given pair <math>n_x,n_y</math> of bin counts is <math>\log\min\left(n_x,n_y\right)</math>. Thus, this value is used as a normalizing divisor for each pair of bin counts.
 
Last, the normalized maximal mutual information value for different combinations of <math>n_x</math> and <math>n_y</math> is tabulated, and the maximum value in the table selected as the value of the statistic.
 
==References==
{{Reflist}}
 
[[Category:Information theory]]
[[Category:Covariance and correlation]]

Revision as of 11:07, 6 February 2014

I'm a 41 years old, married and work at the high school (Asian Studies).
In my spare time I try to teach myself ホッケー jr ジュニア 130〜160cm Korean. I've 武道 試合 演舞用品 been twicethere and look forward to returning sometime in バスケットボール VISIONQUEST the future. I love to read, preferably on my beloved Kindle. I like to watch Grey's Anatomy and The Vampire Diaries as well as docus about nature. I like Fossil hunting.

Look into my blog :: ジョギング マラソン ユニセックス