Diff for "FAQ/LogitNPlusHalf" - CBU statistics Wiki
location: Diff for "FAQ/LogitNPlusHalf"
Differences between revisions 4 and 30 (spanning 26 versions)
Revision 4 as of 2006-07-12 16:45:37
Size: 901
Editor: devel06
Comment:
Revision 30 as of 2015-10-12 14:35:10
Size: 2837
Editor: PeterWatson
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= Adding 0.5 to cells when calculating d' and the empirical logit transform = = Adding 0.5 to cells when calculating odds ratios, d' and the empirical logit transform =

== On the use of adding 0.5 to cells to define an odds ratio ==

 * D G Bonett and R M Price Jnr (2015) Varying coefficient meta-analysis methods for odds ratios and risk ratios. ''Psychological Methods'' '''20(3)''' 394-406. References recommending the use of adding 0.5 to frequencies to minimise bias when evaluating an odds ratio are given in this paper.

In particular Bonett and Price Jnr (2015, p.395) say "The addition of [1/2] to the cell frequencies in Equations 2 and 3 was suggested by Gart (1966), and it can be shown (see Agresti, 2002, p. 595) that the bias of ln(odds ratio) is minimized with the addition of [1/2] to each cell frequency".

 *Subbiah and Srinivasan (2008) note the practice of adding a constant to a zero cell to define an odds ratio. They go on to state "Usually a continuity correction, by addition of 0.5, is considered for studies with zero counts in the estimation of the log of the odds ratio or the log of the risk ratio."

 * M Subbiah and M R Srinivasan (2008) Classification of 2x2 sparse data sets with zero cells. ''Statistics & Probability Letters'' '''78(18)''' 3212-3215.
A pdf copy of this paper is available to read and print from [[http://www.sciencedirect.com/science/article/pii/S0167715208003003|here.]]
Line 10: Line 21:
[Last updated on 27 November, 2003]
== See also ==

[[FAQ/sigdet|Signal Detection Parameters from SPSS]] There [[attachment:SPSSsig.pdf | a different convention]] is followed of putting ''1/2N'' and ''1-1/2N'' in place of ''0%'' and ''100%''.

A discussion on the optimal quantity to add to 0% and 100% hit and false alarm rates to give finite dprime values [[http://stats.stackexchange.com/questions/134779/d-prime-with-100-hit-rate-probability-and-0-false-alarm-probability9 | is here.]]

There are details [[https://sites.google.com/a/mtu.edu/whynotaprime/ | here which suggests a nonparametric alternative]] to dprime called aprime which represents an average area under all possible ROC curves. If this link is broken here are the details copied from there on how to compute ROC diagnostics [[FAQ/sigdetinR | using in R]].
Line 13: Line 31:
[wiki:FAQ Return to Statistics FAQ page]
Line 15: Line 32:
[wiki:CbuStatistics Return to Statistics main page] These pages are maintained by [[mailto:ian.nimmo-smith@mrc-cbu.cam.ac.uk|Ian Nimmo-Smith]] and [[mailto:peter.watson@mrc-cbu.cam.ac.uk|Peter Watson]]
Line 17: Line 34:
[http://www.mrc-cbu.cam.ac.uk/ Return to CBU main page]

These pages are maintained by [mailto:ian.nimmo-smith@mrc-cbu.cam.ac.uk Ian Nimmo-Smith] and [mailto:peter.watson@mrc-cbu.cam.ac.uk Peter Watson]
~- Last updated on 27 July, 2006 -~

Adding 0.5 to cells when calculating odds ratios, d' and the empirical logit transform

On the use of adding 0.5 to cells to define an odds ratio

  • D G Bonett and R M Price Jnr (2015) Varying coefficient meta-analysis methods for odds ratios and risk ratios. Psychological Methods 20(3) 394-406. References recommending the use of adding 0.5 to frequencies to minimise bias when evaluating an odds ratio are given in this paper.

In particular Bonett and Price Jnr (2015, p.395) say "The addition of [1/2] to the cell frequencies in Equations 2 and 3 was suggested by Gart (1966), and it can be shown (see Agresti, 2002, p. 595) that the bias of ln(odds ratio) is minimized with the addition of [1/2] to each cell frequency".

  • Subbiah and Srinivasan (2008) note the practice of adding a constant to a zero cell to define an odds ratio. They go on to state "Usually a continuity correction, by addition of 0.5, is considered for studies with zero counts in the estimation of the log of the odds ratio or the log of the risk ratio."
  • M Subbiah and M R Srinivasan (2008) Classification of 2x2 sparse data sets with zero cells. Statistics & Probability Letters 78(18) 3212-3215.

A pdf copy of this paper is available to read and print from here.

On the use of the +0.5 smoothing for calculating the empirical logit transform see:

  • David R COX and E J SNELL (1989) Analysis of Binary Data , Chapman and Hall: London

On the use of the +0.5 smoothing for calculating the d' signal detection parameter see:

  • Michael J HAUTUS (1995) Corrections for extreme proportions and their biasing effects on estimated values in d'. Behavioral Research Methods, Instruments, & Computers, 27(1), 46-51.

See also

Signal Detection Parameters from SPSS There a different convention is followed of putting 1/2N and 1-1/2N in place of 0% and 100%.

A discussion on the optimal quantity to add to 0% and 100% hit and false alarm rates to give finite dprime values is here.

There are details here which suggests a nonparametric alternative to dprime called aprime which represents an average area under all possible ROC curves. If this link is broken here are the details copied from there on how to compute ROC diagnostics using in R.


These pages are maintained by Ian Nimmo-Smith and Peter Watson

Last updated on 27 July, 2006

None: FAQ/LogitNPlusHalf (last edited 2015-10-12 14:35:10 by PeterWatson)