<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>FAQ/Poisson</title><revhistory><revision><revnumber>29</revnumber><date>2019-11-11 12:23:04</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>28</revnumber><date>2013-03-08 10:17:58</date><authorinitials>localhost</authorinitials><revremark>converted to 1.6 markup</revremark></revision><revision><revnumber>27</revnumber><date>2008-07-24 11:42:20</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>26</revnumber><date>2008-07-24 11:42:01</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>25</revnumber><date>2008-07-24 11:40:07</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>24</revnumber><date>2008-07-24 11:39:43</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>23</revnumber><date>2008-07-24 11:29:58</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>22</revnumber><date>2008-07-24 11:29:32</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>21</revnumber><date>2008-07-24 11:26:12</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>20</revnumber><date>2008-07-24 11:25:03</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>19</revnumber><date>2008-07-24 11:13:18</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>18</revnumber><date>2008-07-24 09:26:05</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>17</revnumber><date>2008-07-24 09:25:06</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>16</revnumber><date>2008-07-24 09:23:43</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>15</revnumber><date>2008-07-23 14:37:03</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>14</revnumber><date>2008-07-23 14:36:39</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>13</revnumber><date>2008-07-23 13:17:26</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>12</revnumber><date>2008-07-23 13:15:00</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>11</revnumber><date>2008-07-23 13:13:53</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>10</revnumber><date>2008-07-23 13:13:12</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>9</revnumber><date>2008-07-23 13:11:41</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>8</revnumber><date>2008-07-23 13:11:07</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>7</revnumber><date>2008-07-23 13:10:21</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>6</revnumber><date>2008-07-23 13:08:04</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>5</revnumber><date>2008-07-23 13:06:48</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>4</revnumber><date>2008-07-23 11:58:16</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>3</revnumber><date>2008-07-23 11:57:46</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>2</revnumber><date>2008-07-23 11:55:06</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>1</revnumber><date>2008-07-23 11:54:09</date><authorinitials>PeterWatson</authorinitials></revision></revhistory></articleinfo><section><title>How do I compare observed numbers correct to those expected by chance in a multi-choice task?</title><para>Suppose a questionnaire has a set of k possible responses to N questions of which one of the k choices is the correct response. Assuming by chance each of the k responses to each question is equally likely the number expected by chance equals N/k. </para><para>Suppose we observe x $$\leq$$ N responses for a patient. We wish to see how likely x responses are given the patient responds at random to k choices on each of N questions. </para><para>To do this we can assume the number of correct responses, x, follows a Poisson distribution, Po(m) of general form: </para><para>$$P(X=x|\mbox{random responses}) = [m<superscript>x </superscript> ]/[x! ]e<superscript>-m </superscript> </para><para>where $$m$$ is the expected total number of correctly answered questions from the N questions. Since this equals N/k we can rewrite the above as: </para><para>$$P(X=x|\mbox{random responses}) = [(N/k)<superscript>x </superscript>] / [x!] e<superscript>-(N/k)</superscript> </para><para>one-tailed p-value = P(X \leq x) = (sum from 1 to x) P(X=x|\mbox{random responses}) = 0.5(two-tailed p-value) </para><para>For large N the Poisson distribution is approximately Normally distributed with mean and variance both equal to N/k so we can analogously obtain a one-tailed p-value as: </para><para>P(X $$\leq$$ x) = Probit( [x-(N/k)] divided by [Square Root{N/k}] ). </para><para><emphasis role="underline">Example</emphasis> </para><para>Suppose we have 14 questions each with 3 possible responses of which only one is correct and a patient gets a total score of 3 correct responses. We wish to determine how likely it is we would observe 3 or fewer responses given the patient has responded at random (1 sided p-value). </para><para>The expected number of correct responses assuming the patient is responding at random is 14/3=4.67. </para><para>Using the Poisson distribution </para><para>P(X $$\leq$$ 3)  </para><para>= P(X = 0) + P(X = 1) + P(X = 2) + P(X = 3) </para><itemizedlist><listitem override="none"><para>= (1 + 4.67 + [4.67<superscript>2 </superscript>]/[2] + [4.67<superscript>3 </superscript>]/[6])e<superscript>-4.67 </superscript> </para></listitem></itemizedlist><para>=  0.31. </para><para>We can think of the p-value as the sum of numbers correct which are <emphasis>less likely</emphasis> assuming random responses than the observed one. It turns out that P(X $$\geq$$ 6) also accounts for <emphasis>less likely</emphasis> occurrences than the observed 3 correctly answered questions, given random responses. So the <emphasis>exact</emphasis> two-sided p-value is the sum of the Poisson probabilities P(X $$\leq$$ 3) and P(X $$\geq$$ 6) which equals 0.64. </para><para>P(X $$\leq$$ 3) is approximately equal to P(X $$\geq$$ 6) because the Poisson distribution is symmetric about its expected value, N/k, the two-tailed p-value can also be computed as approximately equal to twice the one-tailed p-value = 0.31 x 2= 0.62. This is close to the p-value of 0.64 using the sum of the poisson probabilities above. </para><para>The approximate or exact p-values from a Poisson distribution both conclude that there is no evidence to suggest a score of 3 on a three-choice task of 14 questions differs from chance responses. </para><para>We can also evaluate probabilities of observing 3 correct responses due to chance using the <emphasis>Normal approximation</emphasis> to the Poisson distribution: </para><para>$$P(X \leq 3) = Probit (\frac{3-4.67}{\sqrt{4.67}}) = Probit(-0.772) = 0.22$$ with a two-sided p-value of 0.44. The two-tailed probability equals $$P(X \leq 3) + P(X \geq 6.33)$$ since the Normal distribution probabilities are, like those of the Poisson distribution, symmetric about the mean of 4.67. Of course we can't observed 6.33 correct responses but this is a continuous approximation to the discrete Poisson distribution - like joining lines between frequency bars on a histogram of the number of correct responses!  </para><para>As with the Poisson distribution we conclude there is no evidence to suggest getting 3 questions correct on a three-choice task of 14 questions differs from chance. The exact Poisson two-sided p-value and its Normal approximation may be computed using a <ulink url="https://lsr-wiki-02.mrc-cbu.cam.ac.uk/statswiki/FAQ/Poisson/statswiki/FAQ/Poisson?action=AttachFile&amp;do=get&amp;target=multichoice_exact.xls">spreadsheet.</ulink> </para><para>In practice for over 30 questions (N) the Poisson and Normal approximations should closely agree. For less than 30 questions the Poisson is preferable as it is a discrete distribution assuming, as in this example, only integer values can occur (ie that numbers of correctly answered questions are whole numbers). </para></section></article>