<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>FAQ/clogit</title><revhistory><revision><revnumber>9</revnumber><date>2014-03-07 09:54:51</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>8</revnumber><date>2014-02-13 10:15:11</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>7</revnumber><date>2013-03-08 10:17:19</date><authorinitials>localhost</authorinitials><revremark>converted to 1.6 markup</revremark></revision><revision><revnumber>6</revnumber><date>2012-11-13 09:55:09</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>5</revnumber><date>2012-11-13 09:54:58</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>4</revnumber><date>2012-08-16 15:25:41</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>3</revnumber><date>2010-05-14 12:11:00</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>2</revnumber><date>2010-05-14 12:04:54</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>1</revnumber><date>2010-05-14 12:03:47</date><authorinitials>PeterWatson</authorinitials></revision></revhistory></articleinfo><section><title>Conditional logistic regression using R</title><para>Taken from Agresti (2002). Fuller details are given <ulink url="http://www-stat.stanford.edu/~owen/courses/306a/Splusdiscrete2.pdf">here.</ulink> The below R code is used on the data below (first five case-controls are listed) </para><screen><![CDATA[> table.10.3
    pair MI diabetes
1      1  0        1
2      1  1        1
3      2  0        1
4      2  1        1
5      3  0        1]]></screen><para>R code: </para><screen><![CDATA[library(survival)
]]><![CDATA[
table.10.3<-data.frame(pair=rep(1:144,rep(2,144)), MI=rep(c(0,1),144),
diabetes=c(rep(c(1,1),9),rep(c(1,0),16),rep(c(0,1),37),rep(c(0,0),82))
)
]]><![CDATA[
# Can use coxph routine equiavelently to clogit
]]><![CDATA[
#fit.CLR<-coxph(Surv(rep(1,2*144),MI)~diabetes+strata(pair),method="exact",
#data=table.10.3)
]]><![CDATA[
library(survival)
fit.CLR<-clogit(MI~diabetes+strata(pair),method="exact", data=table.10.3)
summary(fit.CLR)]]></screen><para>This gives the output: </para><screen><![CDATA[Call:
clogit(MI ~ diabetes + strata(pair), method = "exact", data = table.10.3)
]]><![CDATA[
]]><![CDATA[
          coef exp(coef) se(coef)   z      p
diabetes 0.838      2.31    0.299 2.8 0.0051
]]><![CDATA[
Likelihood ratio test=8.55  on 1 df, p=0.00345  n= 288 ]]></screen><para>The clogit procedure (unlike the SPSS Cox model) can be used to stratify <emphasis>more</emphasis> than two matched observations ie three or more repeated counts on each of n subjects. See for example Whitaker at al. (2005) with accompanying clogit R code from Helen Whitaker's website <ulink url="http://statistics.open.ac.uk/sccs">here.</ulink> </para><para>Xu et al. (2011) suggest a graphical diagnostic tool to choose the best lag time from trigger (e.g. stress) to event (e.g. epileptic seizure) whilst Farrington and Hocine (2010) advocate checking for clustering of events and using only the first event in each cluster to remove bias resulting from correlated events. </para><para><emphasis role="underline">References</emphasis> </para><para>Agresti A (2002) Categorical Data Analysis. Second Edition. Wiley:New York. </para><para>Farrington CP and Hocine MN (2010) Within-individual dependence in self-controlled case series models for recurrent events. <emphasis>Applied Statistics</emphasis> <emphasis role="strong">59(3)</emphasis> 457-475. A copy is available <ulink url="http://onlinelibrary.wiley.com/doi/10.1111/j.1467-9876.2009.00703.x/pdf">here.</ulink> </para><para>Whitaker HJ, Farrington, CP, Spiessens B. and Musonda P. (2006) Tutorial in biostatistics:the self-controlled case series method. <emphasis>Statistics in Medicine</emphasis> <emphasis role="strong">25(1)</emphasis> 1768-1797. An earlier 2005 pdf draft of this paper is given <ulink url="https://lsr-wiki-02.mrc-cbu.cam.ac.uk/statswiki/FAQ/clogit/statswiki/FAQ/clogit?action=AttachFile&amp;do=get&amp;target=selfc.pdf">here.</ulink> </para><para><ulink url="https://lsr-wiki-02.mrc-cbu.cam.ac.uk/statswiki/FAQ/clogit/statswiki/FAQ/clogit?action=AttachFile&amp;do=get&amp;target=xu.pdf">Xu S, Zhang L, Nelson JC, Zeng C, Mullooly J, McClure D and Glanz J</ulink> (2011) <emphasis>Statistics in Medicine</emphasis> <emphasis role="strong">30</emphasis> 742-752. Plotting to find optimal time at risk for each trigger on an individual. </para></section></article>