<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>FAQ/ab-a</title><revhistory><revision><revnumber>30</revnumber><date>2018-03-12 17:22:22</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>29</revnumber><date>2013-03-08 10:17:25</date><authorinitials>localhost</authorinitials><revremark>converted to 1.6 markup</revremark></revision><revision><revnumber>28</revnumber><date>2011-02-04 11:28:11</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>27</revnumber><date>2011-02-04 11:27:06</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>26</revnumber><date>2011-02-04 11:25:06</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>25</revnumber><date>2011-02-04 11:24:42</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>24</revnumber><date>2011-02-04 11:24:11</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>23</revnumber><date>2011-02-04 11:22:41</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>22</revnumber><date>2011-02-04 11:13:01</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>21</revnumber><date>2011-02-04 10:40:53</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>20</revnumber><date>2011-02-04 10:40:07</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>19</revnumber><date>2011-02-04 10:38:51</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>18</revnumber><date>2011-02-04 10:30:12</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>17</revnumber><date>2011-02-04 10:25:19</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>16</revnumber><date>2011-02-04 10:23:53</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>15</revnumber><date>2011-02-04 09:59:10</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>14</revnumber><date>2011-02-04 09:57:05</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>13</revnumber><date>2011-02-04 09:55:28</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>12</revnumber><date>2011-01-27 12:37:02</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>11</revnumber><date>2011-01-27 12:36:17</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>10</revnumber><date>2011-01-27 12:35:24</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>9</revnumber><date>2011-01-27 12:34:11</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>8</revnumber><date>2011-01-27 12:33:24</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>7</revnumber><date>2011-01-27 12:33:03</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>6</revnumber><date>2011-01-27 12:31:48</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>5</revnumber><date>2011-01-27 12:27:28</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>4</revnumber><date>2011-01-27 12:26:58</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>3</revnumber><date>2011-01-27 12:25:18</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>2</revnumber><date>2011-01-27 12:23:49</date><authorinitials>PeterWatson</authorinitials></revision><revision><revnumber>1</revnumber><date>2011-01-27 12:22:34</date><authorinitials>PeterWatson</authorinitials></revision></revhistory></articleinfo><section><title>What is the relationship between regressions involving variables A and B to those involving B-A and A+B in predicting an outcome?</title><para>Suppose we have a response Y and two continuous predictors such as age of onset (a) and duration of hearing deficit (b-a) with b representing the individual's current age. Then there is an equivalence between the coefficients in this regression and the ones associated with the same response,y, being predicted using a and b as predictors.  </para><para>In particular if $$B_text{i}$$ represents the regression coefficient for variable i then in a regression using a and b-a as predictors: </para><para>Predicted y = $$B_text{a}$$a + $$B_text{b-a}$$(b-a) </para><para>= $$B_text{a}$$a + $$B_text{b-a}$$b - $$B_text{b-a}$$a </para><para>= $$(B_text{a}$$ - $$B_text{b-a}$$)a + $$B_text{b-a}$$b </para><para>So it follows that if $$B_text{i|i,j}$$ represents the regression coefficient of variable i in a regression with i and j as predictors being used to predict a response, y, that </para><para>$$B_text{a|a,b-a}$$ - $$B_text{b-a|a,b-a}$$ = $$B_text{a|a,b}$$ and </para><para>$$B_text{b-a|a,b-a}$$ = $$B_text{b|a,b}$$ </para><para>In other words subtracting the regression coefficients for a and b-a in a regression using a and b-a as predictor is equivalent to the regression coefficient for a in a regression with a and b as predictors and the regression coefficient for b-a with a and b-a as predictors is the same as the regression coefficient for b in a regression with a as the other predictor.  </para><para>It also follows that the standard errors of the regression coefficients for a and b respectively can be derived using the standard errors of the regression coefficients for a and b-a. </para><para>se($$B_text{a|a,b}$$) = se($$B_text{a|a,b-a}$$ - $$B_text{b-a|a,b-a}$$) = $$\sqrt{V(B_text{a|a,b-a}) \mbox{ + } V(B_text{b-a|a,b-a}) \mbox{ - } 2\mbox{Cov}(B_text{a|a,b-a},B_text{b-a|a,b-a})}$$ and </para><para>se($$B_text{b|a,b}$$) = $$B_text{b-a|a,b-a}$$ </para><para><emphasis role="underline">Example</emphasis> </para><para>For one study involving a response y and variables a and b-a we have regression coefficients (s.es) of 1.170 (0.446) for a and 1.023 (0.399) for b-a. </para><para>It follows in a regression involving a and b on the same response the regression (s.e.) of b equals that of b-a in the a, b-a regression, namely 1.023 (0.399).  </para><para>The regression coefficient for a equals 1.170 - 1.023 = 0.148. Given a covariance of 0.026 between the a and b-a regression coefficients </para><para>The se(a) in the regression involving a and b is computed using the s.es and covariance from the regression coefficients in the regression with a and b-a as predictors. </para><para>se(a) = $$\sqrt{0.446<superscript>2 </superscript> + 0.399<superscript>2 </superscript> - 2(0.026)}$$ = $$\sqrt{0.306}$$ = 0.553. </para><para>The zero-order correlations have the same t-values as the regression estimates used to obtain them and their zero-order correlations correspond to the signed square root of the change in R-squareds. </para><para><emphasis role="underline">Example showing the extraction of zero-order correlations from the above regressions</emphasis> </para><para>We can obtain the zero-order correlations of a and b with y from the regressions involving a and the a+b sum and b with the a+b sum by evaluating R-squareds and regression coefficient t-values associated with the 'a+b' sum regression term. <emphasis role="strong">These results confirm that the zero-order correlation of a (b) with y can be obtained indirectly from the b (a) scores and the sum of a and b</emphasis>. Examples below are for a randomly generated data set. </para><para>The zero-order correlation of b with y is the signed square root of the change in R-squared adding 'a+b' to a regression already containing 'a' predicting y = $$\sqrt{0.066-0.050}$$ = -0.12 where the R-squared of 0.066 corresponds to a regression on y using 'a' and 'a+b' as predictors of y and 0.050 is the R-squared of a regression with only 'a' used to predict y. The t-value for 'a+b' =0.34, p=0.75 which equals the p-value for the zero-order correlation of b with y of -0.12. </para><para>The zero-order correlation of a with y is the signed square root of the change in R-squared adding 'a+b' to a regression already containing 'b' predicting y = $$\sqrt{0.066-0.014}$$ = 0.22 where the R-squared of 0.066 corresponds to a regression on y using 'a' and 'a+b' as predictors of y and 0.014 is the R-squared of a regression with 'a' as the only predictor of y. The t-value for 'a+b' =-0.62, p=0.55 which equals the p-value for the zero-order correlation of a with y of 0.22. </para><para><emphasis role="underline">Relationships between a,b and a+b</emphasis> </para><para>It also follows Predicted y = $$B_text{a}$$a + $$B_text{a+b}$$(a+b) </para><para>= $$(B_text{a}$$ + $$B_text{a+b}$$)a + $$B_text{a+b}$$b and </para><para>Predicted y = $$B_text{b}$$b + $$B_text{a+b}$$(a+b) </para><para>= $$(B_text{b}$$ + $$B_text{a+b}$$)b + $$B_text{a+b}$$a </para><para>So it follows that knowing the relationship between the response with both a+b and a is enough to give the relationship between the response and b and the relationship between the response and both a+b and b is enough to give the relationship with a. </para><para>It is also true that <emphasis>unless a and b are highly correlated</emphasis> so a $$\approx$$ $$\pm$$b, </para><para>$$B_text{a}$$a + $$B_text{b}$$b $$\ne$$ $$B_text{a+b}$$(a+b) </para><para>because $$B_text{a}$$ and $$B_text{b}$$ will not in general be equal. <emphasis role="strong">This means you cannot obtain a zero-order correlation between y and the sum of a and b, indirectly, using the separate a and b scores</emphasis>.  </para><para>One can also interpret this as knowing the a+b sum does not tell you the numbers (a and b) that were added together to give it if these number were weighted unequally (so that $$B_text{a}$$ is not equal to $$B_text{b}$$). </para><para>If a and b are highly correlated then the relationships between y and a and y and b are nearly equal and the relationship between y and a+b will then be equal to the relationship between y and either a or b. </para><para><emphasis role="underline">Example</emphasis> </para><para>If b =-3a then for a Pearson correlation, r, r(a,y)=-r(b,y). r(a+b,y) = r(b,y)= - r(a,y) since the b values are higher in absolute value than those of a so the summation will have the same sign of relationship with y as the b values. </para></section></article>