We performed χ2 analyses to compare spouses and respondents on various categorical demographic and clinical characteristics. t Tests for dependent samples were performed to compare the 2 parties on continuous variables. We also used the Wilcoxon signed rank test to compare spouses' and respondents' reports of health status.22 We then evaluated the degree of concordance between spouse-rated health (of the respondent) and self-rated health using χ2 analysis and the weighted κ statistic23 via a linear set of weights (eg, 1.0, 0.75, 0.50, 0.25, and 0). The κ statistic examines the degree of agreement beyond what would occur by chance. A κ statistic of 0 indicates that the level of agreement is no more than would be expected by chance alone, while a κ statistic of 1.0 indicates perfect agreement. Next, we identified the unique association of the various covariates with spouse-rated health over and above their associations with self-rated health. We first ensured that all variables complied with the proportional odds assumption and then performed a series of proportional odds regression analyses with spouse-rated health as an outcome variable, each sociodemographic and clinical correlate as a potential predictor, and self-rated health as a control variable. We then performed logistic regression analyses comparing spouse-rated health with self-rated health as potential predictors of mortality. We repeated the same logistic regression analyses with adjustment for various demographic and clinical characteristics (eg, respondents' age, sex, education status, medical status, functional impairment, depression, cognitive status, physical activity, smoking, alcohol drinking, and cognitive functioning, as well as spouses' depression, cognitive status, and caregiving status). Finally, we calculated receiver operating characteristic curves24 to compare the predictive ability of spouse-rated health against self-rated health, as well as their combined predictive ability (ie, a sum of both health ratings).25- 26 An area under the curve of 1.0 represents perfect predictive ability, whereas an area under the curve of 0.5 represents worthless predictive ability. The statistical comparison between the 2 curves was performed using commercially available software (roccomp command in STATA; StataCorp LP, College Station, Texas) that relies on logistic models for estimating the curves.26