Measurement confounding affects the extent to which verbal IQ explains social gradients in mortality

Benjamin Chapman; Kevin Fiscella; Paul Duberstein; Ichiro Kawachi; Peter Muennig

doi:10.1136/jech-2013-203741

Article Text

PDF

Social factors and health

Measurement confounding affects the extent to which verbal IQ explains social gradients in mortality

Benjamin Chapman1,
Kevin Fiscella2,3,
Paul Duberstein1,2,
Ichiro Kawachi4,
Peter Muennig5

¹Department of Psychiatry, University of Rochester Medical Center, Rochester, New York, USA
²Department of Family Medicine, University of Rochester Medical Center, Center for Communication and Disparities Research, Rochester, New York, USA
³Department of Public Health Sciences, University of Rochester Medical Center, Rochester, New York, USA
⁴Department of Society, Human Development, and Health, Harvard University School of Public Health, Boston, Massachusetts, USA
⁵Department of Health Management and Policy, Columbia University, Mailman School of Public Health, New York, New York, USA

Correspondence to Dr Benjamin Chapman, Department of Psychiatry, University of Rochester Medical Center, 300 Crittenden, Rochester, NY 14620, USA; ben_chapman{at}urmc.rochester.edu

Abstract

Background IQ is thought to explain social gradients in mortality. IQ scores are based roughly equally on Verbal IQ (VIQ) and Performance IQ tests. VIQ tests, however, are suspected to confound true verbal ability with socioeconomic status (SES), raising the possibility that associations between SES and IQ scores might be overestimated. We examined, first, whether two of the most common types of VIQ tests exhibited differential item functioning (DIF) favouring persons of higher SES and/or majority race/ethnicity. Second, we assessed what impact, if any, this had on estimates of the extent to which VIQ explains social gradients in mortality.

Methods Data from the General Social Survey-National Death Index cohort, a US population representative dataset, was used. Item response theory models queried social-factor DIF on the Thorndike Verbal Intelligence Scale and Wechsler Adult Intelligence Scales, Revised Similarities test. Cox models examined mortality associations among SES and VIQ scores corrected and uncorrected for DIF.

Results When uncorrected for DIF, VIQ was correlated with income, education, occupational prestige and race, with correlation coefficients ranging between |0.12| and |0.43|. After correcting for DIF, correlations ranged from |0.06| to |0.16|. Uncorrected VIQ scores explained 11–40% of the Relative Index of Inequalities in mortality for social factors, while DIF-corrected scores explained 2–29%.

Conclusions Two of the common forms of VIQ tests appear confound verbal intelligence with SES. Since these tests appear in most IQ batteries, circumspection may be warranted in estimating the amount of social inequalities in mortality attributable to IQ.

Cognition
Mortality
Social Epidemiology

https://doi.org/10.1136/jech-2013-203741

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Socioeconomic status (SES) and cognitive ability are powerful predictors of health and longevity.1 In fact, cognitive ability, as measured by the IQ, has been hypothesised to account for much of the SES-related health gradient.2 Supporting this hypothesis, correlational studies suggest that those with stronger cognitive skills may be able to better understand medical instructions, navigate social bureaucracies, and avoid accidental death.3–8 IQ also putatively correlates with markers of SES in the 0.4–0.55 range.9

IQ scores are themselves composed of two types of tests. ‘Performance IQ’ tests assess non-verbal reasoning and analytic ability. Tests of ‘Verbal IQ’ (VIQ) reflect language-based reasoning ability and knowledge.10 ‘True’ VIQ is presumed to reflect factors, such as the speed and facility of language acquisition, and abstract and symbolic manipulation of language to solve problems and attain goals.10 These skills assume different forms depending on the socioeconomic environment in which language is learned and reinforced, however.11 ,12 Thus, language-based problem solving may vary markedly across socioeconomic strata.

In psychometric theory, VIQ ‘true scores’ refer to test scores putatively measuring the concept of ‘true VIQ.’13 Under common statistical assumptions, VIQ true scores can be separated from other factors potentially influencing performance on VIQ tests that fall outside the definition of verbal intelligence. Many such factors have been implicated in VIQ test performance, including higher education, middle class SES and majority culture.11 ,12 VIQ tests scores, therefore, run the risk of mixing or confounding verbal intelligence true scores with educationally or socially acquired knowledge, class-differential verbal styles, academic motivation, standardised testing experience and other residue of social position.

More technically, this measurement confounding arises from violations of collapsibility and exchangeability.14 In other words, the association between the latent trait of ‘verbal intelligence’ (unobserved) and scores on a test of VIQ (observed) may not be collapsible across SES strata. Moreover, persons may not be exchangeable on non-intelligence attributes that affect response to items on the test (ie, measures of social standing). This type of systematic measurement error is called Differential Item Functioning (DIF).15 If it is present to a significant degree on a VIQ test, persons of lower SES may achieve artificially deflated VIQ test scores, driving the apparent association between SES and VIQ upward.16 As a result, the extent to which VIQ scores explain social gradients in mortality may appear larger than it actually is.

We examined whether two common forms of VIQ test exhibited DIF related to socioeconomic indicators (education, income, occupational prestige) and race in a US national sample. Although race/ethnicity may be less associated with social class in European countries, it is often considered a dimension of social stratification in the USA. We then compared SES correlations with VIQ scores, adjusted and unadjusted for DIF. Finally, we examined how much of the association between SES factors and all-cause mortality could be explained by VIQ scores, with and without correction for DIF. Our goal was to test a central premise of current research—that cognitive ability explains social patterns in mortality.

Methods

Sample and design

We used data from the General Social Survey (GSS), an annual nationally representative sampling of US population social practices and attitudes. Conducted by the National Opinion Research Center at the University of Chicago, the GSS uses a multistage probability sampling of non-institutionalised adults age 18 years and over, with response rates from 70% to 82% in any given year,17 yielding demographically identical annual samples. The GSS records age, gender, race/ethnicity, respondent occupation, income and years of education on the basis of face-to-face interviews with subjects. The Gallup-Thorndike Test of Verbal Intelligence18 (hereafter Gallup-Thorndike) was administered during these interviews to one-third to one-half of the sample randomly selected during the years 1978, 1982, 1984, 1987–2000. We used data for 9381 persons with complete data for all variables of interest. Those lacking data (usually an SES indicator) were more likely female, younger, minority, and in worse self-rated health (p<0.001); the resulting analytic sample was still broadly similar to that of the USA in 2000.19 The second VIQ test, the Wechsler Adult Intelligence Scales—Revised Similarities Test (WAIS-R Similarities), was administered in 1996 and yielded an analytic sample of 2444, by design demographically comparable to the broader Gallup Thorndike sample.

Measures

Occupation was coded using the Socioeconomic Index (SEI), a continuous measure of occupational prestige, based on US Census information. Income was calibrated to 1990 US dollars. The Gallup-Thorndike18 originally comprised 20 items taken from the Institute for Educational Research (IER) Intelligence Scale CAVD.20 In the GSS, 10 of these items were administered in person by an interviewer. Tests of vocabulary are presumed to assess word familiarity, and also (1) concept formation (without which the correct definition cannot be given) and (2) the ability to deduce meaning of unfamiliar words based on known roots or syllables, using answer choices provided.21

The second VIQ test, the WAIS-R Similarities test,22 presents persons with successively more difficult questions about how two different things are alike. For instance, an easy question might be ‘how are a fly and a mouse related?’ A completely correct response, such as ‘they are both animals’, receives two points. A response that is correct but does not capture the similarity at the most abstract level (eg, ‘they both have eyes’) receives one point. The WAIS-R manual contains detailed guidelines for scoring responses as 0, 1, or 2.22

Vital status through 2008 was ascertained from the National Death Index. The validity of the National Death Index is typically high, with matching certainty arising from social security numbers and the additional identifiers in the GSS reaching 99.8%.23 Further details on the GSS-National Death Index matching are available.17

Analysis

Occupational prestige, incopme, and education were scaled by Relative Index of Inequalities (RII). The RII scores the person of highest standing on a social dimension as 0, and the lowest as 1.24 ,25 A 1 unit change in regression models, therefore, is interpretable as a relative risk, but a specific kind: the risk at the absolute top, relative to the absolute bottom, of a distribution.

Item Response Theory (IRT) analyses of the Gallup-Thorndike were conducted with the Rasch model26 and WAIS-R similarities analyses used the graded response model.27 The Rasch model is formally equivalent to a mixed-effect logistic regression treating test items as repeated measures within person,28 and estimating the probability of success or ‘difficulty parameter’ for each item independently of an examinee's standing on the latent trait (random effect). The graded response model is an extension for ordered responses analogous to the extension from a binary to ordered logistic model (we relaxed the proportional odds assumption). The online supplementary material provides technical details of these IRT models and DIF analysis.29

Briefly, we examined DIF related to race, SEI, household income and education, as well as age and gender (which are correlated with SES) using interaction terms28 in three increasingly stringent steps. In step 1, we screened social factor interaction terms separately for each item. In our second step, the model adjusted for all previously identified sources of DIF for a single item simultaneously. In the third stage, we adjusted for DIF factors across all items simultaneously. At each step, we retained those that were significant and met a DIF effect size threshold such that the item's difficulty was 30% easier at one end of a sociodemographic dimension than at the other, irrespective of VIQ true score. Latent trait scores for each test were then estimated from IRT models with and without this final set of DIF interaction terms.

We examined impact of DIF on SES-VIQ associations via Pearson correlations between each SES factor and VIQ scores unadjusted and adjusted for DIF. We computed the absolute difference (r_unadj−r_adj), as well as relative difference (r_unadj/r_adj) between DIF adjusted and unadjusted score correlations.30 We also estimated the association of DIF-corrected and uncorrected VIQ scores with mortality using Cox proportional hazards models with attained age as time scale and GSS baseline age as point of entry into the risk set,31 ,32 fitting three models for each SES factor. Each model included gender as a covariate with time-varying hazards, based on preliminary proportionality analysis. The first model estimated the SES factor's RII, or the HR for those at the most disadvantaged, versus advantaged end of the distribution. A second model then added VIQ scores unadjusted for DIF and computed the excess hazard explained by these scores as (HR_unadjusted−HR_adjusted)/(HR_unadjusted−1), with 95% CIs obtained via bootstrap (1000 replicates). A third model then controlled for DIF-adjusted VIQ scores, again computing the change in estimate.

Results

Table 1 shows the sample demographics for the Gallup-Thorndike sample (left) and the WAIS-R similarities subset of that sample (right). With respect to VIQ tests, assumptions underlying IRT models appeared to be satisfactory.33 For the Gallup-Thorndike, of the 40 interaction terms involving race, SEI, household income and education, 21remained statistically significant and met the effect size criteria by the end of the three-stage screening. For the WAIS-R similarities test, of the 80 possible interactions, 14 remained significant and met the effect size threshold at the end of the third stage. Social factor DIF favoured white race and higher education, occupation and income. Additional age-related and gender-related DIF was observed on both tests, although the pattern did not consistently favour one gender or younger versus older persons. Online Supplementary table S1 lists the sources of DIF by item for each test. Social DIF seemed more apparent on vocabulary (9 out of 10 items) than similarities (5 out of 8 items).29

View this table:

Table 1

Demographic composition of the analytic sample: 1978–2002 General Social Survey inked to the 2008 mortality via the National Death Index

Table 2 reports the correlations between SES factors and VIQ test scores corrected and uncorrected for DIF. SES correlations with Gallup-Thorndike scores unadjusted for DIF were 0.16–0.33 larger in absolute magnitude, and 2.8–4.4 larger in relative magnitude, than unadjusted scores. For the WAIS-R Similarities, absolute differences in SES correlations ranged from 0.06 to 0.24 and relative differences from 2.0 to 2.6. DIF-adjusted correlations between SES factors and VIQ indicators fell outside the 95% CI of correlations with non-adjusted scores for all SES indicators on both tests.

View this table:

Table 2

Pearson correlations between SES, race, and verbal IQ test scores: 1978–2002 general social survey linked to the 2008 mortality via the National Death Index

Table 3 shows the RII as a HR for mortality for each social factor. Minority race exhibited non-proportional hazards (diminishing risk over the lifecourse), so estimates are presented at age 50 years. Table 3 also shows the change in estimate observed when controlling for latent trait scores adjusted and unadjusted for DIF. VIQ scores unadjusted for DIF accounted for smaller, but non-zero portions of social inequalities in mortality. For the Gallup-Thorndike, the change in estimate arising from corrected scores fell outside of the CI of that for uncorrected scores across three of four social factors. For the WAIS-R similarities test, the same pattern arose, but with wider CIs. Table 4 shows the RII for the WAIS-R similarities and Gallup-Thorndike scores with and without SES-corrected DIF. DIF-corrected scores showed smaller RIIs, with no appreciable difference in the proportion explained by SES. The latter quantity evidenced a very wide CI encompassing 0 in all cases. Sensitivity analyses revealed linearity in the log hazard for all factors, no VIQ social factor interactions or proportionality violations, nearly identical results excluding deaths within the first year, and comparable results with 2-parameter IRT models.

View this table:

Table 3

Social inequalities in mortality explained by biased and unbiased VIQ test scores

View this table:

Table 4

VIQ Inequalities in Mortality Explained by SES Indicators

Discussion

Across two VIQ tests, we found DIF favouring persons of higher SES and/or majority race/ethnicity group. Correcting for this, DIF reduced correlations between VIQ scores and educational attainment, occupational status, income, as well VIQ differences between African–Americans and Caucasians. In turn, VIQ scores adjusted for DIF explained smaller amounts of social inequalities in mortality.

Some have argued that intelligence, rather than SES, is the fundamental cause of differentials in mortality.2 This assertion is supported by many findings that cognitive ability test scores are substantial confounders of SES mortality risk.34 Our findings suggest that DIF-corrected VIQ scores had slightly less association with SES and mortality than uncorrected ones, so a portion of the predictive power of VIQ may arise from indirect SES variance captured by VIQ test scores.

It is important to note, however, that small social differentials in VIQ still existed even when DIF was controlled. Accordingly, DIF-corrected VIQ scores continue to explain a modest portion of social gradients in mortality. This would suggest that VIQ is somehow involved in social inequalities in mortality, albeit to a smaller extent than has been presumed.

Environmental exposures35 and early malnutrition36 have documented effects on brain development and cognitive ability, and it is plausible, if not likely, that persons scoring lower on IQ tests, consequently, are challenged with respect to school performance, occupational advancement and earnings.37 Thus, our data indeed suggest a legitimate—and probably reciprocal—association between VIQ and SES. Given the importance of this issue for policy, the critical question is not whether there is a link, but exactly how much measurement inaccuracy inflates our current estimates.

Specifically, DIF observed here may be explained by numerous factors affecting IQ test performance that are associated with social disadvantage. These include achievement motivation,38 greater test performance anxiety and stress,39 ,40 fear that poor test scores will be used to perpetuate stereotypes about class and intelligence,41 ,42 lack of familiarity with test content among participants from lower SES and/or racial/ethnic minority subcultures,11 different norms for, or uncertainty in, approaching test problems,43 ,44 use of different dialects, distrust of examiners administering the test,45 ,46 less familiarity with testing,47 a lower reading level,48 ,49 and poorer test-taking skills.50 The difficulty of disentangling verbal intelligence from factors relating to culture and academic achievement has been reported for some time.10 Nevertheless, the fairness of IQ tests across SES is justified, in part, by reports that IQ tests with unknown degrees of SES DIF predict SES outcomes.37 Such justifications may require reconsideration if VIQ tests confound Verbal IQ and SES to a non-trivial degree.

Our results must be interpreted with a balanced understanding of strengths and limitations. First, while these considerations suggest that VIQ tests capture educational and other SES variance, an important parallel argument has been offered: years of education, perhaps the most common index of SES, might actually measure some form(s) of intelligence, because cognitive abilities are generally required to achieve higher levels of education.51 From this viewpoint, adjusting any type of VIQ score for education-related DIF corrects for an IQ proxy and, thus, is an overcorrection. However, since ‘years of education’ is not a multi-item test score, IRT analyses cannot examine the issue. One future solution may be to use multi-item tests of academic achievement as a measure of education amenable to traditional IRT approaches and, thus, potentially separable from various forms of IQ. Second, VIQ is just one of two components of general IQ scores. Tests of the other component, Performance IQ, have been suggested by many,52 ,53 but not all,11 ,12 to avoid mixing SES with cognitive ability measurement. In this regard, many in the cognitive epidemiology community have begun to focus on measures of Performance IQ, including tests of reaction time or processing speed, as the key cognitive abilities predictive of mortality.8

Third, we only examine two common tests of VIQ. Although we did not study other tests, these two tests correlate highly with other VIQ tests (ie, 0.7 to 0.8),10 ,54 and with general IQ scores.10 ,21 Thus, we suspect that other VIQ tests, and general IQ scores, may be susceptible to some extent to this phenomenon. However, these results may or may not generalise to non-cognitive psychological tests, such as personality measures, which may also be vulnerable to DIF and deserve study in their own right. It is also important to remember that SES is multidimensional, that some dimensions of SES might be more vulnerable to DIF than others, that different dimensions of SES may have differential associations with mortality, and these associations may vary at different points in the lifespan.

Although our analysis addresses these concerns for three common indicators of SES measured once, the use of other indicators would be helpful, such as the quality of education received or family social position. Longitudinal studies could examine the extent to which cognitive abilities at various points in the lifespan mediate prior SES-related health risks. Performance IQ, and/or tests based on theories of multiple intelligences,55 may contribute better to our understanding of the inter-relationships between class, intelligence and health.

Ultimately, most IQ batteries used in epidemiologic study include vocabulary and/or similarities in VIQ tests. Thus, the behaviour of these tests will be transmitted to general or composite IQ scores, upon which many conclusions are based. If the other tests in the battery do not evidence social-factor DIF, overestimation of IQ-SES associations will be more attenuated. However, the number of other tests in the battery exhibiting similar DIF will dictate the degree of overestimation, and this is an unknown. Our findings thus constitute a ‘proof of principle’ suggesting care in interpreting data on IQ and social gradients in mortality.

What is already known on this subject

General IQ scores are thought to partially explain social gradients in mortality.
General IQ scores are composed of Performance IQ, and Verbal IQ (VIQ) tests.
VIQ tests are suspected to confound true cognitive ability with socioeconomic status (SES).
Measurement error may lead to overestimates of the extent to IQ explains social inequalities in mortality.

What this study adds

Two common types of VIQ tests exhibit differential item functioning favouring persons of higher SES in a nationally representative US cohort.
High correlations between VIQ scores and SES are inflated due to differential item functioning.
Correction for differential item functioning reduces the explanatory role of IQ in social inequalities in mortality.

References

↵
1. Roberts BW,
2. Kuncel N,
3. Shiner RN,
4. et al
. The power of personality: a comparative analysis of the predictive validity of personality traits, SES, and IQ. Perspect Psychol Sci 2007;4:313–46.
OpenUrl
↵
1. Gottfredson LS
. Intelligence: is it the epidemiologists’ elusive “fundamental cause” of social class inequalities in health? J Pers Soc Psychol 2004;86:174–99.
OpenUrl CrossRef PubMed Web of Science
↵
1. Batty GD,
2. Gale CR,
3. Tynelius P,
4. et al
. IQ in early adulthood, socioeconomic position, and unintentional injury mortality by middle age: a cohort study of more than 1 million Swedish men. Am J Epidemiol 2009;169:606–15.
OpenUrl Abstract/FREE Full Text
↵
1. Singh-Manoux A,
2. Ferrie JE,
3. Lynch JW,
4. et al
. The role of cognitive ability (intelligence) in explaining the association between socioeconomic position and health: evidence from the Whitehall II prospective cohort study. Am J Epidemiol 2005;161:831–9.
OpenUrl Abstract/FREE Full Text
↵
1. Calvin CM,
2. Deary IJ,
3. Fenton C
et al. Intelligence in youth and all-cause-mortality: systematic review with meta-analysis. Int J Epidemiol 2011;40:626–44.
OpenUrl Abstract/FREE Full Text
↵
1. Batty GD,
2. Deary IJ,
3. Schoon I,
4. et al
. Mental ability across childhood in relation to risk factors for premature mortality in adult life: the 1970 British Cohort Study. J Epidemiol Community Health 2007;61:997–1003.
OpenUrl Abstract/FREE Full Text
↵
1. Batty GD,
2. Deary IJ,
3. Macintyre S
. Childhood IQ in relation to risk factors for premature mortality in middle-aged persons: the Aberdeen Children of the 1950s study. J Epidemiol Community Health 2007;61:241–7.
OpenUrl Abstract/FREE Full Text
↵
1. Deary IJ,
2. Weiss AW,
3. Batty GD
. Intelligence and personality as predictors of illness and death: How researchers in differential psychology and chronic disease epidemiology are collaborating to understand and address health inequalities. Psychol Sci Public Interests 2010;11:53–79.
OpenUrl FREE Full Text
↵
1. Neisser U,
2. Boodoo G,
3. Bouchard TJ,
4. et al
. Intelligence: knowns and unknowns. Am Psychol 1996;51:77–101.
OpenUrl CrossRef Web of Science
↵
ASK. Assessing adolescent and adult intelligence. 3rd edn. New York: John Wiley & Sons, 2006.
↵
1. Helms J
. Why is there no study of cultural equivalence in standardized cognitive ability testing? Am Psychol 1992;47:1083–101.
OpenUrl CrossRef Web of Science
↵
1. Helms J
. The triple quandry of race, culture, and social class in standardized cognitive ability testing. Contemporary Intellectual Assessment: Theories, Tests, and Issues. US: Guilford Press. 1998:517–32.
↵
1. Nunnaly JC,
2. Bernstein IH
. Psychometric Theory. 3rd edn. New York: McGraw-Hill, Inc., 1994.
↵
1. Greenland S,
2. Robins JM
. Identifiability, exchangeability and confounding revisited. Epidemiol Perspect Innov 2009;6:4.
OpenUrl CrossRef PubMed
↵
1. Millsap RE,
2. Everson HY
. Methodology review: statistical approaches for assessing measurement bias. Applied Psychol Meas 1993;17:297–334.
OpenUrl Abstract/FREE Full Text
↵
1. Rothman KJ,
2. Greenland S,
3. Lash TL
. Modern epidemiology. 3rd edn. Philadelphia, PA: Lippincott Williams & Wilkins, 2008.
↵
1. Muennig P,
2. Johnson G,
3. Kim J,
4. et al
. The general social survey-national death index: an innovative new dataset for the social sciences. BMC Res Notes 2011;4:385.
OpenUrl CrossRef PubMed
↵
1. Thorndike RL
. Two screening tests of verbal intelligence. J Appl Psychol 1942;26:128–35.
OpenUrl CrossRef Web of Science
↵
Bureau of the Census. Statistical Abstracts of the United States: 2001. Washington, DC: US Bureau of the Census, 2001.
↵
Research IoE. I.E.R. Graded test of word knowledge. New York City: Teachers College, Columbia, 1923.
↵
1. Jensen AR
. Vocabulary and general intelligence. Behav Brain Sci 2001;24:1109–10.
OpenUrl
↵
1. Wechsler D
. WAIS-R Manual: Wechsler adult intelligence scale—revised. San Antonio, TX: Psychological Corporation, 1981.
↵
1. Hermansen SW,
2. Leitzmann MF,
3. Schatzkin A
. The impact on national death index ascertainment of limiting submissions to social security administration death master file matches in epidemiologic studies of mortality. Am J Epidemiol 2009;169:901–8.
OpenUrl Abstract/FREE Full Text
↵
1. Sergeant JC,
2. Firth D
. Relative index of inequality: definition, estimation, and inference. Biostatistics 2006;7:213–24.
OpenUrl Abstract/FREE Full Text
↵
1. Mackenbach JP,
2. Kunst AE
. Measuring the magnitude of socio-economic inequalities in health: an overview of available measures illustrated with two examples from Europe. Soc Sci Med 1997;44:757–71.
OpenUrl CrossRef PubMed Web of Science
↵
1. Rasch G
. Probabilistic models for some intelligence and attainment tests. Copenhagen, Denmark: Danish Institute for Educational Research, 1960.
↵
1. van der Linden W,
2. Hambleton RK
. eds Handbook of modern item response theory. New York: Springer-Verlag, 1997.
↵
1. Rijmen F,
2. Tuerlinckx F,
3. De Boeck P,
4. et al
. A nonlinear mixed model framework for item response theory. Psychol Methods 2003;8:185–205.
OpenUrl CrossRef PubMed Web of Science
↵
1. Chapman BP,
2. Fiscella K,
3. Duberstein PR,
4. et al
. Item Response Theory Technical Supplement for “Measurement Confounding Affects the Extent to Which Verbal IQ Explains Social Gradients in Mortality”. Vol. 5. Rochester, NY: University of Rochester Medical Center, 2014.
↵
1. Greenland S
. Modeling and variable selection in epidemiologic analysis. Am J Public Health 1989;79:340–9.
OpenUrl CrossRef PubMed Web of Science
↵
1. Korn EL,
2. Graubard BI,
3. Midthune D
. Time-to-event analysis of longitudinal follow-up of a survey: choice of the time-scale. Am J Epidemiol 1997;145:72–80.
OpenUrl Abstract/FREE Full Text
↵
1. Thiebaut AC,
2. Benichou J
. Choice of time-scale in Cox's model analysis of epidemiologic cohort data: a simulation study. Stat Med 2004;23:3803–20.
OpenUrl CrossRef PubMed Web of Science
↵
1. Junker BW,
2. Sijtsma K
. Latent and manifest monotonicity in item response models. Appl Psychol Meas 2000;24:65–81.
OpenUrl CrossRef
↵
1. Batty GD,
2. Deary T
. Education and mortality: the role of intelligence. Lancet 2005;365:1765–66.
OpenUrl PubMed Web of Science
↵
1. McMichael AJ,
2. Baghurst PA,
3. Vimpani GV,
4. et al
. Tooth lead levels and IQ in school-age children: the Port Pirie Cohort Study. Am J Epidemiol 1994;140:489–99.
OpenUrl Abstract/FREE Full Text
↵
1. Yang S,
2. Platt RW,
3. Kramer MS
. Variation in child cognitive ability by week of gestation among healthy term births. Am J Epidemiol 2010;171:399–406.
OpenUrl Abstract/FREE Full Text
↵
1. Nisbett RE,
2. Aronson J,
3. Blair C,
4. et al
. Intellignece: New findings and theoretical developments. Am Psychol 2012;67:130–59.
OpenUrl CrossRef PubMed Web of Science
↵
1. Duckworth AL,
2. Quinn PD,
3. Lynam DR,
4. et al
. Role of test motivation in intelligence testing. Proc Natl Acad Sci USA 2011;108:7716–20.
OpenUrl Abstract/FREE Full Text
↵
1. Gass CS,
2. Curiel RE
. Test Anxiety in relation to measures of cognitive and intellectual functioning. Arch Clin Neuropsychol 2011;26:396–404.
OpenUrl Abstract/FREE Full Text
↵
1. Allison DE
. Test anxiety, stress, and intelligence-test performance. Can J Behav Sci 1970;2:26–37.
OpenUrl CrossRef
↵
1. Steele CM,
2. Aronson J
. Stereotype threat and the intellectual test performance of African Americans. J Pers Soc Psychol 1995;69:797–811.
OpenUrl CrossRef PubMed Web of Science
↵
1. Croizet JC,
2. Milet M
. Social class and test performance: from stereotype threat to symbolic violence and vice versa. In: Inzlicht M, Schmader T. eds Stereotype threat: theory, process, and application. New York: Oxford University Press, 2012:118–201.
↵
1. Olive H
. Relationship of divergent thinking to intelligence, social class, and achievement in high-school students. J Genet Psychol 1972;121:179–86.
OpenUrl PubMed
↵
1. Edwards TB
. Deliberative attitudes, IQ, and social class. J Exp Educ 1968;37:7–18.
OpenUrl
↵
1. Klein RE,
2. Freeman HE,
3. Spring B,
4. et al
. Cognitive test-performance and indigenous conceptions of intelligence. J Psychol 1976;93:273–9.
OpenUrl CrossRef Web of Science
↵
1. Samuel W,
2. Soto D,
3. Parks M,
4. et al
. Motivation, race, social-class, and IQ. J Educ Psychol 1976;68:273–85.
OpenUrl CrossRef Web of Science
↵
1. Morgan AA,
2. Marsiske M,
3. Whitfield KE
. Characterizing and explaining differences in cognitive test performance between african american and european american older adults. Exp Aging Res 2008;34:80–100.
OpenUrl CrossRef PubMed Web of Science
↵
1. Manly JJ,
2. Jacobs DM,
3. Touradji P,
4. et al
. Reading level attenuates differences in neuropsychological test performance between African American and White elders. J Int Neuropsychol Soc 2002;8:341–48.
OpenUrl CrossRef PubMed Web of Science
↵
1. Riley CS
. Relationship between reading ability and verbal intelligence test performance. Br J Educ Psychol 1966;36:117.
OpenUrl CrossRef
↵
1. Jones RN,
2. Gallo JJ
. Education and sex differences in the mini-mental state examination: effects of differential item functioning. J Gerontol B Psychol Sci Soc Sci 2002;57:P548–58.
OpenUrl Abstract/FREE Full Text
↵
1. Deary IJ,
2. Johnson W
. Intelligence and education: causal perceptions drive analytic processes and therefore conclusions. Int J Epidemiol 2010;39:1362–9.
OpenUrl Abstract/FREE Full Text
↵
1. Shuttleworth-Edwards AB,
2. Kemp RD,
3. Rust AL,
4. et al
. Cross-cultural effects on IQ test performance: A review and preliminary normative indications on WAIS-III test performance. J Clin Exp Neuropsychol 2004;26:903–20.
OpenUrl CrossRef PubMed Web of Science
↵
American Educational Research Association APA, National Council on Measurement in Education. Standards for Educational and Psychological Testing. Washington, DC: American Psychological Association, 1999.
↵
1. Hunt E
. Human Intelligence. New York: Cambridge University Press, 2011.
↵
1. Sternberg RJ,
2. Castejón J,
3. Prieto M,
4. et al
. Confirmatory factor analysis of the Sternberg Triarchic Abilities Test in three international samples: An empirical test of the triarchic theory of intelligence. Eur J Psychol Assess 2001;17:1.
OpenUrl CrossRef

Supplementary materials

Supplementary Data

This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.

Files in this Data Supplement:

Data supplement 1 - Online supplement

Footnotes

Contributorship statement BC, KF, PD, IK, PM: contributed to the conception and design of the work, and interpretation of data; contributed to drafting the work and critically revised it for important intellectual content; final approval of version to be published. BC, PM: contributed to the acquisition and analysis of data.
Funding This work was supported by US National Institutes of Health grants RC2MD004768, R01AG044588, and K08AG031328.
Competing interests None.
Patient consent No.
Ethics approval Institutional Review Board for the General Social Survey.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement The data is publicly available through the General Social Survey.

[1] ↵
Roberts BW,
Kuncel N,
Shiner RN,
et al
. The power of personality: a comparative analysis of the predictive validity of personality traits, SES, and IQ. Perspect Psychol Sci 2007;4:313–46.
OpenUrl

[2] Roberts BW,

[3] Kuncel N,

[4] Shiner RN,

[5] et al

[6] ↵
Gottfredson LS
. Intelligence: is it the epidemiologists’ elusive “fundamental cause” of social class inequalities in health? J Pers Soc Psychol 2004;86:174–99.
OpenUrl CrossRef PubMed Web of Science

[7] Gottfredson LS

[8] ↵
Batty GD,
Gale CR,
Tynelius P,
et al
. IQ in early adulthood, socioeconomic position, and unintentional injury mortality by middle age: a cohort study of more than 1 million Swedish men. Am J Epidemiol 2009;169:606–15.
OpenUrl Abstract/FREE Full Text

[9] Batty GD,

[10] Gale CR,

[11] Tynelius P,

[12] et al

[13] ↵
Singh-Manoux A,
Ferrie JE,
Lynch JW,
et al
. The role of cognitive ability (intelligence) in explaining the association between socioeconomic position and health: evidence from the Whitehall II prospective cohort study. Am J Epidemiol 2005;161:831–9.
OpenUrl Abstract/FREE Full Text

[14] Singh-Manoux A,

[15] Ferrie JE,

[16] Lynch JW,

[17] et al

[18] ↵
Calvin CM,
Deary IJ,
Fenton C
et al. Intelligence in youth and all-cause-mortality: systematic review with meta-analysis. Int J Epidemiol 2011;40:626–44.
OpenUrl Abstract/FREE Full Text

[19] Calvin CM,

[20] Deary IJ,

[21] Fenton C

[22] ↵
Batty GD,
Deary IJ,
Schoon I,
et al
. Mental ability across childhood in relation to risk factors for premature mortality in adult life: the 1970 British Cohort Study. J Epidemiol Community Health 2007;61:997–1003.
OpenUrl Abstract/FREE Full Text

[23] Batty GD,

[24] Deary IJ,

[25] Schoon I,

[26] et al

[27] ↵
Batty GD,
Deary IJ,
Macintyre S
. Childhood IQ in relation to risk factors for premature mortality in middle-aged persons: the Aberdeen Children of the 1950s study. J Epidemiol Community Health 2007;61:241–7.
OpenUrl Abstract/FREE Full Text

[28] Batty GD,

[29] Deary IJ,

[30] Macintyre S

[31] ↵
Deary IJ,
Weiss AW,
Batty GD
. Intelligence and personality as predictors of illness and death: How researchers in differential psychology and chronic disease epidemiology are collaborating to understand and address health inequalities. Psychol Sci Public Interests 2010;11:53–79.
OpenUrl FREE Full Text

[32] Deary IJ,

[33] Weiss AW,

[34] Batty GD

[35] ↵
Neisser U,
Boodoo G,
Bouchard TJ,
et al
. Intelligence: knowns and unknowns. Am Psychol 1996;51:77–101.
OpenUrl CrossRef Web of Science

[36] Neisser U,

[37] Boodoo G,

[38] Bouchard TJ,

[39] et al

[40] ↵
ASK. Assessing adolescent and adult intelligence. 3rd edn. New York: John Wiley & Sons, 2006.

[41] ↵
Helms J
. Why is there no study of cultural equivalence in standardized cognitive ability testing? Am Psychol 1992;47:1083–101.
OpenUrl CrossRef Web of Science

[42] Helms J

[43] ↵
Helms J
. The triple quandry of race, culture, and social class in standardized cognitive ability testing. Contemporary Intellectual Assessment: Theories, Tests, and Issues. US: Guilford Press. 1998:517–32.

[44] Helms J

[45] ↵
Nunnaly JC,
Bernstein IH
. Psychometric Theory. 3rd edn. New York: McGraw-Hill, Inc., 1994.

[46] Nunnaly JC,

[47] Bernstein IH

[48] ↵
Greenland S,
Robins JM
. Identifiability, exchangeability and confounding revisited. Epidemiol Perspect Innov 2009;6:4.
OpenUrl CrossRef PubMed

[49] Greenland S,

[50] Robins JM

[51] ↵
Millsap RE,
Everson HY
. Methodology review: statistical approaches for assessing measurement bias. Applied Psychol Meas 1993;17:297–334.
OpenUrl Abstract/FREE Full Text

[52] Millsap RE,

[53] Everson HY

[54] ↵
Rothman KJ,
Greenland S,
Lash TL
. Modern epidemiology. 3rd edn. Philadelphia, PA: Lippincott Williams & Wilkins, 2008.

[55] Rothman KJ,

[56] Greenland S,

[57] Lash TL

[58] ↵
Muennig P,
Johnson G,
Kim J,
et al
. The general social survey-national death index: an innovative new dataset for the social sciences. BMC Res Notes 2011;4:385.
OpenUrl CrossRef PubMed

[59] Muennig P,

[60] Johnson G,

[61] Kim J,

[62] et al

[63] ↵
Thorndike RL
. Two screening tests of verbal intelligence. J Appl Psychol 1942;26:128–35.
OpenUrl CrossRef Web of Science

[64] Thorndike RL

[65] ↵
Bureau of the Census. Statistical Abstracts of the United States: 2001. Washington, DC: US Bureau of the Census, 2001.

[66] ↵
Research IoE. I.E.R. Graded test of word knowledge. New York City: Teachers College, Columbia, 1923.

[67] ↵
Jensen AR
. Vocabulary and general intelligence. Behav Brain Sci 2001;24:1109–10.
OpenUrl

[68] Jensen AR

[69] ↵
Wechsler D
. WAIS-R Manual: Wechsler adult intelligence scale—revised. San Antonio, TX: Psychological Corporation, 1981.

[70] Wechsler D

[71] ↵
Hermansen SW,
Leitzmann MF,
Schatzkin A
. The impact on national death index ascertainment of limiting submissions to social security administration death master file matches in epidemiologic studies of mortality. Am J Epidemiol 2009;169:901–8.
OpenUrl Abstract/FREE Full Text

[72] Hermansen SW,

[73] Leitzmann MF,

[74] Schatzkin A

[75] ↵
Sergeant JC,
Firth D
. Relative index of inequality: definition, estimation, and inference. Biostatistics 2006;7:213–24.
OpenUrl Abstract/FREE Full Text

[76] Sergeant JC,

[77] Firth D

[78] ↵
Mackenbach JP,
Kunst AE
. Measuring the magnitude of socio-economic inequalities in health: an overview of available measures illustrated with two examples from Europe. Soc Sci Med 1997;44:757–71.
OpenUrl CrossRef PubMed Web of Science

[79] Mackenbach JP,

[80] Kunst AE

[81] ↵
Rasch G
. Probabilistic models for some intelligence and attainment tests. Copenhagen, Denmark: Danish Institute for Educational Research, 1960.

[82] Rasch G

[83] ↵
van der Linden W,
Hambleton RK
. eds Handbook of modern item response theory. New York: Springer-Verlag, 1997.

[84] van der Linden W,

[85] Hambleton RK

[86] ↵
Rijmen F,
Tuerlinckx F,
De Boeck P,
et al
. A nonlinear mixed model framework for item response theory. Psychol Methods 2003;8:185–205.
OpenUrl CrossRef PubMed Web of Science

[87] Rijmen F,

[88] Tuerlinckx F,

[89] De Boeck P,

[90] et al

[91] ↵
Chapman BP,
Fiscella K,
Duberstein PR,
et al
. Item Response Theory Technical Supplement for “Measurement Confounding Affects the Extent to Which Verbal IQ Explains Social Gradients in Mortality”. Vol. 5. Rochester, NY: University of Rochester Medical Center, 2014.

[92] Chapman BP,

[93] Fiscella K,

[94] Duberstein PR,

[95] et al

[96] ↵
Greenland S
. Modeling and variable selection in epidemiologic analysis. Am J Public Health 1989;79:340–9.
OpenUrl CrossRef PubMed Web of Science

[97] Greenland S

[98] ↵
Korn EL,
Graubard BI,
Midthune D
. Time-to-event analysis of longitudinal follow-up of a survey: choice of the time-scale. Am J Epidemiol 1997;145:72–80.
OpenUrl Abstract/FREE Full Text

[99] Korn EL,

[100] Graubard BI,

[101] Midthune D

[102] ↵
Thiebaut AC,
Benichou J
. Choice of time-scale in Cox's model analysis of epidemiologic cohort data: a simulation study. Stat Med 2004;23:3803–20.
OpenUrl CrossRef PubMed Web of Science

[103] Thiebaut AC,

[104] Benichou J

[105] ↵
Junker BW,
Sijtsma K
. Latent and manifest monotonicity in item response models. Appl Psychol Meas 2000;24:65–81.
OpenUrl CrossRef

[106] Junker BW,

[107] Sijtsma K

[108] ↵
Batty GD,
Deary T
. Education and mortality: the role of intelligence. Lancet 2005;365:1765–66.
OpenUrl PubMed Web of Science

[109] Batty GD,

[110] Deary T

[111] ↵
McMichael AJ,
Baghurst PA,
Vimpani GV,
et al
. Tooth lead levels and IQ in school-age children: the Port Pirie Cohort Study. Am J Epidemiol 1994;140:489–99.
OpenUrl Abstract/FREE Full Text

[112] McMichael AJ,

[113] Baghurst PA,

[114] Vimpani GV,

[115] et al

[116] ↵
Yang S,
Platt RW,
Kramer MS
. Variation in child cognitive ability by week of gestation among healthy term births. Am J Epidemiol 2010;171:399–406.
OpenUrl Abstract/FREE Full Text

[117] Yang S,

[118] Platt RW,

[119] Kramer MS

[120] ↵
Nisbett RE,
Aronson J,
Blair C,
et al
. Intellignece: New findings and theoretical developments. Am Psychol 2012;67:130–59.
OpenUrl CrossRef PubMed Web of Science

[121] Nisbett RE,

[122] Aronson J,

[123] Blair C,

[124] et al

[125] ↵
Duckworth AL,
Quinn PD,
Lynam DR,
et al
. Role of test motivation in intelligence testing. Proc Natl Acad Sci USA 2011;108:7716–20.
OpenUrl Abstract/FREE Full Text

[126] Duckworth AL,

[127] Quinn PD,

[128] Lynam DR,

[129] et al

[130] ↵
Gass CS,
Curiel RE
. Test Anxiety in relation to measures of cognitive and intellectual functioning. Arch Clin Neuropsychol 2011;26:396–404.
OpenUrl Abstract/FREE Full Text

[131] Gass CS,

[132] Curiel RE

[133] ↵
Allison DE
. Test anxiety, stress, and intelligence-test performance. Can J Behav Sci 1970;2:26–37.
OpenUrl CrossRef

[134] Allison DE

[135] ↵
Steele CM,
Aronson J
. Stereotype threat and the intellectual test performance of African Americans. J Pers Soc Psychol 1995;69:797–811.
OpenUrl CrossRef PubMed Web of Science

[136] Steele CM,

[137] Aronson J

[138] ↵
Croizet JC,
Milet M
. Social class and test performance: from stereotype threat to symbolic violence and vice versa. In: Inzlicht M, Schmader T. eds Stereotype threat: theory, process, and application. New York: Oxford University Press, 2012:118–201.

[139] Croizet JC,

[140] Milet M

[141] ↵
Olive H
. Relationship of divergent thinking to intelligence, social class, and achievement in high-school students. J Genet Psychol 1972;121:179–86.
OpenUrl PubMed

[142] Olive H

[143] ↵
Edwards TB
. Deliberative attitudes, IQ, and social class. J Exp Educ 1968;37:7–18.
OpenUrl

[144] Edwards TB

[145] ↵
Klein RE,
Freeman HE,
Spring B,
et al
. Cognitive test-performance and indigenous conceptions of intelligence. J Psychol 1976;93:273–9.
OpenUrl CrossRef Web of Science

[146] Klein RE,

[147] Freeman HE,

[148] Spring B,

[149] et al

[150] ↵
Samuel W,
Soto D,
Parks M,
et al
. Motivation, race, social-class, and IQ. J Educ Psychol 1976;68:273–85.
OpenUrl CrossRef Web of Science

[151] Samuel W,

[152] Soto D,

[153] Parks M,

[154] et al

[155] ↵
Morgan AA,
Marsiske M,
Whitfield KE
. Characterizing and explaining differences in cognitive test performance between african american and european american older adults. Exp Aging Res 2008;34:80–100.
OpenUrl CrossRef PubMed Web of Science

[156] Morgan AA,

[157] Marsiske M,

[158] Whitfield KE

[159] ↵
Manly JJ,
Jacobs DM,
Touradji P,
et al
. Reading level attenuates differences in neuropsychological test performance between African American and White elders. J Int Neuropsychol Soc 2002;8:341–48.
OpenUrl CrossRef PubMed Web of Science

[160] Manly JJ,

[161] Jacobs DM,

[162] Touradji P,

[163] et al

[164] ↵
Riley CS
. Relationship between reading ability and verbal intelligence test performance. Br J Educ Psychol 1966;36:117.
OpenUrl CrossRef

[165] Riley CS

[166] ↵
Jones RN,
Gallo JJ
. Education and sex differences in the mini-mental state examination: effects of differential item functioning. J Gerontol B Psychol Sci Soc Sci 2002;57:P548–58.
OpenUrl Abstract/FREE Full Text

[167] Jones RN,

[168] Gallo JJ

[169] ↵
Deary IJ,
Johnson W
. Intelligence and education: causal perceptions drive analytic processes and therefore conclusions. Int J Epidemiol 2010;39:1362–9.
OpenUrl Abstract/FREE Full Text

[170] Deary IJ,

[171] Johnson W

[172] ↵
Shuttleworth-Edwards AB,
Kemp RD,
Rust AL,
et al
. Cross-cultural effects on IQ test performance: A review and preliminary normative indications on WAIS-III test performance. J Clin Exp Neuropsychol 2004;26:903–20.
OpenUrl CrossRef PubMed Web of Science

[173] Shuttleworth-Edwards AB,

[174] Kemp RD,

[175] Rust AL,

[176] et al

[177] ↵
American Educational Research Association APA, National Council on Measurement in Education. Standards for Educational and Psychological Testing. Washington, DC: American Psychological Association, 1999.

[178] ↵
Hunt E
. Human Intelligence. New York: Cambridge University Press, 2011.

[179] Hunt E

[180] ↵
Sternberg RJ,
Castejón J,
Prieto M,
et al
. Confirmatory factor analysis of the Sternberg Triarchic Abilities Test in three international samples: An empirical test of the triarchic theory of intelligence. Eur J Psychol Assess 2001;17:1.
OpenUrl CrossRef

[181] Sternberg RJ,

[182] Castejón J,

[183] Prieto M,

[184] et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Methods

Sample and design

Measures

Analysis

Results

Discussion

What is already known on this subject

What this study adds

References

Supplementary materials

Supplementary Data

Footnotes

Read the full text or download the PDF:

Log in using your username and password