TY - JOUR T1 - Principal component analysis of socioeconomic factors and their association with malaria and arbovirus risk in Tanzania: a sensitivity analysis JF - Journal of Epidemiology and Community Health JO - J Epidemiol Community Health SP - 1046 LP - 1051 DO - 10.1136/jech-2017-209119 VL - 71 IS - 11 AU - Esha Homenauth AU - Debora Kajeguka AU - Manisha A Kulkarni Y1 - 2017/11/01 UR - http://jech.bmj.com/content/71/11/1046.abstract N2 - Principal component analysis (PCA) is frequently adopted for creating socioeconomic proxies in order to investigate the independent effects of wealth on disease status. The guidelines and methods for the creation of these proxies are well described and validated. The Demographic and Health Survey, World Health Survey and the Living Standards Measurement Survey are examples of large data sets that use PCA to create wealth indices particularly in low and middle-income countries (LMIC), where quantifying wealth-disease associations is problematic due to the unavailability of reliable income and expenditure data. However, the application of this method to smaller survey data sets, especially in rural LMIC settings, is less rigorously studied.In this paper, we aimed to highlight some of these issues by investigating the association of derived wealth indices using PCA on risk of vector-borne disease infection in Tanzania focusing on malaria and key arboviruses (ie, dengue and chikungunya). We demonstrated that indices consisting of subsets of socioeconomic indicators provided the least methodologically flawed representations of household wealth compared with an index that combined all socioeconomic variables. These results suggest that the choice of the socioeconomic indicators included in a wealth proxy can influence the relative position of households in the overall wealth hierarchy, and subsequently the strength of disease associations. This can, therefore, influence future resource planning activities and should be considered among investigators who use a PCA-derived wealth index based on community-level survey data to influence programme or policy decisions in rural LMIC settings. ER -