Article Text


Drinking water turbidity and gastrointestinal illness in the elderly of Philadelphia


STUDY OBJECTIVE To investigate the association between drinking water quality and gastrointestinal illness in the elderly of Philadelphia.

DESIGN Within the general population, children and the elderly are at highest risk for gastrointestinal disease. This study investigates the potential association between daily fluctuations in drinking water turbidity and subsequent hospital admissions for gastrointestinal illness of elderly persons, controlling for time trends, seasonal patterns, and temperature using Poisson regression analysis.

SETTING AND PARTICIPANTS All residents of Philadelphia aged 65 and older in 1992–1993 were studied through their MEDICARE records.

MAIN RESULTS For Philadelphia's population aged 65 and older, we found water quality 9 to 11 days before the visit was associated with hospital admissions for gastrointestinal illness, with an interquartile range increase in turbidity being associated with a 9% increase (95% CI 5.3%, 12.7%). In the Belmont service area, there was also an association evident at a lag of 4 to 6 days (9.1% increase, 95% CI 5.2, 13.3). Both associations were stronger in those over 75 than in the population aged 65–74. This association occurred in a filtered water supply in compliance with US standards.

CONCLUSIONS Elderly residents of Philadelphia remain at risk of waterborne gastrointestinal illness under current water treatment practices. Hospitalisations represent a very small percentage of total morbidity.

Statistics from

It has long been known that drinking water contamination can be associated with epidemics of gastrointestinal (GI) disease.1 In response, three approaches were developed to avoid waterborne infectious diseases. Initially, emphasis was placed on developing a protected upland water source. In the early 1900s filtration and chemical disinfection with chlorine were introduced. Any one of these could reduce the mean death rate of typhoid from about 40–50 per 100 000 (depending on the water source) to about 13 per 100 000, and two or more were thought to virtually eliminate waterborne infectious disease epidemics.2-4

Recently it has been shown that some of what is seen as “endemic” GI illness is actually waterborne.5-8 Children and the immunocompromised population have been studied in particular6 9 as has the general population.5 7 8 However, GI illness is a major health concern among the elderly, and the case-fatality rate for the elderly is higher than for other groups.10 The elderly are a growing segment of the population in most western countries.

This study examined the association between daily measures of drinking water turbidity and hospital admissions for GI illness of elderly residents of Philadelphia over the period 1992–1993, controlling for time trends, seasonality and temperature. Philadelphia has a generally well run, filtered and disinfected water system that consistently met EPA regulations throughout the study period. The source water is not protected, and comes from rivers passing through the city.

Drinking water turbidity, a measure of the cloudiness of the water, is commonly used as a proxy measure for the risk of microbial contamination and the effectiveness of the treatment of public drinking water.11 Several studies have shown a correlation between turbidity levels and microbial contamination of raw and treated water,12-14 and a few documented waterborne disease outbreaks were associated with increased turbidity levels15 16 although others reported no “increased” turbidity levels.9 17

The study design used here is a Poisson time series, comparing daily counts of hospital admissions with daily exposure measures (with lags to account for latency/incubation period). These methods have been widely used to evaluate the association between air pollution levels and hospital admissions for lung disease.18-24 The design is attractive because the population serves as its own control. Thus the only potential confounders are those that vary from day to day in correlation with exposure. EPA recently reviewed the corpus of such studies, and determined that methodologically, they are adequate to support EPA regulations.25



Environmental data

Three water treatment plants serve Philadelphia. The Belmont Plant receives its water from the Schuylkill River, which passes through the city of Philadelphia; Belmont treats about 60 million gallons of water per day (mgd) and serves much of the city west of the Schuylkill River. The Queen Lane Plant also gets its water from the Schuylkill River; it treats about 100 mgd and serves the northwest portion of Philadelphia, until the Baxter service area. The Baxter Plant uses water from the Delaware River, which is Philadelphia's east border; it provides about 200 mgd and serves the eastern part of the city. The Baxter Plant also supplements the Queen Lane Plant (NW Philadelphia) service area.

The Philadelphia Water Department (PWD) serves about 1.2 million people. All three treatment plants provide filtration, pre-chlorination and a chloramine residual, conventional water treatment (flocculation and sedimentation), and, as necessary, chemical corrosion inhibitors. PWD measures turbidity for compliance with EPA regulations that stipulate sampling frequency, methods, and analysis. PWD consistently met all federal regulations during the study period and there was no evidence of any operational disturbances.

PWD provided daily turbidity data for each plant for the two year period 1992—93 for each of the water treatment plants. No other water quality parameter was available to us on a daily basis. Turbidity measurements were made every three hours at each of the three water treatment plants. These were averaged for each day in each plant and that plant specific measure was used for the analysis of admissions for GI illness of persons residing in the service area of each plant. An overall city average was computed by taking the arithmetic mean of daily measurements from the three plants, weighting each plant by the proportion of elderly admissions coming from its service area. The primary analysis was by plant and citywide analysis was only considered when plant level analyses indicated similar results in each plant service area.

Daily temperature readings (24 hour mean) were obtained from Philadelphia International Airport, which is the nearest National Atmospheric and Oceanographic Administration weather station.

Hospital visits

All hospitals treating Medicare patients must submit standardised reports to the Health Care Financing Administration for each admission. From these reports we extracted the date of admission, zip code of residence, and the International Classification of Diseases, 9th revision (ICD-9) code for the discharge diagnosis for all residents with a zip code of residence in an area serviced by one of the three Philadelphia water treatment plants.

We restricted our study to ICD-9 codes plausibly related to GI illness. These include a primary diagnosis of GI related illness (ICD-9 codes 001 to 009.9 and 558.9) or associated general symptoms such as electrolyte disorders (ICD-9 276), nausea and vomiting (ICD-9 787), and abdominal pain (ICD-9 789). Symptom codes and the general “catch-all” 558.9 categories were included because there is evidence of coding biases, especially attributing a non-infectious origin to GI illnesses among the elderly,10 and because there is chronic under-reporting of waterborne illness, even for mortality in a well known to be at risk population following a massive and highly publicised waterborne disease outbreak.26 From these data we tabulated daily counts of hospital admissions for GI related illness for each day from 1 January 1992 to 31 December 1993, for persons aged 65 years and older. Because of the different sources of water for the plants, the counts of daily visits were computed separately for each plant service area. In addition, an overall daily count was computed.


The unit of analysis in this study is the day, so potential confounders that must be controlled are those that vary over time, possibly coincidentally with water quality. There are seasonal patterns in daily counts of hospital admissions that are independent of water quality. While weather variables may explain some of these patterns, it is unlikely to account for them all. The specific long wavelength pattern for GI illness probably varies from city to city, but generally shows a winter peak.10 27

In this study, we regressed the daily counts of hospital admissions on turbidity levels, with control for seasonal variations and long term time trends, temperature, and day of the week effects. Separate analyses were done for each plant service area, and, if consistent patterns were seen, for the entire city.

Latencies for typical GI infections range from 1 to 14 days,28-30 so we considered turbidity levels between 1 and 14 days before the hospital visit in our analysis. Potential confounders in such a study design are limited to those that vary over time, such as seasonal patterns, temperature, and day of week effects.

Counts of daily admissions for GI illness are non-negative integers representing counts of rare events. Hence, Poisson regression analysis was used. To better control for potential non-linear dependencies on temperature, and for seasonal variation, we used the generalised additive model,31 which assumes that:

log[E(Y)] = β0 + S1(X1) + ... + Sp(Xp)

where Y denotes the daily admission counts, E(Y) the expected daily counts, the Xi are the covariates and the Siare smooth functions. The use of non-parametric smooth functions allows the dependence of hospital admissions on the covariates to be fit flexibly, as indicated by the data. The properties of such smoothers have been extensively discussed31 and used in environmental epidemiology.6 7 20 22 32 We have used Loess,33 a moving regression smoother. All smooth functions require a smoothing parameter that describes how smooth the function should be. A smooth function of time was used to remove seasonal variation from the data, and the smoothing parameter was chosen to eliminate serial correlation in the residuals. The dependence of admissions on temperature involved two choices. Firstly, we had to choose the lag between temperature and hospital admission, and then we had to choose the smoothing parameter to use. We made both choices by minimising Akaike's Information Criteria, a goodness of fit criteria that trades off improvement in model fit with a penalty for using more degrees of freedom. Day of the week was controlled for using dummy variables for each day, and turbidity levels were entered into the regression model linearly to be conservative. The choice of lag between turbidity and hospital admission was chosen, like the temperature variable, to be the one that minimised Akaike's Information Criteria.

All analyses were conducted separately for the service area of each water treatment plant. If similarities in responses were seen across the plant service areas, the daily counts were combined and regressed against the weighted average turbidity levels to gain more power. If significant associations were found, we also examined whether age was a risk modifier for the association by separately regressing admission counts in the age ranges of 65–74 and ⩾ 75.

Table 1 shows the distribution of the hospital admissions and environmental data. There were a total of 6021 hospital admissions from 1 January 1992 to 31 December 1993; 19.6% of patients resided in areas serviced by Belmont, 28.9% serviced by Queen Lane and 51.5% serviced by Baxter. Turbidity levels were in general very low (mean < 0.20 NTUs at all plants) and well below even the upcoming EPA standard.

Table 1

Distribution of daily environmental variables and daily hospital admissions for GI illness among the elderly of Philadelphia (1992–1993)

In these data, the Pearson's correlation coefficient between temperature and the weighted average turbidity level was low, at 0.14. The correlation between the seasonal pattern in GI hospital admissions and turbidity was similarly low, at −0.19. Figure 1 shows the residuals from the regression of all elderly hospital admissions in the city, showing that the smooth function of time did eliminate seasonal patterns. Figure 2 shows the partial autocorrelation function of the same residuals. No significant serial correlation remained. Similar results were seen for the plant service area specific analyses.

Figure 1

A plot is shown of the residuals from the regression of hospital admissions for gastrointestinal illness of all persons aged 65 and older in Philadelphia plotted against the day of admission. No seasonal pattern remains in the residuals, indicating the regression controlled adequately for season.

Figure 2

This shows the partial autocorrelation function of the residuals of the regression of hospital admissions for gastrointestinal illness of all persons aged 65 and older in Philadelphia. There is no significant correlation at any lag, indicating that serial correlation in residuals is not a problem in the analysis.

Table 2 shows the results of the analyses. An association was found between turbidity levels with a lag of 9 to 11 days and hospital admissions for GI illness in all three water plant service areas. Because of the similarity in lags among the three plants, a citywide analysis was performed. The mean turbidity level between 9 and 11 days before admission was a better predictor of risk than any single day's lag (for example, day 10). An interquartile range increase in exposure (0.035 NTU) was associated with a 9% increase in admissions (95% CI, 5.3, 12.7).

Table 2

Per cent increase in hospital admissions for GI illness among the elderly of Philadelphia associated with an interquartile range increase in turbidity, 1992–1993

In addition to this common association with water quality about one and a half weeks earlier, in the Belmont area there was also an association with the average turbidity 4 to 6 days before admission (9.1% increase for an interquartile range increase in turbidity, 95% CI 5.2, 13.3). The three day average (days 4–5–6) was a better predictor than turbidity five days earlier.

When the data were stratified by age, the association with turbidity 9 to 11 days before admissions was strongest in those > 75 (10.8%v 2.5%, table 2). Again, in the Belmont area, the lag of 4 to 6 days was primarily in the eldery—that is, those 75 and over.

To assure that the association between turbidity and GI hospital admissions was not a chance result of one positive association among 14 lags being considered, we examined the sign of the association at all other lags in all three plant service areas. Out of the 42 possible associations (14 in each service area), only seven were negative. The probability of this occurring given the null hypothesis is less than 0.0001.


In this study, we found an association between turbidity levels and hospital admissions for GI illness within the elderly population in Philadelphia over the period 1992–1993. In all three water treatment plant service areas, turbidity levels lagged 9 to 11 days were associated with hospital admissions (9% increase for an interquartile range increase in exposure). Residents of the Belmont service area also showed an increased risk (9.1 % increase) 4 to 6 days after exposure.

These findings are quite similar to those previously reported for paediatric emergency visits for GI illness in Philadelphia.6 That study reported emergency visits associated with turbidity lagged 10 days in the Belmont and Queen Lane service areas, and lagged nine days in the Baxter service area. Combining all service areas, an association was seen with a 10 day lag. In addition, an association in the Belmont service area with a four day lag was seen.

Overall, in this analysis the strongest association between drinking water quality and hospitalisations for GI illnesses in Philadelphia's elderly occurred in people 75 years old and older.

The lags observed are plausible for waterborne GI disease. Estimates of the actual incidence of waterborne disease suggest that the most common waterborne GI illnesses are viral and protozoan.34 35Incubation periods for these generally range from two days to two weeks.28-30

This study and similar ones5-9 show a significant level of GI disease, including hospitalisations, associated with consumption of drinking water in the US and Canada, even in well run, state of the art water systems with no evidence of operation disruptions. The generally very low turbidity levels in Philadelphia would meet EPA's upcoming Interim Enhanced Surface Water Treatment Rule. The costs of this morbidity are also likely to be significant; average hospital costs are now over $1000/day. Hospitalisations, of course, represent only a small percentage of total associated morbidity.


Turbidity is a measure of the presence of suspended particles in water that are capable of scattering light. Microorganisms are only a tiny portion of these particles (most is mud and silt), and pathogenic microbes are likely to be only a tiny fraction of the total microbial population. Turbidity, then, is a proxy for exposure. In the case of the Philadelphia water supply, the source water is highly contaminated, but it is treated and filtered. Hence, here turbidity may serve as a proxy for the ability of filtration to remove contaminants, including pathogens.11 This relation probably varies day to day, introducing error into our exposure estimate.

There is also measurement error associated with the turbidity measures. Firstly, there were reporting differences among Philadelphia's three water treatment plants. In addition, one study of variability in low level turbidity measurements found a wide but normally distributed variability; the instruments and calibration standard, which were the same in Philadelphia's three water plants, were the principal factors related to variability in that study.36 Random measurement error will decrease the estimated risk because, in a regression analysis, the variability in the independent variable is compared with the variability in the dependent variable. Fortunately, the large size of the data base in this study, especially the fact that each day's turbidity was the average of 24 measurements, increases overall accuracy.

Additional measurement error comes from the exposure links—date of exposure and zip code of residence. “Date” is imprecise because the transit time in the distribution system—that is, the time from water treatment plant to home tap—varies irregularly throughout the city. The date of exposure we used was the date the water left the treatment plant. Assignment of people to treatment plant by zip code is imprecise because people consume water from faucets other than their own residences, both within and outside the city. In addition, many parts of Philadelphia receive water that is blended from two or more of the water treatment plants.

In a univariate regression, misclassification of exposure biases effect estimates to the null. In multivariate regression, the effects are more complicated. However, the probability limit of the bias in the regression coefficient of the exposure of interest can be separated into two terms. One only involves the measurement error for the exposure variable, and again biases toward the null. The second term is proportional to the measurement error of the other covariates, and their correlation with exposure. This term can bias in either direction, depending on the signs of the correlations.37In this analysis, the exposure misclassification is relatively high and the correlation between turbidity and the covariates (season, day of week, and temperature) is moderately low. The net bias in the estimated effect of microbial contamination of drinking water, then, is likely to be toward an underestimate of effect.

key points
  • Disinfected drinking water in Western countries may still be a source of infectious gastrointestinal illness.

  • Some of what is now considered endemic illness may be waterborne.

  • Because of the large number of waterborne pathogens biological monitoring is rare, and proxies for contamination are used. Better proxies should be developed, and supplemented with routine monitoring of gastrointestinal illness.

Misclassification of disease is less intractable. The accuracy of discharge diagnoses were 90% when closely related diagnostic categories, such as the GI categories used here, were grouped.38 Furthermore, little evidence was seen for systematic missclassification of diagnoses into more financially rewarding diagnoses,39 and diagnostic accuracy was higher in large cities, and in later years.40

Random misclassification of the outcome variable in regression will not bias the estimated attributable risk for exposure, it will only increase the standard error of the estimated risks.41 To see this consider first a linear model for the response variable


Where X is a matrix of predictor variables, β a vector of regression coefficients, and Y the vector of daily counts. Suppose instead of Y, we measure Z on each day, where Z is the count of GI admissions including net missclassification. Then, using the usual formula for the estimated regression coefficients, we find that

β=(XtX)-1XtZ = (XtX)-1XtY + (XtX)-1Xte

Where Xt is the transpose of the X matrix, and e is Z-Y, the missclassification error. As up to normalisation Xte is the correlation between the exposure variables and the misdiagnosis error from day to day, we see that unless the net misdiagnosis on a daily basis was correlated with turbidity, there is no bias in the regression coefficient. In our case, we have fit a log-linear model


But for small relative risks, which is what we see in this study, we can expand exp(Xβ) as a power series and keep only the first term, recovering the situation above. Hence, such misclassification reduced the significance of the coefficients, but should not have biased the effect estimate.

This discussion assumes that these diagnosis errors are not correlated with exposure. While it is possible that there is a time trend of misclassification that might be correlated with a long term time trend in exposure, such long term time trends are filtered out in this analysis by the Loess smooth function of time. We cannot identify a likely scenario by which diagnostic patterns would change day to day in correlation with turbidity measurements.


Several recent studies have indicated that, even in systems meeting current federal and state standards, some of what has been understood an endemic GI illness in the US is actually waterborne infectious disease. All the systems studied met then current federal water standards and, with the exception of a very brief period during a massive outbreak of cryptosporidiosis in Milwaukee, all would meet current federal drinking water standards. Those studies have focused on the general population, children, and immunocompromised people. This analysis extends those findings to another sensitive group—the elderly.

Several studies have shown that GI illness is a “special burden” of the elderly. Hospitalisation and case-fatality rates for GI illness in the elderly are disproportionately high.10 27 Community surveys have confirmed that acute gastroenteritis is a common disease in nursing homes,42 including a study conducted in the Philadelphia area43; and in the non-institutionalised elderly population in Europe as well as in the US.44 A recent survey of the published literature to identify populations particularly sensitive to waterborne and foodborne enteric microorganisms found that they included the very young, the immunocompromised and the elderly.45

The legion of elderly is growing rapidly. The number of persons over 65 is estimated to double between 1980 and 2010, from about 25 to 50 million. And the number of people over 85 will likely triple in that time, from about 2.2 million to 7.5 million.

Thus, GI disease, both morbidity and mortality, has significant social and economic costs in the US, especially among the elderly. Given the slow aging of the entire US population and the disproportionate burden of GI disease in the elderly population, these costs can be expected to rise.


We thank the Philadelphia Water Department for providing the turbidity data.

This paper represents the views of the authors and does not represent the views of the US Environmental Protection Agency.

View Abstract


  • Funding: this study was supported by grant ES-0002 of the National Institute for Environmental Health Sciences.

  • Conflicts of interest: none.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.