Register for an account

X

Enter your name and email address below.

Your email address is used to log in and will not be shared or sold. Read our privacy policy.

X

Website access code

Enter your access code into the form field below.

If you are a Zinio, Nook, Kindle, Apple, or Google Play subscriber, you can enter your website access code to gain subscriber access. Your website access code is located in the upper right corner of the Table of Contents page of your digital edition.

Health

Diabetes stops at the state line!

Gene ExpressionBy Razib KhanDecember 7, 2010 8:29 AM

Newsletter

Sign up for our email newsletter for the latest science news

Visualization of data is great. And sometimes it tells us something...though we don't always know what. Slate has an interactive feature showing the rise of diabetes in America by county. Nothing too surprising.

stateline.png

But follow the gradient from El Paso to the Illinois-Missouri border. The differences are small across state lines, but the consistent differences along the borders really don't make. Are there state-level policies or regulations causing this? Or, are there state-level differences in measurement? This weird pattern shows up in other CDC data I've seen. Update: I think the mystery is solved in the comments:

Very interesting. I suspect the answer has to do with the manner in which the county estimates are produced. I went to the original data source, the CDC, and then to the relevant FAQ: http://apps.nccd.cdc.gov/DDT_STRS2/FAQ.aspx#countylevelestimates There they say that the diabetes prevalence estimates come from the "CDC's Behavioral Risk Factor Surveillance System (BRFSS) and data from the U.S. Census Bureau’s Population Estimates Program. The BRFSS is an ongoing, monthly, state-based telephone survey of the adult population. The survey provides state-specific information" So the CDC then uses a complicated statistical procedure ("indirect model-dependent estimates" using Bayesian techniques and multilevel Poisson regression models) to go from state to county prevalence estimates. My hunch is that the state level averages thereby affect the county estimates. The FAQ in fact says "State is included as a county-level covariate. " This is just a guess, but I think it is quite possibly the answer. (I should note that I looked very briefly at maps of unemployment by county and did not see the same pattern; county unemployment rates have to be estimated because there is not enough data at the county level in a survey of 60000 households, but perhaps the BLS does not use state covariates.)

2 Free Articles Left

Want it all? Get unlimited access when you subscribe.

Subscribe

Already a subscriber? Register or Log In

Want unlimited access?

Subscribe today and save 70%

Subscribe

Already a subscriber? Register or Log In