Diabetes stops at the state line!

Explore the rise of diabetes in America with insights from CDC data and state-level prevalence estimates. Discover surprising patterns!

Written byRazib Khan
| 2 min read
Google NewsGoogle News Preferred Source

Newsletter

Sign up for our email newsletter for the latest science news

Sign Up

Visualization of data is great. And sometimes it tells us something...though we don't always know what. Slate has an interactive feature showing the rise of diabetes in America by county. Nothing too surprising.

But follow the gradient from El Paso to the Illinois-Missouri border. The differences are small across state lines, but the consistent differences along the borders really don't make. Are there state-level policies or regulations causing this? Or, are there state-level differences in measurement? This weird pattern shows up in other CDC data I've seen. Update: I think the mystery is solved in the comments:

Very interesting. I suspect the answer has to do with the manner in which the county estimates are produced. I went to the original data source, the CDC, and then to the relevant FAQ: http://apps.nccd.cdc.gov/DDT_STRS2/FAQ.aspx#countylevelestimates There they say that the diabetes prevalence estimates come from the "CDC's Behavioral Risk Factor Surveillance System (BRFSS) and data from the U.S. Census Bureau’s Population Estimates Program. The BRFSS is an ongoing, monthly, state-based telephone survey of the adult population. The survey provides state-specific information" So the CDC then uses a complicated statistical procedure ("indirect model-dependent estimates" using Bayesian techniques and multilevel Poisson regression models) to go from state to county prevalence estimates. My hunch is that the state level averages thereby affect the county estimates. The FAQ in fact says "State is included as a county-level covariate. " This is just a guess, but I think it is quite possibly the answer. (I should note that I looked very briefly at maps of unemployment by county and did not see the same pattern; county unemployment rates have to be estimated because there is not enough data at the county level in a survey of 60000 households, but perhaps the BLS does not use state covariates.)

Meet the Author

Related Topics

Stay Curious

JoinOur List

Sign up for our weekly science updates

View our Privacy Policy

SubscribeTo The Magazine

Save up to 40% off the cover price when you subscribe to Discover magazine.

Subscribe