You learn from failure

Gene ExpressionBy Razib KhanApr 2, 2011 10:32 AM


Sign up for our email newsletter for the latest science news

In yesterday's post on African genetics I tried to work with a large set of populations, but narrowed SNPs down to ~40,000. Today I thought I'd go another route, focus on having a thicker market set, but with fewer populations. So I did a bunch of runs with 400,000 SNPs. Here's K = 8. Please note, I did some "trial" runs and pulled out people with obvious admixture which was recent or an outlier within their population. (e.g., Mozabites with a lot of Sub-Saharan African or San which obviously had European ancestry).

Notice that there are three non-Sub-Saharan modal components. South of the Sahara the European one is absent. But here's the weird thing. Below are MDS representations of genetic distance between the ancestral groups inferred above:

Now without Eurasians + North Africans:

All of these "ancestral" groups are abstractions. More plainly, they're fake but useful (physicists would say "toy models," economists "stylized facts"). But the Nilotic one seems kind of crazy here. It told the program to go look for 8 populations. It went and looked, and came back with some with a weird one. I guess that means I'll have to do cross-validation from now on, even though that slows everything down.

1 free article left
Want More? Get unlimited access for as low as $1.99/month

Already a subscriber?

Register or Log In

1 free articleSubscribe
Discover Magazine Logo
Want more?

Keep reading for as low as $1.99!


Already a subscriber?

Register or Log In

More From Discover
Recommendations From Our Store
Shop Now
Stay Curious
Our List

Sign up for our weekly science updates.

To The Magazine

Save up to 40% off the cover price when you subscribe to Discover magazine.

Copyright © 2023 Kalmbach Media Co.