Why not release data for phylogenetic papers?

Last month I noted that a paper on speculative inferences as to the phylogenetic origins of Australian Aborigines was hampered in its force of conclusions by the fact that the authors didn't release the data to the public (more accurately, peers). There are likely political reasons for this in regards to Australian Aborigine data sets, so I don't begrudge them this (Well, at least too much. I'd probably accept the result more myself if I could test drive the data set, but I doubt they could control the fact that the data had to be private). This is why when a new paper on a novel phylogenetic inference comes out I immediately control-f to see if they released their data. In regards to genome-wide association studies on medical population panels I can somewhat understand the need for closed data (even though anonymization obviates much of this), but I don't see this rationale as relevant at all for phylogenetic data (if concerned one can remove particular functional SNPs). Yesterday I noticed PLoS Genetics published a paper on the genomics of Middle Eastern populations, Genome-Wide Diversity in the Levant Reveals Recent Structuring by Culture. The results were moderately interesting (I'll review the paper in detail later), but bravo to the authors for putting their new data set online. The reason is simple: reading the paper I wanted to see an explicit phylogenetic tree/graph to go along with their figures (e.g., with TreeMix). Now that I have their data I can do that tonight, time permitting. One major aspect of science is reproducibility. Because of capital outlays this is not always viable, and often occurs in a haphazard fashion. But with phylogenetics done on a computer this is less of an issue. I have a desktop at home devoted 99% to running data sets, in part for my own interest, and in part because I want to check the robustness of some of the inferences I see in papers like the ones above.

Why not release data for phylogenetic papers?

Explore the phylogenetic origins of Australian Aborigines, focusing on the importance of data transparency in scientific studies.

Newsletter

Razib Khan

The Secret to Hibernation Is Hidden in Human DNA and We Might One Day Activate It

Two Cancer Drugs Show Surprising Promise in Treating Alzheimer’s

Vitamin C Promotes Skin Cell Growth to Keep Skin Healthy and Prevent Aging

New Blood Type Discovered in France — And Offers a Breakthrough in Transfusion Medicine

Iron Deficiency Could Trigger Sex Change in Mammals Before Birth

New CRISPR Modification Could Make Fixing Genes More Accurate and Effective

Stephen Hawking's Disease: How ALS Impacts the Body and Progress for Treatment

CRISPR Fulfills Its Promise with First-Ever Personalized Gene-Editing Therapy

Advances in Existing Drugs and Personalized Therapy Could Help Treat Osteoarthritis

The Mysterious Source Behind the Monkeypox Virus Is a Squirrel

A Healthy Prenatal Stage Could Be Key to Preventing Psychiatric Disorders

High-Sugar Diet Linked to Lung Cancer, Expanding Our Understanding of Diet’s Impact

Prenatal Treatment Offers Hope for Infants Born With Spinal Muscular Atrophy

Fat Cells Can Retain a Genetic Memory — Even After Weight Loss

Henrietta Lacks’ Cells Were Taken Without Consent, so How Is Her DNA Protected Today?

Stay Curious

JoinOur List

SubscribeTo The Magazine