The disappearance of the sequences appears to be the result of an editorial error.
A batch of early coronavirus data that went missing for a year has emerged from hiding. In June, an American scientist discovered that more than 200 genetic sequences from COVID-19 patient samples isolated in China early in the pandemic had puzzlingly been removed from an online database . With some digital sleuthing, Jesse Bloom , a virus expert at the Fred Hutchinson Cancer Center in Seattle, managed to track down 13 of the sequences on Google Cloud. When Bloom shared his experience in a report
31 July 2021, 4:15 pm·4-min readMedical workers in protective suits inspect equipment at a blood donation room of the Renmin Hospital of Wuhan University in Wuhan, the epicentre of the novel coronavirus outbreak, in Hubei province, China February 14, 2020. Picture taken February 14, 2020. cnsphoto via REUTERS ATTENTION EDITORS - THIS IMAGE WAS PROVIDED BY A THIRD PARTY. CHINA OUT.
A batch of early coronavirus data that went missing for a year has emerged from hiding.In June, an American scientist discovered that more than 200 genetic sequences from COVID-19 patient samples isolated in China early in the pandemic had puzzlingly been removed from an online database. With some digital sleuthing, Jesse Bloom, a virus expert at the Fred Hutchinson Cancer Center in Seattle, managed to track down 13 of the sequences on Google Cloud.
When Bloom shared his experience in a report posted online, he wrote that it “seems likely that the sequences were deleted to obscure their existence.”Sign up for The Morning newsletter from the New York TimesBut now an odd explanation has emerged, stemming from an editorial oversight by a scientific journal. And the sequences have been uploaded into a different database, overseen by the Chinese government. headtopics.com
The story began in early 2020, when researchers at Wuhan University investigated a new way to test for the deadly coronavirus sweeping the country. They sequenced a short stretch of genetic material from virus samples taken from 34 patients at a Wuhan hospital.
The researchers posted their findings online in March 2020. That month, they also uploaded the sequences to an online database called the Sequence Read Archive, which is maintained by the National Institutes of Health, and submitted a paper describing their results to a scientific journal called Small. The paper was published in June 2020.
Bloom became aware of the Wuhan sequences this spring while researching the origin of COVID-19. Reading a May 2020 review about early genetic sequences of coronaviruses, he came across a spreadsheet that noted their presence in the Sequence Read Archive.
But Bloom could not find them in the database. He emailed the Chinese scientists on June 6 to ask where the data went but did not get a response. On June 22, he posted his report, which was covered by The New York Times and other media outlets.Story continues headtopics.com
At the time, a spokesperson for the NIH said that the authors of the study had requested in June 2020 that the sequences be withdrawn from the database. The authors informed the agency that the sequences were being updated and would be added to a different database. (The authors did not respond to inquiries from the Times.)
But a year later, Bloom could not find the sequences on any database.On July 5, more than a year after the researchers withdrew the sequences from the Sequence Read Archive and two weeks after Bloom’s report was published online, the sequences were quietly uploaded to a database maintained by China National Center for Bioinformation by Ben Hu, a researcher at Wuhan University and a co-author of the Small paper.
On July 21, the disappearance of the sequences was brought up during a news conference in Beijing, where Chinese officials rejected claims that the pandemic started as a lab leak.According to a translation of the news conference by a journalist at the state-controlled Xinhua News Agency, the vice minister of China’s National Health Commission, Dr. Zeng Yixin, said that the trouble arose when editors at Small deleted a paragraph in which the scientists described the sequences in the Sequence Read Archive.
“Therefore, the researchers thought it was no longer necessary to store the data in the NCBI database,” Zeng said, referring to the Sequence Read Archive, which is run by the NIH.An editor at Small, which specializes in science at the micro and nano scale and is based in Germany, confirmed his account. “The data availability statement was mistakenly deleted,” the editor, Plamena Dogandzhiyski, wrote in an email. “We will issue a correction very shortly, which will clarify the error and include a link to the depository where the data is now hosted.” headtopics.com
The journal posted a formal correction to that effect on Thursday.It is not clear why the authors did not mention the journal’s error when they requested that the sequences be removed from the Sequence Read Archive, or why they told the NIH that the sequences were being updated. Nor is it clear why they waited a year to upload them to another database. Hu did not respond to an email asking for comment.
Bloom could not offer an explanation for the conflicting accounts, either. “I’m not in a position to adjudicate among them,” he said in an interview.On their own, these sequences can’t resolve the open questions about how the pandemic originated, whether through a contact with a wild animal, a leak from a lab or some other route.
In their initial reports, the Wuhan researchers wrote that they extracted genetic material from “samples from outpatients with suspected COVID-19 early in the epidemic.” But the entries in the Chinese database now indicate that they were taken from Renmin Hospital of Wuhan University on Jan. 30 — almost two months after the earliest reports of COVID-19 in China.
While the disappearance of the sequences appears to be the result of an editorial error, Bloom felt that it was still worthwhile looking for other sequences of coronaviruses that might be lurking online. “It definitely means we should keep looking,” he said.
© 2021 The New York Times Company Read more: Yahoo Singapore »
Covid-19: Healthcare system still under pressure as hospitalised patients, ICU cases continue to rise, says MOH
Millions under virus lockdown as China battles Delta outbreakBEIJING — Millions of people were confined to their homes in China on Monday (Aug 2) as the country tried to contain its largest coronavirus outbreak in months with mass testing and travel curbs. ToniRadjali Wow the flu is back for the winter? Shocking
S Korea's factory activity extends growth despite supply, virus worriesSouth Korea's factory activity grew for a 10th straight month in July, driven by a solid expansion in production and new orders, though ...
Chinese cities test millions as virus cases surgeChinese cities rolled out mass testing of millions of people and imposed fresh travel restrictions as health authorities battled Sunday to contain the country's most widespread coronavirus outbreak in months.
'They Have My Sister': As Uyghurs Speak Out, China Targets Their FamiliesShe was a gifted agricultural scientist educated at prestigious universities in Shanghai and Tokyo. She said she wanted to help farmers in poor areas, like her hometown in Xinjiang, in western China . But because of her uncle’s activism for China ’s oppressed Muslim Uyghurs, her family and friends said, the Chinese state made her a security target. At first they took away her father. Then they pressed her to return home from Japan. Last year, at age 30, Mihriay Erkin, the scientist, died in Xinjia
Olympics-Equestrian-Britain's Townend takes lead, Germany's Jung falls backBritain's world number one Oliver Townend on Sunday retained his lead in the equestrian eventing, keeping a clean sheet on the cross-country ...
SoftBank-backed Indian insurance startup Policybazaar files for US$810 million IPOBENGALURU -SoftBank Group-backed online insurance aggregator Policybazaar has filed for an initial public offering of up to 60.18 billion rupees ...