FEB 15, 2017 9:24 AM PST

Coders Aim to Archive Climate Data

On Saturday coders around the country joined a hackathon organized by DataRefuge and the Environmental Data and Governance Initiative, racing to archive NASA’s climate science data before the country wakes up one day and it has miraculously disappeared. This is a legitimate fear, as the Trump administration has begun its mission of nipping and tucking certain information from public access.

Coders gather to archive climate data. Photo: Wired

The 200 coders met in Doe Library on the UC Berkeley Campus, but there were similar hackathon communities with the same intended mission in over twenty cities. The groups developed an efficient system in which hackers were split into two roles: the taggers and the baggers. The taggers were responsible for finding and marking the specific sites and data sets that needed to be archived, while the baggers were in charge of writing the code to download all the data into the Internet Archive, a digital library. “The process involves developing web-crawler scripts to trawl the internet, finding federal data and patching it together into coherent data sets,” writes Wired. This task is more difficult than one might imagine because there is essentially no consistency in the way government data has been presented on public sites in the last thirty years.

Nevertheless, when the hackathon ended the coders had successfully downloaded 8,404 NASA and DOE webpages onto the Internet Archive— essentially all of NASA's climate data. They also developed “backdoors” to download 25 gigabytes from 101 public datasets, and were expecting even more to come in as scripts on some of the larger datasets finished running, reports Wired.

But that’s not all the hackers accomplished. Figuring that this disappearing information will continue to be an ongoing crisis, the programmers are developing software that will help track the changes in websites, so that we will be aware of what we are losing and when. Engineers call this version control. For instance, the Global Data Center's reports and one of NASA's atmospheric carbon dioxide (CO2) data sets has already been removed from the web.

"Climate change data is just the tip of the iceberg," Eric Kansa, an anthropologist who manages archaeological data archiving for the nonprofit group Open Context, told Wired. "There are a huge number of other data sets being threatened [that are rich] with cultural, historical, sociological information."

Sources: Wired, Live Science




 

About the Author
  • Kathryn is a curious world-traveller interested in the intersection between nature, culture, history, and people. She has worked for environmental education non-profits and is a Spanish/English interpreter.
You May Also Like
DEC 09, 2019
Genetics & Genomics
DEC 09, 2019
Researchers Rewire E. coli to Consume Carbon Dioxide
Milo et. al.   Researchers have genetically rewired the metabolism of Escherichia coli to be autotrophic, using formate (COOH) as a food sou...
DEC 19, 2019
Earth & The Environment
DEC 19, 2019
Tiny Fossils Reveal California's Ocean Acidification History
A century’s worth of microscopic shells has revealed that ocean acidification is occurring in California waters at twice the rate of the global avera...
DEC 31, 2019
Cell & Molecular Biology
DEC 31, 2019
Growing a Better Lab-Based Meat
Meat consumption has risen around the world in the past few decades, and demand is still increasing....
JAN 07, 2020
Plants & Animals
JAN 07, 2020
Baby Penguins Are Often Bullied to Death by Adults
Most people envision penguins as fun, happy-go-lucky birds residing in the Earth’s chilly polar regions, but that’s not always the case. In fac...
JAN 13, 2020
Plants & Animals
JAN 13, 2020
An Albatross Mother's Work is Never Done
Albatross chicks are naturally flightless, and this increases their dependence on their parental units to bring back food for them to eat. In this chick&rs...
JAN 27, 2020
Chemistry & Physics
JAN 27, 2020
Espresso, Scientifically
Everyone has their preferred way to make a cup of coffee, but for those who wish to become the master of espresso, now there's a highly scientific way...
Loading Comments...