FEB 15, 2017 09:24 AM PST

Coders Aim to Archive Climate Data

On Saturday coders around the country joined a hackathon organized by DataRefuge and the Environmental Data and Governance Initiative, racing to archive NASA’s climate science data before the country wakes up one day and it has miraculously disappeared. This is a legitimate fear, as the Trump administration has begun its mission of nipping and tucking certain information from public access.

Coders gather to archive climate data. Photo: Wired

The 200 coders met in Doe Library on the UC Berkeley Campus, but there were similar hackathon communities with the same intended mission in over twenty cities. The groups developed an efficient system in which hackers were split into two roles: the taggers and the baggers. The taggers were responsible for finding and marking the specific sites and data sets that needed to be archived, while the baggers were in charge of writing the code to download all the data into the Internet Archive, a digital library. “The process involves developing web-crawler scripts to trawl the internet, finding federal data and patching it together into coherent data sets,” writes Wired. This task is more difficult than one might imagine because there is essentially no consistency in the way government data has been presented on public sites in the last thirty years.

Nevertheless, when the hackathon ended the coders had successfully downloaded 8,404 NASA and DOE webpages onto the Internet Archive— essentially all of NASA's climate data. They also developed “backdoors” to download 25 gigabytes from 101 public datasets, and were expecting even more to come in as scripts on some of the larger datasets finished running, reports Wired.

But that’s not all the hackers accomplished. Figuring that this disappearing information will continue to be an ongoing crisis, the programmers are developing software that will help track the changes in websites, so that we will be aware of what we are losing and when. Engineers call this version control. For instance, the Global Data Center's reports and one of NASA's atmospheric carbon dioxide (CO2) data sets has already been removed from the web.

"Climate change data is just the tip of the iceberg," Eric Kansa, an anthropologist who manages archaeological data archiving for the nonprofit group Open Context, told Wired. "There are a huge number of other data sets being threatened [that are rich] with cultural, historical, sociological information."

Sources: Wired, Live Science




 

About the Author
  • Kathryn is a curious world-traveller interested in the intersection between nature, culture, history, and people. She has worked for environmental education non-profits and is a Spanish/English interpreter.
You May Also Like
DEC 30, 2018
Videos
DEC 30, 2018
Portland is generating electricity from city water pipes
The video above talks about a new technology for generating electricity: environmental-friendly water pipes. Portland, Oregon partnered with Lucid Energy t...
JAN 04, 2019
Earth & The Environment
JAN 04, 2019
It's time to pull in the big data
Scientists from the Florida Museum of National History have banded together to urge other scientists to take advantage of open-access big data to solve lon...
JAN 07, 2019
Earth & The Environment
JAN 07, 2019
California's coastal biodiversity is under threat
The west coast of the United States is a hotspot for biodiversity. Sea otters, harbor seals, shorebirds, fish and shellfish populate California’s ico...
JAN 30, 2019
Plants & Animals
JAN 30, 2019
New Study Supports Idea That Sonar Causes Mass Stranding Events in Whales
The last several decades have seen an explosive uptick in the number of mass stranding events (MSEs) involving certain types of whales, prompting researche...
JAN 31, 2019
Earth & The Environment
JAN 31, 2019
The link between climate change and congenital heart defects
New research published in the Journal of the American Heart Association details a frightening reality: climate change may likely increase the number of bab...
FEB 04, 2019
Genetics & Genomics
FEB 04, 2019
A Rapid New Method to Help Modern Crops Resist Disease
Scientists can now find genes in wild plants that can make modern crops more resistant to disease, without impacting yield or using pesticides....
Loading Comments...