FEB 15, 2017 9:24 AM PST

Coders Aim to Archive Climate Data

On Saturday coders around the country joined a hackathon organized by DataRefuge and the Environmental Data and Governance Initiative, racing to archive NASA’s climate science data before the country wakes up one day and it has miraculously disappeared. This is a legitimate fear, as the Trump administration has begun its mission of nipping and tucking certain information from public access.

Coders gather to archive climate data. Photo: Wired

The 200 coders met in Doe Library on the UC Berkeley Campus, but there were similar hackathon communities with the same intended mission in over twenty cities. The groups developed an efficient system in which hackers were split into two roles: the taggers and the baggers. The taggers were responsible for finding and marking the specific sites and data sets that needed to be archived, while the baggers were in charge of writing the code to download all the data into the Internet Archive, a digital library. “The process involves developing web-crawler scripts to trawl the internet, finding federal data and patching it together into coherent data sets,” writes Wired. This task is more difficult than one might imagine because there is essentially no consistency in the way government data has been presented on public sites in the last thirty years.

Nevertheless, when the hackathon ended the coders had successfully downloaded 8,404 NASA and DOE webpages onto the Internet Archive— essentially all of NASA's climate data. They also developed “backdoors” to download 25 gigabytes from 101 public datasets, and were expecting even more to come in as scripts on some of the larger datasets finished running, reports Wired.

But that’s not all the hackers accomplished. Figuring that this disappearing information will continue to be an ongoing crisis, the programmers are developing software that will help track the changes in websites, so that we will be aware of what we are losing and when. Engineers call this version control. For instance, the Global Data Center's reports and one of NASA's atmospheric carbon dioxide (CO2) data sets has already been removed from the web.

"Climate change data is just the tip of the iceberg," Eric Kansa, an anthropologist who manages archaeological data archiving for the nonprofit group Open Context, told Wired. "There are a huge number of other data sets being threatened [that are rich] with cultural, historical, sociological information."

Sources: Wired, Live Science




 

About the Author
  • Kathryn is a curious world-traveller interested in the intersection between nature, culture, history, and people. She has worked for environmental education non-profits and is a Spanish/English interpreter.
You May Also Like
APR 20, 2020
Plants & Animals
APR 20, 2020
How Sand Cats Survive in the Harsh Desert Environment
The sand cat is a type of feline that spends nearly all its life in the desert. While they may not look that much differ ...
APR 30, 2020
Earth & The Environment
APR 30, 2020
New Study Reveals Amount of Microplastics on Seafloor
Microplastics—the often microscopic plastic particles resulting from the breakdown of large plastic items or mater ...
MAY 11, 2020
Plants & Animals
MAY 11, 2020
Ever Wonder How a Bee Ascends to the Rank of Queen?
Virtually every beehive sports its own queen bee, but there can be only one. Beneath her are hundreds or thousands of pe ...
MAY 14, 2020
Earth & The Environment
MAY 14, 2020
NASA's ICESat-2 Mission Reports Changes in Arctic Ice Thickness
Arctic sea ice is vital to Earth's climate system, and recent decades have seen troubling declines in sea ice due to ...
MAY 28, 2020
Chemistry & Physics
MAY 28, 2020
Smart sponge selectively absorbs oil
New research published in the journal Industrial Engineering and Chemical Research describes the development of a smart ...
JUN 26, 2020
Earth & The Environment
JUN 26, 2020
Another 2020 plague: locusts
2020 is determined to be a year to remember. On top of the global COVID-19 pandemic, the worst locust outbreak seen in o ...
Loading Comments...