oldWeather3 - U.S. Arctic logbooks

The Logbook of USGS Pioneer, as transcribed by oldWeather. (Video page).

Summary

oldWeather3, also known as “oldWeather Arctic”, or “oldWeather classic”, was a follow-on project to the original oldWeather. It took essentially the same approach, but used a different set of original documents - U.S. Government ships logbooks in the US National Archives, concentrating on voyages to the Arctic. (Sample image).

The project included 200,000 images, from only 18 different ships.

Costs and efficiency

The oldWeather3 website cost about £25,000 to set up (2018 £) - it was derived from the original oldweather site. The website running costs were borne by Zooniverse: These would have been substantial if the site had been run stand-alone, but were much reduced because many costs could be shared across Zooniverse’s whole stable of projects. In any event the running costs were not borne by the project, so they are ignored here.

The other major financial cost is project management: the project took probably about a fifth of one climate scientist’s time to support. Estimating their total cost (salary + overheads) at an arbitrary £70,000 per year gives a £85,000 management cost.

The 200,000 images each took around 10 minutes to transcribe, and each was done three times - that’s about 100,000 person-hours of work, about 62 person years at typical full-time rate (200 days/year, 8 hours/day). All this work was done by the project volunteers.

The amount of volunteer work put into the project is more than this, because the project has done much more than just transcribe the weather observations. A lot of effort has been put into history, community-building and project outreach, but this probably shouldn’t be counted as a transcription cost.


Date run 2012-2018
observations rescued 1,303,738 (so far, will increase to about 5.2 million)
Elapsed time (per ob.) 6 years (2.41 minutes - will reduce to about 0.6 minutes)
Financial cost (per ob.) £110,000 (£0.08 - will reduce to about £0.02)
Effort required (per ob.) 62 person years (4.6 person minutes - will reduce to about 1.1 person minutes)

How are these numbers estimated?.

We have not yet completed processing all the transcriptions into observations. When this is complete we will have perhaps 4 times as many observations from this project, and the costs per observation will reduce accordingly.

Lessons learned

  • In data rescue, some observations are more difficult than others. Older records are harder to read, more variable in format, more difficult to interpret. These records took more than 4 times as much effort to transcribe as simpler records from later periods.
  • Participation in this project was very different to that in original oldWeather only about 1/4 as many people participated at all (4,000 instead of 16,000) and the majority of the work in this case was done by a core group of 20 or so super-volunteers (in original oldWeather the work was much more distributed).
  • One problem with this project is that the barrier to participation was high (the minuimum contribution was one transcribed page, about 10 mins of work), so 19/20 potential participants gave up before finishing their first page.
  • In spite of these problems, the project was a success, and generated many of the same fringe benefits as original oldWeather. But it did take a long time.