Seeking participants for December hackathon!
iDigBio and Zooniverse’s Notes from Nature Project are pleased to invite you to participate in a hackathon to further enable public participation in online transcription of biodiversity specimen labels. The event will occur from December 16-20, 2013, at iDigBio in Gainesville, FL, though you may choose to participate in a subset of the days based upon the schedule. We are especially looking for participation from the most enthusiastic and committed citizen science transcribers! This is a great opportunity to have a direct influence on expanding this tool in the directions you would like to see it go.
The hackathon will produce new functionality and interoperability for Zooniverse’s Notes from Nature and similar transcription tools. There are four areas of development that will be progressively addressed throughout the week.
- Linking images registered to the iDigBio Cloud with transcription tools in order to alleviate storage issues. (Monday)
- Transcription QA/QC and the reconciliation of replicate transcriptions. (Remainder of week)
- Integration of OCR into the transcription workflow. (Remainder of week)
- New UI features and novel incentive approaches for public engagement. (Remainder of week)
There will be opportunities to narrow the focus in each category of activity in a teleconference tentatively scheduled for early in the week of November 25 (and also at the TDWG meeting and the iDigBio Summit, if you are attending either of those events).
If you are interested, please get in touch with Austin Mast (amast@bio.fsu.edu) by Wednesday, Nov 1. iDigBio has budgeted some funds to support travel costs.
With best regards,
Austin and Rob Guralnick (UC-Boulder), co-organizers
A new milestone!
The Notes from Nature team is proud to report reaching the new milestone of 300,000 transcriptions completed! This has been made possible by the generous and committed efforts of nearly 4,000 citizen scientists from around the globe. We look forward to continuing the project and sharing more biological collections with you in the near future. Thank you citizen scientists!
To continue growing and expanding, we are interested in your feedback. What excites you the most from Notes from Nature so far? How would you like to see it evolve? Leave a comment and let us know!
On the radio!
This morning I had the opportunity to join WTJU’s Robert Packard on the Soundboard program to talk about Notes from Nature. Click here to listen to the clip!
A Case for Conservation
Here’s an interesting article entitled “Vanishing act: Conservationists make the case for saving Albemarle County’s rare and threatened habitats” from the C-Ville Weekly, one of the local news sources in Charlottesville, VA. Have you found any specimen in Notes from Nature that come from habitats like the rock outcrop discussed in this article? Some of the specimen in the Mountain Lake Biological Station collection were even collected right in this area!
250,000 Transcriptions!
Since our launch several months ago, the Notes from Nature citizen science community has transcribed 250,000 specimen labels! This is an incredible achievement, and shows promise for where this project can go. We’re indebted to the citizen scientists out there who love this work and have taken it upon themselves to contribute to science in this way.
Some highlights:
- Over 3,500 citizen scientists from around the globe participating
- Over 8,800 plant specimens completed (completion requires at least three transcriptions to ensure quality through consensus)
- Over 16,000 insect specimens completed (same requirement as plants)
- Over 25 bird ledger pages completed – these are WAY more time intensive, and were only added days ago (same completion requirement as others)
We’ve learned a lot during this period, and are now in the process of figuring out where to go next, and how to involve bigger crowds of citizen scientists and more interesting collections from around the world. Our recent call for new collections has garnered interest from curators across the US and Europe, and we hope more will be in contact soon. It’s a very exciting time.
Thank you for all your support!
Viva la revolucion
Notes from Nature recently surpassed its 200,000th transcription! Given this milestone, it seems like a good opportunity for the Notes from Nature team to do two things: 1) We want to show a bit more where – geographically – we have filled in some data gaps; 2) We want to talk a bit more about the Bigger Picture. Where do these transcriptions go after they get done!? We have talked a lot about the scientific uses of these data, and individual projects, but there is a bigger mission and one the Museum world is grappling with right now — how to simultaneously live in an analog and digital world.
Before we talk more about the Big Push to digitize records and get them mobilized for the good of society, lets do something a bit more close to home. Below is snapshot of an intensity map which shows work done by transcribers state by state. We focus on the United States here simply because we have had good dropdown list for USA states and could therefore easily get this map made without too much muxing. We have gotten have gotten a lot of help from transcribers in other counties and you can see more about that in our previous post. You can explore the map in more detail: click here to see the map . We made this by simply tallying each record with a particular name of a state, and then linking those state names using a service provided by Google called Fusion Tables. California (with 64,346 transcriptions) and Florida (with 21,283) make up a lion share of the transcriptions, but there is a lot of effort in the Southeast and West as well. All things one might expect given the regional foci of CalBug and SERNEC. Surprising, North Dakota has 1,518 transcriptions completed and Minnesota 2,109! Go Upper Midwest!
All this work really does feed into a larger effort that is happening here in the United States and around the world to make museum data available for broad use. This isn’t just for scientists, but also for formal and informal science education and the broader public. Museum specimens are obviously of great value — they even tell us more than the who, what, where, when which serves as a basis for documenting trends in changes in distribution and seasonal and yearly timing events such as emergence from hibernation. Each specimen yields further secrets — whether it is DNA that can be extracted from the tissues, body size and relation to physiology, and so on. They also tell stories about landscapes and peoples in the past, and about our own histories. In this sense, natural history tie into the much larger picture of multiple cultures.
Up until recently, if you wanted to see this vast treasure trove of data, you had to get a special pass to enter the collections, and there under the watchful eyes of curators and collections managers, you could examine specimens. Museums have always been places where visitors are most welcome, but physically moving around specimens, and figuring out which collection had what remained a challenge. While access is critical, museum curators have to balance considerations related to the conservation of these precious objects.
In the last ten years, a revolution is unfolding and museums worldwide are digitizing their collections so that the contents can be discovered, searched, and used more effectively and by more people. This work is very challenging. Many folks involved in this endeavor have lamented that years of databasing and a lot of time and effort invested in building system to publish data and make them available… and still only 2-3% of the total number of records in museums (based on our best estimates) are digitally discoverable. We have to hope there is a way to make this whole process more efficient.
So at some point, CalBug and SERNEC will take the hard work done by transcribers and make those digital records available to everyone. You can see some of the progress that has already happened by checking out projects such as VertNet, GBIF, Map of Life and iDigBio. One of the goals of these projects is to bring together data from various sources in order to create a “one stop shop” for the discovery of biodiversity information.
In sum, the bigger story is that we are witnessing a revolution in how museums make their resources available. Thanks for taking part and viva la revolucion!
-Rob Guralnick
What happened to the transcription progress?
One of the questions we have been grappling with at Notes from Nature is how to add more specimen images to the application while still showing a clear path of overall transcription progress. On the one hand, we have many more specimen images lined up from both CalBug and SERNEC, and need to keep expanding the pool of interesting and scientifically important collections being transcribed. On the other hand, we don’t want Notes from Nature citizen science transcribers to become frustrated by a seemingly bottomless pool and confused by constantly increasing and decreasing progress bars. In attempting to address this challenge, we’re going to do some small tests. We’ve added some new specimen in recent days, and would like to hear what you think about these additions. Among the new additions, we have about 74,000 new bugs, including many bombardier beetles, dragonflies, and damselflies, as well as about 13,500 new plant specimen. Do you like that we’ve added these new specimen images? Were you worried by the drop in transcription percentages? Should we work to complete “missions” with smaller subsets before adding more content? Whatever the case, check out the new specimen on Notes from Nature!”
“What’s in bloom?”
Have you enjoyed contributing to scientific research by transcribing plant specimen labels in Notes from Nature? If you like this, you may also be interested in the UVA Mountain Lake Biological Station’s “What’s in bloom” volunteer, citizen science wildflower bloom monitoring project. You can find out details about it here: http://mlbs.org/whatsinbloom . This is another great opportunity to contribute to science, interact with researchers, and enjoy nature.
Tending Our Notes from Nature Garden
Sometimes in the shuffle of getting things done, we forget to explain the simplest things. For example, where do all these images come from? Are there more to do when these are done? What the heck is a CalBug or a SERNEC?
So lets answer some of these questions as best we can. As we mentioned in the “About” section of Notes from Nature, CalBug and SERNEC are both regional consortia of natural history collections — CalBug focused on western North American (predominately) insects and SERNEC on southeastern United States plant specimens.
Lets turn to the SERNEC records first. Right now the following herbaria (or single plant collection) are featured on the site: The R. K. Godfrey Herbarium at Florida State University, with 8,368 specimen images available and the Mountain Lake Biological Station Herbarium at the University of Virginia with 6,990 specimen images. Soon we plan to load a third collection of 13,511 images from the herbarium at the University of South Alabama. This represents a small proportion of the millions of specimens found in southeastern United States herbaria, so there is still a LOT of work to do here.
CalBug has about 230,000 images already taken,of which ~33,000 have been already made available via Notes from Nature, with another 28,000 to be added shortly. These mostly come from the Essig Entomology Museum at U.C. Berkeley but also from U.C. Riverside and the California Academy of Sciences. CalBug will also be adding more images in the future. The ones there now represent a select group of insect taxa including: bombardier beetles (genus = ‘Brachinus’ or genus = ‘Metrius’), cuckoo wasps (family = ‘Chrysididae’), odonates or dragon flies, (order = ‘Odonata’), skippers (family = ‘Hesperiidae’), and tiger beetles (genus = ‘Cicindela’ or genus = ‘Omus’ or genus =’Amblycheila’).