Welcome to another edition of the Plants of Virginia expeditions – Pollinator Plants of Virginia II. In this project, we have assembled a range of species from predominantly animal-pollinated plant families including the Sunflower, Mint, Tomato, Blueberry, Carrot, Coffee and Apple families, all of which provide food for humans, too. Pollinator populations and their overall health have declined in recent decades. While much current research is necessarily focused on the health of non-native, domesticated honey-bees and agricultural productivity, thousands of other invertebrate pollinators such as bumble-bees, small solitary bees, butterflies and moths are in need of help, too. In order for researchers to find these small creatures in the wild to monitor their population sizes or to test them for diseases, they must first locate the food plants that are preferred by each pollinator and wait for their research subjects to appear. Many native pollinator species will consume the pollen or nectar of very few plant species; this very choosy feeding behavior is called oligolecty. It also means that these species can die out if their food plants disappear. By transcribing these herbarium records, you help us develop very fine scale maps of the plants’ locations and flowering times, which can be used by pollinator researchers to find their quarry.
Andrea Weeks, Director, Ted R. Bradley Herbarium, George Mason University, Fairfax, Virginia
Half a million? Five hundred thousand? 500K?
Either way it’s an impressive amount of effort from an amazing group of volunteers. Yesterday NfN reached another incredible milestone; 500,000 transcriptions have now been completed since we launched our second version of the platform. Let’s break this down a bit. If the average transcription takes 3 minutes then we have spent 25,000 hours unlocking these important biodiversity resources.
The NfN team is thrilled with the progress that we have been making and as many of you know all our data is slowly making it’s way to open access data portals like SERNEC and iDigBio among others. These data are already being utilized by researchers, conservation organizations and policy makers. We are also very interested in the benefits that our volunteers get from being involved. For example, we often hear from volunteers who tell us how NfN gives them a much-anticipated break from the stresses of their work or how NfN has encouraged them to get involved with one of their local museums. We also hear of volunteers who have taken some of the knowledge they have learned through the expeditions and gotten outside to experience biodiversity in their local area.
It’s been a wonderful journey and we are looking forward to many future milestones!
— The NfN Team
Enormous thanks to all who participated in the experimental Coreopsis phenology project! I was blown away by your dedication through this new challenge, and I can’t wait to start analyzing the data for trends in flowering and fruiting times. I am also excited to investigate how citizen scientists’ phenological determinations compare to one another and to those of “experts” on the species; perhaps citizen science could be a transformative method for specimen-based studies of phenology!
If you participated in this expedition, we especially want to hear your thoughts on how to improve phenology-related tasks in the future. Please comment or chat with us about what you liked or didn’t like, what helped you most and how you would suggest improving training and the overall experience.
Be on the lookout for new phenology expeditions in the near future, which will likely feature a new segment of plant diversity. Thanks again!
Katelin D. Pearson, Curator, R. K. Godfrey Herbarium (FSU)
The following is an updated FAQ that includes the topics covered in our previous Notes from Nature FAQ post.
We are most thankful to our dedicated volunteers, who not only made great suggestions to improve and clarify some important issues, but have also completed over 490,000 transcriptions the majority of which are on the herbarium interface. These transcriptions are now being added to the SERNEC project portal on a regular basis. After that they may also be picked up by “aggregators” such as GBIF and iDigBio.
Note that this FAQ only covers issues related to the Herbarium interface. While this FAQ will cover the majority of Herbarium expeditions we always recommend reading the tutorial and help text when starting a new expedition.
1.) Interpretation: In general, you should minimize interpretation of open-ended fields and enter information verbatim. This way, we can better achieve consensus when checking multiple records against one another (see below, on that process). However, some discretion would be nice. Here are examples:
Interpretation that you should make: Simple spacing and capitalization errors (e.g. “3miN. of oakland” should be “3 mi N. of Oakland”).
Interpretation you should leave to us: Don’t interpret abbreviations, we’ll sort that out. (e.g. “Convict Lk.”).
2.) Non-English text: While we are currently focused on English language labels, on occasion you may encounter labels in other languages. Transcribe these exactly as written (do not translate to English). Match label content to transcription fields as best as you can. There is a helpful list of comment accent marks later in this document.
3.) Spelling mistakes: Transcribe exactly as written, unless you have looked it up and are absolutely certain of a simple spelling mistake. In this case, you can enter the correct spelling. When you make a correction, please use the Done&Talk button to add a comment describing the change; it’s also recommended that you provide a reliable web citation for the change if it’s anything other than a spelling correction of a common word. You can include #error or another relevant hashtag in your comment to flag the type of correction you made.
4.) Problem records: If you come across a problem record that may need to be addressed by a Researcher, or member of the project team, like a faulty image or other problem record, you can flag the record by commenting on it with #error or another relevant hashtag.
5.) Capitalization: Sometimes information may be in all capital letters on the labels. Unless this is an abbreviation, you should capitalize only the first letter of every word in your transcription (e.g. “COASTAL PLAIN PROVINCE” should be transcribed as “Coastal Plain Province”).
6.) Multiple/conflicting information: Some labels may have more than one instance of a piece of information, such as:
- Scientific names: For Herbarium specimens, transcribe only the most recent name. This can be determined based on the date that appears on the “annotation label” If you do not see a date then enter the name that appears on the primary label. The “determination label” or later added determination information should have everything spelled out, however this is not always the case. If the first letter is the same it is safe to assume the same genus is being used. Here is an example. In the case of the linked image, “D.” abbreviates “Dryopteris”, so you would enter “Dryopteris intermedia”.
- Collectors: In some cases, collectors may be listed on different lines of the label with no punctuation separating them. In your transcription, separate the names with commas. Transcribe the collector names as shown on the label, including honorifics (Mrs., Dr.). It isn’t uncommon for museums to have individual ways of entering collectors’ names so it is always best to review the help text for this specific field.
- Collector numbers: ie “123 & 4567” This could indicate that each collector gave the specimen a different number in the field. This is an uncommon practice and even when it happens it usually doesn’t go on the same label, but if you find one it should be entered exactly as is.
- Dates or date/day ranges: You should enter the earliest date listed only. Multiple dates are uncommon on herbarium labels so in most expeditions we choose to collect only one date. It is worth noting the convention in other collecting disciplines is to take a range of dates (e.g. insects in CalBug) but it isn’t for herbarium specimens.
- Locations: If a specimen is cultivated at one location from cuttings/seeds/rhizomes collected at a different location, enter the place where the specimen was cultivated in the Location field and enter the place where the seeds were collected in the Habitat and Description field.
7.) Missing information: When information in a Herbarium field is not given on the specimen label, you should leave the field blank (in the case of text entry fields) or select “Unknown” or “Not Shown” in the drop-down lists. If information on the specimen label has been verified to be missing from an Herbarium field dropdown list please advise with a Talk post. There are two well-known caveats for this:
- Dade county (Florida, United States) appears to be missing from the County list, but it is present and should be transcribed as “Miami-Dade” (it was renamed in 1997).
- Ivory Coast appears to be missing from the Country list, but it is present and should be transcribed as its French name “Cote d’Ivoire”.
8.) Inconsistent collector names: You may see several variations of the same collector name (e.g. “R. Kral” or “R.Kral”, “RWG” or “R.W.Garrison”) on different labels. We are asking for the collector name(s) to be transcribed as written on the label. This is a somewhat complicated issue since same collectors might appear to be very similar but aren’t always the same. It can take a lot of knowledge about the collector and where they deposited specimens to be able to make a definitive decision.
Interpretation that you should make: Simple spacing or capitalization errors (e.g. “R.kral” should be “R. Kral”)
Interpretation you should leave to us: Don’t interpret abbreviations, we’ll sort that out. (e.g. “RWG” should remain “RWG”)
9.) Scientific name: Provide the most recent name, whether it is a species name (a two-word combination of the genus and what is called the “specific epithet” in botanical nomenclature) or a one-word name that is at a higher taxonomic rank (e.g., just the genus or family name). Names at higher taxonomic ranks than species are used when a more precise identification has not been made. The name should typically take the form of a genus name that begins with a capital letter (genus) and a specific epithet that begins with a lowercase letter. If any of the names are given in all capitals, such as “CYPERUS ODORATUS”, the name should be entered using the typical convention, “Cyperus odoratus” in this case.
Varieties and subspecies: Record the subspecies, but omit the scientific author’s names. So “Cyperus odoratus var. squarrosus (Britton) Jones, Wipff & Carter” should be transcribed as “Cyperus odoratus var. squarrosus”. “Echinodorus cordifolius (Linnaeus) Grisebach ssp. cordifolius” should be transcribed as “Echinodorus cordifolius ssp. cordifolius”.
Be sure to reference #6 above for information related to annotation labels.
10.) Special Characters: What should you type when there is a special character in a text string, such as a degree symbol or language-specific characters? You can do an online search for the symbol or copy and paste it from your word processor’s symbols menu. Some commonly encountered symbols are included at the end of this document.
11.) County: If the county is not stated on the label, please find the appropriate county using an online search or other tools highlighted below. However, if there are multiple potential counties for a locality and it can’t be determined which is correct, please choose the Unknown County option from the County dropdown for U.S. locations; otherwise leave County blank.
12.) Splitting Location and Habitat: Often location and habitat terms will be mixed together, even being interleaved in the same sentence. Some simple guidelines when splitting them apart into separate fields to try to ensure consensus:
- Most times, general/non-specific locales are Habitat, and specific ones are Location, as only very rarely is a species found in the one place the specimen was obtained from (examples: “along road” would be Habitat as it describes the environment the plant grows in, but “along Smith Road” would be location as it describes the specific road where this specimen was found. “Bank of Smith River” would be split into “Bank” Habitat and “Smith River” Location.) In general, there is no need to repeat information in the two fields.
- Don’t introduce punctuation if possible, instead use what is there; sentences need not end with terminal punctuation (i.e., a period or exclamation point) if there is nothing after. There may be occasions when leaving it out would change the meaning of the text, in those cases it’s OK to make an addition.
- Drop unnecessary dangling non-terminal punctuation as needed. For example, “Dry roadside, east of Smithville” would result in “Dry roadside” Habitat, dropping the dangling comma as it is doesn’t terminate a sentence properly, but “Dry roadside. East of Smithville.” would keep the period to “Dry roadside.” Habitat as it does terminate the sentence.
- Capitalize new sentences (as in the example above) caused by the split.
- Data that goes into Habitat/Description:
- Added information in later labels: occasionally in a later determination the scientist will add information about the specimen, i.e., its condition or maturity; this should be included after the primary label’s data (this also applies to other fields as well though it is far less likely to find additional info for them)
- Floodplain describes a habitat. This often occurs with a river name, so for “Mississippi River floodplain”, include of the text in the Habitat field. Since in this case it wouldn’t be accurate to just have “Mississippi River” in the locality field.
- Power lines: as they may help narrow a location but say more about the habitat in which the plant grows as power line corridors are usually cleared of larger shrubs and trees.
- “n=” followed by a number; this is the number of chromosomes.
- Elevation/Altitude information should be entered into the Location field, if there isn’t a separate field for Elevation. Enter elevation verbatim in the units stated on the label.
- Data that goes into Location:
- Latitude and Longitude: Enter exactly as written. See special characters below for how to generate the degree symbol ° (or you can copy it right from here).
- Public Land Survey System: This is the T (township), R (range) and S (section) data used to establish location. For example, SW1/4 NW1/4 S13, T1SR20E refers to the southwest quarter of the northwest quarter of Section 13 of Township 1 South Range 20 East). Quarter sections “1/4” should be written as 3 characters, not one (¼).
- Provinces: Geographic provinces (e.g., Coastal Plain, Piedmont) go into the Location field but administrative provinces of countries (e.g., Alberta in Canada) go in the State/Province field.
14.) Information to Omit/Skip: The following data should not be transcribed (unfortunately, for the sake of consensus, even if you want to). However if you do find something interesting, feel free to use Done&Talk to post a comment about it.
- Synonyms listed adjacent to the primary determination (example: for “Cyperus echinatus [=C. ovularis]” only transcribe “Cyperus echinatus”)
- Common names of species; as many species have multiple common names, some of which are only locally used.
- Information printed into the label/template and not added by the collector, unless it both isn’t present in the data the collector added, and would be transcribed if it was (for example, a “Plants of Florida” label title wouldn’t be transcribed as the data would already indicate Florida in the State field, but “Plants of Fort Smith” title should be entered as “Fort Smith” in Location if this wasn’t present elsewhere).
- Information already entered into one of the dropdown fields. For example, if the label indicates “collected in Smithville, Jones Co.” because county ”Jones” will already be chosen in the dropdown“, Jones Co.” shouldn’t also be transcribed into the Location text as this would be redundant. However if it has “found in northern Jones Co.” then this should be transcribed verbatim into Location as well, as it is new information and would be meaningless if “Jones Co.” was removed.
- “Collected as part of a survey…” and similar “This specimen was examined as part of a study of…” entries, as it is part of a series of information that relates to annotations of the specimens and is not considered to be core information that we are trying to collect.
- “Sheet # of #” entries or other information indicating that this specimen is part of a set
- Hyphens that break a word across two lines. For example “speci-” at the end of a line and “men” at the beginning of the next line would be transcribed as “specimen” without the hyphen.
- Personal comments by the collector that do not relate to the specimen.
15) “s.n.” as the collector number; this stands for the Latin sine numerum meaning “without number”. In this case you should enter “s.n.” in the Collector Number field.
Some Useful Tools (discovered or developed by Notes From Nature users)
Counties and Cities: Good tools for finding counties etc. are lists on Wikipedia, there are lists of municipalities in each state of the U.S.A. (there are also similar lists for other countries). For example, https://en.wikipedia.org/wiki/List_of_municipalities_in_Florida (via the linkbox you can also change the state).
Uncertain Localities: Geographic Names Information System, U.S. Geological Survey.
For locations outside the U.S.: Geonames.org http://www.geonames.org/
Mapping tool with topo quads: To find uncertain counties or localities http://mapper.acme.com
Collector Names: Harvard University Herbarium maintains a database of collectors (http://kiki.huh.harvard.edu/databases/botanist_index.html). Note that many collectors that are encountered may not be in this database.
Hard-to-read text: Use “Sheen”, the visual webpage filter, for some hard-to-read handwriting written in pencil. (Tip was from the War Diary Zooniverse project) https://chrome.google.com/webstore/detail/sheen/mopkplcglehjfbedbngcglkmajhflnjk?hl=en-GB
Special symbols: You should be able to find symbols in word or by doing an online search and copy and paste. Here are a few:
– degree symbol for coordinates:
– plus minus: ±
– fractions: ⅛ ¼ ⅓ ⅜ ½ ⅝ ⅔ ¾ ⅞
– non-English symbols: Ä ä å Å ð ë ğ Ñ ñ õ Ö ö Ü ü Ž ž
Other symbols may be found on Penn State’s Symbol Codes: Accents, Symbols and Foreign Scripts page: http://sites.psu.edu/symbolcodes/codehtml/
ClipX: Freeware Windows clipboard enhancer that saves the last 1,024 items copied to the clipboard and allows them to be pasted through its icon in the system tray. Nothing short of a lifesaver for Ornithology but quite helpful in Herbarium too: http://clipx.en.softonic.com/
The Plant List: Search for scientific names of plants – http://www.theplantlist.org/
Integrated Taxonomic Information System (ITIS): Along with The Plant List, another recognized resource for plant scientific names (as well as animals, fungi, bacteria and more) http://www.itis.gov/
Dates: If all parts of the date are written with numerals and it’s unclear which part is the day and which is the month (for example, 2-4-91) https://en.wikipedia.org/wiki/Date_format_by_country identifies which date format (day-month-year or month-day-year) is commonly used in each country.
Mr Kevvy has generated a very useful set of custom dictionaries. They can be found here:
These dictionaries are a wonderful resource. It should be noted that scientific names can have gender based differences. You will see the specific epithet (commonly called the “species name”) with male and female genera spellings. An example albiflora is feminine and albiflorus is masculine. The Carolina-poppy is Argemone albiflora (not albiflorus). Both albiflora and albiflorus are correctly spelled, but in this case albiflorus should never be used with the genus Argemone.
In this expedition, we are partnering with The Ohio State University herbarium, covering global nitrogen-fixing diversity, especially Andean and Patagonian species, to help unlock biodiversity data in plants with nitrogen-fixing symbioses. This will help us understand the symbiosis from the genetic level to ecology. All the specimens you are helping to transcribe will also be used to generate genomic data, in order to help us further understand the underlying basis of this symbiosis. The label data are also important for helping us understand how the environment and geography have shaped this symbiosis. Your contributions will help us build one of the largest biodiversity projects yet attempted to understand the origin of this globally important plant trait.
We are excited to announce that a new paper about WeDigBio was published today in the journal BioScience. As a reminder WeDigBio stands for Worldwide Engagement for Digitizing Biocollections. It is a global event that focuses on digitizing of natural history museum specimens, which is something we care very deeply about. It’s an event we look forward to every year at Notes from Nature.
The following Press Release was generated by Kristin Friedrich at the Natural History Museum of Los Angeles County. Please note that the publication itself is Open Access, so anyone should be available to download and read it.
Worldwide Engagement for Digitizing Biocollections: WeDigBio
The future of digitizing museum collections
Los Angeles, CA, January 17, 2018 — In an effort to make biological collections more accessible for researchers and the public, many natural history museums are prioritizing the digitization of their collections. The digitization process involves making information about a specimen available on an accessible database — things like when and where it was collected, the species name, and sometimes a photo or 3D image of the specimen or object.
These collections are a record of biodiversity over time. They provide data that can be mined to investigate climate and ecological change, inform conservation efforts, understand population genetics and evolution, and inform education and policy decisions. But they can’t be used if people can’t access them. The more institutions digitize their extensive collections, the better.
“Adding digital data to analog specimens is a critical step in mobilizing museum collections for use in timely research, education and policy,” said Dr. Libby Ellwood, research fellow at the La Brea Tar Pits and Museum.
To that end, from October 22-25, 2015, 21 science institutions held the first global citizen-science event focused on the digitization of biodiversity specimens. The sites — including the Natural History Museum of Los Angeles County (NHMLA, which includes the La Brea Tar Pits and Museum), Florida State University, Smithsonian’s National Natural History Museum, the Field Museum, the Australian Museum, the Florida Museum of Natural History, and many others — hosted events in which members of the community came behind the scenes into museum collections to transcribe specimen labels and enter the information online on platforms like Notes from Nature, DigiVol, Smithsonian Transcription Center, Les Herbonautres, and Symbiota. Others participated remotely by logging onto these online platforms to transcribe labels and enter data. During this Worldwide Engagement for Digitizing Biocollections (WeDigBio.org), thousands of community scientists around the world completed over 50,000 digitization tasks.
Today, in their evaluation of the programs, researchers report in BioScience that participants stayed engaged long after the initial event was over. Since these online platforms can be accessed anytime from anywhere, this heightened engagement provided ongoing assistance to the massive task of collection digitization.
“WeDigBio provides museums and natural history collections the opportunity to engage with local communities and the online public while providing enriching and enjoyable experiences for participants,” said Ellwood.
While many museums around the world are currently pushing to digitize as much of their collections as possible, with over a billion specimens housed at museums around the world, the magnitude of this task presents a significant hurdle for museum staff. It is common for collections to house hundreds of thousands or even millions of specimens. Depending on the type of organism or object, it can be stored in a variety of ways — suspended in alcohol in a jar, laying flat in a drawer, or hanging in special climate-controlled storage. The digitization of 2-dimensional objects is the most straightforward, as they can be scanned with relative ease. But 3-D objects can be especially challenging. Labels that contain all the information about an item — some of which were penned over 100 years ago — are sometimes difficult to read, or in the case of wet specimens in jars, curled up inside a vial within a large container of, say, 100 crabs.
“To digitize these items, someone has to physically pick up, remove the labels and unfurl them, then read and transcribe the information inside, so automation or assembly line systems are nearly impossible to implement,” says Dr. Regina Wetzer, Associate Curator of the NHMLA’s Marine Biodiversity Center.
This is a perfect job for a broad, diverse community of enthusiastic people, also known as community or citizen scientists. Indeed, it is becoming increasingly clear to many institutions that the most feasible way to chip away at these enormous digitization projects is to involve the public. This is a symbiotic arrangement: museums receive much-needed assistance, and members of the community get rare behind-the-scenes access to these science and cultural institutions and enjoy the opportunity to learn more about science, nature, and culture.
“In NHMLA’s project, digitizing labels from about a thousand big crabs, we were really delighted at the level of enthusiasm and commitment that the participants contributed,” said Dean Pentcheff, an NHMLA Project Coordinator.
The next WeDigBio event is scheduled for October 2018.
“Since 2015, WeDigBio has grown and expanded to include new museum-based projects, participants in new countries, and even new transcription platforms,” said Ellwood. “We’re already looking forward to our October 2018 event and hope you’ll join us!”
About the Natural History Museum of Los Angeles County
The Natural History Museum of Los Angeles County has amassed one of the world’s most extensive and valuable collections of natural and cultural history—with more than 35 million objects, some as old as 4.5 billion years. The Natural History Family of Museums includes the NHMLA, the La Brea Tar Pits Museum (Hancock Park/Mid-Wilshire), and the William S. Hart Park and Museum (Newhall, California). The Family of Museums serves more than one million families and visitors annually, and is a national leader in research, exhibitions and education. Visit nhm.org.