[Taxacom] HHDB: hemihomonyms

Paul Kirk p.kirk at cabi.org
Mon Jan 18 04:12:06 CST 2010

If it's worth doing (All Genera Index) or needed 'people' should just do
it ...


-----Original Message-----
From: taxacom-bounces at mailman.nhm.ku.edu
[mailto:taxacom-bounces at mailman.nhm.ku.edu] On Behalf Of Roderic Page
Sent: 18 January 2010 10:02
To: Francisco Welter-Schultes
Cc: taxacom at mailman.nhm.ku.edu
Subject: Re: [Taxacom] HHDB: hemihomonyms

On 17 Jan 2010, at 23:20, Francisco Welter-Schultes wrote:
>> 2) BHL needs to code their titles (volumes) as botanical, zoological,

>> bacteriological, mixed, or not taxonomic.  This is not as big a task 
>> as it might seem as entire runs of journals are often exclusively 
>> botanical or zoological.
> As far as I can see, this would not be easy. Which specialists would 
> be paid for the indexing? I don't see any budget anywhere for that. We

> could do that in the AnimalBase group. But who would pay us?
> Alternative solution: metadata could be added to works by users. I 
> made likewise proposals concerning other metadata, but the BHL 
> associated libraries did not like that. They preferred a procedure by 
> which users would have an opportunity to make suggestions, which is 
> then forwarded to whom it concerns, so that the library which 
> digitized the work would modify the metadata.

The notion that users would "make suggestions" that the library may or
may not accept is doomed to fail. With all due respect to libraries,
their track record at handling metadata with the level of granularity
required by users (other than other libraries) is pretty poor. I have
spent more time than I care to remember dealing with poor quality
metadata provided by digital libraries (e.g.,

Given the magnitude of the task, and the fact that, fundamentally,
nobody cares as much about this stuff as taxonomists, it will have to be
the taxonomic community that steps up to this task. And the asking for
money thing is unlikely to get very far. If the first question is "who
will pay us?" then we're doomed from the start.

> Possibly they would agree in such a solution for user-created metadata

> in fields in which libraries are almost absolutely not skilled.
> Another problem in pre-1900 works is that OCR text recgnition is not 
> 100 % reliable. And if you observe an error rate of only 0.01 % in a 
> work
> - you
> can bet that these errors will concern exactly the taxonomic names.
> Try to search for Planorbis duryi in Wetherby 1879, Journal of the 
> Cincinnati Society of Natural History 2 (2), published in p. 99.
> http://library.cincymuseum.org/starweb/journals/servlet.starweb?path=j
> ournals/STARArchives.web
> English journal, good printing quality, but you will not find it, 
> incorrectly OCR'd.
>> While at it, the works might also be coded by discipline: entomology,

>> ornithology, malacology, etc.
> If you can explain a librarian the difference between malacology and 
> entomology... this will be a funny job...

Furthermore, this may have unintended consequences. For the terrestrial
biota it is not uncommon to have plant and animal names in the same
article, even if the journal is, say, zoological. In my efforts to
extract articles from BHL (http://biostor.org/ ) I've come across
numerous articles on insects that are full of the names of host plants.

>> 4) Finally, the BHL Advanced Search option will need to allow 
>> taxonomic searches to be restricted by Code, and whether unresolved 
>> names should be included or excluded.
> We would have a budget to implement such a function, AIT in Austria 
> could be encharged to do that. But I would predict that it would not 
> be perfect.
> Which of course, should not be a reason for not trying it out.

Deciding on which code a uBio name belongs to is non-trivial, but some
progress could be made using the "packageName" field (which records a
-- sometimes informal -- taxonomic group to which a name belongs). Of
course, uBio is incomplete, and if it lacks all hemihomonyms then
there's a good chance a name may be incorrectly restricted in scope.  
There are other sources which may help, such as linking BHL content to
names in nomenclators such as IPNI and ZooBank.

BHL content provides numerous challenges, but the notion that devolving
key tasks to librarians will help doesn't fill me with confidence.



> Francisco
> _______________________________________________
> Taxacom Mailing List
> Taxacom at mailman.nhm.ku.edu
> http://mailman.nhm.ku.edu/mailman/listinfo/taxacom
> The Taxacom archive going back to 1992 may be searched with either of 
> these methods:
> (1) http://taxacom.markmail.org
> Or (2) a Google search specified as:  site:mailman.nhm.ku.edu/ 
> pipermail/taxacom  your search terms here

Roderic Page
Professor of Taxonomy
Graham Kerr Building
University of Glasgow
Glasgow G12 8QQ, UK

Email: r.page at bio.gla.ac.uk
Tel: +44 141 330 4778
Fax: +44 141 330 2792
AIM: rodpage1962 at aim.com
Facebook: http://www.facebook.com/profile.php?id=1112517192
Twitter: http://twitter.com/rdmpage
Blog: http://iphylo.blogspot.com
Home page: http://taxonomy.zoology.gla.ac.uk/rod/rod.html


Taxacom Mailing List
Taxacom at mailman.nhm.ku.edu

The Taxacom archive going back to 1992 may be searched with either of
these methods:

(1) http://taxacom.markmail.org

Or (2) a Google search specified as:
site:mailman.nhm.ku.edu/pipermail/taxacom  your search terms here
The information contained in this e-mail and any files transmitted with it is confidential and is for the exclusive use of the intended recipient. If you are not the intended recipient please note that any distribution, copying or use of this communication or the information in it is prohibited. 

Whilst CAB International trading as CABI takes steps to prevent the transmission of viruses via e-mail, we cannot guarantee that any e-mail or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions.

If you have received this communication in error, please notify us by e-mail at cabi at cabi.org or by telephone on +44 (0)1491 829199 and then delete the e-mail and any copies of it.

CABI is an International Organization recognised by the UK Government under Statutory Instrument 1982 No. 1071.


More information about the Taxacom mailing list