[Taxacom] Life and Literature Code Challenge

David Campbell pleuronaia at gmail.com
Thu Sep 1 16:58:51 CDT 2011


One thing that might improve the number of scientific names recognized
in BHL would be the capacity to recognize and use the index to a
publication, if it exists.  The OCR seems much better at spotting
scientific names in an index than in text.  A suitable algorithm would
prompt a closer search of the page cited by the index for the name in
question.

User-friendliness would be helped by the capacity to search within a
set of results.  For example, I would like to be able to search for
publications that have "Auricularia" and "Mollusca", as almost all of
the thousands of hits for Auricularia are something else.   Original
publications of both names are problematical, so subsequent usage is
particularly important in this case.

-- 
Dr. David Campbell
Collections Assistant
The Paleontological Research Institution
1259 Trumansburg Road
Ithaca NY 14850




More information about the Taxacom mailing list