[Taxacom] A new way to view taxonomic publications
stephen_thorpe at yahoo.co.nz
Fri Jun 21 23:12:17 CDT 2013
It's called having standards, Rod!!
From: Roderic Page <r.page at bio.gla.ac.uk>
To: Donat Agosti <agosti at amnh.org>
Cc: "<taxacom at mailman.nhm.ku.edu>" <taxacom at mailman.nhm.ku.edu>; David.King <David.King at open.ac.uk>
Sent: Saturday, 22 June 2013 3:57 PM
Subject: Re: [Taxacom] A new way to view taxonomic publications
Sent from my iPhone
On 22 Jun 2013, at 03:29, Donat Agosti <agosti at amnh.org> wrote:
> For my purpose I want to have a OCR accuracy rate between 99.9 and 99.99%
So this is the crux of the problem. You set a very high bar that BHL will struggle to meet in a lot of cases. This then sets limits on what you can achieve.
An alternative is to accept that things will be messier than that, and set your expectations appropriately. Plus we can think about ways to cope with messy text. It strikes me that there is a misplaced obsession with "clean" data that gets in the way of making progress. You want the world to be one way, but it's the other way.
Taxacom Mailing List
Taxacom at mailman.nhm.ku.edu
The Taxacom Archive back to 1992 may be searched with either of these methods:
(1) by visiting http://taxacom.markmail.org/
(2) a Google search specified as: site:mailman.nhm.ku.edu/pipermail/taxacom your search terms here
Celebrating 26 years of Taxacom in 2013.
More information about the Taxacom