[Taxacom] A new way to view taxonomic publications
agosti at amnh.org
Wed Jun 19 22:30:11 CDT 2013
Interesting development and great you go ahead with BIONAMES.
You mention that build the system based on NLM DTD, whilst it is actually taxpub JATS, that this a flavor of on the NLM DTD, a widely use DTD for publishing biomedical articles and especially used to archive the article that has been customized for taxonomy, and thus includes all the elements that are domain specific, such as scientific names or treatments. Citations, tables, figures are part of the DTD.
In the demo you skip one of the nice elements, the treatment element. And in my view this is a loss of quality, which you can easily see, when you start looking up one of the new descriptions, eg . Paedophryne dekot. If you look it up on the right hand in Biostor, you finally end up up at the article level and the treatment itself, which is the relevant information that I need, not the publication. Similar the pdf that you show in biostor has lost all the links that are in the original pdf.
I know, you are not interested in the treatment but names and publications. I as a taxonomist in the world you describe, a publication being a database, which happens when you convert it into XML, and even more so an XML that is domain specific, don't want to be stuck in the world or articles, just because we grew up with it. The goal is customized information, and that means treatment (in the context of a scientific publication). And with Zookeys at hand you have all to make this happen.
Finally, most of the functionality is already in the Zookeys article itself when you look at it in the html version instead of the pdf your link to (http://www.pensoft.net/journals/zookeys/article/1963/at-the-lower-size-limit-for-tetrapods-two-new-species-of-the-miniaturized-frog-genus-paedophryne-anura-microhylidae-) , and even more so in their species profiles http://ptp.pensoft.eu/external_details.php?type=1&query=Paedophryne . The interesting step then is to get this done for other journals and see, how it looks like. There, the production of clean OCR or text extraction from PDF, the semantic mark-up is not done, that is served on a silver platter by Pensoft, with a pretty overhead in the production of the article.
Then this needs to be compared to other similar sites, like Species-ID http://species-id.net/wiki/Paedophryne_dekot , that allows the crowd to edit and add content or Plazi, the treatment repository http://tinyurl.com/nku2rdd , which allows to get back to the treatment and not end up in the article.
So, the challenge has to be that Bionames does not loose granularity, and operates at the level we cite (not article but a page within an article, essentially linking to the treatment), and to show mechanisms to read in pdfs that have no XML, not text in fact, and make this fly, something you have demonstrated to find articles within the BHL body. This time just one level down.
From: taxacom-bounces at mailman.nhm.ku.edu [mailto:taxacom-bounces at mailman.nhm.ku.edu] On Behalf Of Roderic Page
Sent: Thursday, June 20, 2013 12:50 AM
To: taxacom at mailman.nhm.ku.edu com
Subject: [Taxacom] A new way to view taxonomic publications
I've developed a somewhat experimental viewer for articles in the journal ZooKeys which might be of interest. There is a blog post here http://iphylo.blogspot.co.uk/2013/06/a-new-way-to-view-taxonomic-publications.html
You can try a live example here: http://bionames.org/labs/zookeys-viewer/?doi=10.3897/zookeys.154.1963
This viewer is one of the motivations behind http://bionames.org I'm aiming for a platform where we can embed the taxonomic literature and have names and publications seamlessly linked together, enabling us to navigate through the primary taxonomic literature in a single place.
Professor of Taxonomy
Institute of Biodiversity, Animal Health and Comparative Medicine College of Medical, Veterinary and Life Sciences Graham Kerr Building University of Glasgow Glasgow G12 8QQ, UK
Email: r.page at bio.gla.ac.uk
Tel: +44 141 330 4778
Fax: +44 141 330 2792
Home page: http://taxonomy.zoology.gla.ac.uk/rod/rod.html
ORCID id: http://orcid.org/0000-0002-7101-9767
Taxacom Mailing List
Taxacom at mailman.nhm.ku.edu
The Taxacom Archive back to 1992 may be searched with either of these methods:
(1) by visiting http://taxacom.markmail.org
(2) a Google search specified as: site:mailman.nhm.ku.edu/pipermail/taxacom your search terms here
Celebrating 26 years of Taxacom in 2013.
More information about the Taxacom