Towards a scientific workflow featuring Natural Language Processing for the digitisation of natural history collections Meise Botanic Garden
We describe an effective approach to automated text digitisation with respect to natural history specimen labels. These labels contain much useful data about the specimen including its collector, country of origin, and collection date. Our approach to automatically extracting these data takes the form of a pipeline. Recommendations are made for the pipeline's component parts based on state-of-the-art technologies.
Optical Character ...
Optical Character ...