D(h)ante: a New Set of Tools for XIII Century Italian

May 18, 2016 | Software

In this work we build upon the linguistic annotation work of Mirko Tavoni of Dante’s corpus to develop a Part of Speech Tagger (PoS) of XIII century Italian language.

The objective of the work is twofold:

to provide the NLP community with a tool to perform automatic processing of ancient text and
to provide the literature community with more powerful tools for simplifying the annotation process and performing more advanced data analysis.

In D(h)ante we provide the following tools (the code is open source):