POSTDATA Project has participated in DH’s Annual Conference which was celebrated virtually, due to the current COVID-19 pandemic.
On behalf of our team, Javier de la Rosa introduced “PoetryLab: An Open Source Toolkit for the Analysis of Spanish Poetry Corpora“. This paper introduces PoetryLab, an extensible open source toolkit for syllabification, scansion (extraction of stress patterns), enjambment detection (syntactical units split in two lines), rhyme detection, and historical named entity recognition for Spanish poetry. Our toolkit achieves state of the art performance in the tasks for which reproducible alternatives exist. There are some interesting aspects in the life-cycle of the tools and the toolkit, such as professional development practices, thorough testing, continuous integration, automatic docker containerization, and continuous deployment.
It is accessible at http://postdata.uned.es/poetrylab/
Presentation: http://dx.doi.org/10.17613/rsd8-we57