Updated: Sep 18, 2020
Averell is a Python library and command line interface to download and to standardize corpora from ten multi-lingual poetry repositories, in different formats, into a single representation. It is able to download an annotated corpus and reconcile different TEI entities to provide a unified JSON output at the desired granularity. The data obtained in the JSON keys corresponds to some of the data properties of the POSTDATA-core and POSTDATA-structural ontologies.
The source code is fully available at Github: