Averell

  • Version: 1.1.0

  • Status: PUBLISHED

  • Updated: Sep 18, 2020

Averell is a Python library and command line interface to download and to standardize corpora from ten multi-lingual poetry repositories, in different formats, into a single representation. It is able to download an annotated corpus and reconcile different TEI entities to provide a unified JSON output at the desired granularity. The data obtained in the JSON keys corresponds to some of the data properties of the POSTDATA-core and POSTDATA-structural ontologies.

The source code is fully available at Github: