A library for processing plain text corpora to various corpus formats. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In most cases, each NLP tool uses its own idiosyncratic input format. This library helps you to convert a corpus very easy to the desired format. It is called Forpus, because you are **for**\ matting a cor\ **pus**, but this is also a genus of parrot in the family Psittacidae. This library supports **conversions** to * `JSON `_ * `Document-term matrix `_ * Graph * `GEXF `_ * `GML `_ * `GraphML `_ * `Pajek `_ * `SparseGraph6 `_ * `YAML `_ * David Blei's `LDA-C `_ * Thorsten Joachims' `SVMlight `_ See `Getting Started `_ for how to install Forpus.