This resource contains an openly available multilingual digitized version of thousands of documents describing natural languages of the world. The corpus is annotated with various meta, word, and text level attributes. More details about the data and annotations can be found in the reference given below.
There is also a password protected part of the corpus which can be found here.