Text Corpora


German Corpora

Mannheimer Corpora A very large, growing, online German corpus archive (778 million words in August 2000). A copyright-free portion of the archive (379 million words in August 2000) is freely searchable. Invited guests have access to the whole archive. Partially tagged.

Project Gutenberg (German texts)

German newspapers -- tagged corpus with syntactic structure annotated.

German News: subscribe by sending an e-mail request to germnews@vm.gmd.de. Today's news in German