Wordlists, ngrams ...


Wordlists

English

From the Centre for Translation Studies, University of Leeds
 Internet corpus (160 million tokens) wordlist, POS frequencies,  lemmas.
Reuters corpus (100 million tokens) wordlist, POS frequencies, lemmas

Mike Scott's page contains several English wordlists.

French Stop list from Jean.Veronis@lpl.univ-aix.fr

Stop lists and frequency lists for English, French and German. From Patrice Bonhomme.