Wordlists
English
From the Centre for Translation Studies, University of Leeds
Internet corpus (160 million tokens)
wordlist,
POS
frequencies,
lemmas.
Reuters corpus (100 million tokens)
wordlist,
POS
frequencies,
lemmas
Mike
Scott's page contains several English wordlists.
French
Stop list from Jean.Veronis@lpl.univ-aix.fr
Stop
lists and frequency lists for English, French and German.
From Patrice Bonhomme.