A linguistic investigation of the junk emails
Purpose of the project

The goal of this project is to investigate the linguistic features of junk emails and maybe to design a filter for junk emails based on linguistic information rather than on a "bag-of-words" approach.

People
Resources
Word clouds
Word cloud generated from the frequency list using the words which appear at least 5 times in the corpus.
Word cloud generated from the frequency list using the words which appear at least 5 times in the corpus after the stopwords were removed.
Both word clouds were produced using Wordle.
Papers
Other resources