I crawled 1514 unique pages, extracted all words from those paged and collected their frequencies after removing so-called stop words (e.g. “the”, “a”, “I”, etc) and lemmatization. This is what we are talking about:
It is a wordcloud generated for the 1000 most frequently found words
It is far very far from perfect (e.g. many words that can be found on each page of the forum such as "post" and date related words are at the top of list). Anyway, anything interesting standing out for you? I noticed:
- nisa is more frequently found than ideco
- stocks more than bonds
- buy more than sell