What do we talk about on the Forum?
Posted: Sat Nov 21, 2020 2:42 am
I was bored and run my webcrawler on retirejapan.com, specifically limited to this forum. Sorry, Ben!
I crawled 1514 unique pages, extracted all words from those paged and collected their frequencies after removing so-called stop words (e.g. “the”, “a”, “I”, etc) and lemmatization. This is what we are talking about:
![Image](https://i.ibb.co/gjMDyVR/wordcloud.png)
It is a wordcloud generated for the 1000 most frequently found words
It is far very far from perfect (e.g. many words that can be found on each page of the forum such as "post" and date related words are at the top of list). Anyway, anything interesting standing out for you? I noticed:
![Uber Geek :ugeek:](./images/smilies/icon_e_ugeek.gif)
I crawled 1514 unique pages, extracted all words from those paged and collected their frequencies after removing so-called stop words (e.g. “the”, “a”, “I”, etc) and lemmatization. This is what we are talking about:
![Image](https://i.ibb.co/gjMDyVR/wordcloud.png)
It is a wordcloud generated for the 1000 most frequently found words
It is far very far from perfect (e.g. many words that can be found on each page of the forum such as "post" and date related words are at the top of list). Anyway, anything interesting standing out for you? I noticed:
- nisa is more frequently found than ideco
- stocks more than bonds
- buy more than sell