Friday, 31 January 2014

Text mining and making a wordcloud...

As my first experiment with R in terms of doing anything useful, I want to make a word cloud.

I found a page which helps - HERE.

I need to install various packages including 'tm' and 'wordcloud'.

I should probably read this reference:  Introduction to the tm (text mining) Package.

I had to work out an error message:

  • "input string 33 is invalid UTF-8"
I did this by going back to Word and saving the file as UTF-8. 

This is important - text must be in UTF-8 format!

It does seem to have worked but it's quite slow and the word "and" had come up a lot which doesn't seem quite right. 

It also seems to have kind of crashed which is annoying!







No comments:

Post a Comment