The Results Are In!
Probably the only thing that my $625 Data Mining I course through UCSD Extension was good for was the Discussion Board where fellow classmates offered their piece of mind about the class and valuable tips. One great lead was these poll results by KD Nuggets about the most used software tools in the world of data mining and big data.
The top 10 worth learning were:
- Rapid-I RapidMiner
- Weka / Pentaho
- StatSoft Statistica
- Rapid-I RapidAnalytics
- IBM SPSS Statistics
This survey backs up James Kobielus’s claim in his blog that “open-source communities are where much of the fresh action in data science is happening”, as many of the tools preferred by those in the survey are indeed open-source. That’s great news because I don’t have that much money.