| Authors: | Ingo Feinerer, Kurt Hornik, David Meyer |
| Title: | [download] (10474)Text Mining Infrastructure in R |
| Reference: | Vol. 25, Issue 5, Mar 2008 Submitted 2007-09-05, Accepted 2008-02-10 |
| Type: | Article |
| Abstract: | During the last decade text mining has become a widely used discipline utilizing statistical and machine learning methods. We present the tm package which provides a framework for text mining applications within R. We give a survey on text mining facilities in R and explain how typical application tasks can be carried out using our framework. We present techniques for count-based analysis methods, text clustering, text classification and string kernels. |
| Paper: | [download] (10474)Text Mining Infrastructure in R (application/pdf, 685.3 KB) |
| Supplements: | [download] (1261)tm_0.3.tar.gz: R source package (application/x-gzip, 569.6 KB) |
| [download] (1919)v25i05.R: R example code from the paper (text/plain, 25.8 KB) |
|
| Resources: | BibTeX | OAI |
