Speeding-up Text Classification In a GRID Computing Environment



The amount of texts available in digital form has dramatically increased, giving rise to the need of fast text classifiers. The tasks involved can be parallelized and distributed in a GRID environment. This paper reports a study conducted on Reuters-21578 corpus, using a SVM learning machine. The task of text categorization is distributed in several platforms. The results achieved are very promising for speeding-up text categorization tasks and are valid independently of the learning machine.


Text Classification, SVM, GRID Computing


Text mining; SVM;GRID

Related Project

GRID II - Global GRID for Data Mining with Soft Computing on Large Data Bases


ICMLA 2005, December 2005

Cited by

No citations found