Watan software
Watan
Added: October 07, 2013 | Visits: 307
The Arabic Corpus is composed of arabic texts for text categorization. The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories).
Platforms: *nix
License: Freeware | Size: 13.76 MB | Download (32): Arabic Corpus Download |