Text Cat 1.0

Text Cat can be used to guess the language of a given text. The class reads data files that contain ranking information about characters that are most likely to be found in texts of several languages.

The text being analyzed is converted to Unicode to be compared with the language character ranking data.

The class returns an array of the language sorted by ranking. Currently Text Cat support the language are: Arabic, Belarus, Chinese, Czech, Danish, Dutch, English, Esperanto, French, German, Greek, Hebrew, Italian, Japanese, Russian, and Spanish.

License type: GPL - GNU General Public License
Date added: 8 years, 5 months 28 days ago | Last updated: 8 years, 5 months 28 days ago

More popular Text Processing

This is actually one of the most advanced image hosting script. The imagetize key features: SEO, Ads Ready, Admin

Listing Files

text-cat
  • chinese.lm
  • 1.3 KB
  • 06/05/2006 08:42:18
  • french.lm
  • 1.4 KB
  • 06/05/2006 08:42:18
  • belarus.lm
  • 2.0 KB
  • 06/05/2006 08:42:18
  • dutch.lm
  • 1.4 KB
  • 06/05/2006 08:42:18
  • russian.lm
  • 1.6 KB
  • 06/05/2006 08:42:18
  • japanese.lm
  • 1.4 KB
  • 06/05/2006 08:42:18
  • german.lm
  • 1.4 KB
  • 06/05/2006 08:42:18
  • spanish.lm
  • 1.4 KB
  • 06/05/2006 08:42:18
  • greek.lm
  • 1.6 KB
  • 06/05/2006 08:42:18
  • arabic.lm
  • 1.6 KB
  • 06/05/2006 08:42:18
  • italian.lm
  • 1.4 KB
  • 06/05/2006 08:42:18
  • esperanto.lm
  • 1.3 KB
  • 06/05/2006 08:42:18
  • english.lm
  • 1.4 KB
  • 06/05/2006 08:42:18
  • danish.lm
  • 1.4 KB
  • 06/05/2006 08:42:18
  • hebrew.lm
  • 1.7 KB
  • 06/05/2006 08:42:18
  • czech.lm
  • 1.3 KB
  • 06/05/2006 08:42:18
Hot Scripts
Sponsors