Location: PHPKode > scripts > ArWordTag > arwordtag/about.html
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">

<meta http-equiv="Content-Type" content="text/html;charset=utf-8" />
<link rel="stylesheet" type="text/css" href="style.css" media="all" />
<title>Tagging Arabic Text</title>


<div style="width: 80%; position:absolute; left:10%; top:0%; z-index:1">
<br />
<div class="tabArea" align="center">
  <a class="tab" href="example.php">Example</a>
  <a class="tab" href="about.html">About</a>

<div class="Paragraph">

<h2>Tagging Arabic Text</h2><br />

<div class="img-shadow">
<img src="images/thumb.jpg" alt="Khaled Al-Sham'aa" border="0" width="200" height="100" />

      <span class="FirstChar">T</span>his process of identifying names, places, dates, and other 
      noun words and phrases that establish the meaning of a body of text-is critical to software 
      systems that process large amounts of unstructured data coming from sources such as email, 
      document files, and the Web.<br /><br />

      Arabic words are classifies into three main classes,
      namely, verb, noun and particle. Verbs are sub classified into three subclasses (Past
      verbs, Present Verbs, etc.); nouns into forty six subclasses (e.g. Active participle,
      Passive participle, Exaggeration pattern, Adjectival noun, Adverbial noun, Infinitive
      noun, Common noun, Pronoun, Quantifier, etc.) and particles into twenty three subclasses
      (e.g. additional, resumption, Indefinite, Conditional, Conformational, Prohibition,
      Imperative, Optative, Reasonal, Dubious, etc.), and from these three main classes that
      the rest of the language is derived.<br /><br />

      The most important aspect of this system of describing Arabic is that all the subclasses
      of these three main classes inherit properties from the parent classes.<br />

        <span class="Quote">Arabic grammarians describe Arabic as being derived from three
        main categories: noun, verb and particle.</span>
      Arabic is very rich in categorising words, and contains classes for almost every form
      of word imaginable. For example, there are classes for nouns of instruments, nouns of
      place and time, nouns of activity and so on. If we tried to use all the subclasses
      described by Arabic grammarians, the size of the tagset would soon reach more than two
      or three hundred tags. For this reason, we have chosen only the main classes. But because
      of the way all the classes inherit from others, it would be quite simple to extend this
      tagset to include more subclasses.<br /><br />

<br />
          <script src="http://www.google-analytics.com/urchin.js" type="text/javascript">
          <script type="text/javascript">
          _uacct = "UA-1268287-1";
Return current item: ArWordTag