Arabic Root Based Stemmer
Arabic Root Based Stemmer
compression, etc.
stemmer will fail to get the right root because these
foreign words have no root. This algorithm is based on
affix removal beside knowledge from structural Stemming is considered by a number of authors
linguistics. The implementation and evaluation of this as word Standardization [12]. A number of writers
algorithm shows a noticeable improvement in the thought that stemming is useful for improving
accuracy relative to previous algorithms. retrieval performance because it reduces variants of
the same root word to common concept, besides
Keywords: Arabic, Stemming, Root, Negative Suffix, reducing the size of the indexing structure because the
Negative Prefix, Light Stemming, NLP. number of distinct index terms is reduced [3]. Other
writers are not satisfied with the concept of using
1. INTRODUCTION stemming in IR and Text mining [3]. Accordingly
many search engines do not adopt stemming [3].
The Arabic language is the fifth most widely spoken Several common types of stemming strategies are
language in the world. It belongs to the Semitic family; so discussed by Frakes: affix removal, table lookup,
it differs from the Indo-European languages successor variety, and n-grams [7]. Affix removal
morphologically, semantically, and syntactically. The strategy tries to eliminate the prefixes and suffixes.
Arabic alphabet contains twenty-eight letters, always The most important part in this strategy is suffix
written from right to left in cursive form. Diacritical removal, since most variants of terms are generated by
marks (harakat) (tashkiil ) appear either above or suffixes.
below the letters, and play an essential role in many cases
in distinguishing semantically and phonetically between In Arabic language as with other natural
two identical words with the same characters, but with languages the stemmer may face the problem of a
different diacritics. Diacritical marks are used in holy negative prefix, where the prefix which eliminated is
books, poems, and children s literature; newspapers, part of the word and not really a prefix. If a stemmer
journals and other books for adults are usually printed tries to strip the " " which is a well known prefix
without diacritics, which means that many strings are from the following examples, the output will be
ambiguous. Most native Arabic words are derived from definitely wrong, e.g. " " Allah, " " Germany,
verbal roots. Arabized words, on the other hand, mainly " " Brigades, " " Albania, etc. It also includes
nouns borrowed from other languages with a slight other prefix such as " " And which represents a
phonetic adjustment to suit the Arabic pronunciation, have frequently used conjunction, e.g., stripping off " "
no roots [8]. And from " " honesty leads to a wrong stem.
All Arabic words belong to three main categories: The negative prefix problem in Arabic language
noun, verb or particle. Around 64% of Arabic words are stemmer is not restricted to the " " and " " prefixes,
derived from triliteral verbs (three consonants), but there but it also includes other prefixes such as " "," ",
are also biliteral verbs (two consonants), quadriliteral "" ﻠ ـ, "" ﻓ ﺎ ﻟ ـ, etc. The Arabic light stemming in this case
verbs (four consonants), and pentaliteral verbs (five for the term " " و ا ﻟ ـ ﻲGovernor will be wrong, if the
consonants). Naturally these verbs represent the roots for prefix " " و ا ﻟ ـstrip off from the term. Similarly the
which stemming algorithms typically search. This stems of the words " " ﻛ ﺎ ﻟ ﺢglum, " " ا ﷲAllah, " "
stemming process excludes words derived from nouns and successful, if we strip from them the prefixes "" ﻛ ﺎ ﻟ ـ,
particles[9]. "" ﻟ ﻠ ـ, " " ﻓ ﺎ ﻟ ـrespectively. Similarly Arabic stemmers
A morpheme is the smallest meaningful lingual unit
face another problem of a negative suffix, where the
which has a semantic interpretation in the grammar of a
suffix which has been eliminated is part of the word
language. There is a difference between stem and a root, a
and not really a suffix. If a stemmer tries to strip off
1/7
The 2006 International Arab Conference on Information Technology (ACIT'2006)
the " " ا نwhich is a well known suffix from the following doubled, and it look like the w shape. Strong verb is a
examples, the output will be definitely wrong, e.g. "" ﻟ ﻌ ﻤ ﺎ ن triliteral rooted verb’s which does not have any of the
To Amman, " " Japan, etc. Table 5 in the Appendix above three weak letters.
illustrates a number of examples.
"Some words do not have roots. For example
Table lookup is the simplest strategy among the four; the Arabic equivalents of " " ﻧ ﺤ ﻦwe, "" ﺑ ﻌ ﺪ
it simply looks for the root of the term in the lookup table. after, " " ﺗ ﺤ ﺖunder and so on. If the stemmer
The performance of this strategy is highly affected by the comes across any of these words, it does
number of words (terms) and their root in the table, as the nothing. "
lookup tables gets larger the performance get higher too.
Large lookup tables might need a considerable storage "Sometimes a root letter is deleted during
space. Successor variety is not straightforward as the derivation. This is especially true of roots
others, and depends on algorithms which is based on that have duplicate letters (e.g. the last two
structural linguistics and attempts to determine morpheme letters are the same), e.g., "َ " د ُﺟ ٍ ﺞget dressed,
boundaries. N-grams stemming searches for digrams, "َ " د َﻟ ﱠ ﻞdandle, "َ " ﺧ َﻠ ﱠ ﻞsouse, "َ " ﻋ َﻠ ﱠ ﻞexplained,
trigrams or more term successive letters. This strategy is a "َ " ﻗ َﻠ ﱠ ﻞreduced, " " ﺑ َﻠ َ ﻞwet, etc. The stemmer
term of clustering procedure not a stemming procedure. can detect this, and return the letter that was
removed. - If a root contains a hamza, this
The above two problems (negative prefix & negative hamza could change form during derivation,
suffix) of Arabic stemmers leads to a wrong grammatical e.g., " " talk, " " stand up, etc. The
root, so the accuracy of IR & Text mining systems which stemmer detects this, and returns the original
rely on these stemmers will be deteriorated. form of this hamza."
The two main problems of stemming have been L. S. Larkey and M. E. Connell [11] conducted a
described by Chris D. Paice [12]. In the first place, pairs good study based on a modified version of Shereen
of etymologically related words sometimes differ sharply Khoja stemmer. The modified version includes a few
in meaning [12] for example; consider " " ﺳ ﻞask, "" ﺳ ﻠ ﺐ changes to enhance the accuracy of the stemmer.
stole, and " " ﺳ ﻼ مPeace. In the second place, the These changes are summarized as follows:
transformations involved in adding and removing suffixes
involve numerous irregularities and special cases [12]. If a root were not found, the normalized form
Stemming errors are of two kinds: understemming errors, would be returned, rather than returning the
in which words which refer to the same concept are not original unmodified word.
reduced to the same stem, and overstemming errors, in List of place names are considered
which words are converted to the same stem even though "unbreakable" words exempt from stemming.
they refer to distinct concepts. In designing a stemming
algorithm there is a trade-off between these two kinds of In addition to the Arabic stop word list
error. included in the Khoja stemmer, a script was
to remove stop phrases.
A light stemmer plays safe in order to avoid
overstemming errors, but consequently leaves many A light stemmer used to strip off definite
understemming errors. A heavy stemmer boldly removes articles ( ﻓ ﺎ ﻟ ـ, ﻛ ﺎ ﻟ ـ, ﺑ ﺎ ﻟ ـ, و ا ﻟ ـ, ا ﻟ ـ, and ) وfrom
all sorts of endings, some of which are decidedly unsafe, the beginnings of normalized words and
and therefore commits many overstemming errors [12]. strips 10 suffixes from the ends of words ( ا ت,
ا ن, , ي, ة, ه, , , , and ) و ن.
Shereen Khoja addressed the problems that might
face the Arabic stemmer [9]: Table 5 in the appendix shows that light
stemming leads to wrong results if it carried out
"If the root contains a weak letter (e.g. " " أalif, "" و unconditionally, so we record our reservation on the
waw or " " يyaa), the form of this letter may change last step. Larkey, and Connell’s stemmer seem to be
during derivation. To deal with this, the stemmer must better than its parent (Khoja stemmer).
check to see if the weak letter is in the correct form. " If
not, the stemmer produces the correct form of this weak Morphology is a branch of linguistics that is
letter, which then gives the correct form of the root. If any concerned with studying of the internal structure of
triliteral rooted verb’s one of the three root letters contains word forms. Semitic languages have a complex
either " " أalif hamza (a), " " وwaw (w) or " " يyaa (y) then morphology and so the Arabic language is a complex
that is defined as a weak verb, e.g. " " gave, "َ" و َﺟ َ ﺪ language for stemming. Arabic stemmers have to deal
found, "َ " و َﺿ َ ﻊput, "َ " و َﻗ َ ﻒstood, "َ " و َﻋ َ ﺪpromised, "َ" ﺑَ ﺎ ع with affixes (prefixes, infixes, and suffixes), in
bought, " " ﺟ ﺎ ءcame, " " ﻗَ ﺮَ أread. Also weak verbs includes a addition to diacritic marks (harakat), in order to get
triliteral rooted verb’s where the second letter is doubled the right root with its appropriate diacritic marks on it.
with a ّ shadda, e.g. "َ " ﺷ َ ﻤّ ﺮprepared. Shadda (Germination Furthermore Arabic stemmer has to deal with
mark (tashdeed)) is written above the consonant that is
2/7
The 2006 International Arab Conference on Information Technology (ACIT'2006)
Arabized words (foreign words) which have no root, and T(i) be any term
in this case have to be excluded from stemming.
Let LenT(i) be the length of each
This study uses morphological patterns to obtain the term
trilateral and quadriliteral roots. The algorithm used
simply tries to extract the root, in case there is a match Let n be a number of terms within a
between pattern infix and word’s infix. document
Shereen Khoja is a pioneer in this field, but Let chr(i) be the character position
unfortunately we failed to get her original work entitled within a term
"Stemming Arabic Text" with her colleague Roger Let LenP(j) be the length of the
Garside. Leah S. Larkey and Margaret E. Connell and pattern
others headed a team at University of Massachusetts,
Amherst to conduct a number of studies which depends Let Infixes_String be a string
on Khoja work. Their work [10] [11] represent an generated manually, consisting of
improvement to Khoja work. Although their work include the pattern, and the affix of that
improvements to Khoja but it does not solve the problems pattern, e.g., the stem "" ﻣ ﺴ ﺎ ﺑ ﺢ
of negative prefix and negative suffix which discussed swimming pools, match with the
before. Al-Kharashi, I.A. et. Al. [2] presents pattern based pattern of "ﻞ ", so the
stemming for Arabic language, also Taghva K. et. Al. [13] Infixes_String in this case is the
used the same approach which is different from Khoja, string "" ﻣ ﺎ, where " " مlie in the first
with an equivalent performance. Pattern based stemming position, and " " اlie in the third
does not use root dictionary. This approach based on position.
Let T_String be the corresponding
matching the word with a number of Arabic patterns to
extract the root. Chen A. et. Al. [4] conducted a study to
string of the word which
find Arabic roots using Machine Translation (MT) based
corresponds the string of the pattern
Infixes_String, i. e., to clarify the
stemmer. Although this study depends on Ajeeb machine
translation system, stopword removing, clustering, light
idea suppose we want to find the
stemming, and morphological analysis, but it does not
root of the stem " " ﻣ ﺴ ﺎ ﺑ ﺢswimming
presents a solution to the problems of negative prefix and
pools, the system has to check this
negative suffix. Kareem Darwish [5] shows how to extract
word with all 5 characters patterns,
a root from the word, by first removing the prefix and
one of these patterns is " ", so the
Infixes_String in this case is "" ﺗ ﻲ
suffix of the word to get a stem, then match a stem to a
and the T_String is "" ﻣ ﺐ, the
number of templates to get the root. In this study the
researcher did not mention how many templates used in
mismatch is obvious in this case,
comparisons, beside the absence of an algorithm.
when matching the stem with the
pattern " " ﻣ ﻔ ﺎ ﻋ ﻞthe Infixes_String &
Darwish, K. et. Al. [6] used an approach which is similar
T_String will be "" ﻣ ﺎ.
to his previous one[5], but with more details about the
prefixes, and suffixes being removed. Table 6 shows the
Table 1 shows how to get Infixes_String for each
patterns used within our algorithm.
of the patterns used.
3/7
The 2006 International Arab Conference on Information Technology (ACIT'2006)
2.3 Remove diacritics depending on a list of Table 2. Trace of the manual extraction of the
diacritics characters correct root.
Original Normalized T_String Root Status
3. If LenT(i) ≥ 5 then Word T(i) T(i) (Stem) (T(i))
Remove initial definite article ( ﻟ ﻞ، ) ا ل
Else if LenT(i) ≥ 6 then
ْﻋِ ﻠ ﻢ Right
ا ن Wrong
Remove initial definite article ( ﺑ ﺎ ل، ﻓ ﺎ ل، ) ﻛ ﺎ ل
End if
ْﺛَ َﻤ ﺮ Right
َﻋَ ﻠِ ﻢ Right
4. If LenT(i) > 4 and the final character of the T(i) ا ﻹ ﺳﺘ ﺮ ﺣﺎ م ا ﺳﺘ ﺮ ﺣﺎ م ا ﺳﺘﺎ رَ ﺣِ ﻢ Right
like " " ا ءthen َﺗَ ﺮَ ك Right
Replace final " " ا ءwith "" ي َرَ ﺷَ ﺪ Right
End if ّﻣ ﺪ ّﻣ ﺪ - َﻣَ ﺪَ د Right
5. Replace initial ( إ, ), ( ) أwith bare alif ( ) ا
ن ا ن َﻣَ ﺰ Wrong
ﺗ ﺴﺎﺋﻠ ﻮا ﺗ ﺴﺎﺋﻠ ﻮا ﺗﺎ وا َﺳَ ﺄَ ل Right
6. Replace initial ( ) آwith bare alif ( ) ا اﻟ ﻤ ﺪا ر س ﻣ ﺪا ر س ﻣﺎ َدَ رَ س Right
7. Replace final ( ) ةwith ( ) ه
ي َﻛَ ﺮُ م Right
ﺑﺎﻟ ﻤ ﻜﺘﺒ ﺔ َﻛَ ﺘَ ﺐ Right
8. Replace final ( ) ىwith ( ) ي اﻟ ﻄﺎﺋ ﺮ ﻃﺎأ ر ا ْﻃَ ﺄ ر Wrong
ِﺟ Wrong
9. For i 1 to n do ِﻣُ ﺤ Wrong
9.1 If LenT(i) = 3 then
9.1.1 If T(i) ends with germination mark (tashdeed) Table 3 Accuracy of root extraction for three Arabic
( ّ ) then Root(T(i)) = chr(1)& chr(2)& chr(2) text files
Else Root(T(i)) = T(i)
Number of Number of Roots
End if Number of Words not
incorrect extracted
End if words Analyzed
Roots correctly
9.2 If LenT(i) ≥ 4 then
9.2.1 For j 1 to number of patterns of length =
147 3 (2%) 16 (10.8%) 130 (87.2%)
LenT(i) do
244 7 (2.8%) 24 (9.8%) 215 (87.4%)
Figure 1
3. EVALUATION Statistics for root extraction
In order to test the accuracy of our algorithm, we selected Table 4 shows the precision, recall and the
a number of words randomly. Table 2 shows the manual harmonic mean (F-measure). Here we used the
trace of the execution of the above algorithm to extract the precision, recall and F-measure as shown in the
root of the selected terms. following formulas:
Table 3 shows the strength and weakness of the Correct …………. (1)
Precision
above algorithm, using a small data set containing 1,827 Correct Incorrect
words. The system failed to analyze 55 words, since their Correct
Recall …………. (2)
patterns are unknown. This failure mostly due to foreign Correct UnA nalyzed
(Arabized) words. The system accepts to analyze the rest
of the (1,772 words), but we found that accuracy of 2 Precision Recall …………. (3)
F
extracting the right roots is 91%. Precision Recall
4/7
The 2006 International Arab Conference on Information Technology (ACIT'2006)
Table 4 shows that the system obtains about 92% solve these problems within our next enhancement to
overall precision for the analyzed words, note that words this work.
that doe not match any of the verbal and noun patterns
have been ignored as illustrated in table 6 from the
REFERENCES
computations of the accuracy measures, because these
words are foreign words.
Table 4. Accuracy of root extraction for three Arabic [1] Aljlayl. M, Frieder. O. "On Arabic Search:
text files Improving the Retrieval Effectiveness via a
Light Stemming Approach", CIKM 02,
F-
Number of Precision (Accuracy November 4-9, 2002, McLean, Virginia,
Recall measure
words of Analyzed word) USA. Pages 340 -- 347. ACM 1-58113-492-
4/02/0011.
147 0.9771 0.8889 0.9309
244 0.9682 0.8987 0.9322 [2] Al-Kharashi, I.A., & Al-Sughaiyer, I.A.
579 0.9652 0.9411 0.9530 (2002e). "Pattern-based Arabic stemmer". In
857 0.9682 0.9531 0.9606
Proceedings of the 2nd Saudi Technical
Conference and Exhibition (STCEX2002),
1827 0.9697 0.9204 0.9442 Volume II (pp. 238-244), Riyadh, Saudi
Arabia.
4. CONCLUSIONS [3] Baeza-Yates, R., & Ribeiro-Neto, Modern
In order to increase the accuracy of the system, and to Information Retrieval. Addison Wesley,
reduce the probability of facing the problems of negative 1999.
suffix and negative prefix, the system shall not remove the [4] Chen A. and Gey Fredic. 2002. "Building an
prefixes (" " ﻓ ـ، " " ب، " " ﻟ ـ، " " و، " )" ﻓ ـand suffix (" "). arabic stemmer for information retrieval". In
Furthermore the system uses a conditional removing, Proceedings of the Eleventh Text REtrieval
e.g., in case the term length is six or more the system will Conference (TREC 2002), National Institute
remove the following prefixes ("" و ا ل، " " ﺑ ﺎ ل، "" ﻛ ﺎ ل، ")" ﻓ ﺎ ل of Standards and Technology, November.
otherwise when the term length is less than six the term [5] Darwish K. 2002. "Building a shallow Arabic
will be unchanged. Morphological Analyzer in one day", In
As mentioned in Thabet [14] root-based algorithm proceedings of the ACL-02 workshop on
increases word ambiguity, where many word variants Computational approaches to semitic
have different meaning, and this will affect the accuracy languages, Association for Computational
of IR, Text mining, etc systems which rely on root based Linguistics , July.
stemmers. Table 5 presents a number of ambiguous cases, [6] Darwish, K. and D. Oard. "CLIR Experiments
one of these is the term " ", this can be interpreted by at Maryland for TREC 2002: Evidence
the reader as parents, religion, and debt, since this word is Combination for Arabic-English Retrieval".
bare of diacritics, and it is in its own, not within a In TREC. 2002. Gaithersburg, MD.
statement. As we said the diacritics used to distinguish the
words semantically and phonetically. [7] Frakes W. B., Introduction to Information
Storage and Retrieval Systems, chapter 1,
Arabic stemmers can be used to enhance the pages 1--12. Prentice-Hall, 1992.
efficiency of a number of systems such as, Spell checkers,
Information retrieval systems, Text mining systems, Text [8] Kanaan, G.; Al-Shalabi, R.; AL-Kabi, M.N.;
Analysis systems, Compression systems , etc. Jaam, J.M.; Hasnah, A.; . 2004. "New
Approach for Extracting
This algorithm is incapable of extracting Arabic roots Quadriliteral/Quadrilateral Arabic Roots ”, In
of some imperative verbs (" ") that is made up of proceedings of 1st International Conference
one Arabic letter with the fact that its root being of three on Information & Communication
letters (trilateral verbs), e.g., " "ِ " ﻋ ـ, with the root of Technologies: from Theory to Applications,
"" وﻋ ِ ﻲ. In addition, the problem of defective roots (weak ICTTA’04, (Damascus, Syria, April 2004).
roots) is still not solved by this algorithm. Defective roots IEEE-France.
are roots that contain vowels ("" ي، "" و، " )" أwhich are
classified as irregular roots, since some vowels in these [9] Khoja S., Research Interests, Pacific
roots are altered to other vowels or removed in the University, 2043 College Way, Forest Grove,
derivational process [1], e.g., " " ر ﻣ ﺎand " " ر ﻣ ﻲthese two Oregon 97116,
words have the same meaning throw, and both of them http://zeus.cs.pacificu.edu/shereen/research.h
represent the same root. As a future research, we hope to tm, July 8, 2006.
5/7
The 2006 International Arab Conference on Information Technology (ACIT'2006)
Appendix A:
Table 5: The problem of negative prefixes and
negative suffixes
Full Removing Full Removing Full word Removing Full word Removing
word the suffix word the suffix the suffix the suffix
اﻟﺒ ﺮ ﻛﺎ ت اﻟﺒ ﺮ ك ا ﻷ ﻣﺎ ن ا ﻷم ﺑﺎﻟ ﻌ ﻮ ن ﺑﺎﻟ ﻊ ا ﻷم
ا ﻹﻧ ﺴﺎ ن ا ﻹﻧ ﺲ اﻟﺒﺎﻟ ﻮ ن اﻟﺒﺎ ل اﻟﺘﺎ م
اﻟﺜ ﻮ را ت اﻟﺜ ﻮ ر ا ﻷ وا ن ا ﻷ و ﺑ ﻄ ﻮ ن ﺑ ﻂ ﺗ ﺤ ﺲ
اﻟ ﺠ ﻤﺎ ﻋﺎ ت اﻟ ﺠ ﻤﺎ ع ا ﻷ و ﻃﺎ ن ا ﻷ و ط ﺑﻠ ﻮ ن ﺑ ﻞ ﺣ ﻦ
اﻟ ﺤ ﻤ ﻼ ت اﻟ ﺤ ﻤ ﻞ ﺑ ﺮ ﻛﺎ ن ﺑ ﺮ ك اﻟﺘ ﻌﺎ و ن اﻟﺘ ﻌﺎ اﻟ ﺪ
اﻟ ﺪ و را ت اﻟ ﺪ و ر ا ﻟ ﺠِ ﻨ ﺎ ن ا ﻟ ﺠِ ﻦ اﻟ ﺤ ﺴ ﻮ ن اﻟ ﺤ ﺲ اﻟ ﺬ
د و ر ي اﻟ ﺤﻨﺎ ن اﻟ ﺤ ﻦ ﺣﻨ ﻮ ن ﺣ ﻦ ﺳ ﺞ
اﻟ ﺬا ت اﻟ ﺬ ﺧﻠ ﺠﺎ ن ﺧﻠ ﺞ اﻟ ﺴﺘ ﻮ ن اﻟ ﺴ ﺖ ﺳ ﻚ
اﻟ ﺴﻠ ﻄﺎ ت اﻟ ﺴﻠ ﻂ اﻟ ﺮ ي ﺳﻜ ﻮ ن ﺳ ﻚ ﺳﻨ ﺖ
اﻟ ﺴﻨ ﻮا ت اﻟ ﺴﻨ ﻮ ﺻﺎﺑ ﻮ ن ﺻﺎ ب ﺳ ﻦ
اﻟ ﻀ ﻤﺎ ن اﻟ ﻀ ﻢ اﻟ ﻌ ﻲ ع
اﻟ ﺸ ﺮ ﻛﺎ ت اﻟ ﺸ ﺮ ك ﻋ ﺠ ﻤﺎ ن ﻋ ﺠﻢ ﻗ ﺮ و ن ﻗ ﺮ ﻗ ﻮا ن
ﻃﺒﻘﺎ ت ﻃﺒ ﻖ ﻋﻨ ﻮا ن ﻋﻨ ﻮ ﻛﺎﻧ ﻮ ن ﻛﺎ ن ﻛﺪ
اﻟﻘ ﻮا ت اﻟﻘ ﻮ ﻟﺒﻨﺎ ن ﻟﺒ ﻦ ﻣ ﺮه ل
ﻟ ﺠﺄ ت ﻟ ﺞ ﻟ ﻌ ﻤﺎ ن ﻟﻌﻢ اﻟ ﻤﻠ ﻲ ﻣ ﺖ
ﻟ ﺬ وا ت ﻟﺬ و ﻟﻠﺒﻨﺎ ن ﻟﻠﺒ ﻦ ﻣ ﺪﻟ ﻞ
ﻣ ﺮ ﺟﺎ ن ﻣ ﺮ ج ﻣ ﺴ ﻚ
ﻟﻨ ﺰ ﻻ ت ﻟﻨ ﺰ ل اﻟ ﻤ ﻌﻠ ﻖ
ﻣ ﺪا ﺧ ﻼ ت ﻣ ﺪا ﺧ ﻞ ﻣ ﻀﻤ ﻮ ن ﻣ ﻀﻢ ﻣ ﻊ
اﻟﻨﻘﺎ ﺷﺎ ت اﻟﻨﻘﺎ ش ﻣ ﺴﻜ ﻮ ن ﻣ ﺴ ﻚ
و ذ را ت وذ ر 企 伎 企 ﻣﻔﺘ ﻮ ن ﻣﻔ ﺖ
6/7
The 2006 International Arab Conference on Information Technology (ACIT'2006)
ﻤﻔﺘ ﻌ ل ﻤﻔ ﻌﻴ ل ﺒﻔ ﻌﺎ ل ﻟﻔ
Length 6 patterns
ﺒﺘﻔ ﻌﻴ ل ﻟﺘﻔ ﻌﻴ ل ﻓ ﻌ
Length 7 patterns
ﻤ ﺴﺘ
ﻟﻔ ﻌ
Length 8 patterns
7/7