Arabic verb pattern extraction

E. M. Saad, M. H. Awadalla, A. Alajmi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

Arabic is a highly inflected language, and therefore the processes of stemming and root extracting represent a challenge to researches. A new method is presented for extracting Arabic text stem, and lemma. Stemming sometimes affects the semantic of a word, where as lemma preserve the meaning of a word. The approach is based on pattern extraction. It uses a special encoding based on dividing letters into original and non-original letters. Codes are automatically generated for each pattern and then match against input text to extract root, pattern, and lemma of a word. A comparison with other methods reveals a promising result with accuracy up to 96%.

Original languageEnglish
Title of host publication10th International Conference on Information Sciences, Signal Processing and their Applications, ISSPA 2010
Pages642-645
Number of pages4
DOIs
Publication statusPublished - 2010
Event10th International Conference on Information Sciences, Signal Processing and their Applications, ISSPA 2010 - Kuala Lumpur, Malaysia
Duration: May 10 2010May 13 2010

Other

Other10th International Conference on Information Sciences, Signal Processing and their Applications, ISSPA 2010
CountryMalaysia
CityKuala Lumpur
Period5/10/105/13/10

Keywords

  • Morphological analyzer
  • Natural language processing
  • Root extraction

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Signal Processing

Fingerprint Dive into the research topics of 'Arabic verb pattern extraction'. Together they form a unique fingerprint.

  • Cite this

    Saad, E. M., Awadalla, M. H., & Alajmi, A. (2010). Arabic verb pattern extraction. In 10th International Conference on Information Sciences, Signal Processing and their Applications, ISSPA 2010 (pp. 642-645). [5605427] https://doi.org/10.1109/ISSPA.2010.5605427