Arabic verb pattern extraction

E. M. Saad, M. H. Awadalla, A. Alajmi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

Arabic is a highly inflected language, and therefore the processes of stemming and root extracting represent a challenge to researches. A new method is presented for extracting Arabic text stem, and lemma. Stemming sometimes affects the semantic of a word, where as lemma preserve the meaning of a word. The approach is based on pattern extraction. It uses a special encoding based on dividing letters into original and non-original letters. Codes are automatically generated for each pattern and then match against input text to extract root, pattern, and lemma of a word. A comparison with other methods reveals a promising result with accuracy up to 96%.

Original languageEnglish
Title of host publication10th International Conference on Information Sciences, Signal Processing and their Applications, ISSPA 2010
Pages642-645
Number of pages4
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event10th International Conference on Information Sciences, Signal Processing and their Applications, ISSPA 2010 - Kuala Lumpur, Malaysia
Duration: May 10 2010May 13 2010

Publication series

Name10th International Conference on Information Sciences, Signal Processing and their Applications, ISSPA 2010

Other

Other10th International Conference on Information Sciences, Signal Processing and their Applications, ISSPA 2010
Country/TerritoryMalaysia
CityKuala Lumpur
Period5/10/105/13/10

Keywords

  • Morphological analyzer
  • Natural language processing
  • Root extraction

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Signal Processing

Fingerprint

Dive into the research topics of 'Arabic verb pattern extraction'. Together they form a unique fingerprint.

Cite this