Part of Speech (POS) tag sets reduction and analysis using rough set techniques

Mohamed Elhadi; Amjd Al-Tobi

doi:10.1007/978-3-642-10646-0_27

Part of Speech (POS) tag sets reduction and analysis using rough set techniques

Mohamed Elhadi^*, Amjd Al-Tobi

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

The motivation behind this work stems from an earlier work where text was transformed into strings of syntactical structures and used in similarity calculations using sequence algorithm on a string generated by a POS tagger. The performance of computations was greatly affected by the size of the string which in itself is the result of the type of tags used. Generated tags range from several (minimum of nine) general ones to many more (hundreds) detailed tags. Figuring out which tags and what combination of tags affect the realization of meanings, dependencies or relationships that exist in the text is an important issue. The resulting tag set reduction using rough sets and consequently string reduction has resulted in an improved efficiency in similarity calculations between documents while maintaining the same level of accuracy. Such finding was very encouraging.

Original language	English
Title of host publication	Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 12th International Conference, RSFDGrC 2009, Proceedings
Pages	223-230
Number of pages	8
DOIs	https://doi.org/10.1007/978-3-642-10646-0_27
Publication status	Published - 2009
Externally published	Yes
Event	12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, RSFDGrC 2009 - Delhi, India Duration: Dec 15 2009 → Dec 18 2009

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	5908 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, RSFDGrC 2009
Country/Territory	India
City	Delhi
Period	12/15/09 → 12/18/09

Keywords

Data reduction
POS tagging
Rough sets
Similarity calculations
String comparison

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-642-10646-0_27

Cite this

Elhadi, M., & Al-Tobi, A. (2009). Part of Speech (POS) tag sets reduction and analysis using rough set techniques. In Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 12th International Conference, RSFDGrC 2009, Proceedings (pp. 223-230). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5908 LNAI). https://doi.org/10.1007/978-3-642-10646-0_27

Part of Speech (POS) tag sets reduction and analysis using rough set techniques. / Elhadi, Mohamed; Al-Tobi, Amjd.
Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 12th International Conference, RSFDGrC 2009, Proceedings. 2009. p. 223-230 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5908 LNAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Elhadi, M & Al-Tobi, A 2009, Part of Speech (POS) tag sets reduction and analysis using rough set techniques. in Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 12th International Conference, RSFDGrC 2009, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5908 LNAI, pp. 223-230, 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, RSFDGrC 2009, Delhi, India, 12/15/09. https://doi.org/10.1007/978-3-642-10646-0_27

Elhadi M, Al-Tobi A. Part of Speech (POS) tag sets reduction and analysis using rough set techniques. In Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 12th International Conference, RSFDGrC 2009, Proceedings. 2009. p. 223-230. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-10646-0_27

Elhadi, Mohamed ; Al-Tobi, Amjd. / Part of Speech (POS) tag sets reduction and analysis using rough set techniques. Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 12th International Conference, RSFDGrC 2009, Proceedings. 2009. pp. 223-230 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{26aeab4c0add4023b24b382874c6236a,

title = "Part of Speech (POS) tag sets reduction and analysis using rough set techniques",

abstract = "The motivation behind this work stems from an earlier work where text was transformed into strings of syntactical structures and used in similarity calculations using sequence algorithm on a string generated by a POS tagger. The performance of computations was greatly affected by the size of the string which in itself is the result of the type of tags used. Generated tags range from several (minimum of nine) general ones to many more (hundreds) detailed tags. Figuring out which tags and what combination of tags affect the realization of meanings, dependencies or relationships that exist in the text is an important issue. The resulting tag set reduction using rough sets and consequently string reduction has resulted in an improved efficiency in similarity calculations between documents while maintaining the same level of accuracy. Such finding was very encouraging.",

keywords = "Data reduction, POS tagging, Rough sets, Similarity calculations, String comparison",

author = "Mohamed Elhadi and Amjd Al-Tobi",

year = "2009",

doi = "10.1007/978-3-642-10646-0_27",

language = "English",

isbn = "3642106455",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "223--230",

booktitle = "Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 12th International Conference, RSFDGrC 2009, Proceedings",

note = "12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, RSFDGrC 2009 ; Conference date: 15-12-2009 Through 18-12-2009",

}

TY - GEN

T1 - Part of Speech (POS) tag sets reduction and analysis using rough set techniques

AU - Elhadi, Mohamed

AU - Al-Tobi, Amjd

PY - 2009

Y1 - 2009

N2 - The motivation behind this work stems from an earlier work where text was transformed into strings of syntactical structures and used in similarity calculations using sequence algorithm on a string generated by a POS tagger. The performance of computations was greatly affected by the size of the string which in itself is the result of the type of tags used. Generated tags range from several (minimum of nine) general ones to many more (hundreds) detailed tags. Figuring out which tags and what combination of tags affect the realization of meanings, dependencies or relationships that exist in the text is an important issue. The resulting tag set reduction using rough sets and consequently string reduction has resulted in an improved efficiency in similarity calculations between documents while maintaining the same level of accuracy. Such finding was very encouraging.

AB - The motivation behind this work stems from an earlier work where text was transformed into strings of syntactical structures and used in similarity calculations using sequence algorithm on a string generated by a POS tagger. The performance of computations was greatly affected by the size of the string which in itself is the result of the type of tags used. Generated tags range from several (minimum of nine) general ones to many more (hundreds) detailed tags. Figuring out which tags and what combination of tags affect the realization of meanings, dependencies or relationships that exist in the text is an important issue. The resulting tag set reduction using rough sets and consequently string reduction has resulted in an improved efficiency in similarity calculations between documents while maintaining the same level of accuracy. Such finding was very encouraging.

KW - Data reduction

KW - POS tagging

KW - Rough sets

KW - Similarity calculations

KW - String comparison

UR - http://www.scopus.com/inward/record.url?scp=76649087046&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=76649087046&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-10646-0_27

DO - 10.1007/978-3-642-10646-0_27

M3 - Conference contribution

AN - SCOPUS:76649087046

SN - 3642106455

SN - 9783642106453

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 223

EP - 230

BT - Rough Sets, Fuzzy Sets, Data Mining and Granular Computing - 12th International Conference, RSFDGrC 2009, Proceedings

T2 - 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, RSFDGrC 2009

Y2 - 15 December 2009 through 18 December 2009

ER -

Part of Speech (POS) tag sets reduction and analysis using rough set techniques

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this