Similarity score for information filtering thresholds in business processes

Jun Lai, Ben Son, Saqib Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The tremendous growth in the amount of information available poses some key challenges for information filtering and retrieval. Users not only expect high quality and relevant information, but also wish that the information be presented in an as efficient way as possible. The traditional filtering methods, however, only consider the relevant values of document. These conventional methods fail to consider the efficiency of documents retrieval. In this paper, we propose a new algorithm to calculate pn index called document similarity score based on elements of the document. Using the index, document profile will be derived. Any documents with the similarity score above a given threshold wilt be clustered. Using these pre-clusiered documents, information filtering and retrieval can be made more efficient. Experimental results clearly show our proposed method tremendously improves the efficiency of information filtering and retrieval. We also give an example application of our proposed method in business processes.

Original languageEnglish
Title of host publicationProceedings of INMIC 2004 - 8th International Multitopic Conference
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages743-748
Number of pages6
ISBN (Electronic)0780386809, 9780780386808
DOIs
Publication statusPublished - 2004
Event8th International Multitopic Conference, INMIC 2004 - Lahore, Pakistan
Duration: Dec 24 2004Dec 26 2004

Other

Other8th International Multitopic Conference, INMIC 2004
CountryPakistan
CityLahore
Period12/24/0412/26/04

Fingerprint

Information filtering
Information retrieval
Industry

Keywords

  • Business process
  • Clustering
  • Elements
  • Information filtering
  • Information retrieval
  • Search engine
  • Web crawlers
  • World Wide Web

ASJC Scopus subject areas

  • Engineering(all)
  • Computer Science(all)

Cite this

Lai, J., Son, B., & Ali, S. (2004). Similarity score for information filtering thresholds in business processes. In Proceedings of INMIC 2004 - 8th International Multitopic Conference (pp. 743-748). [1492988] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/INMIC.2004.1492988

Similarity score for information filtering thresholds in business processes. / Lai, Jun; Son, Ben; Ali, Saqib.

Proceedings of INMIC 2004 - 8th International Multitopic Conference. Institute of Electrical and Electronics Engineers Inc., 2004. p. 743-748 1492988.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lai, J, Son, B & Ali, S 2004, Similarity score for information filtering thresholds in business processes. in Proceedings of INMIC 2004 - 8th International Multitopic Conference., 1492988, Institute of Electrical and Electronics Engineers Inc., pp. 743-748, 8th International Multitopic Conference, INMIC 2004, Lahore, Pakistan, 12/24/04. https://doi.org/10.1109/INMIC.2004.1492988
Lai J, Son B, Ali S. Similarity score for information filtering thresholds in business processes. In Proceedings of INMIC 2004 - 8th International Multitopic Conference. Institute of Electrical and Electronics Engineers Inc. 2004. p. 743-748. 1492988 https://doi.org/10.1109/INMIC.2004.1492988
Lai, Jun ; Son, Ben ; Ali, Saqib. / Similarity score for information filtering thresholds in business processes. Proceedings of INMIC 2004 - 8th International Multitopic Conference. Institute of Electrical and Electronics Engineers Inc., 2004. pp. 743-748
@inproceedings{b3a4dd71b90f465fa364aa344db18c40,
title = "Similarity score for information filtering thresholds in business processes",
abstract = "The tremendous growth in the amount of information available poses some key challenges for information filtering and retrieval. Users not only expect high quality and relevant information, but also wish that the information be presented in an as efficient way as possible. The traditional filtering methods, however, only consider the relevant values of document. These conventional methods fail to consider the efficiency of documents retrieval. In this paper, we propose a new algorithm to calculate pn index called document similarity score based on elements of the document. Using the index, document profile will be derived. Any documents with the similarity score above a given threshold wilt be clustered. Using these pre-clusiered documents, information filtering and retrieval can be made more efficient. Experimental results clearly show our proposed method tremendously improves the efficiency of information filtering and retrieval. We also give an example application of our proposed method in business processes.",
keywords = "Business process, Clustering, Elements, Information filtering, Information retrieval, Search engine, Web crawlers, World Wide Web",
author = "Jun Lai and Ben Son and Saqib Ali",
year = "2004",
doi = "10.1109/INMIC.2004.1492988",
language = "English",
pages = "743--748",
booktitle = "Proceedings of INMIC 2004 - 8th International Multitopic Conference",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - Similarity score for information filtering thresholds in business processes

AU - Lai, Jun

AU - Son, Ben

AU - Ali, Saqib

PY - 2004

Y1 - 2004

N2 - The tremendous growth in the amount of information available poses some key challenges for information filtering and retrieval. Users not only expect high quality and relevant information, but also wish that the information be presented in an as efficient way as possible. The traditional filtering methods, however, only consider the relevant values of document. These conventional methods fail to consider the efficiency of documents retrieval. In this paper, we propose a new algorithm to calculate pn index called document similarity score based on elements of the document. Using the index, document profile will be derived. Any documents with the similarity score above a given threshold wilt be clustered. Using these pre-clusiered documents, information filtering and retrieval can be made more efficient. Experimental results clearly show our proposed method tremendously improves the efficiency of information filtering and retrieval. We also give an example application of our proposed method in business processes.

AB - The tremendous growth in the amount of information available poses some key challenges for information filtering and retrieval. Users not only expect high quality and relevant information, but also wish that the information be presented in an as efficient way as possible. The traditional filtering methods, however, only consider the relevant values of document. These conventional methods fail to consider the efficiency of documents retrieval. In this paper, we propose a new algorithm to calculate pn index called document similarity score based on elements of the document. Using the index, document profile will be derived. Any documents with the similarity score above a given threshold wilt be clustered. Using these pre-clusiered documents, information filtering and retrieval can be made more efficient. Experimental results clearly show our proposed method tremendously improves the efficiency of information filtering and retrieval. We also give an example application of our proposed method in business processes.

KW - Business process

KW - Clustering

KW - Elements

KW - Information filtering

KW - Information retrieval

KW - Search engine

KW - Web crawlers

KW - World Wide Web

UR - http://www.scopus.com/inward/record.url?scp=84935115300&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84935115300&partnerID=8YFLogxK

U2 - 10.1109/INMIC.2004.1492988

DO - 10.1109/INMIC.2004.1492988

M3 - Conference contribution

AN - SCOPUS:84935115300

SP - 743

EP - 748

BT - Proceedings of INMIC 2004 - 8th International Multitopic Conference

PB - Institute of Electrical and Electronics Engineers Inc.

ER -