Wednesday, 18 February 2026

A Proposed Multi-Domain Approach for Automatic Classification of Text Documents

Abdelrahman M. Arab, Ahmed M. Gadallah and Akram Salah, Cairo University, Egypt

ABSTRACT

Classification is an important technique used in information retrieval. Supervised classification suffers from certain limitations concerning the collection and labeling of the training dataset. When facing MultiDomain classification, multiple training datasets and classifiers are needed which is relatively difficult. In this paper an unsupervised classification system is proposed that can manage the Multi-Domain classification problem as well. It is a multi-domain system where each domain represented by an ontology. A document is mapped on each ontology based on the weights of the mutual tokens between them with the help of fuzzy sets, resulting in a mapping degree of the document with each domain. An experiment carried out showing satisfying classification results with an improvement in the evaluation results of the proposed system compared to Apache Lucene.

KEYWORDS

Information Retrieval, Ontology, Machine Learning, Document Classification, Fuzzy Sets 

Original Source URL: https://aircconline.com/ijsc/V8N1/8117ijsc01.pdf

https://airccse.org/journal/ijsc/current2017.html





No comments:

Post a Comment

Call for Papers! Welcome to AIS 2026!

12th International Conference on Artificial Intelligence and Soft Computing (AIS 2026) July 25 ~ 26, 2026, Toronto, Canada https://ais2026.o...