Friday, 16 June 2023

Current Issue - May 2023, Volume 14, Number 2

 May 2023, Volume 14, Number 2

Texts Classification with the usage of Neural Network based on the Word2vec’s Words Representation 

D. V. Iatsenko, Southern Federal University, Russia

ABSTRACT

Assigning the submitted text to one of the predetermined categories is required when dealing with application-oriented texts. There are many different approaches to solving this problem, including using neural network algorithms. This article explores using neural networks to sort news articles based on their category. Two word vectorization algorithms are being used — The Bag of Words (BOW) and the word2vec distributive semantic model. For this work the BOW model was applied to the FNN, whereas the word2vec model was applied to CNN. We have measured the accuracy of the classification when applying these methods for ad texts datasets. The experimental results have shown that both of the models show us quite the comparable accuracy. However, the word2vec encoding used for CNN showed more relevant results, regarding to the texts semantics. Moreover, the trained CNN, based on the word2vec architecture, has produced a compact feature map on its last convolutional layer, which can then be used in the future text representation. I.e. Using CNN as a text encoder and for learning transfer.

KEYWORDS

Deep Learning, Text classification, Word2Vec, BOW, CNN

Original Source URL: https://aircconline.com/ijsc/V14N2/14223ijsc01.pdf

https://airccse.org/journal/ijsc/current2023.html




No comments:

Post a Comment

February Issue Journal! Authors are invited to submit papers!

International Journal on Soft Computing (IJSC) ISSN: 2229 - 6735 [Online]; 2229 - 7103 [Print] https://airccse.org/journal/ijsc/ijsc.html He...