This work describes an automatic text classification method implemented in a software tool called NETHIC, which takes advantage of the inner capabilities of highly-scalable neural networks combined with the expressiveness of hierarchical taxonomies. As such, NETHIC succeeds in bringing about a mechanism for text classification that proves to be significantly effective as well as efficient. The tool had undergone an experimentation process against both a generic and a domain-specific corpus, outputting promising results. On the basis of this experimentation, NETHIC has been now further refined and extended by adding a document embedding mechanism, which has shown improvements in terms of performance on the individual networks and on the whole hierarchical model.

Lomasto, L., Di Florio, R., Ciapetti, A., Miscione, G., Ruggiero, G., Toti, D., An Automatic Text Classification Method Based on Hierarchical Taxonomies, Neural Networks and Document Embedding: The NETHIC Tool, Paper, in Lecture Notes in Business Information Processing, (grc, 03-05 May 2019), Springer, N/A 2020:<<LECTURE NOTES IN BUSINESS INFORMATION PROCESSING>>,378 57-77. 10.1007/978-3-030-40783-4_4 [http://hdl.handle.net/10807/163941]

An Automatic Text Classification Method Based on Hierarchical Taxonomies, Neural Networks and Document Embedding: The NETHIC Tool

Toti, Daniele
2020

Abstract

This work describes an automatic text classification method implemented in a software tool called NETHIC, which takes advantage of the inner capabilities of highly-scalable neural networks combined with the expressiveness of hierarchical taxonomies. As such, NETHIC succeeds in bringing about a mechanism for text classification that proves to be significantly effective as well as efficient. The tool had undergone an experimentation process against both a generic and a domain-specific corpus, outputting promising results. On the basis of this experimentation, NETHIC has been now further refined and extended by adding a document embedding mechanism, which has shown improvements in terms of performance on the individual networks and on the whole hierarchical model.
2020
Inglese
Lecture Notes in Business Information Processing
21st International Conference on Enterprise Information Systems, ICEIS 2019
grc
Paper
3-mag-2019
5-mag-2019
978-3-030-40782-7
Springer
Lomasto, L., Di Florio, R., Ciapetti, A., Miscione, G., Ruggiero, G., Toti, D., An Automatic Text Classification Method Based on Hierarchical Taxonomies, Neural Networks and Document Embedding: The NETHIC Tool, Paper, in Lecture Notes in Business Information Processing, (grc, 03-05 May 2019), Springer, N/A 2020:<<LECTURE NOTES IN BUSINESS INFORMATION PROCESSING>>,378 57-77. 10.1007/978-3-030-40783-4_4 [http://hdl.handle.net/10807/163941]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/163941
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
social impact