MULTI-LABEL TEXT CLASSIFICATION VIA DOCUMENT ENHANCEMENT AND LABEL CORRELATIONS LEARNING

Authors

  • Chuzhen Li Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia.
  • Mohd Juzaiddin Ab Aziz Center for Software Technology and Management (SOFTAM), Faculty of Information Science and Technology, University Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia.
  • Mohd Ridzwan Yaakub Center for Artificial Intelligence Technology (CAIT), Faculty of Information Science and Technology, University Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia.

DOI:

https://doi.org/10.22452/mjcs.vol38no2.1

Keywords:

Multi-label text classification, Label correlation, Document representation

Abstract

Multi-label text classification (MLTC) has become increasingly popular due to its broader applicability and closer alignment with real-world objects' inherent properties and rules. Numerous approaches have been suggested to capture the label correlations. Yet, most of them capture relationships between labels in an implicit manner and typically do not explicitly distinguish or define label similarity correlations and label pairing correlations, but rather treat them as a unified label correlation. To this end, in this paper, we propose an approach to distinguish and explicitly define label similarity correlations and pairing correlations. The approach begins by acquiring text and label representations simultaneously. Next, the document representations are enhanced by concatenating with the most similar document subsets. Finally, the label similarity correlations and pairing correlations are explicitly learned in the label correlations learning. This approach shows that the performance surpasses the previous competitive models, with micro-F1 scores of 75.3% and 89.6% on the AAPD and RCV1-V2 datasets, respectively. 

Downloads

Download data is not yet available.

Downloads

Published

2025-06-30

How to Cite

Li, C. ., Aziz, M. J. A. ., & Yaakub, M. R. . (2025). MULTI-LABEL TEXT CLASSIFICATION VIA DOCUMENT ENHANCEMENT AND LABEL CORRELATIONS LEARNING. Malaysian Journal of Computer Science, 38(2), 131–151. https://doi.org/10.22452/mjcs.vol38no2.1