Paper
23 May 2023 A mobileBERT-based preset class fusion text classification algorithm
Xiaobing Wang, Meng Wang
Author Affiliations +
Proceedings Volume 12645, International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023); 126453X (2023) https://doi.org/10.1117/12.2681099
Event: International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023), 2023, Hangzhou, China
Abstract
Grassroots governance staff are submitting increasing amounts of text data to governance system due to encouragement of intelligent governance work. Misclassified work order data results in data loss, leading to wasted labor and material resources. Hence, effective text classification techniques are needed for correction. Training results are insufficient to suit industry needs due to limited work order data and difficult data conditions. For difficulties above, this research presents a mobileBERT-based preset class fusion text categorization algorithm (Preset class fusion BERT, PCFBERT). Templated preprocessing of address information, label representations produced by algorithm encoder are separated from input data representations to compute contrastive loss function and improve model performance on one hand and used as classifier input to produce final prediction results on the other. Traditional text classification algorithms cannot guarantee high speed and accuracy in face of tiny samples of actual data and complex data conditions. Suggested technique solves this problem. Templated preprocessing of address names reduces noise and makes dataset information more reliable. Label fusion and contrast learning techniques based on mobileBERT improve model data representation in this paper. Triple mapping module fuses pre-defined label class information into classifier and better utilize global information to improve decision-making. Combining strategies above improves structure robustness and performance without increasing model size. This research evaluates recommended strategy using simulated data from desensitized work orders and compares it to many popular text classification methods. Suggested method outperforms popular text classification algorithms in classifying work order text data. Same higher performance is achieved in two open Chinese multiclassification text datasets.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Xiaobing Wang and Meng Wang "A mobileBERT-based preset class fusion text classification algorithm", Proc. SPIE 12645, International Conference on Computer, Artificial Intelligence, and Control Engineering (CAICE 2023), 126453X (23 May 2023); https://doi.org/10.1117/12.2681099
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Statistical modeling

Education and training

Classification systems

Performance modeling

Data fusion

Associative arrays

Feature fusion

Back to Top