KEYWORDS: Education and training, Deep learning, Transformers, Image classification, Visual process modeling, Data modeling, Performance modeling, Image processing, Convolutional neural networks, Process modeling
Due to the close similarity in tire textures, it is challenging to predict the type of a tire with a single deep learning model. Therefore, an ensemble learning based tire classification method is proposed in this paper, which leverages the advantages local perception in Convolutional Neural Networks (CNNs) and global attention in Vision Transformers (ViTs). Initially, individual networks based on CNNs and ViTs are independently trained. Subsequently, the backbone networks of these models are frozen, and the features obtained from different models are concatenated. The concatenated features are then fed into a feed forward network for prediction. The experimental results demonstrate that the proposed ensemble model exhibits higher accuracy and stronger generalization capability in tire classification tasks. Compared to individual predictions with single model based on CNNs and ViTs, the accuracy is improved by more than 18.51% and 8.05%, respectively.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.