File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: High-resolution cross-scale transformer: A deep learning model for bolt loosening detection based on monocular vision measurement

TitleHigh-resolution cross-scale transformer: A deep learning model for bolt loosening detection based on monocular vision measurement
Authors
KeywordsConnection loosening detection
High-resolution architecture
Monocular vision measurement
Vision transformer
Issue Date2024
Citation
Engineering Applications of Artificial Intelligence, 2024, v. 133, article no. 108574 How to Cite?
AbstractThe reliability of bolt connections significantly impacts the operational state and lifespan of industrial equipment. Vision-based noncontact methods exhibit high efficiency in bolt loosening detection. However, limited image features hinder measurement accuracy. To improve bolt loosening detection performance, this paper proposes a novel deep learning backbone, the high-resolution cross-scale transformer, to extract high precision keypoints for bolt three-dimensional model construction. Simultaneously, a monocular vision measurement model is established to get the bolt exposed length and evaluate the connection loosening state. The proposed backbone hybridizes the advantages of high-resolution architecture and transformer, realizing global information aggregation and fine-grained image details. A simplified module, dual-scale multi-head self-attention, is designed to reduce the computational redundancy caused by the implementation of high-resolution multi-branch architecture. In the experiment section, the high-resolution cross-scale transformer outperforms other keypoint detection baselines, achieving the top one performance with 91.6 average precision and 84.9 average recall. The monocular vision measurement model realizes a 0.053 mm error with a 0.028 mm standard deviation, satisfying the industrial implementation requirement. Additionally, the model is tested on different industrial situations and an additional outside dataset, indicating the model's robustness and actual environment adaptability.
Persistent Identifierhttp://hdl.handle.net/10722/350072
ISSN
2023 Impact Factor: 7.5
2023 SCImago Journal Rankings: 1.749

 

DC FieldValueLanguage
dc.contributor.authorWu, Tianyi-
dc.contributor.authorShang, Ke-
dc.contributor.authorDai, Wei-
dc.contributor.authorWang, Min-
dc.contributor.authorLiu, Rui-
dc.contributor.authorZhou, Junxian-
dc.contributor.authorLiu, Jun-
dc.date.accessioned2024-10-17T07:02:53Z-
dc.date.available2024-10-17T07:02:53Z-
dc.date.issued2024-
dc.identifier.citationEngineering Applications of Artificial Intelligence, 2024, v. 133, article no. 108574-
dc.identifier.issn0952-1976-
dc.identifier.urihttp://hdl.handle.net/10722/350072-
dc.description.abstractThe reliability of bolt connections significantly impacts the operational state and lifespan of industrial equipment. Vision-based noncontact methods exhibit high efficiency in bolt loosening detection. However, limited image features hinder measurement accuracy. To improve bolt loosening detection performance, this paper proposes a novel deep learning backbone, the high-resolution cross-scale transformer, to extract high precision keypoints for bolt three-dimensional model construction. Simultaneously, a monocular vision measurement model is established to get the bolt exposed length and evaluate the connection loosening state. The proposed backbone hybridizes the advantages of high-resolution architecture and transformer, realizing global information aggregation and fine-grained image details. A simplified module, dual-scale multi-head self-attention, is designed to reduce the computational redundancy caused by the implementation of high-resolution multi-branch architecture. In the experiment section, the high-resolution cross-scale transformer outperforms other keypoint detection baselines, achieving the top one performance with 91.6 average precision and 84.9 average recall. The monocular vision measurement model realizes a 0.053 mm error with a 0.028 mm standard deviation, satisfying the industrial implementation requirement. Additionally, the model is tested on different industrial situations and an additional outside dataset, indicating the model's robustness and actual environment adaptability.-
dc.languageeng-
dc.relation.ispartofEngineering Applications of Artificial Intelligence-
dc.subjectConnection loosening detection-
dc.subjectHigh-resolution architecture-
dc.subjectMonocular vision measurement-
dc.subjectVision transformer-
dc.titleHigh-resolution cross-scale transformer: A deep learning model for bolt loosening detection based on monocular vision measurement-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1016/j.engappai.2024.108574-
dc.identifier.scopuseid_2-s2.0-85192973687-
dc.identifier.volume133-
dc.identifier.spagearticle no. 108574-
dc.identifier.epagearticle no. 108574-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats