Improving the Steel Surface Defect Detection Algorithm of YOLOv5 Network

Qian Zhang

Authors

Qian Zhang School of Software, Beijing University of Aeronautics and Astronautics, Beijing 100191

Keywords:

Surface Defect Detection, YOLOv5, Attention Mechanism, K-Means++, Industrial Steel, Small Target Detection, Real-time Inspection, Computer Vision

Abstract

Automated surface defect detection in industrial steel production is critical for ensuring product quality, but remains a challenging task due to the prevalence of small, low-contrast defects and the multi-scale nature of these anomalies. Current detection systems often struggle with insufficient accuracy, particularly for small targets, and a lack of robustness across varying defect sizes. To address these limitations, this paper proposes a series of targeted improvements to the YOLOv5s algorithm. First, at the input stage, the K-Means++ algorithm is employed to recluster the dataset and generate optimized initial anchor boxes, which provides a better prior for the model to learn from and improves localization, especially for small defects. Second, an attention mechanism is integrated into the backbone network to enhance feature representation. This module enables the model to focus computational resources on more informative spatial regions and channel features associated with defects, effectively suppressing irrelevant background noise and amplifying subtle defect signatures. Comprehensive experiments were conducted on a dedicated industrial steel defect dataset. The results demonstrate that the improved algorithm achieves a mean Average Precision (mAP@0.5) of 83.3%, representing a significant 6.2% increase over the baseline YOLOv5s model. Crucially, this performance gain is achieved without sacrificing inference speed; the enhanced model maintains a real-time detection rate of 96.5 frames per second (fps) on a standard GPU. These findings confirm that the proposed enhanced YOLOv5s algorithm successfully balances high precision with real-time processing capabilities, making it a viable and effective solution for automated visual inspection in demanding industrial environments such as steel manufacturing.

References

Zeng, Yuan, et al. "Education investment, social security, and household financial market participation." Finance Research Letters 77 (2025): 107124.

Wang, Hao. "Joint Training of Propensity Model and Prediction Model via Targeted Learning for Recommendation on Data Missing Not at Random." AAAI 2025 Workshop on Artificial Intelligence with Causal Techniques. 2025.

Ding, C.; Wu, C. Self-Supervised Learning for Biomedical Signal Processing: A Systematic Review on ECG and PPG Signals. medRxiv 2024.

D. Restrepo, C. Wu, S.A. Cajas, L.F. Nakayama, L.A. Celi, D.M. López. Multimodal deep learning for low-resource settings: A vector embedding alignment approach for healthcare applications. (2024), 10.1101/2024.06.03.24308401

Xie, Minhui, and Shujian Chen. "InVis: Interactive Neural Visualization System for Human-Centered Data Interpretation." Authorea Preprints (2025).

Zhu, Bingxin. "RAID: Reliability Automation through Intelligent Detection in Large-Scale Ad Systems." (2025).

Zhang, Yuhan. "InfraMLForge: Developer Tooling for Rapid LLM Development and Scalable Deployment." (2025).

Hu, Xiao. "GenPlayAds: Procedural Playable 3D Ad Creation via Generative Model." (2025).

Qin, Haoshen, et al. "Optimizing deep learning models to combat amyotrophic lateral sclerosis (ALS) disease progression." Digital health 11 (2025): 20552076251349719.

Wang, Yang, and Zhejun Zhao. "Advancing Abstract Reasoning in Artificial General Intelligence with a Hybrid Multi-Component Architecture." 2024 4th International Symposium on Artificial Intelligence and Intelligent Manufacturing (AIIM). IEEE, 2024.

Fu, Lei, et al. "Adversarial Prompt Optimization in LLMs: HijackNet’s Approach to Robustness and Defense Evasion." 2025 4th International Symposium on Computer Applications and Information Technology (ISCAIT). IEEE, 2025.

Lei, Fu, et al. "Teacher-Student Framework for Short-Context Classification with Domain Adaptation and Data Augmentation." (2025).

Zheng, Haoran, et al. "FinGPT-Agent: An Advanced Framework for Multimodal Research Report Generation with Task-Adaptive Optimization and Hierarchical Attention." (2025).

Improving the Steel Surface Defect Detection Algorithm of YOLOv5 Network

Authors

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Current Issue

Information