YOLO-MSD: a robust industrial surface defect detection model via multi-scale feature fusion

Ge, Yifei; Li, Zhuo; Meng, Lin

doi:10.1007/s10489-025-06739-0

YOLO-MSD: a robust industrial surface defect detection model via multi-scale feature fusion

Open access
Published: 03 July 2025

Volume 55, article number 840, (2025)
Cite this article

You have full access to this open access article

Download PDF

Applied Intelligence Aims and scope Submit manuscript

YOLO-MSD: a robust industrial surface defect detection model via multi-scale feature fusion

Download PDF

1939 Accesses
7 Citations
Explore all metrics

Abstract

Object detection is vital for automated surface defect inspection, yet most models suffer from bloated architectures and poor performance on multi‑class, multi‑scale tasks involving large‑size images, limiting their use on edge devices. We propose YOLO‑MSD, a lightweight surface defect detection model that integrates two key designs: (1) a novel four-scale backbone that effectively extracts small and multi-scale targets from large-size images by enhancing feature representation across different scale resolutions, and (2) a streamlined feature‑pyramid neck that boosts cross‑scale fusion while reducing parameters and computational cost. Extensive experiments on five public datasets verify the model’s effectiveness. On the PCB, HRIPCB and GC10‑DET datasets featuring high-resolution images, YOLO‑MSD achieves 96.67% mAP, 96.62% mAP and 69.09% mAP, respectively, while maintaining a low parameter count and computational complexity. It also outperforms most advanced models on two additional public datasets and achieves 20.82 FPS with a power consumption of 6.95 W on the PCB dataset when deployed on a Jetson Xavier NX edge device. These results demonstrate the accuracy, efficiency, and deployability of YOLO‑MSD for industrial surface‑defect detection.

YOLO-CEA: a real-time industrial defect detection method based on contextual enhancement and attention

Article 30 June 2023

Surface Defect Detection Algorithm Based on Feature-Enhanced YOLO

Article 17 October 2022

PCP-YOLO: an approach integrating non-deep feature enhancement module and polarized self-attention for small object detection of multiscale defects

Article 05 December 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Object detection serves as a highly effective deep learning method, demonstrating exceptional success in surface defect detection [1,2,3], primarily attributed to its sophisticated classification and localization abilities. This method addresses several critical limitations of traditional anomaly detection approaches, such as subpar performance in handling multi-class and multi-scale object tasks, difficulty in processing large-size images, poor real-time capabilities, and overall inefficiency. By overcoming these pain points, object detection provides a more robust and efficient solution for identifying and localizing defects. The following part highlights the specific challenges and difficulties in defect detection, illustrating the areas where object detection models offer significant improvements.

Initially, computer vision-based classification method is employed for defect detection, primarily tasked with identifying images that contain defects [6, 7]. In cases where the image contains only a single type of defect, as illustrated in Fig. 1(a), the classification method is capable of effectively classifying the defects. Although the approach is able to recognize the type of defect present in the image, it is unable to accurately localize the defect within the picture. Whereas, when it contains multi-class and multi-scale defects as in Fig. 1(b), the classification method is just able to detect the presence of defects in the image but lacks the ability to classify or localize them accurately. In order to solve these problems, the object detection approach is developed, which is capable of realizing accurate classification and localization of multiple types of defects, as shown in Fig. 1(b). The application of object detection technology to defect detection greatly enhances both detection efficiency and accuracy, leading to remarkable advancements in the field of defect identification and detection. You only look once (YOLO) [8,9,10,11], Faster R-CNN [12], and SSD [13], as the most representative object detection algorithms, have achieved excellent performance in various fields such as industrial manufacturing, material inspection, weld inspection, and textile inspection. These achievements are primarily due to the incorporation of advanced technologies, including deep learning and computer vision, which enhance the capabilities of defect detection.

Furthermore, in modern industrial applications, high-resolution, large-size images are essential for detecting subtle surface defects and ensuring product quality. These images reveal fine imperfections more clearly, such as scratches and dents. However, their use also brings challenges, including increased data volume, higher storage and computational demands, and longer training times [14]. Traditional object detection models may find it difficult to handle such high-resolution images effectively, leading to a decrease in detection efficiency and an increase in false alarms.

In response to these challenges, enhancing detection accuracy without substantially increasing model complexity has become a key objective. A common approach to improve detection accuracy is to increase the model depth by stacking additional convolutional blocks. However, this approach also leads to a more complex and computationally intensive model [15]. To alleviate this, multi-scale feature fusion methods have been proposed [16], enabling the model to capture both fine and coarse features while reducing complexity. However, this method has some limitations in improving detection accuracy and cannot realize significant improvement. Therefore, it is crucial to trade off detection accuracy and model complexity in large-size image detection.

To address the issues of bloated structures and poor performance on multi-class, multi-scale surface defect detection in large-size images, this study proposes YOLO-MSD, a lightweight and effective model designed for industrial applications. The model features a four-scale backbone built with a novel Multi-Scale Convolution (MSC) module, which enhances feature extraction and fusion across different resolutions. In addition, we design a streamlined feature pyramid network (SFPN) to reduce the neck’s complexity while improving fusion efficiency. The anchor-free YOLO head is also simplified to lower computational cost and boost inference speed. Extensive evaluations conducted on five public datasets demonstrate that YOLO-MSD outperforms most advanced models. The model also runs on Jetson Xavier NX to confirm its suitability for edge deployment.

The main contributions are shown below.

This study proposes YOLO-MSD, a robust and lightweight object detection model tailored for industrial surface defect detection in large-size images, achieving higher detection accuracy and robustness against multi-scale, multi-class objects under complex backgrounds.
This work presents a novel MSC block and an SFPN to enhance feature extraction and fusion efficiency while reducing model complexity, enabling better performance with lower computational costs.
Extensive experiments conduct on five public datasets demonstrate that YOLO-MSD outperforms most existing state-of-the-art (SOTA) models in both detection accuracy and efficiency, validating its effectiveness and adaptability for industrial applications.

The remaining sections are organized as follows. Section 2 introduces the related work of multi-scale feature extraction structures, defect detection models, and their applications. Section 3 describes the MSD blocks and the MSD-based defect detection model YOLO-MSD. Section 4 performs extensive ablation and comparison experiments. Section 5 summarizes the article.

1 Related work

This section provides an overview of defect detection models and a brief exploration into the development of deep learning models for surface defect detection. Additionally, multi-scale feature extraction architectures are discussed. These are essential for capturing defects of varying size and complexity.

1.1 Defect detection models

Early defect detection relied on rule-based image processing techniques such as edge detection, threshold segmentation, and morphological operations [17]. These methods perform effectively with simple and predictable defects, but their accuracy significantly diminishes when faced with complex and irregular defects. With the improvement of computational power and the accumulation of large amounts of labelled data, convolutional neural network (CNN)-based deep learning methods are gradually replacing traditional methods in defect detection [18, 19]. Deep learning models are capable of automatically learning features in images and have the ability to handle defects in complex environments.

Classical CNN architectures such as VGG [20], ResNet [21] and DenseNet [22], which were used in the early days, have achieved some success in defect detection, but these models usually require a large amount of computational resources. In order to improve the detection efficiency and ensure real-time performance in industrial inspection. Lightweight networks such as MobileNet [23], EfficientNet [24] and Ghost [25] have been proposed and widely used to reduce computational overhead while maintaining detection accuracy. These methods mainly target the classification of defects.

When detecting multiple defects in complex environments, it is not only necessary to identify the class of defects but also to localize the defects accurately. Traditional CNN networks often have difficulty in such scenarios, which has driven the adoption of deep learning-based object detection models in industrial applications. Models such as Faster R-CNN, SSD, and YOLO have gained popularity for their ability to precisely classify and locate defects while maintaining high detection accuracy. With the continuous progress of deep learning technology, YOLO model, as a representative of one-step detection, has been optimized many times to show more and more powerful detection performance, such as YOLOX [26], YOLOv8 [27], YOLOv10 [28] and YOLO11 [11]. These improved versions have improved in terms of accuracy, speed, and model efficiency, making them even more effective for defect detection in complex environments.

To further improve defect detection accuracy, traditional CNN architectures usually extract complex features by stacking more convolutional groups at a single scale. This results in a bloated model structure and increased computational overhead. In addition, in practical applications, the defects in large-size images are of various sizes and types, and it is often difficult to fully capture such information by relying on single-scale features only. To address this challenge, multi-scale feature fusion techniques have emerged [16]. By fusing features at different scales, the model is able to capture both detailed and global information better, thus improving the accuracy of detection. In the next section, we delve into the development history of multi-scale feature fusion structures and their important role in enhancing defect detection performance.

1.2 Multi-scale feature fusion structures

Multi-scale feature fusion has been widely used to improve object detection in complex scenarios. By processing inputs at multiple scales and integrating the features, these methods enhance the model’s ability to detect objects of various sizes while balancing accuracy and computational cost [10, 29,30,31]. Such architectures significantly improve the model’s ability to detect objects of varying sizes, especially in high-resolution images. For YOLO models, the multi-scale feature fusion architecture is mainly implemented in the backbone and neck components. The backbone component typically consists of CNN models, while the neck component is composed of FPN structures.

For CNN-based models, ResNet first introduced the residual architecture, enabling direct feature reuse through shortcut connections between the original and convolved features [21]. Building on this concept, Iqbal et al. [32, 33] introduced a series of CNN structures for the automated detection of synovial fluid in human knee joints and the classification of endothelial cells derived from human-induced pluripotent stem cells. In the field of object detection, YOLOv3 incorporated a ResNet-like structure by introducing the Darknet53 backbone for effective multi-scale feature extraction [34]. This was further improved in YOLOv4, which proposed CSPDarknet53 to enhance feature fusion across two scales [35]. YOLOv5 [36], YOLOX [26], and YOLOv7 [37] continued refining this architecture to improve accuracy and efficiency. More recently, YOLOv8 [27], YOLOv10 [28], and YOLOv11 [11] have focused on lightweight designs while maintaining high performance across various detection tasks. However, these models perform poorly in large-size images containing multiple-scale objects. To address this problem we plan to use more dimensions to extract features.

For the neck structure, YOLOv3 first introduced FPN to fuse features at three scales [34]. YOLOv4 enhanced this by adding PANet, incorporating both top-down and bottom-up paths for better low-level feature integration [35]. YOLOv5 combined FPN and PANet for more effective multi-scale fusion, while YOLOX further optimized PANet [26]. BiFPN introduced learnable weights to adaptively balance features across scales, improving fusion performance [38]. However, these improvements increase network complexity and computational cost. In this work, we aim to design a novel lightweight neck that achieves efficient feature fusion with small overhead.

2 Methodology

This section overviews our proposed surface defect detection model YOLO-MSD. Section 3.1 describes the architecture of MSC blocks. Section 3.2 introduces the MSC blocks-based YOLO-MSD structure.

2.1 MSC blocks

In order to enhance feature extraction capabilities and recognition accuracy, traditional backbones often deepen the network and stack convolutional modules. Although this approach is effective, it significantly increases computational complexity and overhead. To address this issue, we propose a novel MSC block in this paper. The MSC block trades off computational overhead and detection accuracy by utilizing multi-scale parallel computation and feature fusion to improve feature extraction capabilities. Additionally, the MSC block reduces computational overhead by employing a few convolutional operations.

Figure 2 shows the architecture of the MSC blocks. For the MSC41 block, the primary function is to split the input features into four dimensions and then perform feature extraction and fusion across these four dimensions. In detail, the first CBS (contains of a convolution layer, a Batch Normalization layer and a SiLU activation function) operation aims to extract the features of the input image and boost the number of channels. Then the output is split into four scales. The first dimension uses a CBS group with $1\times 1$ kernel size convolution to vary the number of channels. The other dimensions achieve down sampling and channels changes by a CBS group with a convolution kernel of $1\times 1$ and a stride length of 2. The number of channels in each dimension is one-quarter of the input channels. As a result, we acquire four scales with different input sizes, the first dimension has the largest input size and the last dimension gains the smallest input size (that is a quarter of the first dimension). In addition, the MaxPooling operations integrate concatenate operations achieving features fusion, which transmits the large-scale information to small scales and enriches the features of the small scales.

Specifically, the computation across the four scales proceeds continuously until the target size is achieved, eliminating the redundant operations typically found in traditional feature fusion strategies, which involve separating, fusing, and then separating again. This approach preserves the original features of each scale while seamlessly merging them with the features of the next scale, thus enhancing the model’s overall feature extraction capability.

The MSC41 block is able to formulate as follows,

$$\begin{aligned} {\left\{ \begin{array}{ll} Y^{C1}_{M41}= P(f^{n/4}_{C31}(f^{n/4}_{C11}(X)) \\ Y^{C2}_{M41}= f^{n/4}_{C31}(f^{n/4}_{C31}(f^{n/4}_{C12}(X))\oplus P(Y^{C1}_{M41})) \\ Y^{C3}_{M41}= f^{n/4}_{C31}(f^{n/4}_{C31}(f^{n/4}_{C12}(f^{n/4}_{C12}(X)))\oplus P(Y^{C2}_{M41})) \\ \begin{aligned} Y^{C4}_{M41} & = f^{n/4}_{C31}(f^{n/4}_{C31}(f^{n/4}_{C12}(f^{n/4}_{C12}(f^{n/4}_{C12}(X))))\\ & \oplus P(Y^{C3}_{M41})) \end{aligned} \end{array}\right. } \end{aligned}$$

(1)

where $Y^{C_i}_{M41}$ ($i = 1, 2, 3, 4$) represents the output feature at the i-th scale of the MSC41 block. Here, $C_i$ denotes the i-th dimension. The input X is processed by a CBS operation with n output channels. Each $f^{c}_{C_{ks}}(\cdot )$ represents a CBS group with kernel size $k \times k$, c output channels, and stride s. The operator $P(\cdot )$ denotes a $3 \times 3$ MaxPooling operation used to downsample features before fusion. $\oplus$ indicates the concatenation operation. This formulation enables hierarchical feature extraction and fusion across four scales, where each output $Y^{C_i}_{M41}$ is built upon and enriched by information from the previous scales.

In contrast to the MSC41 block, the MSC42 block reduces the input processing while preserving the feature extraction across the four dimensions initially defined in the MSC41 block. The MSC42 blocks are expressed below,

$$\begin{aligned} {\left\{ \begin{array}{ll} Y^{C1}_{M421} = P(f^{n/2}_{C31}(f^{n/2}_{C31}(Y^{C1}_{M41})) \\ Y^{C2}_{M421} = f^{n/2}_{C31}(f^{n/2}_{C31}(f^{n/2}_{C12}(Y^{C2}_{M41}))\oplus P(Y^{C1}_{M42})) \\ Y^{C3}_{M421} = f^{n/2}_{C31}(f^{n/2}_{C31}(f^{n/2}_{C12}(Y^{C3}_{M41}))\oplus P(Y^{C2}_{M42})) \\ Y^{C4}_{M421} = f^{n/2}_{C31}(f^{n/2}_{C31}(f^{n/2}_{C12}(Y^{C4}_{M41}))\oplus P(Y^{C3}_{M42})) \end{array}\right. } \end{aligned}$$

(2)

$$\begin{aligned} {\left\{ \begin{array}{ll} Y^{C1}_{M422} = P(f^{n}_{C31}(f^{n}_{C31}(Y^{C1}_{M421})) \\ Y^{C2}_{M422} = f^{n}_{C31}(f^{n}_{C31}(f^{n}_{C12}(Y^{C2}_{M421}))\oplus P(Y^{C1}_{M422})) \\ Y^{C3}_{M422} = f^{n}_{C31}(f^{n}_{C31}(f^{n}_{C12}(Y^{C3}_{M421}))\oplus P(Y^{C2}_{M422})) \\ Y^{C4}_{M422} = f^{n}_{C31}(f^{n}_{C31}(f^{n}_{C12}(Y^{C4}_{M421}))\oplus P(Y^{C3}_{M422})) \end{array}\right. } \end{aligned}$$

(3)

where $Y^{Ci}_{M421} (i=1, 2, 3, 4)$ presents the i-th scale output of the first MSC42 block. $Y^{Ci}_{M422} (i=1, 2, 3, 4)$ indicates the i-th dimension output of the second MSC42 block. In particular, when the fourth channel of the second MSC42 block satisfies the output requirements, eliminating the need for further convolution.

The MSC3 and MSC2 blocks follow similar principles but operate on three and two dimensions, respectively. The expression of MSC3 and MSC2 blocks is as follows,

$$\begin{aligned} {\left\{ \begin{array}{ll} Y^{C1}_{M3} = P(f^{2n}_{C31}(f^{2n}_{C31}(Y^{C1}_{M422})) \\ Y^{C2}_{M3} = f^{2n}_{C31}(f^{2n}_{C31}(f^{2n}_{C12}(Y^{C2}_{M422}))\oplus P(Y^{C1}_{M3})) \\ Y^{C3}_{M3} = f^{2n}_{C31}(f^{2n}_{C31}(f^{2n}_{C12}(Y^{C3}_{M422}))\oplus P(Y^{C2}_{M3})) \end{array}\right. } \end{aligned}$$

(4)

$$\begin{aligned} {\left\{ \begin{array}{ll} Y^{C1}_{M2} = P(f^{4n}_{C31}(f^{4n}_{C31}(Y^{C1}_{M3})) \\ Y^{C2}_{M2} = f^{4n}_{C31}(f^{4n}_{C31}(f^{4n}_{C12}(Y^{C2}_{M3}))\oplus P(Y^{C1}_{M2})) \end{array}\right. } \end{aligned}$$

(5)

where, $Y^{Ci}_{M3} (i=1, 2, 3)$ represents the i-th dimension output of the MSC3 block. $Y^{Ci}_{M2} (i=1, 2)$ is the i-th scale output of the MSC2 block.

2.2 Overview of YOLO-MSD

To overcome the challenges of industrial surface defect detection in large-size images and trade off the recognition accuracy and model complex of YOLO, we propose a novel defect detection network YOLO-MSD. Figure 3 shows the framework of YOLO-MSD, which mainly consists of Backbone, Neck and Head. The detailed information about YOLO-MSD is provided below.

2.2.1 MSC blocks-based backbone

The main task of Backbone is to extract the deep features from input images. Hence, improving the feature-extracting capability of Backbone is primary. This study proposes a novel MSCNet for the Backbone, enabling multi-scale feature extraction and fusion. The left of Fig. 3 displays the structure of the Backbone. This study employs some novel MSC blocks to extract and fuse features from four scales. The MSC41 block is used to split the input into four dimensions and exchange information on these dimensions. Other MSC blocks aim to acquire deep features of the input image from multiple scales. Furthermore, the last three outputs of each scale are concatenated separately to generate three outputs with different resolutions and the three outputs are transmitted to the Neck for further feature fusion. This means that each output of the Backbone contains feature information about four dimensions, ensuring the YOLO-MSD adequately extracts features from the input image. The three outputs of the Backbone are expressed below,

$$\begin{aligned} \left\{ \begin{array}{l} Y_{B1} = f^{2n}_{C31}(Y^{C1}_{M3}\oplus Y^{C2}_{M422}\oplus Y^{C3}_{M421}\oplus Y^{C4}_{M41}) \\ Y_{B2} = f^{4n}_{C31}(Y^{C1}_{M2}\oplus Y^{C2}_{M3}\oplus Y^{C3}_{M422}\oplus Y^{C4}_{M421}) \\ Y_{B3} = f^{8n}_{C31}(Y_{CC}\oplus Y^{C2}_{M2}\oplus Y^{C3}_{M3}\oplus Y^{C4}_{M422}) \end{array}\right. \end{aligned}$$

(6)

$$\begin{aligned} Y_{CC} = f^{8n}_{C31}(f^{8n}_{C32}(Y^{C1}_{M2}) \end{aligned}$$

(7)

where $Y_{B1}$, $Y_{B2}$ and $Y_{B3}$ are the large, middle and small resolutions of the Backbone outputs, respectively. $Y_{CC}$ donates the output of CC block in Fig. 3.

2.2.2 SFPN-based neck

Current YOLO necks always suffer from a bloated architecture, bringing a huge computational burden and a complex structure. To address the problem, this research presents a slight SFPN framework for feature fusion. The middle of Fig. 3 shows the detailed construction of SFPN. For the last scale, due to the same scale output of the Backbone has already fully fused the four-scale features, we only use a CBS convolution group in the Neck part for further feature fusion. For the middle layer, to achieve feature fusion from three scales, we employ a combination of downsampling, CBS operations, and upsampling on the three neck inputs. Specifically, the first input undergoes downsampling, the last undergoes upsampling, and the middle input utilizes a CBS operation. The resulting features are then concatenated and fed into a final CBS operation for fusion. Moreover, for the first scale, two upsampling operations are utilized for the last input and the middle input, combining with the first input for feature fusion across three dimensions. The process of the Neck is represented by the following expression,

$$\begin{aligned} {\left\{ \begin{array}{ll} \begin{aligned} Y_{N1} & = f^{m}_{C11}(f^{m}_{C11}(Y_{B1})\oplus f^{m}_{up}(Y_{N2})\\ & \oplus f^{m}_{up}(f^{2m}_{up}(Y_{B3}))) \\ \end{aligned}\\ \begin{aligned} Y_{N2} & = f^{2m}_{C11}(f^{2m}_{C12}(Y_{B1})\oplus f^{2m}_{C11}(Y_{B2})\\ & \oplus f^{2m}_{up}(Y_{B3})) \\ \end{aligned} \\ Y_{N3} = f^{4m}_{C11}(Y_{B3}) \end{array}\right. } \end{aligned}$$

(8)

where the Neck outputs $Y_{N1}$, $Y_{N2}$, and $Y_{N3}$ represent large, medium, and small resolutions, respectively. $f^{m}_{up}$ denotes the upsampling process (including a CBS convolution group and an upsampling operation) with m output channels.

Compared with current YOLO necks, our proposal uses fewer convolution and pooling operations, which reduces the computational burden of Neck. In addition, we utilize upsampling operations to transmit features of the last scale to the other two scales. Through this approach, the SFPN framework achieves effective feature fusion while maintaining low computational complexity.

2.2.3 Anchor-free head

To achieve fast inference speed and reduce computational complexity, this research uses an anchor-free YOLO head and removes some convolution operations. Figure 4 displays the architecture of the Head. In detail, we utilize a CBS operation to integrate the input channels. Then, the input is divided into two segments: the first segment is responsible for classifying the defects, while the second segment determines the existence of these defects and localizes them. Moreover, the outputs are concatenated for one dimension head output. As a result, the three scales of the YOLO head enable YOLO-MSD to classify and localize defects of various sizes across different dimensions. The YOLO head is expressed as follows,

$$\begin{aligned} \begin{aligned} Y_{Hi}&= f^{nc}_{C}(f^{m}_{C11}(f^{m}_{C11}(Y_{Ni})))\\&\oplus f^{4}_{C}(f^{m}_{C11}(f^{m}_{C11}(Y_{Ni}))) \\&\oplus f^{1}_{C}(f^{m}_{C11}(f^{m}_{C11}(Y_{Ni}))) \end{aligned} \end{aligned}$$

(9)

where, $Y_{Hi} (i=1, 2, 3)$ presents the i-th output of the YOLO Head. $f^{nc}_{C}$ is the convolution operation with a $1\times 1$ kernel size, nc (number of classes) output channels and a stride of 1. $f^{4}_{C}$ is used to predict the bounding box of defects with four coordinates. $f^{1}_{C}$ aims to identify whether a defect exists.

We provide a notation list of the mathematical symbols, as shown in Table 1.

Table 1 Notation list of the mathematical symbols

YOLO-MSD: a robust industrial surface defect detection model via multi-scale feature fusion

Abstract

Similar content being viewed by others

YOLO-CEA: a real-time industrial defect detection method based on contextual enhancement and attention

Surface Defect Detection Algorithm Based on Feature-Enhanced YOLO

PCP-YOLO: an approach integrating non-deep feature enhancement module and polarized self-attention for small object detection of multiscale defects

Explore related subjects

1 Related work

1.1 Defect detection models

1.2 Multi-scale feature fusion structures

2 Methodology

2.1 MSC blocks

2.2 Overview of YOLO-MSD

2.2.1 MSC blocks-based backbone

2.2.2 SFPN-based neck

2.2.3 Anchor-free head

3 Evaluation

3.1 Experiment configuration

3.1.1 YOLO-MSD family

3.1.2 Evaluation metrics

3.1.3 Datasets

3.1.4 Implementation details

3.2 Experiments

3.2.1 Ablation experiments on the PCB dataset

Effectiveness evaluation of SFPN framework

Effectiveness evaluation of MSCNet architecture

3.2.2 Comparison experiments on the PCB dataset

3.2.3 Comparison experiments on the HRIPCB dataset

3.2.4 Comparison experiments on the GC10-DET dataset

3.2.5 Comparison experiments on the CRACK and NEU-DET datasets

3.2.6 Deployment on low-performance devices

3.3 Discussion

4 Conclusion

Data Availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of Interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Profiles