Download
Download
Download
P. Padmapoorani1 , S. Senthilkumar1
KSR College of Engineering, Department of Civil Engineering, Namakkal, Tamilnadu , 637215, India
1
ABSTRACT
Cracks in concrete structures are caused due to contraction and expansion irregularities, from potential damages
caused in the buildings. These irregularities and damages are assessed by the engineers manually or through
identification and prediction models with machine learning techniques to evaluate the impact and significance
of the structural health in buildings. This research aims at applying machine learning based on VGG16-
Net model for the detection of cracks in concrete structures. The proposed model is of CNN (convolutional
neural network) + VGG neural network based architecture. The study uses the gradient boosting algorithm for
image segmentation. The datasets are obtained from “Kaggle” resource and the library used is ‘Hugging Face
Transformers’. To evaluate the developed models’ performance metrics such as “accuracy, precision, recall and
f1-score” are used. The ‘accuracy’ score obtained is compared against the ‘ViT’ (Google transformer) accuracy
rate, for comparison. The proposed model achieved 98% validation accuracy rate with 0.3% loss. Thus the
developed research contributes an innovative and a novel ML model that predicts and identifies the cracks in
concrete structures with less loss and higher accuracy with CNN architecture than ViT (vision transformer)
models. Current study also provides more input upon CNN being more accurate than ViT models for future
researchers for comparative analyses.
Keywords: VGG16-Net; concrete cracks; concrete non-cracks; convolutional neural network; structural health;
concrete structures; ViT; vision transformers.
1. INTRODUCTION
In the recent years the researches on the structural health of concrete has increased rapidly. The study of
infrastructures on the dams, buildings, bridges and roads have been done by the investigators for the health,
impact, mishandlings and other structural observations [1]. The expansion and contraction of the concrete
structures in the buildings have been observed as the primary reason to study the health of the concrete structures.
To identify these damages, cracks and breaks in the concrete structures the investigators have been adopting
and leaning towards machine learning models in the recent years than observing through direct and physical
investigation that requires more time, budget, labour and human intervention.
The machine learning based models in the concrete structural health analysis has been focused by
researchers to study the variations in the expansions, vibrations, features (frequency and spatial), contractions
and dampness [2]. Automatic detection of the concrete structures’ cracks and how it impacts the health of the
concrete structures have been recently focused with the “vision transformers” (ViT) as the core focus.
The studies by authors [3] and [4] focused on pavement cracks through ViT models where they insisted
that the Convolutional Neural-Networks (CNNs) as the base model network for identifying the cracks in
pavements which is evaluated and presumed to be the better and higher performing models in image segmenting
and classifying.
The CNN models in identifying the cracks in the images of pavements and concrete structures have been
identified as better model. However, study by [5] insisted that deep learning-based CNN models in detecting the
cracks with the aerial support of unmanned vehicles (UAV) as a combination as dual process is faster, robust and
reliable than detecting cracks in the images of random inputs. Digital images in the automated crack detection
models as inputs than videos in real-time have been attempted more to evaluate the detection models. The
measurement of the cracks in the concrete structures are calculated by considering various features, like width,
length, thickness, thinness, mass and depth of the crack [6]. Though the manual crack detection is measurable
DOI: https://doi.org/10.1590/1517-7076-RMAT-2023-0010
PADMAPOORANI, P.; SENTHILKUMAR, S., revista Matéria, v.28, n.1, 2023
it is not always a reliable outcome since there is a huge possibility of error and miscalculations through human
interventions. Henceforth, automatic detection of cracks in concrete structures and pavements through deep
learning, where advanced techniques like transformers are recently identified as ‘highly’ performing models
than the CNN models with better accuracy [7]. The models traditionally adopted NN models are ResNet50 [8],
AlexNet [9], VGG16-Net [10], LeNet [11], GoogleNet [12,13], MobileNets [14] and DenseNet models [15].
Initially vision transformers (ViT) were applied only in the natural language processing (NLP) by
replacing the RNN (recurrent neural-network) models with long-short-term memory (LSTM) approach. Later
they were utilized in question answering, language translations and text classifications [16] applications too. ViT
models have been tremendously dominating in SOTA (State-Of-The-Art) performance and efficiency with the
NLP datasets, than other categories. Similarly the vision transformers are also popularly used for the a dvantages
over CNN like its cost, processing time and speed. However there exists a debate that in certain areas, CNN
models are more efficient with accuracy than the transformers. The ViT thus could be assumed as, a major
contributor in the computer vision domains that is adopted for different applications like image segmentation,
video understanding, object detection and image detection in machine learning.
In this research the author will develop a traditional CNN model to detect cracks in the concrete structures
to examine its health with the main purpose as: “applying machine learning on concrete structures to monitor
and examine the health of concretes”. Simultaneously the research will also analyse and examine the objectives:
• To examine the accuracy achieved by a conventional model in detecting the cracks in concrete structures;
• To examine the accuracy achieved by a contemporary model in detecting the cracks in concrete structures;
• To compare the outcomes to weigh the most appropriate model based on cost, time, resource, accuracy rate
through metric evaluation.
2. LITERATURE REVIEW
The studies on the convolutional networks in detecting cracks in concrete structures and monitoring the concrete
structural health are primarily focused. Models that use CNN architecture in deep learning and vision based
(i.e. digital image segmentation and vision transformers) are studied as secondary studies as literary resources.
Contrarily, several authors [1–3,29,30] focused and examined exclusively upon the vision based crack
detection models. They claimed that though the traditional models are better in accuracy than the vision
transformer models that averagely produce outcomes that are of 95% accurate, the transformer models are rapid,
robust, incurs lesser costs and lesser time for computing and processing. Hence, the recent researchers have been
adopting the vision transformers as crack detection models that produce lesser accuracy than traditional models
in micro cracks, macro cracks, complex and closely-spaced crack detection in buildings, pavements and asphalt
blocks that are of concrete structures. To monitor the health in the concrete structures the authors have been
developing the prediction and identification models that also reduce the manual labour and errors that occur due
to human miscalculations, thus the automated machine learning based models are being adopted by eradicating
human intervention in the crack detection. Also, to make the models highly efficient and accurate the reliable
models like CNN, ANN, ResNet and Vision Transformers (ViT) have been adopted by the researchers more,
based on their necessity and research objectives.
2.3. Transformers
GEHRI et al. [6] used digital image correlation technique to identify and detect cracks in the images through
automated detection model where the model achieved higher accuracy in detecting even the complex cracks
than micro and macro cracks in the concrete structures. The authors used kinetic measurement technique and
achieved higher accuracy. However, the kinetic measurement method had biased outcomes in closely-spaced
cracks. Crack detection models that have bias in complex crack detection will result with irregular outcomes
and inconsistent results that could affect the research. Henceforth the authors [3] proposed a model later on
with ‘slope surface’ crack detection from images using deep learning. The authors found that vision transformer
model achieved 94% accuracy whereas the other models (LeNet, AlexNet, InceptionA, MobileNet, ResNet
and InceptionE) achieved higher accuracy as 99% each respectively. So, in the year 2021, authors [31] proposed
a vision transformer model (SOTA) through visual interpretation technique that uses CNN architecture.
The authors used CNN architecture model and ViT model with Chinese and German asphalt as samples.
The accuracy obtained by the ViT model with German samples was 99% and in the Chinese samples was
91%. The authors thus proved that ViT model achieves higher accuracy too with lesser cost and time for
processing the data.
Similarly, the authors [4] also proposed visual interpretation through deep learning and authors [7]
proposed a deep learning model for slope surface-based crack detection. Both studies compared CNN and
ViT architectures and found that CNN proved to be higher in accuracy score than ViT model. However, they
also insisted that, cost, time, labour and resources were also high in the CNN model along with its stacked-up
layers. Henceforth they concluded and claimed that ViT models are better in crack detection for small budget
researches with better accuracy and CNN models are costlier but highly efficient with accurate outcomes. YU
et al. [32] examined the damages in building structures using the deep learning CNN model and authors [33]
examined vision-based crack detection in concrete structures using the hybrid CNN model approach. Both studies
concluded that CNN model is accurate in crack detection. Later, the study by [34] utilized the optimization
algorithm (improved bird-swarm algorithm: IBSA) in CNN model and found it to be significantly accurate
and precise with minimal loss than earlier approaches. Similarly in another study authors [35] examined the
concrete structures with CNN model with enhanced chicken-swarm algorithm (ECSA) and found that, CNN
models achieve higher accuracy and performance in detecting cracks. Thus it is comprehensible that, utilizing
an optimization algorithm with CNN model certainly increases the accuracy than a hybrid or a normal CNN
model architecture.
Hence to prove these researches and how the samples might differ in the ViT model crack detection and
CNN crack detection models, the current study aims at comparing the outcomes of the CNN and ViT models
based on accuracy, computing cost and processing time.
3. PROPOSED METHOD
The proposed method for the developed research includes different processing phases in this research. The flow
of the research (refer to Figure 1) is:
The model developed includes stages where the concrete structural image is inputted to the model. Then
the image is later on pre-processed, classified as cracks and non-cracks and the outcomes/results are segregated
and the final output obtained is stored and the models are compared for the accuracy as the metric evaluation.
next a linear layer of 256 batch size of output featuring 2. The final layer is the Softmax as the seventh layer
(refer to Figure 3).
The transformer model is applied on datasets where the images of the concrete structures are focused
and detected for the cracks and non-cracks in the images. The detected images are then classified into separate
folders with classes “cracks” and “non-cracks”.
ea x
( ) a (1)
Cy 1 e a y
Where:
δ = Softmax, ν– = Input vector, C = total classes in multi-class classifier, e a x = input vector’s standard exponential
function and e a y = output vector’s standard exponential function.
through the layers, it is then passed on to the dropout layer and fully connected layer. Finally, the Softmax layer
is connected and the outcome is processed and segregated post classification as “cracks” and “non-cracks”.
mn bn
xn+1 = xn − __
√ ν̂ n
By using the AOA-algorithm the researcher uses the model to identify and classify the concrete structures
and classify them as cracked and non-cracked structures.
4.2. Datasets
The images of the cracked and non-cracked cement structures as the datasets are acquired from the resource
“Kaggle” by [37]. The total datasets accumulate to images of 20 thousand and more concrete structures from
buildings, bridges and pavements. Among which the research has used the 70% (14000:14000 for cracks and
non-cracks as image classification) for training and the rest 30% datasets (6000:6000 for both cracks and non-
cracks image classification) for testing and validation. Thus, the split-ratio for the obtained dataset is categorized
as 70:20:10 for training, testing and validation respectively.
The VGG16-Net model is trained with 70% datasets and the loss is evaluated for the model where the
cross-entropy is used. The loss functions are:
Inference:
The evaluation loss (refer to Figure 4) was observed as high (0.47) at the initial epoch training. Later around the
80th epoch the loss reduced from 0.47 to 0.35. However, the loss fluctuated from 125th epoch till 275th epoch and
remained reducing. The loss attained the value 0.34 at 575th epoch and remained same until 600th.
The training loss (refer to Figure 5) of the model is evaluated and the loss is observed initially at the 47th
epoch at 0.59. The loss reduced from 0.58 to 0.38 at 75th epoch. Later on, the loss fluctuated and at epoch 380th
the loss reduced at 0.36. Finally at the epoch 600th the loss value reduced to 0.356.
4.4.1. F1-score
The F1-score for the model is calculated through:
2 × (Precision × Recall)
F1-Score = (2)
Precision + Recall
From Figure 6 the highest F1-Score has been observed as 0.985 at the 600th epoch.
PADMAPOORANI, P.; SENTHILKUMAR, S., revista Matéria, v.28, n.1, 2023
4.4.2. Recall
The Recall for the model is calculated through:
TruPos
Recall = (3)
TruPos + FalNeg
Where: TruPos denotes true-positives, TruFal denotes true-negatives, FalNeg denotes false-negatives and Fal-
Pos denotes false-positives.
From Figure 7 the highest recall has been observed as 0.991 from 360th to 600th epoch.
4.4.3. Precision
The Precision for the model is calculated through:
TruPos
Precision = (4)
TruPos + FalPos
From Figure 8 the highest precision has been observed as 0.98 at the 600th epoch.
4.4.4. Accuracy
The Accuracy for the model is calculated through:
TruPos + TruNeg
Accuracy = (5)
TruPos + FalPos + TruNeg + FalNeg
From Figure 9 the highest accuracy has been observed as 0.98 at the 600th epoch.
The final training and testing of the model was done with 1261 samples for training and 239 for valida-
tion with 32 batch sizes and 165 samples for testing with 16 batch sizes. The epoch size was 10.
The loss from Table 1 could be observed where at the 10th epoch the accuracy increased as the losses
decreased for both training and validation.
TIME/STEP
TRAINING- TRAINING- VALIDATION- VALIDATION-
(IN SECOND AND
LOSS ACCURACY LOSS ACCURACY
MILLISECONDS)
1 26s 675ms 0.5381 0.7697 0.5768 0.6875
2 30s 767ms 0.3971 0.8039 0.4241 0.7500
3 23s 571ms 0.3396 0.8446 0.3104 0.8616
4 24s 605ms 0.2256 0.8999 0.3926 0.8616
5 22s 562ms 0.2266 0.9056 0.2000 0.9152
6 22s 561ms 0.1340 0.9504 0.1427 0.9330
7 22s 571ms 0.1099 0.9536 0.2263 0.8973
8 22s 562ms 0.1422 0.9479 0.1233 0.9598
9 22s 563ms 0.0975 0.9642 0.0845 0.9598
10 22s 560ms 0.0747 0.9715 0.1093 0.9598
PADMAPOORANI, P.; SENTHILKUMAR, S., revista Matéria, v.28, n.1, 2023
Crack Crack
Crack Crack
Non-crack Crack
Crack Crack
Non-crack Non-crack
Non-crack Non-crack
Non-crack Non-crack
Non-crack Non-crack
PADMAPOORANI, P.; SENTHILKUMAR, S., revista Matéria, v.28, n.1, 2023
The validation accuracy as the metric for evaluating the performance of the models ‘ViT’ and ‘VGG16-
Net’ is measured (refer to Table 2). The ViT model achieved 96.6% and the VGG16-Net model achieved 95.98%
validation accuracy.
The training of the VGG16-Net model was carried out by increasing the epochs from 10 to 30 with
16 batch size and learning rate 0.1.
From Table 5 it can be observed that the loss has been reduced from 0.65 to 0.35 from epoch 1 to epoch 30.
Figure 10 illustrates the comparison of the models ViT (vision transformer) and VGG16-Net (CNN).
From the observation it is understood that, the outcomes obtained from the VGG-16 model for identifying and
detecting the cracked and non-cracked concrete structures are better than the transformer model. The evaluation
metrics shows that accuracy, recall and f1-scores are higher in VGG16+CNN model than ViT model; whereas
the precision of ViT model is somewhat similar to the VGG16 model. This finding is similar to conclusions of
the studies by authors [3,6,12,13] where CNN models are better in accuracy than ViT models.
From Figure 11 it is apparent that the developed VGG16 model acquired more accuracy than existing
convolutional neural networking models like AlexNet, GoogleNet, InceptionNet and ResNet.
5. FINDINGS
The findings from the evaluated outcomes are presented in Table 6.
• The scores of the VGG16-Net model are: precision at 98.0%, accuracy at 98.0%, f1-score at 98.50% and
recall at 99.10%.
• The scores of the ViT model are: precision at 97.30%, accuracy at 92.72%, recall at 92.50% and f1-score at
94.87%.
• The validation accuracy of the ViT model (96.6%) and the VGG16-Net model (95.98%) was found to
be 96%.
Hence it is found that when the computing cost, scheduling time, processing time, constructing stacked-up
layers and other resources are considered ViT crack detection model is better however when the accuracy,
precision, recall and f1-score are considered the CNN crack detection model is better.
Limitations: The study aims at comparing the CNN based VGG16 detection model against the ViT
model and thus other architectures are neglected. Since the study utilizes the resources from kaggle, the UAV
(unmanned aerial-vehicles) based researches are not studied. Thus, the research exclusively focuses on the pur-
pose and limits the focus on vision transformer and VGG16 models alone.
7. REFERENCES
[1] YUAN, G., LI, J., MENG, X., et al., “CurSeg: a pavement crack detector based on a deep hierarchical
feature learning segmentation framework”, IET Intelligent Transport Systems, v. 16, n. 6, pp. 782–799,
2022. doi: http://dx.doi.org/10.1049/itr2.12173.
[2] PARAMANANDHAM, N., KOPPAD, D., ANBALAGAN, S., “Vision based crack detection in concrete
structures using cutting-edge deep learning techniques”, TS. Traitement du Signal, v. 39, n. 2, pp. 485–492,
2022. doi: http://dx.doi.org/10.18280/ts.390210.
[3] CHEN, Y., GU, X., LIU, Z., et al., “A fast inference vision transformer for automatic pavement image
classification and its visual interpretation method”, Remote Sensing, v. 14, n. 8, pp. 1877, 2022. doi: http://
dx.doi.org/10.3390/rs14081877.
[4] XU, Z., GUAN, H., KANG, J., et al., “Pavement crack detection from CCD images with a locally enhanced
transformer network”, International Journal of Applied Earth Observation and Geoinformation, v. 110,
pp. 102825, 2022. doi: http://dx.doi.org/10.1016/j.jag.2022.102825.
[5] SILVA, W.R.L.D., LUCENA, D.S.D., “Concrete cracks detection using deep learning image classification”,
Proceedings, v. 2, n. 489, pp. 1–6, 2018.
[6] GEHRI, N., MATA-FALCON, J., KAUFMANN, W., “Automated crack detection and measurement based
on digital image correlation”, Construction & Building Materials, v. 256, pp. 119383, 2020. doi: http://
dx.doi.org/10.1016/j.conbuildmat.2020.119383.
[7] YANG, Y., MEI, G., “Deep transfer learning approach for identifying slope surface cracks”, Applied
Sciences, v. 11, n. 23, pp. 11193, 2021. doi: http://dx.doi.org/10.3390/app112311193.
[8] HE, K., ZHANG, X., REN, S., et al., “Deep residual learning for image recognition”, IEEE Proceedings
on Computer Vision and Pattern Recognition (CVPR), v. 2021, pp. 770–778. 2016.
[9] KRIZHEVSKY, A., SUTSKEVER, I., HINTON, G.E., “ImageNet classification with deep convolu-
tional neural networks”, Communications of the ACM, v. 60, n. 6, pp. 84–90, 2017. doi: http://dx.doi.
org/10.1145/3065386.
[10] SIMONYAN, K., ZISSERMAN, A., “Very deep convolutional networks for large-scale image recogni-
tion”, In: Proceedings of the 3rd International Conference of Learning Representations (ICLR), pp. 1–12,
2015.
[11] LECHUN, Y., BOTTOU, L., BENGIO, Y., et al., “Gradient - based Learning Applied to Document
Recognition”, IEEE Proceedings, v. 86, pp. 2278–2323, 1998.
[12] SZEGEDY, C., LIU, W., JIA, Y., et al., “Going deeper with convolutions”, IEEE Proceedings of Computer
Vision and Pattern Recognition (CPVR), v. 2015, pp. 1–9. 2015.
[13] SZEGEDY, C., VANHOUCKE, V., LOFFE, S., et al., “Rethinking the inception architecture for computer
vision”, IEEE Proceedings: Computer Vision and Pattern Recognition (CVPR), v. 2016, pp. 2818–2826.
2016.
[14] HOWARD, A.G., ZHU, M., CHEN, B., et al., “MobileNets: efficient convolutional neural
networks for mobile vision applications, 2017, https://www.semanticscholar.org/reader/3647d6d0f-
151dc05626449ee09cc7bce55be497e, accessed in August, 2020.
[15] HUANG, G., LIU, Z., VAN DER MAATEN, L., et al., “Densely connected convolutional networks”,
IEEE Proceedings on Computer Vision and Pattern Recognition (CVPR), v. 2017, pp. 2261–2269. 2017.
[16] FANG, W., LUO, H., XU, S., et al., “Automated text classification of near-misses from safety reports:
an improved deep learning approach”, Advanced Engineering Informatics, v. 44, pp. 101060, 2020. doi:
http://dx.doi.org/10.1016/j.aei.2020.101060.
[17] ISLAM, M.M.M., KIM, J.-M., “Vision-based autonomous crack detection of concrete structures using a
fully convolutional encoder–decoder network”, Sensors, v. 19, n. 19, pp. 4251, 2019. doi: http://dx.doi.
org/10.3390/s19194251. PubMed PMID: 31574963.
[18] KIM, J.J., KIM, A.-R., LEE, S.-W., “Artificial neural network-based automated crack detection and analysis
for the inspection of concrete structures”, Applied Sciences, v. 10, n. 22, pp. 8105, 2020. doi: http://dx.doi.
org/10.3390/app10228105.
PADMAPOORANI, P.; SENTHILKUMAR, S., revista Matéria, v.28, n.1, 2023
[19] ARBAOUI, A., OUAHABI, A., JACQUES, S., et al., “Concrete cracks detection and monitoring using
deep learning-based multiresolution analysis”, Electronics, v. 10, n. 15, pp. 1772, 2021. doi: http://dx.doi.
org/10.3390/electronics10151772.
[20] DORAFSHAN, S., THOMAS, R.J., MAGUIRE, M., “SDNET2018: an annotated image dataset for
non-contact concrete crack detection using deep convolutional neural networks”, Data in Brief, v. 21,
pp. 1664–1668, 2018. doi: http://dx.doi.org/10.1016/j.dib.2018.11.015. PubMed PMID: 30505897.
[21] ADAM, E.E.B., SATHESH, A., “Construction of accurate crack identification on concrete structure using
hybrid deep learning approach”, Journal of Innovative Image Processing, v. 3, n. 2, pp. 85–99, 2021. doi:
http://dx.doi.org/10.36548/jiip.2021.2.002.
[22] PARK, M.J., KIM, J., JEONG, S., et al., “Machine learning-based concrete crack depth prediction using
thermal images taken under daylight conditions”, Remote Sensing, v. 14, n. 9, pp. 2151, 2022. doi: http://
dx.doi.org/10.3390/rs14092151.
[23] ALI, L., ALNAJJAR, F., JASSMI, H.A., et al., “Performance evaluation of deep CNN - based crack
detection and localization techniques for concrete structures”, Sensors, v. 21, n. 5, pp. 1688, 2021. doi:
http://dx.doi.org/10.3390/s21051688.
[24] COCA, L.-G., CUSMULIUC, C.G., IFTENE, A., “Automatic tarmac crack identification appli-
cation”, Procedia Computer Science, v. 192, pp. 478–486, 2021. doi: http://dx.doi.org/10.1016/j.
procs.2021.08.049.
[25] HALLEE, M.J., NAPOLITANO, R.K., REINHART, W.F., et al., “Crack detection in images of masonry
using CNNs”, Sensors, v. 21, n. 14, pp. 4929, 2021. doi: http://dx.doi.org/10.3390/s21144929. PubMed
PMID: 34300668.
[26] REHUMAAN, S.F.K., “Modelling of cracked concrete and identification of design parameters using static
non-linear analysis”, Civil Engineering and Architecture, v. 10, n. 2, pp. 584–599, 2022. doi: http://dx.doi.
org/10.13189/cea.2022.100216.
[27] SU, C., WANG, W., “Concrete cracks detection using convolutional neural network based on transfer
learning”, Mathematical Problems in Engineering, v. 2020, pp. 1–10, 2020.
[28] ZHANG, H., LIU, C., HO, J., et al., “Crack detection based on convnext and normalization”, Journal
of Physics: Conference Series, v. 2289, n. 1, pp. 012022, 2022. doi: http://dx.doi.org/10.1088/1742-
6596/2289/1/012022.
[29] REN, Y., HUANG, J., HONG, Z., et al., “Image-based concrete crack detection in tunnels using deep
fully convolutional networks”, Construction & Building Materials, v. 234, pp. 117367–117378, 2020. doi:
http://dx.doi.org/10.1016/j.conbuildmat.2019.117367.
[30] SATHYA, K., SANGAVI, D., SRIDHARSHINI, P., et al., “Improved image based super resolution and
concrete crack prediction using pre-trained deep learning models”, Journal of Soft Computing in Civil
Engineering, v. 4, n. 3, pp. 40–51, 2020.
[31] DOSOVITSKIY, A., BEYER, L., KOLESNIKOV, A., et al., “An image worth 16*16 words: transformers
for image recognition at scale”, ICLR Conference, v. 2021, pp. 1–22, 2021.
[32] YU, Y., WANG, C., GU, X., et al., “A novel deep learning-based method for damage identification of
smart building structures”, Structural Health Monitoring, v. 18, n. 1, pp. 143–163, 2019. doi: http://dx.doi.
org/10.1177/1475921718804132.
[33] YU, Y., SAMALI, B., RASHIDI, M., et al., “Vision-based concrete crack detection using a hybrid
framework considering noise effect”, Journal of Building Engineering, v. 61, pp. 105246, 2022. doi:
http://dx.doi.org/10.1016/j.jobe.2022.105246.
[34] YU, Y., LIANG, S., SAMALI, B., et al., “Torsional capacity evaluation of RC beams using an improved
bird swarm algorithm optimised 2D convolutional neural network”, Engineering Structures, v. 273,
pp. 115066, 2022. doi: http://dx.doi.org/10.1016/j.engstruct.2022.115066.
[35] YU, Y., RASHIDI, M., SAMALI, B., et al., “Crack detection of concrete structures using deep convolu-
tional neural networks optimized by enhanced chicken swarm algorithm”, Structural Health Monitoring,
v. 21, n. 5, pp. 2244–2263, 2022. doi: http://dx.doi.org/10.1177/14759217211053546.
[36] VASWANI, A., SHAZEER, N., PARMAR, N., et al., “Attention is all you need”, In: Proceedings of
the 31st International Conference on Neural Information Processing Systems, v. 30, pp. 5998–6008,
2017.
[37] SRIKAR, B., “Concrete crack images for classification”, 2019, https://www.kaggle.com/datasets/
thesighsrikar/concrete-crack-images-for-classification, accessed in August, 2020.
PADMAPOORANI, P.; SENTHILKUMAR, S., revista Matéria, v.28, n.1, 2023
[38] SHARMA, N., DHIR, R., RANI, R., “Crack detection in concretes using transfer learning”, Advances in
Mathematics: Scientific Journal, v. 9, n. 6, pp. 3895–3906, 2020.
[39] RAJADURAI, R.-S., KANG, S.-T., “Automated vision-based crack detection on concrete surfaces using
deep learning”, Applied Sciences, v. 11, pp. 5229, 2021. doi: http://dx.doi.org/10.3390/app11115229.
[40] MENG, X., “Concrete crack detection algorithm based on deep residual neural networks”, Scientific
Programming, v. 2021, pp. 3137083, 2021. doi: http://dx.doi.org/10.1155/2021/3137083.