Towards Accurate Classification of Miniature Images
Towards Accurate Classification of Miniature Images
Towards Accurate Classification of Miniature Images
org/%C3%B6nceki-kongre-ki-taplari
Latin America 5th International Conference on Scientific Researches
March 17-19, 2023 - Medellin
ABSTRACT
Miniatures are small images drawn on manuscripts to describe and depict the subject of the
manuscripts in a visual manner. In this way, they strengthened the narration of the manuscripts
and supported a better grasp of the narration. They can be considered as historical documents
as they depict many contemporary subjects and events such as victories, dynastic histories,
palace celebrations or patron travels from the period in which they were made. Over the
centuries, many Ottoman masters have trained and produced valuable works on various and
rich subjects. Most of these works have survived to the present day.
It seems possible to extract meaningful and useful information from miniature arts via the image
processing and analysis methods that benefit from artificial intelligence and computational
abilities. This study aimed to automatically classify miniature works, in other words, to identify
the masters of the works from images with deep learning. In this context, a deep learning
algorithm was developed and trained to learn to identify the masters of the works from images.
The algorithm predicted the craftsmen of the miniatures with very high accuracy. Namely, the
algorithm has achieved 0.9722 categorical accuracy, 0.9706 Precision, 0.9167 Recall and
0.9968 AUC (area under the curve) scores. This study has shown that useful information can
be easily and successfully uncovered by processing and analyzing these historical arts.
1. INTRODUCTION
Miniatures are small-sized images intended to visually describe and depict the content subject
in manuscripts. Miniature paintings are made to provide a better understanding of what is told
in the texts or to strengthen the narration.
They generally depict portraits, lives of sultans, festivals, historical events, life style, nature
and city views, literary works, religious subjects, traditions and customs, women and men,
and creatures such as animals and plants [1].
Miniature art was used by the Ottomans from the 14th to the early 18th centuries, and by the
Safavid Empire in Iran in the 16th and 17th centuries [2]. Miniatures depict many
contemporary subjects from the period in which they were made. For example, great victories,
dynastic history, palace celebrations or patron travels. In this respect, miniatures are historical
documents that carry the events to the present, and therefore they can be used as evidence in
the study of social and political history.
During the Ottoman period, many miniature artists were trained for centuries. These artists
have produced very valuable works on various and rich subjects. Sometimes they depict court
entertainment and activities, sometimes wars, nature, daily life or social events. Today, a
significant portion of the images of these works have been widely accessible through various
researches or sources.
On the other hand, the new methods and techniques in software and artificial intelligence have
made it possible to process and analyze images, produce meaningful information, make
various inferences, and obtain many other useful results. Character recognition from images
[3], text reading [4], transcription [5], and translation [6], object and pattern recognition [7],
image classification [8], etc., many jobs can be performed quickly, easily and automatically. It
is very possible for miniature works to benefit from these methods and techniques. Thus,
many works that would be very difficult, time-consuming or almost impossible to achieve
without these methods and techniques can be performed effortlessly.
For example, automatic detection of the craftsmen of the miniatures from the images can be
very useful in terms of information retrieval and automatic data processing. Such a system
will pave the way not only for the artist's identity information, but also for obtaining many
more information about the work by making use of visuals.
In this study, the craftsmen of the Ottoman miniatures were tried to be identified
automatically from the images. For this purpose, a deep learning algorithm has been designed
and tailored to achieve the best performance. The algorithm was trained and extensively
tested with the images of miniature works of four different craftsmen. The algorithm achieved
very appealing success in classification. Thus, this study showed that the artists of the
miniatures can be easily identified from miniature images with very high accuracy.
2. EXPERIMENT
Miniatures contain various differences depending on factors such as the period in which they
were made, the subject to be depicted, or the style of the artist depicting them. For example,
artists have created works using different textures, motifs, techniques and a naturalist and/or
realistic styles [9]. These differences highlight the intrinsic and distinctive features that
distinguish the works from others.
In this study, it is aimed to identify the artists of the miniatures by exploiting these distinct
features. For this purpose, a convolutional neural network (CNN) with several different kind
of layers was designed and fine-tuned to achieve the best performance in identifying the
masters of the miniatures. The architecture of the model is visualized in the Figure 1.
2.1. Dataset 0
The visualization of the training (categorical accuracy and loss) is given in Figure 1. The
training of the network was carried on for a maximum of 1000 epochs. Batch size was 16. The
learning rate was initialed with 1E-3 and halved if there was no improvement in the
performance of the network for 100 epochs. Training was terminated when the network failed
to progress in learning for 500 epochs. A validation test was performed with 1/10 of the
training data at the end of each epoch to determine whether the network was over-fitting or
learning efficiently.
Levni Rumuzi Seyyid Lokman
3. RESULTS
The sub figures 3(a) and 3(b) show the categorical accuracy and losses, respectively, for the
training and validation. In the graphs, the blue and orange lines visualize the training and
validation, respectively. As the network could no longer continue to learn, the training was
terminated around 900th epochs. According to the graphs, the network displayed a very
successful education graph. As the training continued, the losses and classification
performances approached 0 and 1, respectively, and remained fairly stable at these levels. The
validation curve was quite fluctuating at the beginning of the training, while it became stable
in the later stages and flat at the end. As a result, the network appears to have learned
successfully. The closeness of training and validation scores also proves the learning success
of the network.
epoch epoch
(a) (b)
Figure 3. Training Performances of Algorithm. (a) Categorical Accuracy, (b) Categorical Loss.
Table 2. Classification Performance of the Model in Categorical Accuracy, Precision, Recall and
Area Under Curve (AUC)
Categorical
Accuracy Precision Recall AUC
0.9722 0.9706 0.9167 0.9968
The effects of applying (or vice versa) Batch Normalization and Max Pooling techniques on
the performance of the algorithm were also investigated. The results are given in Table 3. As
can be seen, the classification performance of the algorithm is significantly reduced when
these techniques are not applied individually or together. Especially when Max Pooling
technique is not applied, the number of parameters of the network increased from thousands
to millions and accordingly the need for memory and computational power has increased
exponentially.
Table 3. Classification Performance of the Model with and without Batch Normalization and
Max Pooling.
BatchNorm. MaxPooling Accuracy Params#
Yes Yes 0.9722 1,556
Yes No 0.7778 16.78M
No Yes 0.7500 1,428
No No 0.7778 8.39M
To further examine the excellence in classification of the algorithm, the Receiver Operating
Characteristic (ROC) and Precision-Recall curves are also given in Figure 5. Both graphs
clearly demonstrate the excellence of the algorithm in classification. All values are extremely
close to 1.
(a) (b)
Figure 5. (a) Receiver Operating Characteristic (ROC) and (b) Precision-Recall Curves for
Classification.
4. CONCLUSION
This study tried to classify the miniature works of the Ottoman period with deep learning
techniques. In this context, the focus is on identifying the craftsmen of the miniature works
from their images. A deep network specially developed for classification was trained from
sample miniature images, and asked to predict the craftsmen of the miniatures that it had not
seen before. In the experiments, the algorithm achieved a very high and promising
classification accuracy. The algorithm showed 0.9722 categorical accuracy, 0.9706 Precision,
0.9167 Recall and 0.9968 area under the curve (AUC) values. This study showed that these
miniatures can easily and automatically be classified with high accuracy.
As a result, this study has revealed that useful information can easily and successfully be
uncovered from these valuable historical arts by image processing and analysis techniques of
artificial intelligence.
REFERENCES
[1] Akdeniz Sanat, 6, 11, 2013.
[2] Asl, M. M., A Comparative Analysis of Factors Influencing the Evolution of Miniature