Medical Image Retrieval Based On Latent Semantic Indexing
Medical Image Retrieval Based On Latent Semantic Indexing
Medical Image Retrieval Based On Latent Semantic Indexing
II.
A. Color histogram
This paper adopts one dimension color histogram based on
HSV color space. We can shift RGB color space into HSV
color space [2] to obtain h [0,360] , s [ 0,1] , v [0,1] .
Red is the main color of gastroscopic image [3] while
yellow and green are the color of gastric cancer cell. In the
converted HSV space, component h concentrates on [0, 100]
and [300,360], and s and v components distribution are
relatively more homogeneous. According to above-mentioned
characteristics, we quantize the h component into 16 ranks
nonuniformly, and quantize the s and v components into 4
ranks uniformly.
I.
INTRODUCTION
c( k,c) ( I ) =
i
Pr [ p2 I c
p1I ci
p2 I
(1)
| p1 p 2 = k ]
LOW-LEVEL FEATURES
c(k,c) ( I ) = p1 I ci , p 2 I c j | p1 p 2 = k
i
561
(2)
c(ik,c) j ( I )
, where
8khci ( I )
.
The
denominator
is
the total number
hci ( I ) = n Pr [ p I Ci ]
pI
1
log M
fij
fij
F log F .
j =1
A. Normalization [6]
The purpose of normalization is to let each component of
feature vector get the same importance.
m
Figure 1.
m
Ak
Uk
k
k
k
VkT
B. Term weighting
In text retrieval field, researchers always use term
weighting technology to set the index item different
significance so as to improve the performance of retrieval
system. Apply this technology on to image retrieval field,
suppose there are M images I1 , I 2 ," , I M , we extract K feature
items w1 , w2 ," , wK . In the image I j , the feature item wi s
D. Similarly metric
We use cosine distance to measure the distance between
query images semantic vector q and the images semantic
562
IV.
aTj q
(3)
aj q
10
15
20
25
30
35
40
45
Precision
75.99%
82.88%
83.55%
82.00%
83.33%
83.99%
83.32%
83.77%
83.11%
Average-r
11.14
9.20
7.85
7.50
7.32
7.31
7.38
7.27
7.31
Average-p
0.810
0.884
0.886
0.895
0.899
0.900
0.898
0.900
0.899
50
60
80
100
120
140
160
200
256
82.66%
83.10%
83.11%
83.11%
83.11%
83.11%
83.11%
83.11%
83.11%
7.37
7.39
7.45
7.47
7.46
7.46
7.46
7.46
7.46
0.896
0.896
0.894
0.894
0.894
0.894
0.894
0.894
0.894
TABLE II.
Precision
58.90%
83.11%
75.77%
73.78%
70.89%
68.23%
67.11%
67.11%
68.89%
45
Average-r
11.24
6.40
7.14
7.55
7.92
8.12
8.51
8.63
8.41
Average-p
0.741
0.919
0.880
0.859
0.837
0.821
0.809
0.802
0.808
50
60
80
100
120
140
160
200
256
68.45%
68.00%
67.34%
67.77%
67.12%
67.12%
67.12%
67.12%
67.12%
8.59
8.48
8.48
8.59
8.67
8.72
8.69
8.69
8.69
0.805
0.805
0.804
0.794
0.793
0.793
0.795
0.795
0.795
563
20
0
14
0
Dimension k
10
0
60
45
35
25
15
Precision
col or hi st ogr am
1
0. 8
0. 6
0. 4
0. 2
0
Color histogram
Precision
Raw data
Normalize
Normalize
Weighted
Normalize, Weighted,
SVD(k=30)
Raw data
Normalize
Normalize
Weighted
Normalize, Weighted,
SVD(k=10)
56.23%
70.21%
83.11%
83.99%
56.67%
47.78%
67.12%
83.11%
Average-r
12.86
8.01
7.46
7.31
9.74
12.34
8.69
6.40
Average-p
0.689
0.849
0.894
0.900
0.726
0.635
0.795
0.919
Pr eci si on
col or hi st ogr am
1
0. 8
0. 6
0. 4
0. 2
0
Color autocorrelogram
VI.
Raw dat a
CONCLUSIONS
REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
564
Mustafa O, Ediz P. A color image segmentation approach for contentbased image retrieval. Pattern Recognition, 2007.40(4):1318-1325
Naoto K, Yasuo M. Database retrieval for similar images using ICA and
PCA bases. Engineering Applications of Artificial Intelligence,
2005.18(6):705-717
Fang YCZ, Bang TM, and Chuan KS. Endoscope diagnosis and
differential diagnosis map. LiaoNing Science and Technology
Publishing House,2003.7
Adam W, Peter Y. Content-based image retrieval using joint
correlograms. Multimedia Tools and Applications, 2007.34(2):239-248
Zhao R, Grosky W I. Negotiating the semantic gap: From feature maps
to semantic landscapes. Pattern Recognition, 2002, 35:593-600
Tai XY, Bei YE. Introduction to information retrieval technology.
BeiJingScience Press,2006