ScoreFusion Eg08 Final Lowres
ScoreFusion Eg08 Final Lowres
ScoreFusion Eg08 Final Lowres
19
I. Pratikakis and T. Theoharis (Editors)
Abstract
In this work, we introduce a score fusion scheme to improve the 3D object retrieval performance. The state of
the art in 3D object retrieval shows that no single descriptor is capable of providing fine grain discrimination required by prospective 3D search engines. The proposed fusion algorithm linearly combines similarity information
originating from multiple shape descriptors and learns their optimal combination of weights by minimizing the
empirical ranking risk criterion. The algorithm is based on the statistical ranking framework [CLV07], for which
consistency and fast rate of convergence of empirical ranking risk minimizers have been established. We report the
results of ontology-driven and relevance feedback searches on a large 3D object database, the Princeton Shape
Benchmark. Experiments show that, under query formulations with user intervention, the proposed score fusion
scheme boosts the performance of the 3D retrieval machine significantly.
Categories and Subject Descriptors (according to ACM CCS): H.3.3 [Information Search and Retrieval]: Retrieval
Models I.5.1 [Models]: Statistical
1. Introduction
Next generation search engines will enable query formulations, other than text, relying on visual information encoded
in terms of images and shapes. The 3D search technology,
in particular, targets specialized application domains ranging from computer aided design to molecular data analysis. In this search modality, the user picks a query from a
catalogue of 3D objects and requests from the retrieval machine to return a set of "similar" database objects in decreasing relevance. 3D object retrieval hinges on shape matching,
that is, determining the extent to which two shapes resemble each other. Shape matching is commonly done by reducing the characteristics of the shapes to vectors or graph-like
data structures, called shape descriptors [BKS 05, TV04,
IJL 05], and then, by evaluating the similarity degrees between the descriptor pairs. We call the similarity degree between two descriptors as the matching score between two
shapes. In the retrieval mode, the matching scores between
a query and each of the database objects are sorted. The retrieval machine then displays database objects in descending
order of the scores. Effective retrieval means that the objects
displayed in the upper part of the list better match the query
object than the rest of the list.
Ongoing research in 3D object retrieval shows that no
submitted to Eurographics Workshop on 3D Object Retrieval (2008)
imizers have been established. Third, our algorithm operates on scores, and not on descriptors themselves. This adds
greater generality and flexibility to our algorithm for a broad
spectrum of retrieval applications.
The paper is organized as follows. In Section 2, we introduce the score fusion problem and give a solution based
on support vector machines (SVM) [HTF01]. We also explain the use of our score fusion algorithm in two different
protocols, bimodal and two-round searches, which can be
viewed as particular instances of ontology-driven search and
relevance feedback respectively. In Section 3, we give an
overview of the density-based framework [ASYS07a] that
we use for shape description. In Section 4, we experiment
on PSB [SMKF04] and show the degree by which we can
boost the retrieval performance of density-based shape descriptors using the proposed score fusion algorithm. In the
final Section 5, we conclude and discuss further research directions.
2. Score Fusion by Ranking Risk Minimization
2.1. The Score Fusion Problem
Consider the problem of ranking two generic database
shapes x and x0 based on their relevance to a query shape q.
Suppose that we have access to K similarity values sk and s0k
for each of the pairs (x, q) and (x0 , q) respectively. These K
similarity measures can be obtained from different descriptor sets and/or by using different metrics operating on the
same set of descriptors. In our context, a similarity value
sk , simk (x, q) arises from a certain shape descriptor and reflects some, possibly different, geometrical and/or topological commonality between the database shape x and the query
shape q. An ideal similarity measure should score higher for
similar shape pairs as compared to less similar ones. In retrieval problems, a shape x in the database that is more similar to the query q is expected to be ranked higher than any
other intrinsically less similar shape x0 . These similarity values/scores can be written more compactly in the vector form
as s = [s1 , . . . , sK ] RK . Our objective is to build a scalarvalued final scoring function of the form (x, q) = hw, si,
where w = [w1 , . . . , wK ] RK is a vector, whose components
form the weights of the corresponding scores sk . The scoring
function should assign a higher score to the more relevant
shape, i.e., it should satisfy the following property:
(x, q) > (x0 , q)
(x, q) < (x0 , q)
if y y0 > 0,
if y y0 < 0.
(2)
m<n
1
ERR(w; q),
|C| qC
C
Average per-query weight vector. The weight vector w
for a given shape concept is computed as the average
q corresponding to the
of the per-query weight vectors w
training shapes within that class, that is,
C =
w
1
w q .
|C| qC
DBI
CRSP
RTS
NN (%)
66.5
67.9
67.4
DCG (%)
66.3
66.8
65.0
PSB Set A
61.628.1
71.826.5
74.925.2
PSB Set B
60.628.1
62.628.4
62.527.7
well for certain concepts. This might be due to heuristicsbased learning of per-concept weight vectors, but, we think
that the following arguments better explain the situation:
For some concepts, the linear similarity model might not
be flexible enough to maintain good classification accuracy in the score difference domain. When instances from
queries belonging to a certain concept are pooled together,
the discrimination problem in the score difference domain
might become more complex than what can be solved using a simple linear decision boundary. However, if the linear similarity model were totally unacceptable, we would
not expect a good performance on the training set either.
In fact, in only 4 out of 161 concepts in PSB Set A, the
AVE-W fusion has worsened the performance by not more
than 2.3% DCG points with respect to the baseline SUM
rule. In PSB Set B, on the other hand, 61 concepts (again
out of 161) have suffered from an average performance
loss of 8.5% DCG points.
In Table 3, we provide the DCG scores when we use the
basic SUM rule instead of learning-based score fusion
(AVE-W or PCMIN-W) for negatively affected concepts
(i.e., those concepts for which learning-based score fusion has worsened the DCG performance). The right most
columns give the number of positively affected concepts.
We deduce that the linear similarity model is adequate for
the training set and generalizes well on the previously unseen instances of 100 concepts in the test set.
4.2. Performance in Two-round Search
In the two-round query formulation, the benefits of the proposed score fusion scheme become much more evident. To
submitted to Eurographics Workshop on 3D Object Retrieval (2008)
evaluate the performance in this search protocol, we have reserved the PSB Set A as the database shapes and PSB Set B
as the query shapes. The first round results have been obtained by the basic SUM rule (i.e., RTS).
In Figure 5, we display the DCG performance of the online sub-protocol as a function of the number of marked
items M from 4 to 32. In this figure, the line at the bottom
stands for the DCG of the first round (i.e., the performance
of the SUM rule, DCG = 62%). The line at the top stands
for the DCG when all database models are marked as either relevant or non-relevant, serving as an empirical ideal,
i.e., the maximum achievable DCG on this data set using the
presented score fusion algorithm and the running set of description schemes (DCG = 79%). Based on these results,
we make the following comments:
As the number of marked items M increases, we observe a
steep increase in the DCG performance, compatible with
theoretical fast rates of convergence proven in [CLV07].
The DCG profile converges smoothly to the empirical
ideal as the user marks more and more items in the first
round.
To give certain performance figures, for M = 8, DCG obtained after fusing the scores becomes 68%, giving a 6%
improvement compared to the baseline. The 70% DCG
barrier is reached after M = 12 marked items.
In Figure 6, we display the DCG performance of the offline sub-protocol as a function of the number of displayed
items M again 4 to 32. We emphasize that, in this mode,
M refers to the number of displayed items and the user interaction needed is limited to mark just one shape, the first
relevant one after the first round. Accordingly, here, M is not
related to the convergence of the algorithm. Increasing M
does not cost anything in terms of user interaction. After this
clarification, we have the following comments:
At M = 4, score fusion boosts the retrieval performance
by 4% and the DCG profile keeps a slow but constant
increase as the number of displayed items M in the first
round is increased.
In a typical retrieval scenario, displaying M = 32 items
has no cost. These results tell us that we can obtain DCG
improvements by 5% with respect to the baseline. Noting that the performances of top 3D shape descriptors differ only by a couple of percentage points, this 5% gain can
be considered as significant and comes virtually at no cost
at the querying process. The only bottleneck is the off-line
processing of the database shapes to learn the weight vectors, which may eventually be used in the second round.
With on-line score fusion, we can obtain significant improvements as the user is asked to mark more and more
items. In special applications where the user voluntarily
marks the demanded number of items, the on-line scheme
is preferable. The off-line scheme, on the other hand, comes
at no cost at query time and still yields satisfactory improvesubmitted to Eurographics Workshop on 3D Object Retrieval (2008)
kernel methods [HTF01] to learn a non-linear scoring function. Direct minimization of per-concept risks, optimization
of DCG-based criteria and kernelization of the score fusion
algorithm will constitute our future research directions in
this field.
References
[Akg07] A KGL C. B.: Density-based Shape Descriptors
and Similarity Learning for 3D Object Retrieval. PhD
thesis, Tlcom ParisTech and Bogazii University, 2007.
http://www.tsi.enst.fr/akgul/pdfs/thesis-cba-web.pdf.
[ASYS07a] A KGL C. B., S ANKUR B., Y EMEZ Y.,
S CHMITT F.:
Density-based 3D shape descriptors.
EURASIP Journal on Advances in Signal
Processing 2007 (2007), Article ID 32503, 16 pages.
doi:10.1155/2007/32503.
[LMT05] L EIFMAN G., M EIR R., TAL A.: Semanticoriented 3D shape retrieval using relevance feedback. The
Visual Computer 21, 8-10 (September 2005), 865875.
[PPT07]
P. PAPADAKIS I. P RATIKAKIS S. P., T HEO T.: Efficient 3D shape matching and retrieval using
a concrete radialized spherical projection representation.
Pattern Recognition 40, 9 (2007), 24372452.
An immediate perspective for further research is to extend this general score fusion scheme to other type of shape
descriptors, notably to 2D image-based ones [BKS 05]. Furthermore, we may obtain performance improvements using
HARIS