0% found this document useful (0 votes)

45 views6 pages

Sparse Concept Coding For Visual Analysis

The document proposes a new matrix factorization method called Sparse Concept Coding (SCC) for image representation and analysis. SCC provides a sparse representation that can capture the intrinsic geometric structure of image space. SCC is a two-step approach that first learns basis vectors by exploiting the manifold structure of images, and then learns a sparse representation for each image with respect to the learned basis using LASSO. Experimental results on image clustering show SCC provides a better semantic representation compared to other methods.

Uploaded by

galaxystar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views6 pages

Sparse Concept Coding For Visual Analysis

Uploaded by

galaxystar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Sparse Concept Coding for Visual Analysis

Deng Cai
Hujun Bao
Xiaofei He
The State Key Lab of CAD&CG, College of Computer Science, Zhejiang University
388 Yu Hang Tang Rd., Hangzhou, Zhejiang, China 310058
{dengcai, bao, xiaofeihe}@cad.zju.edu.cn

Abstract

product can well approximate X. Each column vector of U

can be regarded as a basis vector corresponding to a certain semantic concept and each column vector of A is the
representation of an image in this concept space.
One of the most well know matrix factorization methods is Latent Semantic Analysis (LSA) [5], which is fundamentally based on Singular Value Decomposition. LSA is
optimal in the sense of reconstruction error and thus optimal for data representation when Euclidean structure is
concerned. Another popular matrix factorization method is
Non-negative Matrix Factorization (NMF) [10], which requires the factorization matrices (both U and A) are nonnegative. The non-negative constraints only allow additive combinations among different basis vectors and it is
believed that NMF can learn a parts-based representation
[10]. The effectiveness of NMF has been demonstrated in
many image analysis tasks [1, 12] . Inspired by biological
visual systems, people has been arguing sparse features of
data points are useful for learning [18, 22, 24]. Sparse Coding (SC) [11, 16] is a recently popular matrix factorization
method which requires the representation matrix A to be
sparse. The sparseness of A indicates that each image will
only relate to several concepts (with non-zero coefficients to
the corresponding basis vectors). All these popular matrix
factorization methods only consider the Euclidean structure
of the image space.
Recent studies have shown that human generated image
data is probably sampled from a submanifold of the ambient
Euclidean space [2, 19, 21]. In fact, the image data cannot
possibly fill up the high dimensional Euclidean space uniformly. Therefore, the intrinsic manifold structure needs to
be considered while performing the matrix factorization.
Motivated from recent progresses in matrix factorization (sparse coding) and manifold learning, in this paper we
propose a novel matrix factorization method, called Sparse
Concept Coding (SCC), for image representation. SCC is a
a two-step approach including basis learning and sparse representation learning. In the first step, SCC learns the basis
by exploiting the intrinsic geometric structure of the data.
By applying the spectral analysis on the nearest neighbor

We consider the problem of image representation for visual analysis. When representing images as vectors, the
feature space is of very high dimensionality, which makes it
difficult for applying statistical techniques for visual analysis. To tackle this problem, matrix factorization techniques,
such as Singular Vector Decomposition (SVD) and Nonnegative Matrix Factorization (NMF), received an increasing amount of interest in recent years. Matrix factorization
is an unsupervised learning technique, which finds a basis
set capturing high-level semantics in the data and learns
coordinates in terms of the basis set. However, the representations obtained by them are highly dense and can not
capture the intrinsic geometric structure in the data. In this
paper, we propose a novel method, called Sparse Concept
Coding (SCC), for image representation and analysis. Inspired from the recent developments on manifold learning
and sparse coding, SCC provides a sparse representation
which can capture the intrinsic geometric structure of the
image space. Extensive experimental results on image clustering have shown that the proposed approach provides a
better representation with respect to the semantic structure.

1. Introduction
Image representation is the fundamental problem in image processing. Researchers have long sought effective
and efficient representations of images. For a give image
database, we may have thousands of distinct features. However, the degree of the freedom of each image could be far
less. Instead of the original feature space, one might hope
to find a hidden semantic concept space to represent the
images. The dimensionality of this concept space will be
much smaller than the feature space. To achieve this goal,
matrix factorization based approaches have attracted considerable attention in the last decades [1, 5, 10].
Given an image data matrix X Rmn , each column of
X corresponding to an image, the matrix factorization methods find two matrices U Rmk and A Rkn whose
2905

that U Rmk and A = VT Rkn is the optimal solution of the optimization problem (1) under the constraint
rank(UA) k [8].
Another popular matrix factorization algorithm is Nonnegative Matrix Factorization (NMF) [10], which focuses
on the analysis of data matrices whose elements are nonnegative. NMF adds the non-negative constraint on both U
and A in the optimization problem (1). The non-negative
constraints on U and A only allow additive combinations
among different basis vectors. For this reason, it is believed
that NMF can learn a parts-based representation [10].
The representation matrix A learned in the above two
methods is usually dense. Since each basis vector (column
vector of U) can be regarded as a concept, the denseness
of A indicates that each images is a combination of all the
concepts. This is contrary to our common knowledge since
most of the images only include several semantic concepts.
Sparse Coding (SC) [11, 16] is a recently popular matrix
factorization method trying to solve this issue. Sparse coding adds the sparse constraint on A, more specifically, on
each column of A, in the optimization problem (1). In this
way, SC can learn a sparse representation. SC has several
advantages for data representation. First, it yields sparse
representations such that each data point is represented as
a linear combination of a small number of basis vectors.
Thus, the data points can be interpreted in a more elegant
way. Second, sparse representations naturally make for an
indexing scheme that would allow quick retrieval. Third,
the sparse representation can be overcomplete, which offers
a wide range of generating elements. Potentially, the wide
range allows more flexibility in signal representation and
more effectiveness at tasks like signal extraction and data
compression. Finally, there is considerable evidence that biological vision adopts sparse representations in early visual
areas [15]. The sparse coding approach is fundamentally
different from those sparse subspace learning methods, e.g.,
SparsePCA [25] and SparseLDA [14]. Instead of learning
a sparse A, the sparse subspace learning methods [14, 25]
learn a sparse U. The low dimensional representation matrix learned by these sparse subspace learning methods is
still dense.

graph, SCC encodes the semantic structure in the basis vectors. In the second step, SCC uses the LASSO [9] to learn
a sparse representation with respect to the learned basis for
each image. As a result, SCC can have more discriminating
power than the traditional matrix factorization approaches
which only consider the Euclidean structure of the data.
The rest of the paper is organized as follows: in Section
2, we provide a brief review of matrix factorization. Our
Sparse Concept Coding method is introduced in Section 3.
The experimental results are presented in Section 4. Finally,
we provide the concluding remarks in Section 5.

2. Background
Given a data set with high dimensionality, matrix factorization is a common approach to compress the data
by finding a set of basis vectors and the representation
with respect to the basis for each data point. Let X =
[x1 , , xn ] Rmn be the data matrix, matrix factorization can be mathematically defined as finding two matrices
U Rmk and A Rkn whose product can best approximate X:
X UA.
Each column of U can be regarded as a basis vector which
captures the higher-level features in the data and each column of A is the k-dimensional representation of the original
inputs with respect to the new basis. From this sense, matrix factorization can also be regarded as a dimensionality
reduction method since it reduces the dimension from m to
k.
A common way to measure the approximation is by
Frobenius norm of a matrix . Thus, the matrix factorization can be defined as the optimization problem as follows:
min X UA2
U,A

(1)

Various matrix factorization algorithms add different constraints on the above optimization problem based on different goals.
Latent Semantic Analysis (LSA) is one of the most popular matrix factorization algorithms for image analysis [5]. It
adds rank constraint (rank(UA) k) on the optimization
problem in Eq. (1). LSA is fundamentally based on Singular Value Decomposition (SVD) [8]. Suppose the rank of X
is r, the SVD decomposition of X is as follows:

3. Sparse Concept Coding

All the three matrix factorization methods discussed
above only consider the Euclidean structure of the image
space. However, recent studies have shown that human
generated image data is probably sampled from a submanifold of the ambient Euclidean space [2, 19, 21]. In fact,
the human generated image data cannot possibly fill up
the high dimensional Euclidean space uniformly. Therefore, the intrinsic manifold structure needs to be considered
while learning the basis.
Combining with the idea behind the sparse coding ap-

X = UVT ,
where = diag(1 , , r ) and 1 2 r >
0 are the singular values of X, U = [u1 , , ur ] and ui s are
called left singular vectors, V = [v1 , , vr ] and vi s are
called right singular vectors. We have UT U = VT V = I.
LSI uses the first k columns of U and V. It can be proven
2906

Let Y = [y1 , , yk ], yi s are the eigenvectors of the

above generalized eigen-problem with respect to the smallest eigenvalue. Each row of Y is the flat embedding for
each data point.
Considering the manifold structure, our SCC tries to
learn the basis U which can best fit Y, which can be
achieved by solving the optimization problem as follows:

proach, we believe that a good matrix factorization should

consider the following two aspects:
The learned basis should capture the intrinsic geometric structure of the data;
And the representation under the new basis should be
sparse.

min Y XT U2 + U2

Considering the above two aspects, we propose a novel

matrix factorization method called Sparse Concept Coding
(SCC) in this section. SCC is a two-step approach for matrix factorization. The first step is basis (concept) learning.
By exploring the intrinsic geometric structure of the data,
SCC can learn a basis which is optimal for concept expression. The second step is representation learning. SCC uses
LASSO [9]which is a L1-regularized regression model, to
learn a sparse representation for each image.
The goal of SCC is trying to solve the following optimization problem
min X UA2
U,A

where the term U2 is to regularize the model to avoid

over-fitting [9] and is the regularization parameter. In
statistics, this model is called Ridge Regression [9].
By taking the derivative of Eq. (5) with respect to U and
setting it to zero, we can obtain the optimal solution:

1
XY
U = XXT + I

(6)

where I is the identity matrix.

If the image data are of very high dimensionality, it is
computational expensive to inverse the matrix XXT + I.
In reality, we can use some iterative algorithms to directly
solve the regression problem in Eq. (5) (e.g., LSQR [17]).
Let s denote the average number of non-zero entries in
each image (each column of X), s m. Compute U by
solving Eq. (5) with LSQR method needs O(kns) time
[17]. Considering W is a p-nearest neighbor graph, we need
O(n2 s + n2 p) time to compute it. And we need O(knp)
time to compute the smallest k eigenvectors of the eigenproblem (4) [20]. Considering k n, our SCC algorithm
needs O(n2 s + n2 p) to compute the basis U.

(2)

where the basis U should capture the intrinsic geometric

structure of X and each column vector of A should be
sparse. In the next subsection, we begin with a discussion
on how to learn the basis U.

3.1. Basis Learning

Recently, various researchers [2, 19, 21] have considered
the case when the data is drawn from sampling a probability distribution that has support on or near to a submanifold of the ambient space. In order to detect the underlying manifold structure, many manifold learning algorithms
have been proposed [2, 19, 21]. There algorithms construct
a nearest neighbor graph to model the local geometric structure and perform spectral analysis on the graph weight matrix. This way, these manifold learning algorithms can unfold the data manifold and provide the flat embedding
for the data points.
Consider a graph with N vertices where each vertex corresponds to an image in the data set. The edge weight matrix W is usually defined as follows:

1, if xi Np (xj ) or xj Np (xs )
(3)
Wij =
0, otherwise.

3.2. Sparse Representation Learning

After we obtain the basis U, the representation A can
be computed column by column independently through the
following minimization problem.
min xi Uai 2 + |ai |
ai

(7)

where xi and ai are the i-th columns of X and A, respectively. |ai | denote the L1-norm of ai and this L1-norm regularization term is added to ensure the sparseness of ai . In
statistics, the above L1-regularized regression problem is
called LASSO [9].
The optimization problem in Eq. (7) has the following
equivalent formulation:

where Np (xi ) denotes the set of p nearest neighbors of xi .

Define a diagonal matrix D whose entries are
column (or
row, since W is symmetric) sums of W, Dii = j Wij , we
can compute the graph Lapalcian L = D W [4].
The flat embedding for the data points which unfold
the data manifold can be found by solving the following
generalized eigen-problem [2]:
Ly = Dy

(5)

min xi Uai 2

s.t.

|ai |

(8)

The Least Angel Regression (LARs) algorithm [6] can be

used to solve the optimization problem in Eq. (8). Instead
of setting the parameter , LARs provides another choice

(4)
2907

information metric MI(C, C ) is defined as follows:

to control the sparseness of ai by specifying the cardinality

(the number of non-zero entries) of ai . LARs can compute
the entire solution path (the solutions with all the possible
cardinality on ai ) of problem (8). in O(k 3 + mk 2 ).
Overall, the time complexity of our SCC algorithm is

MI(C, C ) =

Traditional sparse coding approaches consider the factorization on Euclidean space, whereas SCC learns the
basis by exploiting the intrinsic geometric of the data.
The basis learned by SCC can have more discriminating power.

NMI(C, C ) =

MI(C, C )
max(H(C), H(C ))

where H(C) and H(C ) are the entropies of C and C , respectively. It is easy to check that NMI(C, C ) ranges from
0 to 1. NMI=1 if the two sets of clusters are identical, and
NMI=0 if the two sets are independent.

Traditional sparse coding approaches iteratively find

the U and V, which is very time consuming. Our SCC
approach only needs to solve a sparse eigen-problem
and two regression problems, which is very efficient.

4.2. Compared Algorithms

In our experiments, we compared five matrix factorization algorithms and a baseline method as follows:

4. Experimental Results
Previous studies show that matrix factorization and
sparse coding approaches are very powerful on clustering
[23]. It can achieve similar or better performance than most
of the state-of-the-art clustering algorithms. In this section,
we also evaluate our SCC algorithm on clustering problems.
The COIL20 image library1 is used in our experiment.
COIL20 contains 3232 gray scale images of 20 objects
viewed from varying angles and each object has 72 images.

Baseline method which simply performs clustering in

the original feature space.
Latent Semantic Analysis [5] which is based on Singular Value Decomposition (SVD).
Nonnegative Matrix Factorization (NMF) [10].
Sparse Coding (SC) approach [16]. We use an efficient
space coding algorithm which is introduced in [11].

4.1. Evaluation Metric

Laplacian Sparse Coding (LapSC) approach [7], which

is an manifold (geometric) extension of traditional SC
by using a graph regularizer.

The clustering result is evaluated by comparing the obtained label of each sample with the label provided by the
data set. Two metrics, the accuracy (AC) and the normalized mutual information metric (NMI) are used to measure
the clustering performance [3]. Given a data point xi , let ri
and si be the obtained cluster label and the label provided
by the corpus, respectively. The AC is defined as follows:
i=1

p(ci , cj )
p(ci ) p(cj )

where p(ci ) and p(cj ) are the probabilities that a sample

arbitrarily selected from the data set belongs to the clusters
ci and cj , respectively, and p(ci , cj ) is the joint probability
that the arbitrarily selected sample belongs to the clusters ci
as well as cj at the same time. In our experiments, we use
the normalized mutual information NMI as follows:

Comparing with the traditional sparse coding approaches

[11, 16], SCC has its own advantages as follows:

AC =

p(ci , cj ) log2

ci C,cj C

O(n2 s + n2 p + k 3 + mk 2 ).

Space Concept Coding (SCC), which is the new approach proposed in this paper.
After the matrix factorization, we have a low dimensional
representation of each image. The clustering is then performed in this low dimensional space.

(si , map(ri ))
N

4.3. Clustering Results

where N is the total number of samples and (x, y) is the

delta function that equals 1 if x = y and equals 0 otherwise,
and map(ri ) is the permutation mapping function that maps
each cluster label ri to the equivalent label from the data
corpus. The best mapping can be found by using the KuhnMunkres algorithm [13].
Let C denote the set of clusters obtained from the ground
truth and C obtained from our algorithm. Their mutual

We use k-means as the clustering algorithm. k-means

can be performed on the original feature space (baseline)
or the concept space (matrix factorization methods). In
order to randomize the experiments, we evaluate the clustering performance with different number of clusters (k =
4, 6, , 18, 20). For each given cluster number k (except
20), 20 tests were conducted on different randomly chosen
classes, and the average performance as well as the standard deviation was computed over these 20 tests. In each

1 http://www1.cs.columbia.edu/CAVE/software/softlib/coil-20.php

2908

Table 1. Clustering performance

Accuracy (%)
SVD
NMF
SC
83.115.0 81.014.2 78.515.4
75.512.2 74.310.1 73.011.2
70.49.3 69.38.6 68.210.3
70.87.2 69.47.6 72.69.0
64.34.6 69.06.3 67.07.5
67.36.2 67.65.6 66.55.3
64.14.9 66.06.0 65.14.2
62.34.3 62.83.7 61.65.4
64.3
60.5
64.4
69.1
68.9
68.5

70
65
60
55

SCC
LapSC
SC
NMF
SVD
Baseline

50
45
40

5 20

100
150
200
# of basis vectors

250

LapSC
79.616.7
76.311.5
72.09.1
76.87.4
72.25.2
72.76.4
71.83.9
69.43.5
71.1
73.5

SCC
90.513.9
92.46.1
87.511.0
86.17.4
83.07.1
81.34.4
80.04.9
77.43.6
83.4
84.6

Baseline
74.618.3
73.211.4
71.86.8
75.06.2
73.15.6
73.34.2
74.63.1
73.72.6
73.4
73.6

Normalized Mutual Information (%)

SVD
NMF
SC
LapSC
74.418.2 71.818.4 70.818.0 73.319.1
73.112.1 71.911.6 72.011.7 76.611.0
72.88.3 71.07.4 70.649.0
76.37.1
75.15.2 73.95.7 76.566.0
81.25.4
72.54.6 73.35.5 73.865.5
79.54.6
74.94.9 73.84.6 74.383.5
80.34.4
74.52.7 73.44.2 74.823.1
80.52.6
73.92.5 72.42.4 73.122.3
79.42.2
74.5
72.5
73.62
79.8
74.0
72.7
73.3
78.5

70
75
70
65
60
55
50
45

SCC
87.617.4
90.27.8
88.78.3
89.34.2
87.65.2
87.53.4
87.22.8
86.31.9
88.3
88.1

95
80

SCC
LapSC
SC
NMF
SVD
Baseline

10 30 50 68 100
150
200
# of basis vectors

50
SCC
LapSC
SC
NMF
SVD
Baseline

250

10
15
Cardinality

Mutual information (%)

Accuracy (%)

4
6
8
10
12
14
16
18
20
Avg

Baseline
83.015.2
74.510.3
68.65.7
69.68.0
65.06.8
64.04.9
64.04.9
62.74.7
63.7
68.3

Accuracy (%)

85
80
75
70
65

SCC
LapSC
SC
NMF
SVD
Baseline

60
55
50

10
15
Cardinality

Figure 1. The performance of five matrix factorization methods vs.

the number of basis vectors

Figure 2. The performance of SCC vs. the cardinality of the low

dimensional representation.

test, k-means algorithm was applied 10 times with different

random starting points and the best result in terms of the
objective function of k-means was recorded.
For matrix factorization methods, how to determine the
number of basis vectors is still an open problem. In this experiment, we report the performances of all the matrix factorization methods with the number of basis vectors equal
to the number of clusters. There are two parameters in the
basis learning phase of SCC: the number of nearest neighbors p and the ridge regularization parameter . These two
parameters were empirically set as 5 and 0.1, respectively.
In the sparse representation learning phase of SCC, the parameter is the cardinality (the number of non-zero entries)
of each low dimensional vector. We empirically let the cardinality equal to half of the number of the basis vectors.
Table (1) shows the clustering results of the five compared methods. In all the cases on both two data sets, our
SCC method significantly outperforms its competitors. By
simply using k-means on the low dimensional sparse representation, SCC achieves very impressive clustering performance. LapSC is the second best method. By incorporating
a graph regularizer, LapSC gains significant improvement
over the traditional sparse coding approach (SC). When the
number of basis vectors is fixed as the number of clusters,
the SVD method fails to gain significant improvement over
the baseline approach.
Figure (1) shows how the performances of the five matrix
factorization methods vary with the number of learned ba-

sis vectors. We simply reported the clustering results using

the entire data set. As we can see, the performances of all
the four methods drastically decrease as the number of basis vectors decreases (smaller than the number of classes).
This result is expected since for a data set containing c class,
there is at least c concepts. Thus, the concept space is
with at least c dimensions. It is interesting to see that SCC
reaches its highest performance with exactly c basis vectors, which might suggests that SCC can really capture the
concept structure of the image space. As the number of basis vectors increases, the performance of SCC slightly decreases. In the entire scope (c 250), the performances of
all the five methods are relatively stable and our SCC significantly outperforms the other methods.
Figure (2) shows how the performance of SCC varies
with the cardinality parameter. We have 20 basis vectors
and each image is represented as a 20-dimensional vector
in the SCC concept space. The cardinality denotes the
number of non-zero entries of each 20-dimensional vector.
As we can see, the performance of SCC is very stable with
respect to the cardinality as long as the cardinality is not
too small. The smallest value for cardinality which maintains the high performance is 5. In other words, despite
the high dimensionality of the original feature space (1024),
each image can be represented as a 20-dimensional sparse
vector in the SCC concept space. And each sparse vector
has only 5 non-zero entries. This significant property differentiates SCC from SVD and NMF. The low dimensional
2909

representations found by SVD and NMF are dense, which

suggests that every image contains all the concepts. This is
contrary to our common knowledge since most of the images only contain several concepts.

[8] G. H. Golub and C. F. V. Loan. Matrix computations. Johns

Hopkins University Press, 3rd edition, 1996.
[9] T. Hastie, R. Tibshirani, and J. Friedman. The Elements of
Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer-Verlag, 2001.
[10] D. D. Lee and H. S. Seung. Learning the parts of objects
by non-negative matrix factorization. Nature, 401:788791,
1999.
[11] H. Lee, A. Battle, R. Raina, , and A. Y. Ng. Efficient sparse
coding algorithms. In Advances in Neural Information Processing Systems 19. MIT Press, Cambridge, MA, 2006.
[12] S. Z. Li, X. Hou, H. Zhang, and Q. Cheng. Learning spatially localized, parts-based representation. In 2001 IEEE
Computer Society Conference on Computer Vision and Pattern Recognition (CVPR01), pages 207212, 2001.
[13] L. Lovasz and M. Plummer. Matching Theory. Akad
emiai
Kiad
o, North Holland, Budapest, 1986.
[14] B. Moghaddam, Y. Weiss, and S. Avidan. Generalized spectral bounds for sparse LDA. In ICML 06: Proceedings of the
23rd international conference on Machine learning, pages
641648, 2006.
[15] B. A. Olshausen and D. J. Field. Emergence of simple-cell
receptive field properties by learning a sparse code for natural images. Nature, 381(6583):607609, 1996.
[16] B. A. Olshausen and D. J. Field. Sparse coding with an overcomplete basis set: A strategy employed by v1. Vision Research, 37:33113325, 1997.
[17] C. C. Paige and M. A. Saunders. LSQR: An algorithm for
sparse linear equations and sparse least squares. ACM Transactions on Mathematical Software, 8(1):4371, March 1982.
[18] R. Raina, A. Battle, H. Lee, B. Packer, and A. Y. Ng. Selftaught learning: Transfer learning from unlabeled data. In
Proc. 2007 Int. Conf. Machine Learning (ICML07), 2007.
[19] S. Roweis and L. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500):2323
2326, 2000.
[20] G. W. Stewart. Matrix Algorithms Volume II: Eigensystems.
SIAM, 2001.
[21] J. Tenenbaum, V. de Silva, and J. Langford. A global geometric framework for nonlinear dimensionality reduction.
Science, 290(5500):23192323, 2000.
[22] J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong.
Learning Locality-constrained Linear Coding for Image
Classification. In IEEE Conference on Computer Vision and
Pattern Recognition (CVPR10), 2010.
[23] W. Xu, X. Liu, and Y. Gong. Document clustering based on
non-negative matrix factorization. In Proc. 2003 Int. Conf.
on Research and Development in Information Retrieval (SIGIR03), pages 267273, Toronto, Canada, Aug. 2003.
[24] J. Yang, K. Yu, Y. Gong, and T. Huang. Linear spatial
pyramid matching using sparse coding for image classification. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pages 17941801,
2009.
[25] H. Zou, T. Hastie, and R. Tibshirani. Sparse principle component analysis. JCGS, 15(2):262286, 2006.

5. Conclusions
In this paper, we have presented a novel matrix factorization method for image representation called Sparse Concept Coding (SCC). SCC is a two-step approach including
a basis learning stage and a sparse representation learning
stage. In the first stage, SCC learns the basis by exploiting
the intrinsic geometric structure of the data. By applying
the spectral analysis on the nearest neighbor graph, SCC
encodes the semantic structure in the basis vectors. In the
second stage, SCC uses the LASSO to learn a sparse representation with respect to the learned basis for each image.
As a result, SCC can have more discriminating power than
the traditional matrix factorization approaches (SVD, NMF
and SC) which only consider the Euclidean structure of the
data. Experimental results on image clustering show that
SCC provides better representation in the sense of semantic
structure.

Acknowledgments
This work was supported in part by National Natural
Science Foundation of China under Grants 60905001 and
60875044, National Basic Research Program of China (973
Program) under Grant 2011CB302206 and Fundamental
Research Funds for the Central Universities.

References
[1] S. Agarwal, A. Awan, and D. Roth. Learning to detect objects in images via a sparse, part-based representation. IEEE
Transactions on Pattern Analysis and Machine Intelligence,
pages 14751490, 2004.
[2] M. Belkin and P. Niyogi. Laplacian eigenmaps and spectral
techniques for embedding and clustering. In Advances in
Neural Information Processing Systems 14, pages 585591.
2001.
[3] D. Cai, X. He, and J. Han. Document clustering using locality preserving indexing. IEEE Transactions on Knowledge
and Data Engineering, 17(12):16241637, December 2005.
[4] F. R. K. Chung. Spectral Graph Theory, volume 92 of Regional Conference Series in Mathematics. AMS, 1997.
[5] S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science,
41(6):391407, 1990.
[6] B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani. Least
angle regression. Annals of Statistics, 32(2):407499, 2004.
[7] S. Gao, I. Tsang, L. Chia, and P. Zhao. Local features are not
lonelyLaplacian sparse coding for image classification. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR10), pages 35553561, 2010.

2910

3G3 Lab Report Radhika Gupta
No ratings yet
3G3 Lab Report Radhika Gupta
6 pages
Advanced Machine Learning: (COMP 5328)
No ratings yet
Advanced Machine Learning: (COMP 5328)
46 pages
Graph Regularized Non-Negative Matrix Factorization For Data Representation
No ratings yet
Graph Regularized Non-Negative Matrix Factorization For Data Representation
14 pages
NIPS 2006 Efficient Sparse Coding Algorithms Paper
No ratings yet
NIPS 2006 Efficient Sparse Coding Algorithms Paper
8 pages
Sparse Representations For Facial Expressions Recognition Via Optimization
No ratings yet
Sparse Representations For Facial Expressions Recognition Via Optimization
8 pages
1.1 Background: September 28, 2015 16:10 Sparse Coding and Its Applications in Computer Vision - 9in X 6in b2310
No ratings yet
1.1 Background: September 28, 2015 16:10 Sparse Coding and Its Applications in Computer Vision - 9in X 6in b2310
6 pages
Graph Regularized Non-Negative Matrix Factorization For Data Representation
No ratings yet
Graph Regularized Non-Negative Matrix Factorization For Data Representation
17 pages
This Work Is Supported in Part by The Sensip Center and The Consortium Award No. S0322
No ratings yet
This Work Is Supported in Part by The Sensip Center and The Consortium Award No. S0322
5 pages
Fast Inference in Sparse Coding Algorithms With Applications To Object Recognition
No ratings yet
Fast Inference in Sparse Coding Algorithms With Applications To Object Recognition
9 pages
Robust Principal Component Analysis?
No ratings yet
Robust Principal Component Analysis?
39 pages
Robust Principal Component Analysis
No ratings yet
Robust Principal Component Analysis
39 pages
REF-19-Deep Networks For Image Super-Resolution With Sparse Prior
No ratings yet
REF-19-Deep Networks For Image Super-Resolution With Sparse Prior
10 pages
25th International Conference On Computer Theory and Applications, ICCTA 2015, 24-26 October 2015, Alexandria, Egypt
100% (1)
25th International Conference On Computer Theory and Applications, ICCTA 2015, 24-26 October 2015, Alexandria, Egypt
196 pages
Non-Negative Matrix Factorization, A New Tool For Feature Extraction: Theory and Applications
No ratings yet
Non-Negative Matrix Factorization, A New Tool For Feature Extraction: Theory and Applications
8 pages
Statistical Modeling of Texture Sketch: Abstract. Recent Results On Sparse Coding and Independent Component
No ratings yet
Statistical Modeling of Texture Sketch: Abstract. Recent Results On Sparse Coding and Independent Component
15 pages
Non-Negative Matrix Factorization: Marshall Tappen 6.899
No ratings yet
Non-Negative Matrix Factorization: Marshall Tappen 6.899
26 pages
Exact Decomposition of Joint Low Rankness and Local Smoothness Plus Sparse Matrices PDF
No ratings yet
Exact Decomposition of Joint Low Rankness and Local Smoothness Plus Sparse Matrices PDF
16 pages
DL Unit-5
No ratings yet
DL Unit-5
34 pages
Technical Report: Sparse Hierachical Extrapolated Parametric Methods For Cortical Data Analysis
No ratings yet
Technical Report: Sparse Hierachical Extrapolated Parametric Methods For Cortical Data Analysis
13 pages
Singular Value Decomposition (SVD) in Matrix Factorization
No ratings yet
Singular Value Decomposition (SVD) in Matrix Factorization
7 pages
Q1 - VLAD - Aggregating Local Descriptors Into A Compact Image Representation
No ratings yet
Q1 - VLAD - Aggregating Local Descriptors Into A Compact Image Representation
8 pages
03 Face Detection
No ratings yet
03 Face Detection
7 pages
Aggregating Local Descriptors Into A Compact Image Representation
No ratings yet
Aggregating Local Descriptors Into A Compact Image Representation
8 pages
Kai Labusch, Erhardt Barth and Thomas Martinetz - Sparse Coding Neural Gas: Learning of Overcomplete Data Representations
No ratings yet
Kai Labusch, Erhardt Barth and Thomas Martinetz - Sparse Coding Neural Gas: Learning of Overcomplete Data Representations
21 pages
DL Unit - 5
No ratings yet
DL Unit - 5
14 pages
Ensi3 PRML s6 Encoders
No ratings yet
Ensi3 PRML s6 Encoders
19 pages
Asymmetric Distances For Binary Embeddings: Albert Gordo, Florent Perronnin, Yunchao Gong, Svetlana Lazebnik
No ratings yet
Asymmetric Distances For Binary Embeddings: Albert Gordo, Florent Perronnin, Yunchao Gong, Svetlana Lazebnik
15 pages
Fast and Robust Archetypal Analysis For Representation Learning
No ratings yet
Fast and Robust Archetypal Analysis For Representation Learning
8 pages
Linear Algebra: Submitted by Ahmad Saeed Submitted To Sir Muzzam Ali BITM-F18-022
No ratings yet
Linear Algebra: Submitted by Ahmad Saeed Submitted To Sir Muzzam Ali BITM-F18-022
5 pages
Algorithms For Non-Negative Matrix Factorization
No ratings yet
Algorithms For Non-Negative Matrix Factorization
8 pages
Rigamonti cvpr2011
No ratings yet
Rigamonti cvpr2011
8 pages
Algorithms For Non-Negative Matrix Factorization
No ratings yet
Algorithms For Non-Negative Matrix Factorization
7 pages
Image Inpainting Using Non-Negative Matrix Factorization
No ratings yet
Image Inpainting Using Non-Negative Matrix Factorization
6 pages
Trzcinski PAMI Preprint
No ratings yet
Trzcinski PAMI Preprint
14 pages
Non-Negative Matrix Factorization
No ratings yet
Non-Negative Matrix Factorization
21 pages
Factor Graphs For Robot Perception
100% (1)
Factor Graphs For Robot Perception
144 pages
Regularized Compression of A Noisy Blurred Image
No ratings yet
Regularized Compression of A Noisy Blurred Image
13 pages
Kim 2013
No ratings yet
Kim 2013
35 pages
Discriminative Learned Dictionaries For Local Image Analysis
No ratings yet
Discriminative Learned Dictionaries For Local Image Analysis
8 pages
1861 Algorithms For Non Negative Matrix Factorization
No ratings yet
1861 Algorithms For Non Negative Matrix Factorization
7 pages
A11 Find
No ratings yet
A11 Find
37 pages
Sparse Representation For Image Similarity Assessment Using SIFT Algorithm
No ratings yet
Sparse Representation For Image Similarity Assessment Using SIFT Algorithm
8 pages
Recklessly Approximate Sparse Coding
No ratings yet
Recklessly Approximate Sparse Coding
35 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Lecture10 LowDim MaYi
No ratings yet
Lecture10 LowDim MaYi
105 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
Ieee 2015 Matlab Projects 141226005942 Conversion Gate02
No ratings yet
Ieee 2015 Matlab Projects 141226005942 Conversion Gate02
8 pages
Rough Entropy-Based Fused Granular Features in 2-D Locality Preserving Projections For High-Dimensional Vision Sensor Data
No ratings yet
Rough Entropy-Based Fused Granular Features in 2-D Locality Preserving Projections For High-Dimensional Vision Sensor Data
10 pages
BCI Case Law
No ratings yet
BCI Case Law
4 pages
Greco ND
No ratings yet
Greco ND
18 pages
Contextual Image Classification: Understanding Visual Data for Effective Classification
From Everand
Contextual Image Classification: Understanding Visual Data for Effective Classification
Fouad Sabry
No ratings yet
Ð 2DÞ PCA: Two-Directional Two-Dimensional PCA For Efficient Face Representation and Recognition
No ratings yet
Ð 2DÞ PCA: Two-Directional Two-Dimensional PCA For Efficient Face Representation and Recognition
8 pages
Dense Disparity Maps From Sparse Disparity Measurements
No ratings yet
Dense Disparity Maps From Sparse Disparity Measurements
8 pages
Compressive Sensing of Sparse Tensors: Shmuel Friedland, Qun Li, Member, IEEE, and Dan Schonfeld, Fellow, IEEE
No ratings yet
Compressive Sensing of Sparse Tensors: Shmuel Friedland, Qun Li, Member, IEEE, and Dan Schonfeld, Fellow, IEEE
63 pages
Trainable COSFIRE Filters For Keypoint Detection and Pattern Recognition
No ratings yet
Trainable COSFIRE Filters For Keypoint Detection and Pattern Recognition
15 pages
Graph Dual Regularization Non-Negative Matrix Factorization For Co-Clustering
No ratings yet
Graph Dual Regularization Non-Negative Matrix Factorization For Co-Clustering
14 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
From Everand
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
Fouad Sabry
No ratings yet
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Regularized Least-Squares Classification
No ratings yet
Regularized Least-Squares Classification
24 pages
6902 An Applied Algorithmic Foundation For Hierarchical Clustering
No ratings yet
6902 An Applied Algorithmic Foundation For Hierarchical Clustering
10 pages
Probability Theory Sigma Field and Measures
No ratings yet
Probability Theory Sigma Field and Measures
11 pages
Boltzmann Exploration Done Right
No ratings yet
Boltzmann Exploration Done Right
15 pages
Indian Statistical Instituite: Students' Brochure
No ratings yet
Indian Statistical Instituite: Students' Brochure
35 pages
Roweis Gaussianidentities
No ratings yet
Roweis Gaussianidentities
4 pages
E-14 Afr
No ratings yet
E-14 Afr
5 pages
Financial Audit
No ratings yet
Financial Audit
3 pages
National University of Sciences & Technology: Form Th-2
No ratings yet
National University of Sciences & Technology: Form Th-2
1 page
National University of Sciences & Technology: Master'S Thesis Work Final Oral Exam
No ratings yet
National University of Sciences & Technology: Master'S Thesis Work Final Oral Exam
1 page
UOL Appointment Application Form
67% (3)
UOL Appointment Application Form
3 pages
Graduatng Students Exit Survey
No ratings yet
Graduatng Students Exit Survey
1 page
Lipids: TEST(s) Normal UNIT(s)
No ratings yet
Lipids: TEST(s) Normal UNIT(s)
1 page
Particle Swarm Optimization of Neural Network Architectures and Weights
No ratings yet
Particle Swarm Optimization of Neural Network Architectures and Weights
4 pages
Word2vector Paper PDF
No ratings yet
Word2vector Paper PDF
9 pages
Cross-Scenario Unknown-Aware Face Anti-Spoofing With Evidential Semantic Consistency Learning
No ratings yet
Cross-Scenario Unknown-Aware Face Anti-Spoofing With Evidential Semantic Consistency Learning
16 pages
Physics-Guided Convolutional Neural Network (PhyCNN) For Data-Driven
No ratings yet
Physics-Guided Convolutional Neural Network (PhyCNN) For Data-Driven
13 pages
深度学习框架下基于网络图像的细粒度识别研究孙泽人
No ratings yet
深度学习框架下基于网络图像的细粒度识别研究孙泽人
112 pages
A1.1 Outline 1
No ratings yet
A1.1 Outline 1
76 pages
Drepmrec: A Dual Representation Learning Framework For Multimodal Recommendation
No ratings yet
Drepmrec: A Dual Representation Learning Framework For Multimodal Recommendation
9 pages
Generalized Focal Loss Towards Efficient Representation Learning For Dense Object Detection
No ratings yet
Generalized Focal Loss Towards Efficient Representation Learning For Dense Object Detection
15 pages
Wang Et Al. - 2021 - A Novel SRTSR Model For Cross-Resolution Person Re
No ratings yet
Wang Et Al. - 2021 - A Novel SRTSR Model For Cross-Resolution Person Re
9 pages
Learning To Generate Reviews and Discovering Sentiment
No ratings yet
Learning To Generate Reviews and Discovering Sentiment
9 pages
Weednet: Dense Semantic Weed Classification Using Multispectral Images and MAV For Smart Farming
No ratings yet
Weednet: Dense Semantic Weed Classification Using Multispectral Images and MAV For Smart Farming
8 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
79 pages
Machine Learning and Deep Learning: Janiesch, Christian Zschech, Patrick Heinrich, Kai
No ratings yet
Machine Learning and Deep Learning: Janiesch, Christian Zschech, Patrick Heinrich, Kai
12 pages
Deep Unsupervised Learning
No ratings yet
Deep Unsupervised Learning
90 pages
TP-GNN - A Graph Neural Network Framework For Tier Partitioning in Monolithic 3D ICs
No ratings yet
TP-GNN - A Graph Neural Network Framework For Tier Partitioning in Monolithic 3D ICs
6 pages
Multi-Task Deep Neural Networks For Natural Language Understanding
No ratings yet
Multi-Task Deep Neural Networks For Natural Language Understanding
10 pages
A New Structural Relation Extraction Framework For SAR Occluded Target Recognition
No ratings yet
A New Structural Relation Extraction Framework For SAR Occluded Target Recognition
15 pages
ALBEF
No ratings yet
ALBEF
16 pages
2 - Self-Supervised Learning For Anomaly Detection and Localization
No ratings yet
2 - Self-Supervised Learning For Anomaly Detection and Localization
28 pages
Research Paper in VLM For Multilabel Classification
No ratings yet
Research Paper in VLM For Multilabel Classification
15 pages
Lecun 20080517 Deep Learning
No ratings yet
Lecun 20080517 Deep Learning
74 pages
MTP: Advancing Remote Sensing Foundation Model Via Multi-Task Pretraining
No ratings yet
MTP: Advancing Remote Sensing Foundation Model Via Multi-Task Pretraining
21 pages
Continual Nuclei Segmentation Via Prototype-Wise Relation Distillation and Cont
No ratings yet
Continual Nuclei Segmentation Via Prototype-Wise Relation Distillation and Cont
11 pages
1 s2.0 S0893608024004775 Main
No ratings yet
1 s2.0 S0893608024004775 Main
14 pages
AAAI24 Tutorial OOD in Time Series Slides 02182024
No ratings yet
AAAI24 Tutorial OOD in Time Series Slides 02182024
108 pages
Harvesting Efficient On-Demand Order Pooling From Skilled Couriers - Enhancing Graph Representation Learning For Refining Real-Time Many-to-One Assignments
No ratings yet
Harvesting Efficient On-Demand Order Pooling From Skilled Couriers - Enhancing Graph Representation Learning For Refining Real-Time Many-to-One Assignments
12 pages
Violence Detection Paper 1
No ratings yet
Violence Detection Paper 1
6 pages
Sim CLR
No ratings yet
Sim CLR
11 pages
Wang 20 K
No ratings yet
Wang 20 K
11 pages
Robust and Data-Efficient Generalization of Self-Supervised Machine Learning For Diagnostic Imaging
No ratings yet
Robust and Data-Efficient Generalization of Self-Supervised Machine Learning For Diagnostic Imaging
30 pages
HAST-IDS: Learning Hierarchical Spatial-Temporal Features Using Deep Neural Networks To Improve Intrusion Detection
No ratings yet
HAST-IDS: Learning Hierarchical Spatial-Temporal Features Using Deep Neural Networks To Improve Intrusion Detection
15 pages
CCA Loss
No ratings yet
CCA Loss
5 pages

Sparse Concept Coding For Visual Analysis

Uploaded by

Sparse Concept Coding For Visual Analysis

Uploaded by

Sparse Concept Coding for Visual Analysis

product can well approximate X. Each column vector of U

3. Sparse Concept Coding

Let Y = [y1 , , yk ], yi s are the eigenvectors of the

proach, we believe that a good matrix factorization should

min Y XT U2 + U2

Considering the above two aspects, we propose a novel

where the term U2 is to regularize the model to avoid

where I is the identity matrix.

where the basis U should capture the intrinsic geometric

3.1. Basis Learning

3.2. Sparse Representation Learning

where Np (xi ) denotes the set of p nearest neighbors of xi .

min xi Uai 2

The Least Angel Regression (LARs) algorithm [6] can be

information metric MI(C, C ) is defined as follows:

to control the sparseness of ai by specifying the cardinality

Traditional sparse coding approaches iteratively find

4.2. Compared Algorithms

Baseline method which simply performs clustering in

4.1. Evaluation Metric

Laplacian Sparse Coding (LapSC) approach [7], which

where p(ci ) and p(cj ) are the probabilities that a sample

Comparing with the traditional sparse coding approaches

p(ci , cj ) log2

4.3. Clustering Results

where N is the total number of samples and (x, y) is the

We use k-means as the clustering algorithm. k-means

Table 1. Clustering performance

Normalized Mutual Information (%)

Mutual information (%)

Mutual information (%)

Figure 1. The performance of five matrix factorization methods vs.

Figure 2. The performance of SCC vs. the cardinality of the low

test, k-means algorithm was applied 10 times with different

sis vectors. We simply reported the clustering results using

representations found by SVD and NMF are dense, which

[8] G. H. Golub and C. F. V. Loan. Matrix computations. Johns

You might also like