Sinha Adelson ICCV93
Sinha Adelson ICCV93
Sinha Adelson ICCV93
1. Introduction
Figure 1. A typical real world scene. Changes in surface
reflectance or orientation lead to variations in image
luminance that may have identical profiles locally.
2. Previous work:
The predominant paradigm for analysing images in terms
of their illumination and reflectance components is Retinex
[16, 17, 12, 19, 5], motivated by Land's lightness
constancy experiments with 'Mondrian' stimuli (the term
'lightness' denotes a perceptual estimate of true reflectance).
Mondrian patterns consist of patches of differently colored
paper pasted on a planar background. The illumination
across the patterns is constrained to vary smoothly. In such
a setup, discontinuities in image luminance coincide
exclusively with reflectance discontinuities in the scene.
Retinex exploits this characteristic to recover the lightness
distribution from a given image. The image is differentiated
and then thresholded to get rid of slow intensity variations
due to illumination. Subsequent reconstitution of the
image through integration leads to the recovery of the
underlying reflectance distribution upto an unknown offset.
While Retinex performs well in the Mondrian domain,
its assumption of a smoothly varying illumination gradient
limits its usefulness in a 3-D world containing painted
polyhedra. The sharp intensity transitions exhibited by
156
(a)
(b)
Figure 2. Retinex treats both the patterns shown above
identically; as far as it is concerned, the pattern on the left
is just another flat Mondrian with three differently colored
patches. The more natural interpretation of pattern (a)
would be as a uniform albedo 3-D cube illuminated in a
particular fashion.
Adelson and Pentland [2] proposed a cost based
'workshop metaphor' for interpreting images of simple
painted polyhedra. A set of specialists deal with 3-D shape,
illumination, and reflectance, and seek a least cost model
for the image. The model was able to deal with some
simple images but did not readily generalize to other
situations.
In the rest of the paper, we describe a new approach that
seems better suited to interpreting images of objects
belonging to the world of painted polyhedra.
3. A new direction:
In the present investigation, our domain consists of
painted polyhedral/origami objects such as those shown in
figure 3. We assume an absence of inter- and intra-object
occlusions and cast shadows. The surfaces shall be assumed
to be qualitatively matte without necessarily being
precisely Lambertian. The scene shall be assumed to be
illuminated with a diffuse ambient and a single distant light
source (see [20] for justification).
S2
S1
a
(a)
b
a
(b)
Zig-zag
Stairs and Stripes
Figure 3. Two sample objects from our domain of interest.
Notice that the figure on the right represents an object
whose physical realization is impossible.
157
Reflectance edges
Illumination edges
Reflectance edg es
158
(b)
(a)
Line
drawing
Input
(a)
3-D shape
recovery
3-D shape
3-D shape
Shading
consistency
check
Light
source
(b)
3.3
(b)
As stated earlier, our aim here is to interpret 2-D linedrawings extracted from the input gray-level patterns in
terms of their perceptually/physically likely 3-D structures.
The difficulty of this task, well documented by sevral
reserachers [14, 22], arises from its highly underconstrained
nature; any planar line-drawing is geometrically consistent
with infinitely many 3-D structures, as shown in figure 11.
159
After
Before
l3
l2
160
1
n
S1
S2
161
n1
1
2
n2
n3
(a)
(a)
(c)
(c)
(c)
(b)
(d)
(d)
(d)
(a)
(b)
(b)
162
Input image
Reflectance
image
Illumination
image
3-D structure
Input image
Reflectance
image
Illumination
image
3-D structure
References:
[1] Adelson, E. H. 1993. Perceptual organization and the
judgment of brightness. Science, Vol. 262: 2042-2044.
[2] Adelson, E. H., and Pentland, A. 1990. The Perception of
Shading and Reflectance, MIT Media Laboratory
Technical Report # 140.
[3] Barrow, H. G., and Tenenbaum, J. M., 1978. Recovering
Intrinsic Scene Characteristics from Images. In Hanson,
A. R., and Riseman, E. M., Computer Vision Systems, 326 Academic Press.
[4] Barrow, H. G., Tenenbaum, J. M. 1981. Interpreting linedrawings as three-dimensional surfaces. Artificial
Intelligence, Vol. 17, Nos. 1-3, pp. 75-116.
[5] Blake, A. 1985. Boundary conditions for lightness
computation in Mondrian world. Computer Vision,
Graphics and Image Processing, 32: 314-327.
[6] Cormen, T., Lieserson, C., Rivest, R., 1991.
Fundamentals of Algorithms, McGraw Hill: New York.
[7] Dines, L. L. 1918-19. Systems of linear inequalities.
Annals of Mathematics, 2nd series, vol. 20, pp. 191199.
[8] Duda, R. O. & Hart, P. E. 1973. Pattern Classification and
Scene Analysis. Wiley: New York.
[9] Fischler, M. A., Leclerc, Y. G. 1992. Recovering 3-D wire
frames from line drawings. Proceedings of the Image
Understanding Workshop.
[10] Fourier, J. J.-B. Oeuvres II, Paris, 1890.
[11] Gilchrist, A., Delman, S., and Jacobsen, A. 1983. The
Classification and Integration of Edges as Critical to the
163