10618 • The Journal of Neuroscience, August 1, 2012 • 32(31):10618 –10626
Behavioral/Systems/Cognitive
A Fast and Simple Population Code for Orientation in
Primate V1
Philipp Berens,1,2,3,4 Alexander S. Ecker,1,2,3,4 R. James Cotton,4 Wei Ji Ma,4 Matthias Bethge,1,2,3
and Andreas S. Tolias2,4,5,6
Werner Reichardt Centre for Integrative Neuroscience and Institute of Theoretical Physics, University of Tübingen, 72076 Tübingen, Germany, 2Bernstein
Center for Computational Neuroscience Tübingen and 3Max Planck Institute for Biological Cybernetics, 72076 Tübingen, Germany, 4Department of
Neuroscience, Baylor College of Medicine, and 5Michael E. DeBakey Veterans Affairs Medical Center, Houston, Texas 77030, and 6Department of
Computational and Applied Mathematics, Rice University, Houston, Texas 77005
1
Orientation tuning has been a classic model for understanding single-neuron computation in the neocortex. However, little is known
about how orientation can be read out from the activity of neural populations, in particular in alert animals. Our study is a first step
toward that goal. We recorded from up to 20 well isolated single neurons in the primary visual cortex of alert macaques simultaneously
and applied a simple, neurally plausible decoder to read out the population code. We focus on two questions: First, what are the time
course and the timescale at which orientation can be read out from the population response? Second, how complex does the decoding
mechanism in a downstream neuron have to be to reliably discriminate between visual stimuli with different orientations? We show that
the neural ensembles in primary visual cortex of awake macaques represent orientation in a way that facilitates a fast and simple readout
mechanism: With an average latency of 30 – 80 ms, the population code can be read out instantaneously with a short integration time of
only tens of milliseconds, and neither stimulus contrast nor correlations need to be taken into account to compute the optimal synaptic
weight pattern. Our study shows that—similar to the case of single-neuron computation—the representation of orientation in the spike
patterns of neural populations can serve as an exemplary case for understanding the computations performed by neural ensembles
underlying visual processing during behavior.
Introduction
How do populations of neurons represent information in their
joint firing patterns and how can this information be read out by
downstream neurons (Averbeck et al., 2006)? In the motor system, for instance, numerous studies have applied decoding methods to obtain a detailed understanding of the population code for
movement direction (Georgopoulos et al., 1986; Hatsopoulos
and Donoghue, 2009). In the visual system, however, the situation is different: Ever since Hubel and Wiesel discovered that
neurons in the primary visual cortex (V1) of cats and monkeys
are tuned to the orientation of a visual stimulus (Hubel and WiReceived March 19, 2012; revised May 3, 2012; accepted May 24, 2012.
Author contributions: P.B. and A.S.T. designed research; P.B., A.S.E., and A.S.T. performed research; R.J.C.,
W.J.M., and M.B. contributed unpublished reagents/analytic tools; P.B. analyzed data; P.B., A.S.E., and A.S.T. wrote
the paper.
This work was partially supported by the German National Academic Foundation (P.B.); by the German Ministry
of Education, Science, Research and Technology through the Bernstein Award (FKZ 01GQ0601) (M.B.) and the
Bernstein Centre for Computational Neuroscience (FKZ 01GQ1002); the German Excellency Initiative through the
Centre for Integrative Neuroscience Tübingen (EXC307); the Max Planck Society; National Eye Institute–National
Institutes of Health Grant R01 EY018847 (A.S.T.); The Arnold and Mable Beckman Foundation Young Investigator
Award (A.S.T.); and The McKnight Endowment Fund for Neuroscience Scholar Award (A.S.T.). We thank M. Subramaniyan, T. Shinn, and D. Murray for technical assistance.
This article is freely available online through the J Neurosci Open Choice option.
Correspondence should be addressed to either of the following: Philipp Berens, Werner Reichardt Centre for
Integrative Neuroscience, Ottfried-Müller-Straße 25, 72076 Tübingen, Germany, E-mail: philipp@bethgelab.org; or
Andreas S. Tolias, Baylor College of Medicine, Department of Neuroscience, One Baylor Plaza, S553, Houston, TX
77030, E-mail: astolias@bcm.edu.
DOI:10.1523/JNEUROSCI.1335-12.2012
Copyright © 2012 the authors 0270-6474/12/3210618-09$15.00/0
esel, 1959, 1968), orientation selectivity of single neurons and the
underlying mechanisms have been studied extensively (De Valois
et al., 1982; Ferster and Miller, 2000; Ringach et al., 2002). Despite
this intense attention, little is known about how populations
of V1 neurons represent orientation, in particular in alert primates. Only recently, decoding methods have been applied to
population recordings from primate V1 to infer optimal spatial and temporal pooling rules for stimulus detection (Chen
et al., 2006, 2008) and to investigate the importance of correlations for decoding from the spiking responses of V1 population in anesthetized monkeys (Graf et al., 2011). In addition,
a similar approach has been used to investigate the representation of dynamic stimulus sequences in the anesthetized cat
(Benucci et al., 2009).
Here, we study the population code for orientation in V1 by
simultaneously recording spiking activity from populations of up
to 20 well isolated V1 neurons in alert macaque monkeys, while
the animals are viewing static gratings with different orientations
and contrasts. We apply a simple neurally plausible decoder to
discriminate between two orientations based on the population
response patterns in short time windows. We focus on two questions: First, what are the time course and the timescale at which
orientation can be read out from the population response? Second, how complex does the decoding mechanism in a downstream neuron have to be to reliably discriminate between visual
stimuli with different orientations? The answers to these questions place constraints on the perceptual performance the popu-
Berens et al. • Population Code for Orientation in Primate V1
lation code can support and constrain the neural machinery
necessary to perform further processing downstream.
We find that the population code for orientation in the primary visual cortex of alert macaques is remarkably fast and simple: The decoder can discriminate between two orientations
already after 30 – 80 ms, using the spike count in a time window as
small as 30 ms, and its performance is best during the initial
transient phase of the trial. Surprisingly, a fixed decoder can perform well throughout the whole trial and even across different
contrasts, leading to a largely contrast-invariant population code.
Finally, we show that—at the population size studied here—the
decoder does not have to take correlations between neurons into
account, substantially simplifying the process of learning the optimal readout weights.
Materials and Methods
Electrophysiological recordings. Recordings were made from two adult
male rhesus monkeys (Macaca mulatta) using chronically implanted arrays of 12 and 24 custom-build tetrodes (monkeys D and H, respectively). The general methods, including implant design and surgical
procedures, have been described previously (Tolias et al., 2007; Ecker et
al., 2010). All procedures were conducted in accordance with the guidelines of the local authorities (Regierungspräsidium) and the European
Community (EUVD 6/609/EEC) for the care and use of laboratory animals (monkey D) and the Baylor College of Medicine Institutional Animal Care and Use Committee-approved protocols and the NIH Guide for
the Care and Use of Laboratory Animals (monkey H).
Spike sorting was done by fitting a mixture of Gaussians model to the
detected and aligned spikes after reducing dimensionality by principal
component analysis. The mixture model obtained from the spike sorting
algorithm allows assessing single-unit isolation quantitatively. Details of
spike sorting and quantification of single-unit isolation have been described previously (Tolias et al., 2007). We used only very well isolated
cells for which the sum of estimated false positives and misses did not
exceed 5% of their total number of spikes. We used only neurons whose
average firing rate during stimulation across orientation and contrast
exceeded 0.1 Hz.
For most of the analysis in the paper, we used a dataset of 17 sessions
(monkey D, 12; monkey H, 5) in which static gratings were shown (see
below for details on visual stimulation). According to our isolation and
firing rate criteria, those sessions contained data from 6 to 20 neurons
each (average population size, 10). For the control analysis of Figure 6, we
used an additional dataset with 27 sessions (monkey D, 14; monkey H,
13) in which drifting gratings were shown. This dataset contained 5–19
neurons per session (average population size, 10.7). These datasets have
also been analyzed in the study by Ecker et al. (2010).
Behavioral paradigm and visual stimulation. For the main dataset, static
sine wave gratings were displayed on CRT monitors. Trials were initiated
by a sound and the appearance of a colored fixation target (⬃0.15°) at the
center of the screen. Three hundred milliseconds after the monkey acquired fixation, the stimulus was presented for 500 ms. The monkey had
to maintain fixation for another 300 ms until the fixation target disappeared. Upon successful completion of a trial, it was rewarded by a drop
of juice. The animals were implanted with a scleral search coil and required to fixate within a radius of 0.5–1°.
The sine wave gratings always covered and extended beyond the receptive fields of all neurons recorded from the array (typical size, 4.5°; spatial
frequency, 4 cycles/°; fixed phase across trials). They were presented at
eight different orientations and two different contrasts in each session (1,
2, 3, 10, 20, 30, 50, and 100% Michelson contrast were used). For each
condition, we collected between 10 and 85 trials per stimulus condition.
In four sessions, only seven of eight orientations were shown.
We labeled contrast levels up to 3% as “low,” between 10 and 30% as
“medium,” and ⬎50% as “high.” According to this scheme, 8 datasets
were recorded at low contrast, 16 datasets at medium contrast, and 10
datasets at high contrast, with each session resulting in two datasets with
two different contrasts (see above).
J. Neurosci., August 1, 2012 • 32(31):10618 –10626 • 10619
For the dataset used in the control analysis of Figure 6, drifting sine
wave gratings were used (contrast, 100%; drift speed, 3.4 cycles/s). All
other stimulation procedures were identical to the main dataset.
Preprocessing. We binned the spike trains at a resolution of 10 ms.
Binned spike trains were convolved with a boxcar filter, typically 50 ms
wide (except when otherwise noted). Times were aligned to the end of the
boxcar window (not the center) such that the bin with the label “100 ms”
contains the number of spikes fired by the neuron between 50 and 100 ms
after stimulus onset.
Decoding analysis. We probe the population representation of orientation using a classification task. We used regularized logistic regression
(Bishop, 2007) to decode the stimulus orientation from the population
response X[t] ⫽ (X1[t], . . . , XN[t])T on a single trial, where Xj[t] is the
number of spikes fired by neuron j in the tth time bin and N is the number
of neurons in the population. The task of the decoder was to decide
whether this population response vector occurred in a trial with stimulus
orientation 1 or 2. Such a classification task is commonly used in psychophysical studies in humans and monkeys (Vázquez et al., 2000) and
has also proven useful to theoretically assess the quality of different population coding schemes (Berens et al., 2011).
Logistic regression is a generalized linear model for two-class classification, where the posterior over the class is a log-linear function of the
population response (Bishop, 2007) as follows:
冉 冊
P 共 ⫽ 1兲 ⫽ w TX 关 t 兴 .
(1)
Here, (a) ⫽ 1/(1 ⫹ exp(⫺a)) is the logistic function and w is a vector of
weights. We trained the logistic regression model with L2 regularization
using the glmnet toolbox (Friedman et al., 2010) in MATLAB (Mathworks). The instantaneous decoder is a decoder that was trained separately for each time bin t and each combination of stimulus orientations.
We deviated from this scheme to test certain properties of the population
code as described below.
We performed cross-validation using the repeated random subsample
technique (80% training data, 20% test data, balanced between both
classes, 100 repetitions) for the whole regularization path (100 regularization parameters logarithmically spaced between 10 ⫺10 and 10). We
apply the “1 SE” rule (Friedman et al., 2010) and report the decoding
error/percentage correct of the most strongly regularized decoder within
1 SE of the decoder with the best decoding performance. This is a conservative estimate of the true decoding performance.
In several cases, we made modifications to the decoder to test whether
the readout quality would be affected by those changes. (1) The cumulative decoder was trained and tested on the spike count across the whole
trial. (2) The constant decoder was trained on the spike count across
the whole trial (like to the cumulative one) and tested on the spike
count for each specific time bin individually (in contrast to the cumulative one). (3) The decoding weights assuming that the spike
counts of the population follow an independent Poisson distribution
were computed as follows:
wi ⫽
f i共 兲 ⫺ f i共 ⫹ ⌬ 兲
0.5 共 f i 共 兲 ⫹ f i 共 ⫹ ⌬ 兲兲
(2)
(for derivation, see Ma, 2010). (4) Contrast invariance was tested in two
ways: First, by training the decoder on the combined trials of both high
and low contrast and testing it on either one of them (contrastindependent decoder). For training, we randomly selected 50% of the
trials from each contrast to ensure that the contrast-independent decoder had the same amount of information available for learning as the
specialized one. Second, by training a decoder on the high-contrast trials
and evaluating its performance on the low-contrast trials of the same
population and vice versa (cross-contrast decoder). (5) The weight vector of the linear logistic regression decoder is influenced by the average
spike count and the covariance matrix of the neurons. To test for the
importance of covariations in the firing rates for the decoding performance, we trained the logistic regression model on trial-shuffled data.
The performance of the decoder was then evaluated on the original da-
10620 • J. Neurosci., August 1, 2012 • 32(31):10618 –10626
Berens et al. • Population Code for Orientation in Primate V1
taset (Latham and Nirenberg, 2005). In addition, we tested whether the
variances and covariances of the firing rates contained extra information
not contained in the spike counts by incorporating the quadratic features
冑XiXj as additional features into the logistic regression decoder as one
specific example of a nonlinear readout.
Population model. We use a standard population model consisting of
200 neurons with homogenous orientation tuning functions modeled by
cosine-like tuning functions (Berens et al., 2011). The model population
has two subpopulations, differing in the width of their tuning curve and
the semisaturation contrast of their contrast response functions. Contrast response functions were modeled as:
f共c兲 ⫽
c
n
50
cn
,
⫹ cn
(3)
where n is a constant determining the steepness of the curve. The semisaturation contrast c50 was set to 5% for subpopulation 1 and to 50% for
subpopulation 2. Thus, at low contrast, only the broadly tuned subpopulation was activated, while at high contrasts ⬎50% of both subpopulations were activated. For illustration, the steepness parameter n was set to
20, a value higher than what is commonly observed experimentally
(⬃1–5) (Albrecht and Hamilton, 1982). This was done to achieve a fast
transition between contrasts in which only the broad subpopulation was
activated and contrasts in which both subpopulations were activated. We
calculated the minimum discrimination error for two gratings 10° apart
(Berens et al., 2011). We used a short time window (50 ms) and set the
peak firing rate of the neurons to 50 Hz, the baseline rate to 5 Hz.
Other statistical methods. For an estimate of the decoding performance
achieved by single neurons, we computed:
d⬘i ⫽
i共 1兲 ⫺ i共 2兲
冑共 i2共 1兲
⫹ i2 共 2 兲兲 / 2
,
(4)
where i() and i2 共 兲 are the average spike count and the variance of
neuron i in response to stimulus and the dependence on t has been
omitted for clarity. From d= we compute the classification error (Duda et
al., 2000) by 1 ⫺ ⌽(d⬘/2), where ⌽(x) is the cumulative normal distribution function. The population d⬘ was computed by inverting this formula
and applying it to the discrimination error of the population decoder.
The Fano factor was computed as
F ⫽
2i 关 t 兴
i关 t 兴
(5)
for each neuron and time window. Time bins with no spikes were omitted from further analysis.
Latency was computed as the time at which the decoding performance
reached 75% of its maximal value.
Relative performance was computed as the ratio of the performance
gains over chance level, (Pc( A) ⫺ 0.5)/(Pc( B) ⫺ 0.5), where Pc( A) is the
performance of the decoder (in fraction correct) in condition A. Therefore, a decoder that achieves 55% correct in condition A and 60% in
condition B has a relative performance of 50%.
The two-way ANOVA for testing the effect of (1) the difference of
the preferred orientation of a neuron to the decision boundary and
(2) the difference in the orientation of the stimulus, ⌬, was computed
on the average weights between 50 and 250 ms after stimulus onset. We
computed the average weight per session for a given combination of the
two factors and treated the different sessions as replicates (N ⫽ 34).
Noise correlations for pairs of neurons were measured by the correlation coefficient
r sc ⫽
Cov 关 x, y 兩 兴
冑Var 关 x 兩 兴Var关 y 兩 兴
(6)
of the spike counts x, y of the two neurons. Spike counts were computed
in the whole trial of 500 ms duration.
If not otherwise stated, all reported average values are medians with
SEs. The SE of the median was computed using the MATLAB function
bootstrp with 1000 bootstrap samples.
Figure 1. A, Illustration of the decoder as a linear–nonlinear model neuron. It adds the
spikes of all presynaptic neurons in a small time window and multiplies them with a synaptic
weight. The result is summed and passed through a nonlinearity, and a threshold is applied. This
indicates which of two gratings was presented. B, Time course of the decoding performance for
an example session with 20 neurons recorded at 10 and 100% contrast (green and blue lines,
respectively) averaged over ⌬. C, Average time course for all 17 sessions sorted by contrast
(low contrast: 1–3%, red; medium contrast: 5–30%, green; high contrast: 50 –100%, blue)
averaged over ⌬. D, Neurometric functions for the example session in B with decoding performance as a function of ⌬. The color code is as in B. E, Average neurometric functions for all
17 sessions sorted by contrast. The color code is as in D.
Results
Our goal was to study how orientation is represented in the joint
activity patterns of neural ensembles in primary visual cortex of
awake primates. To this end, we recorded simultaneously from
populations of up to 20 well isolated V1 neurons in two awake
macaques using chronically implanted tetrode arrays (Tolias et
al., 2007; Ecker et al., 2010). The animals were viewing static
gratings with different orientations and contrasts (17 sessions,
6 –20 neurons; two contrasts per session; for details, see Materials
and Methods). We took a decoding approach to study how fast
and on what timescale orientation could be read out from the
neural activity and how this readout depended on contrast or
correlations. We used logistic regression (Bishop, 2007) to discriminate between two gratings of different orientation based on
the instantaneous population response patterns in short time
windows of 50 ms (see Materials and Methods), which may be
biophysically plausible for dendritic integration (London and
Häusser, 2005). Reminiscent of a simplified linear–nonlinear
neuron model (Schwartz et al., 2006), the logistic regression decoder adds the weighted spike counts of its inputs, applies a sigmoidal nonlinearity, and decides whether the output is larger or
smaller than a threshold to indicate whether the population
pattern is a response to one or the other of the two orientations
Berens et al. • Population Code for Orientation in Primate V1
(Fig. 1 A; see Materials and Methods, Eq. 1). We trained a specific
decoder on each time bin and studied how the classification performance of this decoder evolved as a function of time and
changed with the difference between the orientations of the gratings and their contrast (Fig. 1).
We found that the readout of orientation is most accurate
during the transient phase of the neural response. The decoding
performance rose sharply after trial onset with a latency of 77.5 ⫾
4.1 ms (75% of the peak performance; Figs. 1 B, C, 2 A). This
means that the decoder achieved 75% of the peak performance
using the spikes in the time bin between 30 and 80 ms after
stimulus onset. On average, the peak performance was reached
after 120.0 ⫾ 19.0 ms (Fig. 2 B). After this, the performance typically decayed until the end of the trial (Fig. 1 B, C): In 30 of 34
datasets, the decoder achieved the best discrimination performance during the initial transient phase of the neural response
(90 –130 ms). During this time window, the relative performance
compared with the sustained response (300 – 400 ms) was
125.9 ⫾ 9.2% (Fig. 2C). An additional peak in decoding performance during the off-response at the end of each trial was present
in 3 of 17 sessions (data not shown).
Both the difference between the orientations of the gratings
(⌬) and the contrast had an effect on the latency of the population readout and its overall performance: The latency decreased
with increasing ⌬ (Fig. 2 D; two-way ANOVA on latencies with
factors ⌬ and contrast; main effect of ⌬, p ⫽ 0.021) and with
increasing contrasts (Fig. 2 A; Spearman’s rank correlation:
⫺0.63, p ⬍ 0.001). Similarly, the decoder achieved higher performance at higher contrasts, both on average (Fig. 1C,E) and in 15
of 17 sessions individually (Fig. 2 E, binomial test, p ⬍ 0.001), and
classification performance was highest for orthogonal gratings
(Fig. 1 D, E).
We next changed the size of the time window used to count
the input spikes for the decoder to study the timescale necessary
to reliably discriminate two orientations based on the population
response. In addition, we asked whether a decoder that can count
spikes over hundreds of milliseconds outperforms one that can
count spikes only in a small time window of tens of milliseconds.
We found that an instantaneous readout based on the spikes in a
small time window of tens of milliseconds is sufficient to achieve
good decoding performance. Specifically, the relative performance was hardly changed if the duration of the integration window was shortened to 30 ms or extended to 70 ms (Fig. 3A;
relative performance, 92 and 105%, respectively). Using an integration window of only 10 ms, the decoder still achieved a relative
performance of 67% on average. We explicitly tested whether the
readout could be improved by accumulating spikes over longer
periods of time (cumulative decoder; see Materials and Methods).
We found that the instantaneous decoder with an integration window of 50 ms achieved a relative performance of 78.7 ⫾ 2.1% compared with the cumulative decoder across the whole trial (Fig. 3B,
dark). For comparison, an instantaneous readout of single neurons
achieved only 34.6 ⫾ 0.8% relative performance compared with the
cumulative one (Fig. 3B, light).
We next wondered to what extent single-neuron properties
such as firing rate or response variability determine the time
course of the population readout accuracy. The firing rates of the
neurons in our sample showed a transient increase right after
stimulus onset, after which the firing rates decayed until the end
of the trial (Fig. 4 A; average firing rate before stimulus onset,
4.02 Hz), in line with earlier reports (Müller et al., 2001). The
response variability as measured by the Fano factor dropped after
stimulus onset before returning back to its baseline value (Fig.
J. Neurosci., August 1, 2012 • 32(31):10618 –10626 • 10621
A
B
C
D
E
Figure 2. A, Latency as a function of contrast. There are 34 data points in total, 2 for each of
the 17 sessions. The dashed line indicates the median latency. B, Distribution of the time to peak
performance. The dashed line indicates the median time to peak performance. C, Distribution of
the relative performance of the transient period (80 –130 ms) versus the sustained period
(300 – 400 ms). The dashed line indicates the median relative performance. D, Mean latency as
a function of the difference in orientation, ⌬. E, Peak performance as a function of contrast.
The solid lines connect the two data points belonging to the same session. The black dots and
lines indicate sessions in which the decoding performance increased with contrast, and the gray
dots and lines sessions in which the performance decreased with contrast. The dashed line
indicates median peak performance.
A
B
Figure 3. A, Relative peak performance using the integration window size denoted on the
x-axis compared with the standard integration window size of 50 ms. The peak performance is
computed for each length of the integration window separately and was defined as the performance in the time window that gave the best performance with that integration time. B,
Distribution of the relative peak performance of the instantaneous decoder compared with the
cumulative decoder for populations (dark) and single neurons (light).
4 B; average Fano factor before stimulus onset, 1.04), akin to what
has been observed in V1 and other brain areas before (Müller et
al., 2001; Churchland et al., 2010). Interestingly, contrast had
slightly different effects on the firing rates and the response variability: During the transient phase of the response, the firing rates
hardly differed between the medium and high contrast condition,
10622 • J. Neurosci., August 1, 2012 • 32(31):10618 –10626
Berens et al. • Population Code for Orientation in Primate V1
while the Fano factor showed a pronounced decrease from meA
B
dium to high contrast (Fig. 4 A, B). The time course of the singleunit discriminability reflects both the firing rate and response
variability modulations (Fig. 4C): It peaks during the transient
phase of the response (Müller et al., 2001) and closely resembles
the time course of the population readout accuracy (Fig. 4C,D)
(see also Fig. 1C). Thus, the time course of the accuracy of the population representation seems largely determined by the properties of
its constituent neurons (see also below for an analysis of the importance of correlations).
After having established that an instantaneous decoder can
provide a fast and reliable readout of the population response, we
C
D
turned to study the weights of the population decoder. We investigated how they evolve during the trial or change with contrast
and whether they depend on noise correlations between neurons. We found that, on average, the readout weights did not
change over the time of the trial (Fig. 5 A, B). They depended
on the distance of the preferred orientation of a neuron to the
“decision boundary” in the middle between the two discriminated orientations (Fig. 5 B, C; for statistical analysis, see below), with substantial variability around this mean (Fig. 5D).
This variability is likely partially due to the small populations
sampled as well as true tuning curve and firing rate variability
Figure 4. A, Average firing rate change after stimulus onset compared with baseline (200 to
0 ms before stimulus onset) for all single neurons sorted by contrast (low, medium, and high
in the neural population. Interestingly, the average weight
contrast in red, green, and blue, respectively) averaged over all orientations. Seventy-seven
profile did not depend on contrast (Fig. 5C), but was affected
neurons were recorded at low contrast, 180 neurons at medium contrast, and 119 neurons at
by the orientation difference between the two stimuli (Fig.
high contrast. The firing rate profile was smoothed using a Gaussian kernel with SD of ⬃30 ms.
5E): If the stimulus orientations were closer together, neurons
B, Average Fano factor change after stimulus onset computed and smoothed as in A. C, Average
with preferred orientations closer to the decision boundary
single-unit discriminability d⬘ change averaged across all combinations of ⌬. D, Average
were weighed more strongly than if they were farther apart
population discriminability d⬘ change averaged across all combinations of ⌬.
(two-way ANOVA, main effect of ⌬, n.s.; main effect of the
difference of the preferred orientation
relative to the decision boundary, p ⬍
A
10 ⫺10; interaction, p ⬍ 10 ⫺10; see Materials and Methods). For ⌬ ⫽ 22.5°,
the weight profile closely resembled that
of an optimal decoder reading out a
B
C
D
E
F
population with spike counts following
an independent Poison distribution
(Fig. 5F ).
As the average weights do not depend
on time, we expect a constant decoder
trained on the spike count in the whole
trial to perform as well as the instantaneous decoder trained on each time bin
separately. We found that this was indeed
the case, in particular during the sustained
phase of the trial (Fig. 6 A, B; relative peak Figure 5. A, Average absolute weight as a function of time for the three contrast groups (low, medium, and high contrast in red,
performance, 92.3 ⫾ 1.3%). Note that this green, and blue, respectively), renormalized to a maximum of 1 for each contrast separately. B, Average weight as a function of
decoder is trained on the average spike time and the distance of the preferred orientation of a neuron to the decision boundary in the middle of the two orientations that
were discriminated. The average is taken across neurons. Weights for medium contrast are shown. The dark color indicates
count during a trial just as the cumulative negative weights, and the bright color, positive weights. C, Average weight profile as a function of the distance of the preferred
decoder, but tested on the spike counts in orientation of a neuron to the decision boundary for the three contrast groups (low, medium, and high contrast in red, green, and
each individual time bin. To rule out that blue, respectively), renormalized to a maximum of 1 for each contrast separately. D, As in C, but with variability (all contrasts
the success of the constant decoder is due collapsed). Error bars show 1 SD. E, Average weight as a function of the distance between the two gratings that were discriminated
to the constant phase of the stimulus, we and the distance of the preferred orientation of a neuron to the decision boundary. The dark color indicates negative weights, and
performed the same analysis in an addi- the bright color, positive weights. F, The normalized weights of the logistic regression decoder (black) compared with the optimal
tional dataset in which moving gratings weights assuming that the spike counts of the population follow an independent Poisson distribution (gray) for ⌬ ⫽ 22.5°.
were used for stimulation (see Materials
and Methods). We found that the readout performance of the
found in macaque V1 in all layers except layer 4C (Ringach et al.,
constant decoder remained close to the bin-based decoder (Fig.
2002).
6C; relative peak performance, 91.2 ⫾ 1.8%). Also, the drop in
In addition, we expect the population readout to be contrast
peak performance was similar for both static and moving gratings
invariant to some degree, as the average weight profile also does
(Fig. 6 D). Therefore, the population code seems largely phase
not depend on contrast (Fig. 5C). In this case, the weights learned
invariant, consistent with the large fraction of complex cells
at one contrast could be used to decode orientation at another. To
Berens et al. • Population Code for Orientation in Primate V1
J. Neurosci., August 1, 2012 • 32(31):10618 –10626 • 10623
compared it with the contrast-specific decoder trained and tested on the same contrast (Fig. 7C,D). We first analyzed the
relative decoding performance conditioned on whether the decoder was
trained on a contrast belonging to the low,
medium, or high contrast group (averaging across all test contrasts). If the decoder
was trained on medium or high contrast,
the relative performance of the crosscontrast decoder compared with the specialized decoder was high (84.0 and 84.5%
Figure 6. A, Relative peak performance of the constant decoder compared with the bin-based one. The dashed line indicates at medium and high contrast, respecmedian relative performance. B, Time course of the relative performance of the constant decoder compared with the bin-based tively). When trained on low contrast, the
one. C, As in B for an additional dataset of sessions in which drifting gratings were shown. Despite the continually changing phase, relative performance was significantly
the relative performance of the constant decoder remains high compared with the bin-based one. D, Comparison of the peak
lower (Fig. 7C; 52.1%; Kruskal–Wallis
performance using a bin-based or constant decoder for both static (red) and moving (blue) gratings.
test, p ⫽ 0.0028). Next, we analyzed the
relative performance depending on which
contrast level the decoder was tested on (averaging across all
A
B
training contrasts). We found that the relative performance did
not change depending on the contrast level the decoder was tested
on (Fig. 7D; 84.8, 74.1, and 77.7%, respectively; Kruskal–Wallis
test, p ⫽ 0.25). These results indicate that a contrast-invariant
decoder can indeed achieve almost as good performance as a
contrast-specialized one. Why is the relative performance of the
decoder significantly impaired when trained on low contrast? At
low contrasts, not all necessary information is present to determine the decoding weights for all neurons accurately. This can
occur, for example, if a neuron does not fire at all, because the
C
D
contrast is far below its semisaturation contrast. If the weights of
the decoder are determined in a way such that the correct weights
are assigned to each neuron at high enough contrast, contrastinvariant orientation readout is possible.
Finally, noise correlations, shared trial-to-trial fluctuations in
the population response, are believed to affect the population
code (Averbeck et al., 2006). In the anesthetized macaque, they
are typically found to be on the order of ⬃0.1– 0.2 (Kohn and
Smith, 2005; Smith and Kohn, 2008). Here, they contribute
⬃10% to the total information about the orientation of stimulus
contained in pairs of neurons (Montani et al., 2007). FurtherFigure 7. A, Average neurometric functions sorted by the contrast the decoder was tested on
more, ignoring these noise correlations found in anesthetized
(low, medium, and high contrast in red, green, and blue, respectively) showing decoding peranimals in a population readout can lead to a significant loss in
formance as a function of ⌬, for both a decoder trained and tested on the same contrast (solid)
decoding performance (Graf et al., 2011).
and a decoder trained on data from both contrasts recorded during a single session and tested
on one of them (dashed). B, Distribution of the relative peak performance comparing a decoder
In contrast, we found that the decoding weights in the awake
trained on both contrasts and tested on one of them to one trained and tested on the same
macaque can be learned without knowledge of the correlation
contrast. The dashed line indicates median relative performance. C, Average relative peak perstructure of the population. In our data, noise correlations were
formance comparing a decoder trained on one contrast and tested on the other (“cross-contrast
small (Ecker et al., 2010) to begin with (0.005 ⫾ 0.002, mean ⫾
decoder”) to one trained and tested on the same contrast (“same-contrast decoder”) sorted by
SEM; Fig. 8 A; average across pairs from all tetrodes), but the
the contrast level trained on. Error bars denote SEs of the median. D, As in C, but sorted by the
average correlation level was slightly higher at low contrasts comcontrast level tested on.
pared with high ones (one-way ANOVA, p ⫽ 0.0044; Fig. 8 A,
inset). We studied whether the decoder would have to be aware of
test this property of the code, we first trained the decoder on the
these noise correlations for adjusting its readout weights. We
data acquired at the two contrasts from one session and evaluated
found that a decoder trained on trial-shuffled data (and tested on
it at one of them (contrast-independent decoder; see Materials
the original data; see Materials and Methods) performed as well
and Methods). For this and all following analyses, we returned to
as a decoder with access to the correlation structure of the poputhe static grating data. The contrast-independent decoder perlation response (Fig. 8 B; relative performance, 100 ⫾ 0.2%), informed almost as well as the contrast-specific decoder (Fig. 7 A, B;
dependent of the contrast level (Fig. 8 B, inset; one-way ANOVA,
94.6 ⫾ 1.1% relative performance). This means that the reprep ⫽ 0.74). This confirms our analysis above that the weights of the
sentations of orientation at the two contrast levels do not interlogistic regression decoder closely resemble the weights derived
fere with each other—in this case, we would have expected
from assuming that the population has an independent Poisson
strongly reduced classification performance. We next used a
spike count distribution (Fig. 5F ).
cross-contrast decoder that was trained on one contrast and
We also tested whether allowing the decoder to explicitly take
tested on the other contrast presented during that session. We
correlations into account improves decoding performance. To
10624 • J. Neurosci., August 1, 2012 • 32(31):10618 –10626
this end, we added quadratic features to the logistic regression
decoder, resulting in a nonlinear readout. This is different from
our analysis above, in which the decoder was linear but once
trained on the original data and once on the trial-shuffled data.
Since we had only few trials for many sessions, this resulted in a
slight drop in performance overall (relative performance quadratic vs linear decoder, 96.6%), as has been reported for the
anesthetized macaque (Graf et al., 2011). Thus, quadratic features—and thus more complex nonlinear decoders—are unlikely to improve classification performance substantially.
Discussion
Orientation tuning in single neurons has been a classic model for
understanding single-neuron computation. In the same way, the
representation of orientation in the spike patterns of neural populations can serve as an exemplary case for the understanding of
the computations performed by neural ensembles underlying visual processing during behavior. Our
study is a first step toward that goal. We
A
showed that the neural ensembles in primary visual cortex of awake macaques
represent orientation in a way that facilitates a fast and simple readout mechanism: With an average latency of 30 – 80
ms, the population code can be read out
instantaneously with a short integration
time of only tens of milliseconds and neither stimulus contrast nor correlations
need to be taken into account to compute
the synaptic weight pattern.
Berens et al. • Population Code for Orientation in Primate V1
A
B
Figure 8. A, Distribution of the noise correlations between pairs of cells. The dashed line
indicates the mean level of correlations. Inset, Average correlation level as a function of contrast
level (median ⫾ SE of the median). B, Distribution of the relative peak performance comparing
a decoder trained on shuffled data and tested on the real data with one trained and tested on
the real data. The dashed line indicates median relative performance. Inset, Average relative
performance as a function of contrast level (median ⫾ SE of the median).
B
C
Temporal aspects of the
Figure 9. A, Illustration of the model with two subpopulations. The broad subpopulation has a semisaturation contrast of 5%
population readout
and the narrow subpopulation of 50%. At low contrast (⬍50%), only the broad subpopulation is activated (top). At high contrasts
The accuracy of the population readout (⬎50%), both populations are active (bottom). B, Performance of a decoder discriminating between two gratings 10° apart at low
was highest during the transient phase of and high contrast using a contrast-specialized decoder (trained and tested on the same contrast; solid), a contrast-independent
the trial and followed the temporal mod- decoder (trained on both contrasts, tested on one; dashed), and a cross-contrast decoder (trained on one contrast, tested on the
ulation of the firing rate and variability of other; dotted). The lower contrast was 10% in all cases, while the higher contrast varied between 15 and 100%. C, Relative
performance of the contrast-independent and cross-contrast decoders compared with the specialized decoder.
the single neurons. Thereby, our findings
extend earlier work by Müller et al. (2001)
to neural populations. Interestingly, the weights of the populamary visual cortex exhibits a high degree of contrast invariance;
tion readout remain largely constant over the course of the trial,
that is, a readout neuron does not have to adjust its synaptic
even if the phase of the grating is continually changing. This
weights to a specific contrast. In contrast, a recent study reports
finding may seem surprising given the complex temporal tuning
that the population code is less contrast invariant in the anesthedynamics previously reported for V1 neurons (Ringach et al.,
tized monkey [Graf et al., (2011), their supplemental material]. It
1997). However, Ringach et al. used briefly flashed gratings to
is possible that the differences in the correlation structure of the
map the tuning functions measuring the “impulse response ” of
population activity between the anesthetized and awake state (see
the neurons. In contrast, we measured the “step response ” by
below) are responsible for the different degrees of contrast invaripresenting the stimulus continuously for several hundred milliance, as the magnitude of correlations in the anesthetized state
seconds possibly averaging over tuning dynamics reported in the
strongly depends on contrast (Kohn and Smith, 2005).
study by Ringach et al. Using briefly flashed gratings, Benucci et
One might assume that contrast invariance of the population
al. (2009) studied the population representation of orientation
code follows from the well known contrast invariance of single
sequences in V1 of anesthetized cats using a decoding approach.
neurons (Sclar and Freeman, 1982) in a straightforward manner:
The study focused on the transitions between stimuli of different
because single neurons show contrast-invariant orientation tunorientation and found that a simple instantaneous readout can
ing (i.e., the orientation tuning curve and the contrast response
perform well despite the interactions between the representations
function factorize), the population activity will be contrast inof successive stimuli. This suggests that our finding—a constant
variant as well. However, since neurons in V1 have differing tuninstantaneous decoder is sufficient to read out the population
ing and contrast response functions (Albrecht and Hamilton,
representation—may extend even to complex stimulus
1982; Ringach et al., 2002), this does not have to be the case. To
sequences.
illustrate this point, consider a model with two subpopulations,
one broadly tuned and one narrowly tuned, in which the narHow contrast-invariant is the population code?
rowly tuned neurons have a higher semisaturation contrast than
The computation of invariant representations is difficult (Rust
the broadly tuned neurons (Ganmor et al., 2009). The activity
and Stocker, 2010). We found that the population code in priprofile of such a population is not contrast invariant, despite
Berens et al. • Population Code for Orientation in Primate V1
contrast invariance at the single-neuron level (see Materials and
Methods) (Fig. 9A). For this model, the contrast-independent
decoder trained on data collected at both contrasts performs well
(Fig. 9 B, C), while the relative performance of a cross-contrast
decoder is only ⬃50% for both high and low contrasts. When
trained at low contrast, the decoder has no way to assign the
proper weights to the narrowly tuned subpopulation as these
neurons are not activated by the stimuli, similar to what we find
in our data (Fig. 7C). When trained at high contrast, the decoder
assigns higher weights to the narrowly tuned subpopulation than
to the broadly tuned neurons as they are more informative about
the stimulus. If this decoder is used at low contrast, the narrowly
tuned neurons only add noise to the decoding process, leading to
lower performance.
In our data, the population code achieves ⬃80% relative performance for the cross-contrast decoder (Fig. 7C,D); this is considerably more than could be achieved in the model, but direct
comparisons between model and data are difficult as the performance of the cross-contrast decoder depends on the exact setting
of the model parameters. The simple model illustrates, however,
that a high degree of contrast invariance is not a trivial consequence of contrast invariance at the single-neuron level. In cats,
semisaturation contrast and tuning width are independent
(Busse et al., 2009), such that the resulting population code is
indeed contrast invariant. Our study is the first to show that the
population code in V1 in alert macaques shares this property. It
will be interesting to explore how invariant the orientation representation is to changes in other parameters (e.g., spatial frequency or speed).
Are correlations important for decoding?
The magnitude and functional consequences of noise correlations have been subject to intense debate in recent years, both in
the theoretical and the experimental literature (Averbeck et al.,
2006; Smith and Kohn, 2008; Ecker et al., 2010, 2011; Cohen and
Kohn, 2011). Extending our previous work (Ecker et al., 2010),
we report here that the noise correlations measured in alert monkeys (⬃0.01) do not have to be taken into account when discriminating between different orientations, for both high and low
contrasts. In the anesthetized monkey, ignoring correlations
leads to significantly impaired readout performance even at a
population size similar to ours (Graf et al., 2011). The reason for
this discrepancy likely is that correlated variability is higher under
anesthesia (⬃0.16) (Smith and Kohn, 2008) than in the awake
state (⬃0.01) (Ecker et al., 2010), despite similar firing rates (average geometric mean firing rate, 3.4 vs 5 spikes/s, respectively)
[see also Greenberg et al. (2008) and Renart et al. (2010) for direct
comparisons]. This effect is possibly a result of ongoing brain
state modulations that occur under anesthesia (Kohn et al., 2009;
Ecker et al., 2010; Renart et al., 2010).
Our finding, however, is subject to some limitations: First, in
populations of several hundred neurons, even very small correlations may become important for reading out the population
code, in particular as they might imply stimulus specific higherorder correlation structures (Schneidman et al., 2006; Macke et
al., 2011). Second, the strength of noise correlations may vary as
a function of cortical layer (Hansen and Dragoi, 2011) or correlations in certain pairs of neurons may be stronger than the average correlations in the populations studied here (Ecker et al.,
2010; Ko et al., 2011). In this case, a detailed characterization of
the precise shape of the correlation structure is needed to determine the impact of noise correlations on neural coding. Third, we
evaluated the performance of the population code in a fairly easy
J. Neurosci., August 1, 2012 • 32(31):10618 –10626 • 10625
task, when discriminating between coarsely spaced orientations.
If the task is more difficult (e.g., when discriminating between
finely spaced orientations), correlations may become important
(Samonds and Bonds, 2004).
Implications for models of cortical computation
We have shown that the readout weights are independent of contrast and that potential improvements in decoding accuracy
through quadratic features are small at best. These results provide
evidence that the population variability in V1 is Poisson-like (i.e.,
it belongs to the exponential family with linear sufficient statistics) (Ma et al., 2006; Ma, 2010). This type of variability is a
crucial assumption made in theoretical studies of probabilistic
population codes. In such codes, uncertainty about the stimulus
is represented in the neural activity in addition to information
about the stimulus itself. Poisson-like variability is particularly
suitable to facilitate near-optimal Bayesian computation in tasks
such as cue combination (Ma et al., 2006), decision making (Beck
et al., 2008), and visual search (Ma et al., 2011). However, experimental evidence that cortical population activity is Poisson-like
has been lacking so far. It would be interesting to work out the
details of how uncertainty is represented in a real V1 population
and how it is used and transformed during perceptual decision
making.
Conclusions
This study demonstrates that decoding techniques are a useful
tool for investigating the properties of neural population codes,
in particular for testing invariance properties (for a similar approach in V4/IT, see Rust and Dicarlo, 2010). As a next step, it will
be important to compare neural decoding performance to psychophysical performance. This will allow judging how similar the
behavioral readout is to the simple readout mechanism suggested
by our results. Testing the contrast invariance of perceptual decisions might be an interesting starting point.
Notes
Supplemental material for this article is available at http://bethgelab.org/
datasets/v1gratings. This webpage contains all neurophysiological data used
in this article and example Matlab code. This material has not been peer
reviewed.
References
Albrecht DG, Hamilton DB (1982) Striate cortex of monkey and cat: contrast response function. J Neurophysiol 48:217–237.
Averbeck BB, Latham PE, Pouget A (2006) Neural correlations, population
coding and computation. Nat Rev Neurosci 7:358 –366.
Beck JM, Ma WJ, Kiani R, Hanks T, Churchland AK, Roitman J, Shadlen MN,
Latham PE, Pouget A (2008) Probabilistic population codes for Bayesian decision making. Neuron 60:1142–1152.
Benucci A, Ringach DL, Carandini M (2009) Coding of stimulus sequences
by population responses in visual cortex. Nat Neurosci 12:1317–1324.
Berens P, Ecker AS, Gerwinn S, Tolias AS, Bethge M (2011) Reassessing
optimal neural population codes with neurometric functions. Proc Natl
Acad Sci U S A 108:4423– 4428.
Bishop CM (2007) Pattern recognition and machine learning. Berlin:
Springer.
Busse L, Wade AR, Carandini M (2009) Representation of concurrent stimuli by population activity in visual cortex. Neuron 64:931–942.
Chen Y, Geisler WS, Seidemann E (2006) Optimal decoding of correlated
neural population responses in the primate visual cortex. Nat Neurosci
9:1412–1420.
Chen Y, Geisler WS, Seidemann E (2008) Optimal temporal decoding of
neural population responses in a reaction-time visual detection task.
J Neurophysiol 99:1366 –1379.
Churchland MM, Yu BM, Cunningham JP, Sugrue LP, Cohen MR, Corrado
GS, Newsome WT, Clark AM, Hosseini P, Scott BB, Bradley DC, Smith
10626 • J. Neurosci., August 1, 2012 • 32(31):10618 –10626
MA, Kohn A, Movshon JA, Armstrong KM, Moore T, Chang SW, Snyder
LH, Lisberger SG, Priebe NJ, et al. (2010) Stimulus onset quenches
neural variability: a widespread cortical phenomenon. Nat Neurosci
13:369 –378.
Cohen MR, Kohn A (2011) Measuring and interpreting neuronal correlations. Nat Neurosci 14:811– 819.
De Valois RL, Yund EW, Hepler N (1982) The orientation and direction
selectivity of cells in macaque visual cortex. Vision Res 22:531–544.
Duda RO, Hart PE, Stork DG (2000) Pattern classification: pattern classification, Part 1 2. Auflage. New York: Wiley.
Ecker AS, Berens P, Keliris GA, Bethge M, Logothetis NK, Tolias AS
(2010) Decorrelated neuronal firing in cortical microcircuits. Science
327:584 –587.
Ecker AS, Berens P, Tolias AS, Bethge M (2011) The effect of noise correlations in populations of diversely tuned neurons. J Neurosci
31:14272–14283.
Ferster D, Miller KD (2000) Neural mechanisms of orientation selectivity in
the visual cortex. Annu Rev Neurosci 23:441– 471.
Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33:1–22.
Ganmor E, Okun M, Lampl I (2009) The V1 population gains normalization. Neuron 64:778 –780.
Georgopoulos AP, Schwartz AB, Kettner RE (1986) Neuronal population
coding of movement direction. Science 233:1416 –1419.
Graf AB, Kohn A, Jazayeri M, Movshon JA (2011) Decoding the activity of
neuronal populations in macaque primary visual cortex. Nat Neurosci
14:239 –245.
Greenberg DS, Houweling AR, Kerr JN (2008) Population imaging of ongoing neuronal activity in the visual cortex of awake rats. Nat Neurosci
11:749 –751.
Hansen BJ, Dragoi V (2011) Correlated variability in laminar cortical circuits. Soc Neurosci Abstr 37:175.03.
Hatsopoulos NG, Donoghue JP (2009) The science of neural interface systems. Annu Rev Neurosci 32:249 –266.
Hubel DH, Wiesel TN (1959) Receptive fields of single neurones in the cat’s
striate cortex. J Physiol 148:574 –591.
Hubel DH, Wiesel TN (1968) Receptive fields and functional architecture of
monkey striate cortex. J Physiol 195:215–243.
Ko H, Hofer SB, Pichler B, Buchanan KA, Sjöström PJ, Mrsic-Flogel TD
(2011) Functional specificity of local synaptic connections in neocortical
networks. Nature 473:87–91.
Kohn A, Smith MA (2005) Stimulus dependence of neuronal correlation in
primary visual cortex of the macaque. J Neurosci 25:3661–3673.
Kohn A, Zandvakili A, Smith MA (2009) Correlations and brain states:
from electrophysiology to functional imaging. Curr Opin Neurobiol
19:434 – 438.
Berens et al. • Population Code for Orientation in Primate V1
Latham PE, Nirenberg S (2005) Synergy, redundancy, and independence in
population codes, revisited. J Neurosci 25:5195–5206.
London M, Häusser M (2005) Dendritic computation. Annu Rev Neurosci
28:503–532.
Ma WJ (2010) Signal detection theory, uncertainty, and Poisson-like population codes. Vision Res 50:2308 –2319.
Ma WJ, Beck JM, Latham PE, Pouget A (2006) Bayesian inference with
probabilistic population codes. Nat Neurosci 9:1432–1438.
Ma WJ, Navalpakkam V, Beck JM, Berg R, Pouget A (2011) Behavior and
neural basis of near-optimal visual search. Nat Neurosci 14:783–790.
Macke JH, Opper M, Bethge M (2011) Common input explains higherorder correlations and entropy in a simple model of neural population
activity. Phys Rev Lett 106:208102.
Montani F, Kohn A, Smith MA, Schultz SR (2007) The role of correlations
in direction and contrast coding in the primary visual cortex. J Neurosci
27:2338 –2348.
Müller JR, Metha AB, Krauskopf J, Lennie P (2001) Information conveyed
by onset transients in responses of striate cortical neurons. J Neurosci
21:6978 – 6990.
Renart A, de la Rocha J, Bartho P, Hollender L, Parga N, Reyes A, Harris KD
(2010) The asynchronous state in cortical circuits. Science 327:587–590.
Ringach DL, Hawken MJ, Shapley R (1997) Dynamics of orientation tuning
in macaque primary visual cortex. Nature 387:281–284.
Ringach DL, Shapley RM, Hawken MJ (2002) Orientation selectivity in macaque V1: diversity and laminar dependence. J Neurosci 22:5639 –5651.
Rust NC, Dicarlo JJ (2010) Selectivity and tolerance (“invariance”) both
increase as visual information propagates from cortical area V4 to IT.
J Neurosci 30:12978 –12995.
Rust NC, Stocker AA (2010) Ambiguity and invariance: two fundamental
challenges for visual processing. Curr Opin Neurobiol 20:382–388.
Samonds JM, Bonds AB (2004) From another angle: differences in cortical
coding between fine and coarse discrimination of orientation. J Neurophysiol 91:1193–1202.
Schneidman E, Berry MJ 2nd, Segev R, Bialek W (2006) Weak pairwise correlations imply strongly correlated network states in a neural population.
Nature 440:1007–1012.
Schwartz O, Pillow JW, Rust NC, Simoncelli EP (2006) Spike-triggered neural characterization. J Vis 6:484 –507.
Sclar G, Freeman RD (1982) Orientation selectivity in the cat’s striate cortex
is invariant with stimulus contrast. Exp Brain Res 46:457– 461.
Smith MA, Kohn A (2008) Spatial and temporal scales of neuronal correlation in primary visual cortex. J Neurosci 28:12591–12603.
Tolias AS, Ecker AS, Siapas AG, Hoenselaar A, Keliris GA, Logothetis NK
(2007) Recording chronically from the same neurons in awake, behaving
primates. J Neurophysiol 98:3780 –3790.
Vázquez P, Cano M, Acuña C (2000) Discrimination of line orientation in
humans and monkeys. J Neurophysiol 83:2639 –2648.
All in-text references underlined in blue are linked to publications on ResearchGate, letting you access and read them immediately.