Frame Sparse
Frame Sparse
Frame Sparse
SPARSE REPRESENTATIONS
SPARSE REPRESENTATION — ATSI 2
PRESENTATION
▸ Mail: matthieu.kowalski@universite-paris-saclay.fr
REMINDER
SPARSE REPRESENTATION — ATSI 4
FOURIER
‣ From time representation to frequency
- Measure the similute (correlation, angle) between pure (complex) sine and a signal
- Fourier analysis computes the correlation between the signal s(t) and a pure (complex) sine of frequency ν:
Let w(t) be a real smooth window localized around t = 0. Let the time-frequency atoms
The stft transform of a signal x(t) computes the correlation between x(t) and the time-frequency atoms φτ,ν(t) = w(t − τ)e i2πνt:
+∞
∫−∞
X(τ, ν) = ⟨x(t), φτ,ν(t)⟩ = x(t)w(t − τ)e −i2πνtdt
‣ It is a time-frequency transform
∫−∞ ∫−∞
−i2πντ
‣ It is invertible: x(t) = X(τ, ν)w(t − τ)e dτdν
EXAMPLE: GLOCKENSPIEL
SPARSE REPRESENTATION — ATSI 11
Let ψ(t) be an admissible "mother" wavelet, and its the dilated and translated version
a ( a )
1 t−b
ψa,b(t) = ψ
( a )
+∞
1 t−b
a ∫−∞
Cx(a, b) = ⟨x(t), ψa,b(t)⟩ = x(t)ψ dt
‣ It is a times-scale transform
( )
+∞ +∞
1 t − b da db
‣ It is invertible: x(t) = c ∫ ∫
X(a, b)ψ
a a 2
ψ −∞ −∞
TIME-FREQUENCY VS TIME-SCALE
▸ STFT tilling of the time-frequency plane ▸ Wavelet tilling of the time-frequency plane
SPARSE REPRESENTATION — ATSI 13
FRAME
SPARSE REPRESENTATION — ATSI 15
EXAMPLE
{0 t ≠ k
1 t=k
▸ Let the Kronecker basis Δ = {δ0(t), …, δN−1(t) } , δk (t) =
1
Let the Fourier basis ℰ = {ϵ0(t), …, ϵN−1(t)}, ϵk(t) = e i2π Nk t
▸ N
FRAME: DEFINITION
▸ A frame is a system of discrete representation where inversion is stable
▸ Let a dictionary 𝒟 = {φn(t), φn(t) ∈ L (ℝ)} (φn(t) being called a atom of 𝒟). 𝒟 is a frame of
2
2 2
L (ℝ) iff it exists two constant A, B > 0 such that for all f ∈ L (ℝ)
+∞
2
2 2
∑
A∥f∥ ≤ ⟨f(t), φn(t)⟩ ≤ B∥f∥
n=−∞
Φ* : L 2(ℝ) → ℓ 2(ℤ)
f(t) ↦ Φ*f = {⟨f(t), φn(t)⟩}
n∈ℤ
The coefficients {⟨f(t), φn(t)⟩} are called the analysis coefficients of f(t)
n∈ℤ
▸ Synthesis operator Φ
Φ : ℓ 2(ℤ) → L 2(ℝ)
+∞
∑
α ↦ Φα = αnφn(t)
n=−∞
∑
ℛ = ΦΦ* : ⟨f(t), φn(t)⟩φn(t)
n=−∞
+∞
∑
‣ We have f(t) = ℛ( f ) = ⟨f(t), φn (t)⟩φn(t) iff the frame is a Parseval frame
n=−∞
SPARSE REPRESENTATION — ATSI 19
𝒟 = {φ̃n(t) = ℛ (φn)}
˜ −1
∑ ∑
‣ We have f(t) = ⟨f, φ̃ n(t)⟩φn (t) = ⟨f, φn(t)⟩ φ̃n (t)
n=−∞ n=−∞
1
‣ If 𝒟 is a tight-frame, then φ̃n(t) = φn(t)
A
SPARSE REPRESENTATION — ATSI 20
FRAME: EXAMPLES
▸ An orthogonal basis is a Parseval-Frame
‣ Moreover, we have
T−1 T−1
i2π Tν t
∑∑
x[t] = X[τ, ν]w[t − τ]e
τ=0 ν=0
[ t0 ]
L ν
i2π ν L t
φτ,ν[t] = w t − τ e 0
‣ Usual choices are ν0 = 1 (FFT of size L) or ν0 = 2 (FFT of size 2L), and t0 = 2 (overlap of 50 %
between 2 windows) or t0 = 4 (overlap of 75 % between 2 windows)
‣ The dual frame is still a Gabor frame, with the dual window w̃(t)
23
SPARSE SYNTHESIS
SPARSE REPRESENTATION — ATSI 24
TN
▸ Let Φ ∈ ℂ the matrix associated to the dictionary (the synthesis operator). The k-th column of
Φ is then the atom φk
SPARSE SYNTHESIS
▸ Back to the example: x(t) = δk(t) + ϵn(t)
∑
Ie: find the synthesis coefficients α such that x[t] = αnφn[t] such that most of αn = 0
▸
n=0
SPARSE SYNTHESIS
▸ Goal: how can we synthesize x with the fewest possible coefficients ?
min∥α∥0 s.t. x = Φα
‣ No sparsity
SPARSE REPRESENTATION — ATSI 28
∑
Replace the ℓ0 norm by the ℓ1 norm: ∥α∥1 = | αn |
▸
n=0
‣ When this solution is the same as the true ℓ0 problem ? (Wait few classes)
SPARSE REPRESENTATION — ATSI 29
- Repeat
Moreover, we have
(k)
lim ∥r ∥ = 0
k→+∞
SPARSE REPRESENTATION — ATSI 31
‣ It converges asymptotically
‣ Algorithm:
- Repeat
SPARSE DENOISING
SPARSE REPRESENTATION — ATSI 35
T
▸ Let y ∈ ℝ be a noisy observation of x:
y=x+n
y = Φα + n
‣ One can use MP, OMP, or the convex relaxation by replacing the ℓ0 norm by the ℓ1 norm
1 2
min ∥y − Φα∥2 + λ∥α∥1
2
SPARSE REPRESENTATION — ATSI 37
LASSO
1 2
min ∥y − Φα∥2 + λ∥α∥1
2
‣ It is a convex, non smooth problem
1
min ∥z − α∥22 + λ∥α∥1
2
‣ It is the so-called proximal operator of λ∥ ⋅ ∥1
‣ Solution: soft-thresholding
+
( | zk | )
λ
αk = 𝒮λ(zk) = zk 1 − with (x)+ = max(x,0)
SPARSE REPRESENTATION — ATSI 39
LASSO: ISTA
1 2
min ∥y − Φα∥2 + λ∥α∥1 = ℱ(α)
2
‣ Let L = ∥Φ∥ 2
( )
(t+1) (t) 1 (t)
α = 𝒮λ/L α + Φ*(y − Φα )
L
(0) 2
(t) L ∥α − α*∥
‣ ℱ(α ) − ℱ(α*) ≤
2 t
‣ Fast version: FISTA (just use a relaxation step)
SPARSE REPRESENTATION — ATSI 40
CONCLUSION
▸ Stable discretization of continuous transform: frame theory
▸ Sparse denoising
▸ Algorithms: Greedy (MP, OMP), Convex optimization (LASSO and proximal descent)
▸ Questions: matthieu.kowalski@universite-paris-saclay.fr