Functional Data Analysis With PACE: Kehui Chen
Functional Data Analysis With PACE: Kehui Chen
Functional Data Analysis With PACE: Kehui Chen
Kehui Chen
Department of Statistics,
University of California, Davis
JSM, 2012
Outline
Overview of PACE
Development of PACE
Supported by various NSF grants.
Coordinated by Hans-Georg Muller and Jane-Ling Wang.
PACE 1.0 was written by Fang Yao in 2005, and subsequent
covariance, and k =
problematic.
PACE works for both dense and sparse data.
For sparse case, best linear unbiased prediction; for dense data, it
is asymptotically
equivalent to the numerical approximation of
R
ik = T (Xi (t) (t))k (t)dt.
Yao et al. (2005), Hall et al. (2006), Li and Hsing (2010), Cai
1 , t2 ) = a 0 , where
Covariance function is given by G(t
n
Covariance Estimation
G(t,t)+2
G(s,t)
E(Y|X) = +
X c (t) (t)dt
E(Y|X) = +
X c (t) (t)dt
E(Y|X) = +
X c (t) (t)dt +
E(Y|X) = +
X c (t) (t)dt
E(Y|X) = +
X c (t) (t)dt +
Absorbance
4.5
3.5
2.5
2
850
900
950
Spectrum Channel
1000
1050
1000
800
600
400
200
200
400
600
800
850
900
950
x
1000
1050
10
Residual
10
10
20
30
Fitted
40
50
60
10
5
3
2
0
1
0
1
2
1050
1000
1050
10
1000
950
950
900
900
15
850
900
950
1000
1050
850
RR
850
Residual
1
0
1
2
3
4
5
10
15
20
25
30
Fitted
35
40
45
50
55
Quantiles
50
45
Predicted Quantiles
40
35
30
25
20
15
true
median
0.1 th
0.9 th
10
10
15
20
25
30
Fat Content
35
40
45
50
Velocity on I-880
Velocity (mph)
10:25:26
14:15:41
70
70
60
60
50
50
40
40
30
30
20
20
10
10
21
22
23
24
25
26
27
21
22
23
Velocity (mph)
16:33:50
25
26
27
26
27
12:29:56
70
70
60
60
50
50
40
40
30
30
20
20
10
21
24
10
22
23
24
25
Postmile
26
27
21
22
23
24
25
Postmile
for Y conditional on X
for Y conditional on X
For Gaussian process: E(Y|X) and cov(Y|X)
for Y conditional on X
For Gaussian process: E(Y|X) and cov(Y|X)
Common principal component assumption
Additive assumption
cov(Y(t1 ), Y(t2 ) | X)
2
= GYY (t1 , t2 ) +
j=1 {k=1 gjk (k ) k=1 fjk (k ) }j (t1 )j (t2 )
send data (time, location, and speed) back through GPS enabled
mobile phones.
The follow-up project Mobile Millennium is generating more
data.
20
0
20
40
60
80
50
100
150
200
250
300
20
0
20
40
60
80
50
100
150
200
250
300
50
100
150
200
250
300
50
100
150
200
250
300
20
0
20
40
60
80
0
50
100
150
200
250
300
20
0
20
40
60
80
20
0
20
40
60
80
0
50
100
150
200
Time (sec)
250
300
Time (sec)
dynamics)
Modeling of functional errors (variance processes, volatility
processes)
Time-synchronization based on pairwise warping
Functional manifold analysis
Modeling of functional correlations
Distance based methods (curve clustering)
Stringing method
p = setOptions()
p2 = setOptions(0 bwmu0 , 3)
http://anson.ucdavis.edu/~mueller/data/programs.html.
THANK YOU!
Yao, F., Muller, H.G., Wang, J.L. (2005), Functional data analysis for sparse
longitudinal data. J. American Statistical Association, 100, 577-590.
Yao, F., Muller, H.G., Wang, J.L. (2005), Functional Linear Regression
Analysis for Longitudinal Data. Annals of Statistics, 33, 2873-2903.
Chiou, J., Muller, H.G. (2007), Diagnostics for functional regression via
residual processes. Computational Statistics and Data Analysis, 51,
4849-4863.
Liu, B., Muller, H.G. (2009), Estimating derivatives for samples of sparsely
observed functions, with application to on-line auction dynamics. J. American
Statistical Association, 104, 704-717.
Muller, H.G., Yao, F. (2010), Empirical dynamics for longitudinal data. Annals
of Statistics, 38, 3458C3486.
Muller, H.G., Sen, R., Stadtmuller, U. (2011), Functional Data Analysis for
Volatility. J. Econometrics 165, 233-245.
Yang, W., Mller, H.G. Muller, H.G., Stadtmller, U. (2011), Functional singular
component analysis. J. Royal Statistical Society B, 73, 303C-324.
Chen, K., Chen, K., Muller, H.G., Wang, J.L. (2011), Stringing
high-dimensional data for functional analysis. J. American Statistical
Association, 106, 275-284.