Academia.eduAcademia.edu

Aurora

2013

Estimating non-linear utility functions of time use in the context of an activity schedule adaptation model

Aurora Estimating non-linear utility functions of time use in the context of an activity schedule adaptation model Chang-Hyeon Joh, Theo Arentze and Harry Timmermans Eindhoven University of Technology Conference paper Session: Scheduling Moving through nets: The physical and social dimensions of travel 10th International Conference on Travel Behaviour Research Lucerne, 10-15. August 2003 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Estimating non-linear utility functions of time use in the context of an activity schedule adaptation model Chang-Hyeon Joh, Theo Arentze and Harry Timmermans Eindhoven University of Technology Urban Planning Group P.O. Box 513, 5600 MB Eindhoven The Netherlands Phone: +31-40-2472283; Fax: +31-40-2475882; e-mail: eirass@bwk.tue.nl Abstract Progress in activity-based modeling has recently focused on scheduling and rescheduling decisions (e.g. Gärling et al., 1999). In contributing to this line of research, the authors suggested a comprehensive model, called Aurora, of activity rescheduling decisions as a function of time pressure (Timmermans, et al., 2001) . While the original paper focused on duration adjustment and schedule composition, later the proposed theory was elaborated and extended to include many different facets of activity rescheduling behavior (Joh et al., 2001, 2002). Numerical simulations supported the face validity of the model. Given the potential of the model, the next phase in the research project is concerned with the estimation of this complex, non- linear model. This paper develops and tests an appropriate estimation method for the model, using a combination of theory and dedicated genetic algorithms. Results of numerical experiments are discussed. The paper first summarizes the theory underlying the model. Next, a method to estimate the model is proposed. The properties of this method are explored numerically using simulated data. The estimation method is tested using both perfect and noisy data. Finally, some conclusions are drawn and avenues for future research are suggested. Keywords Aurora, (re)scheduling, utility function, heuristic estimation, genetic algorithms Preferred citation Joh, C.H., T.A. Arentze and Harry J.P. Timmermans (2003) Estimating non- linear utility functions of time use in the context of an activity schedule adaptation model, Paper presented at the 10th International Conference on Travel Behavior Research, Lucerne, August 2003. I 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 1. Introduction Progress in activity-based modeling has recently focused on the analysis and modeling of the process of scheduling and rescheduling activities. An example of such research is Gärling et al. (1999), who addressed the problem of how activity scheduling is influenced by time pressure. They suggested that when faced with time pressure, individuals will first try to compress the duration of activities, or try to reschedule them. If this is not sufficient, then individuals are assumed to prioritize activities and eliminate the one with the lowest priority. This process is continued until the total duration is below some threshold. Over the last few years, the authors developed a more comprehensive model, called Aurora. The model can be viewed as a successor of the Albatross model (Arentze and Timmermans, 2000) and is conducted in the context of the Amadeus research program1 . The system is more comprehensive in that it allows modeling the dynamics of activity scheduling and rescheduling decisions as a function of unexpected events during the execution of activity programs. The model incorporates several behavioral principles and decision styles, including riskavoiding and opportunistic behavior. While the original paper focused on duration adjustment and schedule composition (Timmermans et al., 2001), later the proposed theory was elaborated and extended to include many different facets of activity rescheduling behavior (Joh et al., 2001a). Several numerical simulations supported the face validity of the model. Ultimately, however, the model should be derived from empirical data. The question then becomes how the parameters of the model can be estimated. This is certainly not a trivial que stion as the researcher is faced with several problems. First, the model has no algebraic solution. Secondly, the theory underlying the model argues that scheduling decisions are statedependent. Finally, the model should satisfy several sets of discontinuous constraints. 1 Amadeus is a collaborative research program that aims at developing models to assess the long-term, mid-term and short-term implications on activity-travel patterns of multi-modal transportation systems. The program is sponsored by NWO and involves research teams from the universities of Amsterdam, Delft, Eindhoven and Utrecht. The scope and objectives of the program are discussed in Arentze et al. (2001). 1 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 The present paper reports on the development and test of such an estimation method. First, we will summarize the theory underlying the model. Next, a method to estimate the model is proposed and the properties of this method are explored. Finally, some conclusion will be drawn. 2. Theory 2.1 Activity utility function Central to the model is that individuals derive some utility from being involved in activities. This utility U is a function of the duration v of the activity. In particular, the functional form can be expressed as: U= U max (1 + γ exp[ β (α − v)])1 / γ (1) Ux 1 + exp[ β x (α x − T )] (2) and U max = where, α, β, γ, α x, βx and Ux are activity-specific parameters, the values of which differ between activities; v is the duration of the activity; 2 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 T denotes the activity history, i.e. the time elapsed since the last implementation of the activity; Umax asymptotically determines the maximum utility of the activity, which is a function of T. Activity utility is assumed to be a monotonically increasing function of activity duration. Unlike previous authors (e.g., Becker, 1965; Kitamura, 1984), we assume that the utility first increases at an increasing rate and then increases at a decreasing rate. That is, we assume that involvement in an activity first involves an unsaturated, warming- up period where each additional unit of involvement (duration) increases utility at an increasing rate. After reaching the maximum marginal utility at some duration (inflection point), experiences become saturated, and the marginal utility decreases with every additional time unit of further involvement. This notion can be captured in terms of an asymmetric S-shaped curve with an inflection point, as opposed to the commonly assumed logarithmic curve of monotonically decreasing marginal utility. Figure 1 portrays the model and illustrates the effect of the parameters. The α parameter determines the duration at which the marginal utility reaches its maximum value (inflection point). A larger α-value shifts the curve to the right, implying a longer warming- up period. The β parameter determines the slope of the curve. A larger β-value represents a steeper curve, implying that some fixed change in duration results in a larger change in utility. Thus, a larger β-value means that the utility is more sensitive to the duration and hence less flexible in terms of adaptation. The γ parameter determines the relative position of the inflection utility. If the value is close to 1, the curve approximates a symmetric curve, and the inflection point is in the middle between the maximum utility and zero. When the value approximates 0, the utility at the inflection level is close to zero, implying that marginal utility is diminishing at virtually all levels of duration. The maximum utility Umax of an activity is not fixed, but is assumed to be determined by the context of the current schedule. We postulate that Umax depends on the activity history, i.e. the time elapsed since the last implementation of the activity. When an activity has been conducted a long ago, the utility of conducting that activity is assumed to increase. We postulate that the functional form describing the impact of history on the maximum utility is also S- 3 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 shaped. The utility of conducting an activity when it just has been performed will be small. The need to conduct that activity again will then systematically increase, until a certain upper limit of Umax has been reached. We simplify the functional form of Umax to the symmetric case. Figure 1. Impacts of utility parameters Change in α (100 à 150: to the right) Change in β (0.05 à 0.1: to the steeper) 1000 1000 900 900 800 800 700 700 600 600 500 500 400 400 300 300 200 200 100 100 0 0 0 50 100 150 200 0 250 Change in γ (1 à 0.1: to the bottom) 50 100 150 200 Change in Ux (1000 à750: to the bottom) 1000 1000 900 900 800 800 700 700 600 600 500 500 400 400 300 300 200 200 100 100 0 0 0 50 100 150 200 0 50 100 150 200 The utility of an activity can the n be computed, given the estimated α, β, γ, α x, β x and Ux, and the duration v and history T of the activity. Note that the model can be extended by further incorporating into the Umax function other decision dimensions such as activity sequence, location, transport mode, accompanying person, etc. 4 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 2.2 Schedule adjustment Faced with time pressure, an individual is assumed to proceed with a series of mental processes of schedule adjustment and implement the best schedule. In the proposed model, various operators are assumed to accomplish schedule adjustment s (Joh et al., 2002). The schedule composition operator changes the list of activities included in a schedule by deleting, inserting and substituting activities. Sequencing, location, transport mode and accompanying person operators change the means of implementing the concerned activity. The application of the operators leads to incremental mental adjustments of the schedule and continues until no more improvement is possible. 2.3 Problems with estimation Assume that we know in advance the maximum level of utility Umax of each activity, and we can observe the level of utility U of each activity over time. We could the n linearize the suggested activity utility function and, using the information of duration v of the observed schedule data, directly solve the equation for the parameter values α, β and γ for each activity. This is unfortunately not the case because the ut ilities of the activities (U and Umax) are unobservable and unknown. Furthermore, unlike the logarithmic function of ever-diminishing marginality (e.g., Kitamura, 1984), the suggested utility function of the current research does not provide the algebraic solution for the optimum durations of the maximum schedule utility. Therefore, we cannot directly ‘solve’ the estimates but have to ‘find’ the estimates that best fit the observed durations by searching iteratively multiple combinations of parameter values. The number of such combinations would however be prohibitively large for an exhaustive search. 2.4 Estimation method For the above reasons, we suggest a heuristic method that searches only part of the entire solution space, but at the same time, provides good, near optimum solutions. The method is based on the critical assumption that the marginal utility of activities in the schedule is the same. If this would not be the case, an individual would further adjust durations to increase the total utility. The duration of the activity of the higher marginal utility will be increased while the duration of the other activity will be decreased. 5 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 In principle, based on these theoretical decisions, an estimation method could be developed. However, the numerical value of the equal marginal utility of the schedule is unknown in reality, and multiple solutions for the parameter estimates that satisfy the equal marginal utility across activities of the schedule could exist. Therefore, an additional assumption is required. To find a solution, we postulate that the level of equal marginal utility of the schedule is partly reflected by the amount of time pressure of the schedule, measured as the total duration of the fixed activities of the schedule, and the number of activities. The numerical level of equal marginal utility can then be approximated as a function of these variables. The rationale behind this assumption is that given the number of activities, higher time pressure (less available time) would raise the level of equal marginal utility. On the other hand, given the time pressure, a larger number of activities would raise the marginal utility for each activity. These assumptions seem appropriate given our theory especially for saturated activities. Equation (3) expresses this critical assumption. βU max exp[β (α − v)] = Σδ k X k (1 + γ exp[ β (α − v)])1/ γ +1 ∀a∈S (3) where, Xk is the k th attribute of the equal- marginal- utility function (X1 = fixed duration, X2 = number of activities of the schedule); δ k is the marginal contribution of the k th attribute Xk to the level of equal marginal utility, which is attribute-specific; S denotes the current schedule. By solving equation (3) for v, given each of the combinations of possible values of parameters, the overall goodness-of- fit of the predicted set of parameter values can be calculated as: 6 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Gl = ΣGal (4) Gal =| v oa − v apl | (5) with where, Gl is the goodness-of- fit of the lth predicted combination of parameter estimates, where l = 1, …, L, and L is the number of the parameter values combinations to be examined; Gal is the goodness-of-fit of the lth predicted combination of parameter estimates for activity a; v ao and v apl are respectively the observed duration of activity a and the duration of activity a predicted by the lth predicted combination of parameter values. The set of parameter values that corresponds to the best goodness-of- fit can then be identified. Still, however, some further operational problems need to be solved for obtaining the duration prediction v p in order to compute the goodness-of- fit of the associated set of predicted parameter estimates. First, there is no direct, algebraic solution for v p that satisfies equation (3), and therefore, we used an algorithm of golden section search to ‘find’ the duration instead of ‘solving’ it. Secondly, the predicted set of parameter values may have no cross points between the equal marginal utility line given by the RHS of equation (3) and the marginal utility curve given by 7 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 the LHS of equation (3). That is, it may be the case that the predicted time pressure goes beyond the maximum of the marginal utility. It then has no cross points for the corresponding duration prediction. In reality, it may be that when the time pressure is so high, and the schedule requires very high marginal utilities for activities to be included, the activities with a lower maximum marginal utility will not survive, and therefore, we would not observe such activities in the schedule. In our estimation, however, we do observe the activity that was implemented in the schedule, and therefore, if the predicted parameters result in no cross points, that predicted solution would clearly be wrong. The predicted parameters associated with such ‘illegality’ and their neighbors should not be visited again in the iterative solution search procedure enforced by assigning an appropriate size of penalty. To this end, the goodness-of- fit of an activity should be revised from equation (5) as: | v o − v ap | l G al =  ao | v a − 0 | +1440 + gap for legal solution for illegal solution (6) As implied by this equation, the measure of the goodness-of- fit of the illegal solution for an activity is the sum of three sources of differences. The first source of difference is the pure difference between observation and prediction. As discussed above, if the time pressure is much too high compared with the maximum level of an activity’s marginal utility, that activity should not be included in the schedule, and therefore, the predicted duration v p of this activity is zero. The second source of difference is a penalty for being illegal. In the current study, we used 1 minute as the unit of measurement for duration. The added number 1440 then means the entire time duration of a day in minutes. Equation (6) distinguishes illegal solutions from legal ones by adding this big number, which makes the resulting measure of goodness-of- fit worse than any possible legal solution. The final source of difference for the illegal solutions is the degree of illegality to make the search sensitive for direction. Figure 2 illustrates three different solutions of an activity, where the predicted parameter values of this activity’s marginal utility curve α, β, γ, αx, β x and Ux are the same, and only the time pressure parameter values δ are different. The first solution is legal in the sense that the marginal utility curve and the time pressure line (TP 1 ) crosses at least one point. The second and third solutions are however illegal, because the marginal utility curve and time pressure lines do not cross. The lower part of equation (6) should be applied. The first two terms of the RHS of the lower part of this equation are the same for the second and third solutions, which makes their goodness-of- fit measure much worse than the first solution. The vertical arrows between the 8 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 time pressure lines and the maximum of the marginal utility curve makes a further distinction between these illegal solutions, which states that the third solution is even worse than the second. Figure 2. Illegality of the solutions MU 60 50 TP3 40 TP2 30 20 TP1 10 0 50 60 70 α Figure 3. Decision of predicted duration 9 80 • v 90 100 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Finally, the existence of the equilibrium durations at both sides of the inflection point raises the question of how to choose the one that is used for prediction. We let the method choose the one that gives the better match. This can be illustrated as in Figure 3. In the figure, the black dots are the predicted durations v p corresponding to the cross points of the observed level of equal marginal utility and the marginal utility curve. The white dot is the observed duration v o of this activity. The black dot on the RHS is chosen in this illustration because it is closer to the observation than the other black dot. Accordingly, the goodness-offit of an activity is again revised from equation (6) as: max[| v oa − v ap |] l G al =  o | v a − 0 | +1440 + gap for legal solution for illegal solution (7) The proposed estimation method can therefore be summarized as: Minimize: G ∈ {Gl } Subject to: MUal = MUa'l (8) ∀a , a '∈ S (9) where, G is the goodness-of- fit of the estimated model of the best parameter values combination; Gl is defined as equations (4) and (7); MU a denotes the marginal utility of activity a, as expressed in the LHS of equation (3). 10 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Solving equation (3) for v across activities of the schedule meets the constraints of equation (9) and provides the prediction of activity duration used in equation (7). For each activity of the schedule, then the prediction error is computed, and the overall goodness-of-fit of a predicted combination of parameter values is the sum of prediction errors across activities as in equation (4). 3. Algorithm Our problem thus is to estimate from schedule data the activity-specific parameters α, β, γ, α x, β x and Ux, given the information of duration v, history T and time pressure attribute X. A genetic algorithm was applied to solve equations (8) and (9). The following operational decisions were made. 3.1 Representation of the solution candidates We employed a real coding scheme to represent the solution candidates of the real parameter values. The real-coding genetic algorithm (RCGA) does not binary-digitalize the real number information but uses real numbers directly representing the solutions with minimum and maximum possible values (Wright, 1991). Given m parameters to estimate for each of n activities and p time-pressure parameters, a solution candidate is represented as an array of mn+p elements of real numbers, and an element of a real number represents a corresponding parameter. In a preliminary study, this RCGA representation scheme outperformed the ordinary binary representation scheme in terms of precision and the speed. In particular, the increased speed was obtained by the RCGA representation scheme, where the encoded information (genotype) for genetic modification and the real form (phenotype) for candidate evaluation are the same, and hence, a decoding process that transforms the genotype information into the phenotype is not required. 11 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 3.2 Genetic operators Our RCGA also employed crossover and mutation operators for genetic modification of the solution candidates like any other GA. The details of the operators of the RCGA are however different from ordinary binary GAs. After a series of preliminary studies, we chose the BLX0.5 crossover and Mühlenbein mutation (Herrera et al., 1998). As for crossover, assume two 2 solution candidate arrays C1 = [ c11 ,.., c1mn+ p ] and C2 = [ c12 ,.., cmn + p ] that are selected from the current pool of solution candidates to be modified for the next pool. The hi of the offspring H = [ h1,.., hmn+ p ] is determined such that the value randomly lies in-between c min − I ⋅ ω and c max + I ⋅ ω , where 1 ≤ i ≤ mn+p, c min = min[ c i1, ci2 ] , c max = max[c1i , c i2 ] and I = c max − c min . We chose 0.5 for ω. As for mutation, assume a randomly selected solution candidate C = ' [ c1 ,.., c mn+ p ]. The c i' of the offspring C ' = [ c1' ,.., cmn + p ] is determined such that 15 c i' = ci ± 0.1 ⋅ (bi − a i ) ⋅ Σ η k 2 − k , where ai and bi are the minimum and maximum values that the k =0 i parameter can take, ηk is randomly determined to be 1 with a probability of 1/16 and 0 with a probability of 15/16, and the + or – sign is chosen with a probability of 0.5. th 3.3 Genetic parameters After an extensive study, we chose the following operational parameter values for the genetic procedure. - Size of the pool or Number of solution candidates for an iteration = 100 - Stop condition = 10000 iterations after the initialization - Probability of choosing crossover instead of mutation for the current round iteration = 70 % - Number of solution candidates selected for crossover from the previous pool = 50 - Number of solution candidates selected for mutation from the previous pool = 90 - Selection of solution candidates for modification = Random selection with replacement - Probability of mutating a parameter of the solution candidate selected for mutation = 10 % 12 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 3.4 Overall procedure Overall, our RCGA works as follows. • The RCGA randomly initializes 100 solution cand idates, each of which is an array of mn+p real numbers that represent m parameters of n activities and p time pressure parameters. ‚ It then evaluates each candidate, which is a rather complex procedure. Each solution candidate is a prediction of parameter va lues of activity utility function at the current step of iteration. Given these values and the time pressure information of the observed schedules, the system finds the predicted duration of each activity included in the observed schedule using the relatio n of equality between the marginal utilities mathematically derived from the utility function and observed from the time pressure attributes X. The absolute difference between predicted and observed durations of that activity is then computed, and these differences are summed across activities of the schedule and finally across schedules of the entire data as a measure of goodness-of- fit of the solution candidate. ƒ The RCGA selects solution candidates, and the selection probability is proportional to the goodness-of- fit. Better goodness-of- fit increases the chance for a candidate to be selected. The following probabilistic roulette wheel (Joh et al., 2001b) is used at each time of a selection. Pl = Σ Gl ' l' Gl Σ Gl ' Σ l' l' G l' (10) where Pl and Gl are respectively the selection probability and the goodness-of-fit measure of the lth solution candidate (l = 1, …, 100). Note that the goodness-of-fit of a candidate is the sum of prediction errors across schedules, and hence, a smaller value means a better fit. The selection is repeated 100 times with replacement. „ Among the selected candidates, the RCGA modifies randomly selected 50 (for crossover) or 90 (for mutation) candidates. … The RCGA again evaluates the new pool of genetically modified solution candidates. An iteration consists of the steps ƒ-„-… and is repeated 10000 times for a run. 13 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 4. Model Test Results Before dealing with any empirical data, we examined the properties of the suggested estimation method on simulated data. The purpose of this study was to investigate whether the suggested method produced the correct results using a set of simulated schedule data. Furthe rmore, we wanted to better understand the performance of the suggested approach for noisy data. To this end, we prepared a set of simulated activity schedule data, which assumed no noise for activity duration and time pressure attributes, and a set of activity schedule data, using the same value with some added amount of noise. In the following, we first examine whether the suggested estimation method is capable of reproducing the parameters, and then further examine the robustness of the suggested method against various sources of noise in the simulated data. 4.1 Estimation results for simulated activity schedules using exact data The specification of the model that was used for the simulations is largely the same as the one expressed in equations (8) and (9). We however simulated five time pressure attributes, instead of two, and hence we have five time pressure parameters, δ 1 to δ5 for the purpose of model test. We assumed two types of activities, which suggests a total of seventeen activity utility parameters (= 2×6+5) to estimate. The simulated data consists of fifty simulated schedules of these two types of activities. A schedule provides information about the duration v and history T of each activity and the numerical values of five time pressure attributes X of that schedule. The observed history is simulated in the range from 1 to 30 integer values, and the observed time pressure attributes are observed in the range from 0 to 100 real values across cases. The ‘true’ values for parameters were prescribed arbitrarily but reasonably representing assumed activities. Given these simulated observations and the true values of the activity utility function parameters (Table 1), the simulated duration observations for the activities of each schedule can be obtained from equation (3). As mentioned in section 3.1, the RCGA needs to predefine the minimum and maximum possible values to simulate. The following ranges for the parameter values were used: α = 10 ~300; β = 0.01 ~1; γ = 0.1 ~1; α x = 1 ~30; β x = 0.01 ~1; Ux = 50 ~500. 14 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 The proposed estimation method was run 30 times on the same data. Each run was terminated after 10000 iterations. The total of 30 runs took approximately 22 hours to complete, indicating the complexity of the problem. Table 1 shows the estimation results averaged across the 30 runs. Table 1. Estimation results with exact data parameter true value estimated δ1 δ2 δ3 δ4 δ5 75 420 0.15 0.10 0.8 0.1 7 3 0.15 0.20 250 150 0.0125 0.0500 0.0240 0.0800 0.0100 75.31 421.39 0.15 0.11 0.58 0.27 7.64 3.06 0.14 0.19 307.20 173.02 0.01587 0.06253 0.02952 0.09671 0.01213 GOF - 0.09946 α β γ αx βx Ux activity activity activity activity activity activity activity activity activity activity activity activity 1 2 1 2 1 2 1 2 1 2 1 2 Table 1 suggests that the parameter estimates are close to the true parameter values that were used for generating the simulated schedule data. This means that if our assumption that ind ividual activity rescheduling behavior is based on equalizing marginal utilities is true, and data do not exhibit any noise, the suggested estimation method will produce fairly exact estimates of activity utility parameters. A more detailed inspection of the results suggest that the α, β, α x and β x values are estimated more precisely than γ and Ux . This can be explained by examining Figure 4 which portrays the marginal impacts of the parameters and shows that the impact of γ and Ux is relatively small. In case of γ, it is difficult to recognize the difference in the curve between different γ’s 15 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 under a certain level of time pressure. In case of Ux , unless the difference is very large (500 and 100), the curves show little difference between Ux ’s (500 and 450). The GA is therefore less sensitive to these two parameters in the estimation. In contrast, changes in duration are most sensitive to the α parameter, which is consistent with the estimation results where the α value has the highest accuracy. Figure 4. Impacts of parameter values on the marginal utility curve α (60 à 75 à 90: to the right) β (0.5 à 0.3 à 0.1: to the bottom) 3 4.5 4 2.5 3.5 2 3 2.5 1.5 2 1.5 1 1 0.5 0.5 0 0 50 60 70 80 90 50 100 αx (10 à 20 à 25: to the bottom) 60 70 80 90 100 βx (0.05 à 0.1 à 0.15: to the bottom) 16 8 14 7 12 6 10 5 8 4 6 3 4 2 2 1 0 0 50 60 70 80 90 100 50 γ (0.2 à 0.5 à 1: to the bottom) 60 70 80 90 100 Ux (500 à 450 à 100: to the bottom) 3.5 3 3 2.5 2.5 2 2 1.5 1.5 1 1 0.5 0.5 0 0 50 60 70 80 90 100 50 60 70 80 90 100 An additional concern when estimating these types of functions is the possible linear correlation between parameter estimates. Table 2 shows the correlation between parameter estimates over 30 runs of estimation. 16 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Table 2. Correlations between parameter estimates Activity 1 α β γ αx βx Ux α β γ αx βx Ux 1 -.616** -.616** 1 -.842** .698** -.415* .935** .493* -.943** -.567** .899** -.842** -.415* .698** .935** 1 .410* .410 1 -.453* -.990** .654** .839** .493* -.567** -.943** .899** -.453* .654** -.990** .839** 1 -.842** -.842** 1 α β γ αx βx Ux 1 .610** .610** 1 -.551** .322 -.201 .106 .200 -.097 -.871** -.258 -.551** -.201 .322 .106 1 .340 .340 1 -.341 -.921** .769** .402 .200 -.871** -.097 -.258 -.341 .769** -.921** .402 1 -.424* -.424* 1 Activity 2 α β γ αx βx Ux Note: * and ** denotes that the correlation is significant at the 0.05 and 0.01 levels, respectively. Table 2 shows that there exist significant linear correlations between parameter estimates. These results suggest that when dealing with the empirical data, a sequential estimation strategy may be preferable if we face substantial interaction between parameter estimates, resulting in unstable parameter estimates over runs. As shown in Figure 4, the α parameter is most sensitive to the change in the duration and is thus expected to be most accurate and stable in prediction, which is proven in Table 1. The sequential estimation may therefore begin with the estimation of α given other parameter values of averages of the initial simultaneous estimations. In fact, preliminary tests suggest that indeed considerably more stable parameters were obtained when using a sequential estimation procedure. A detailed discussion of these results is beyond the scope of the present paper. 17 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 4.2 Estimation results for simulated activity schedules using noisy data Three types of noise were considered: (i) rounding of reported activity duration, (ii) inexact observation of the time pressure attributes and (iii) inexact observation of activity duration. These sources of noise can be expressed in a single equation as: ) βU x exp[ − exp[ β x (α x − T )]] exp[ β (α − (v + e vn ))] = Σδ X + en (11) (1 + γ exp[ β (α − (v) + evn ))])1/ γ +1 where, ) v is the duration rounded by respondents ; en is the error term of the time pressure for the nth observed schedule; evn is the error term of the duration observation. The goal of the analysis here was to examine the robustness of the suggested estimation method in the presence of such noise for each source separately, one at a time. 4.2.1 Rounding noise The simulated data represent activity duration to the precision of three digits after decimal points. Given the simulated observed T and X, a search algorithm finds the value of v based on equation (3) at this level of precision. Table 3 presents an example of the simulated data, whereas Table 4 presents the estimation results for different degrees of rounding error. 18 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Table 3. Example of the simulated schedule data (no rounding, 1- minute and 5- minutes rounding) case 1 2 3 4 5 6 7 8 9 10 … v1 85.616 96.986 88.115 89.351 83.345 92.574 93.965 75.334 91.649 82.997 … v2 422.797 421.676 421.881 427.553 427.412 436.768 438.813 444.023 434.218 425.829 … T1 24 22 17 18 12 26 23 2 27 12 … no rounding T2 X1 25 2.62 1 9.40 7 44.35 8 11.11 23 40.48 22 8.27 16 1.80 22 21.04 15 4.65 17 53.92 … … X2 29.41 5.72 0.35 32.59 9.26 10.79 5.78 7.56 4.73 17.02 … X3 59.2 14.58 45.41 0.34 50.68 22.74 39.95 9.96 34.40 11.46 … X4 23.3 2.44 12.54 12.14 14.46 4.25 3.80 0.91 6.20 24.87 … X5 23.68 21.95 63.88 19.60 86.35 70.86 22.22 24.98 92.67 54.58 … X3 59.2 14.58 45.41 0.34 50.68 22.74 39.95 9.96 34.4 11.46 … X4 23.3 2.44 12.54 12.14 14.46 4.25 3.8 0.91 6.2 24.87 … X5 23.68 21.95 63.88 19.6 86.35 70.86 22.22 24.98 92.67 54.58 … X3 59.2 14.58 45.41 0.34 50.68 22.74 39.95 9.96 34.4 11.46 … X4 23.3 2.44 12.54 12.14 14.46 4.25 3.8 0.91 6.2 24.87 … X5 23.68 21.95 63.88 19.6 86.35 70.86 22.22 24.98 92.67 54.58 … case 1 2 3 4 5 6 7 8 9 10 … v1 86 97 88 89 83 93 94 75 92 83 … v2 423 422 422 428 427 437 439 444 434 426 … T1 24 22 17 18 12 26 23 2 27 12 … 1-minute rounding T2 X1 X2 25 2.62 29.41 1 9.4 5.72 7 44.35 0.35 8 11.11 32.59 23 40.48 9.26 22 8.27 10.79 16 1.8 5.78 22 21.04 7.56 15 4.65 4.73 17 53.92 17.02 … … … case 1 2 3 4 5 6 7 8 9 10 … v1 90 100 90 90 80 90 90 80 90 80 … v2 420 420 420 430 430 440 440 440 430 430 … T1 24 22 17 18 12 26 23 2 27 12 … 10-minutes rounding T2 X1 X2 25 2.62 29.41 1 9.4 5.72 7 44.35 0.35 8 11.11 32.59 23 40.48 9.26 22 8.27 10.79 16 1.8 5.78 22 21.04 7.56 15 4.65 4.73 17 53.92 17.02 … … … As expected, more rounding error results in less accurate estimates, in particular the γ and Ux estimates as discussed in Figure 4. Considering the homogeneous activity durations resulting from the rounding, however, the results are rather accurate and promising. 19 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Table 4. Estimation results with rounded data parameter true value no rounding δ1 δ2 δ3 δ4 δ5 75.31 421.39 0.15 0.11 0.58 0.27 7.64 3.06 0.14 0.19 307.20 173.02 0.01587 0.06253 0.02952 0.09671 0.01213 75.89 421.80 0.14 0.11 0.27 0.48 6.61 3.02 0.15 0.20 248.52 176.77 0.01603 0.06445 0.03003 0.09735 0.01211 80.11 422.92 0.18 0.13 0.76 0.87 9.14 3.18 0.13 0.19 171.89 142.70 0.00979 0.04069 0.02402 0.07987 0.01197 GOF - 0.09946 0.34374 11.81063 β γ αx βx Ux 1 2 1 2 1 2 1 2 1 2 1 2 10 minutes rounding 75 420 0.15 0.10 0.8 0.1 7 3 0.15 0.20 250 150 0.0125 0.0500 0.0240 0.0800 0.0100 α activity activity activity activity activity activity activity activity activity activity activity activity 1 minute rounding 4.2.2 Time pressure attribute observation noise To study this effect as expressed as en in the RHS of equation (11), two scales of the error were introduced using a normal distribution N (0,σ 2 ). One has a standard deviation size of 10 % of the average time pressure level 2.7, and the other the standard normal distribution. That is, en ~ N (0,0.272 ) and en ~ N(0,1). The results are described in Table 5. Several interesting observations can be made from Table 5. First, the estimates appear to be relatively robust against this type of noise. Secondly, the scale seems to affect the results. If the estimation method is perfect, the introduction of a random error term en in the RHS of equation (11) should not affect the results. Given the limited number of simulated observations and the small size of time pressure, however, the result is not very disappointing. The random error of scale std = 1 seems to confuse the GA too much. Increasing the number of observations should improve the estimation results. Optionally, a sequential estimation of the time pressure parameters would improve the accuracy, which takes a linear regression analy- 20 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 sis of equation (3) with the scalar value of estimated marginal utility in the LHS of the equation. Table 5. Estimation results with noisy time-pressure data parameter true value no noise std = 0.27 std = 1 δ1 δ2 δ3 δ4 δ5 75 420 0.15 0.10 0.8 0.1 7 3 0.15 0.20 250 150 0.0125 0.0500 0.0240 0.0800 0.0100 75.31 421.39 0.15 0.11 0.58 0.27 7.64 3.06 0.14 0.19 307.20 173.02 0.01587 0.06253 0.02952 0.09671 0.01213 77.93 424.66 0.17 0.13 0.58 0.19 6.93 3.42 0.16 0.16 211.23 126.98 0.01767 0.06288 0.03223 0.09972 0.00759 80.37 426.76 0.19 0.13 0.16 0.18 11.16 3.71 0.12 0.27 361.45 178.28 0.03461 0.09891 0.07479 0.09974 0.02435 GOF - 0.09946 4.59547 11.60373 α β γ αx βx Ux activity 1 activity 2 activity 1 activity 2 activity 1 activity 2 activity 1 activity 2 activity 1 activity 2 activity 1 activity 2 4.2.3 Duration measurement noise Measurement error evn is added to the duration both in the numerator and denominator of the LHS of equation (11) as v + e vn . Errors were assumed to be drawn at random from a normal distribution N (0,σ2 ). Table 6 gives an example, where “std10% act1” means that the error is randomly drawn from a normal distribution of size σ, which is 10 % of the average duration of activity 1 in the data. The average simulated duration of activity 1 and activity 2 is 86 and 429 minutes, respectively. Hence the “std10%” condition includes the error evn ~ N(0,8.62 ) for activity 1 and evn ~ N(0,42.92 ) for activity 2 added to the duration v in both the numerator and the denominator of the LHS of equation. The “std5%” condition results in evn ~ N(0,4.32 ) for activity 1 and evn ~ N(0,21.452 ) for activity 2. Table 6 gives an example of the measurement noise simulated in this way. 21 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Table 6. Data with measurement error of different sizes case 1 2 3 4 5 6 7 8 9 10 … case 1 2 3 4 5 6 7 8 9 10 … error size of 10% of average duration original duration introduced error duration with error activity 1 activity 2 std10% act1 std10% act2 activity 1 activity 2 83.720 442.837 -16.19 -13.34 67.530 429.497 84.856 423.586 -2.76 -76.64 82.096 346.946 93.901 436.775 -20.59 31.91 73.311 468.685 85.396 431.398 -6.86 -19.63 78.536 411.768 91.466 434.846 2.93 -10.68 94.396 424.166 90.889 430.142 10.94 -58.65 101.829 371.492 87.401 427.610 4.53 39.37 91.931 466.980 92.693 422.688 8.47 -41.10 101.163 381.588 90.541 430.078 4.88 28.22 95.421 458.298 89.534 431.238 -4.73 -1.59 84.804 429.648 … … … … … … error size of 5% of average duration original duration introduced error duration with error activity 1 activity 2 std5% act1 std5% act2 activity 1 activity 2 83.720 442.837 -8.09 -6.67 75.630 436.167 84.856 423.586 -1.38 -38.32 83.476 385.266 93.901 436.775 -10.3 15.95 83.601 452.725 85.396 431.398 -3.43 -9.82 81.966 421.578 91.466 434.846 1.47 -5.34 92.936 429.506 90.889 430.142 5.47 -29.32 96.359 400.822 87.401 427.610 2.27 19.68 89.671 447.290 92.693 422.688 4.23 -20.55 96.923 402.138 90.541 430.078 2.44 14.11 92.981 444.188 89.534 431.238 -2.37 -0.79 87.164 430.448 … … … … … … To investigate the effect of the duration measurement noise, we tested two sets of simulated schedule data of different sizes. The estimation results are shown in Table 7. The following observations can be made. As for the sample size, the bigger sample (n=200) returns a better result than the smaller sample (n=50). As for the goodness-of- fit, the size of the GOF of the estimated model is almost the same with the size of the error introduced to the duration measurement, and the estimations are all converged. The figures of Table 8 are the results averaged across n cases. 22 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Table 7. Estimation results with measurement noise on the data of different sizes parameter true value std = 5% δ1 δ2 δ3 δ4 δ5 75.31 421.39 0.15 0.11 0.58 0.27 7.64 3.06 0.14 0.19 307.20 173.02 0.01587 0.06253 0.02952 0.09671 0.01213 83.20 410.46 0.47 0.04 0.64 0.53 17.38 6.47 0.22 0.34 207.95 430.42 0.04447 0.01751 0.05218 0.05001 0.00807 82.74 418.40 0.24 0.08 0.64 0.62 11.09 6.45 0.32 0.33 166.70 390.40 0.04911 0.04327 0.06489 0.05607 0.01505 GOF - 0.09946 44.00144 22.10836 200-cases data true value no error std = 10% std = 5% β γ αx βx Ux parameter δ1 δ2 δ3 δ4 δ5 75 420 0.15 0.10 0.8 0.1 7 3 0.15 0.20 250 150 0.0125 0.0500 0.0240 0.0800 0.0100 75.57 421.31 0.15 0.11 0.51 0.44 7.65 3.02 0.14 0.19 288.17 174.95 0.01522 0.06057 0.02835 0.09343 0.01176 79.03 421.55 0.18 0.05 0.70 0.55 8.91 6.13 0.14 0.30 288.00 326.01 0.02678 0.06283 0.03579 0.09786 0.01231 78.58 423.48 0.17 0.06 0.63 0.62 7.42 6.05 0.15 0.29 279.12 271.48 0.02406 0.07522 0.04032 0.09879 0.01332 GOF - 0.17417 36.42288 18.13449 α β γ αx βx Ux activity activity activity activity activity activity activity activity activity activity activity activity 1 2 1 2 1 2 1 2 1 2 1 2 std = 10% 75 420 0.15 0.10 0.8 0.1 7 3 0.15 0.20 250 150 0.0125 0.0500 0.0240 0.0800 0.0100 α activity activity activity activity activity activity activity activity activity activity activity activity 50-cases data no error 1 2 1 2 1 2 1 2 1 2 1 2 23 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Table 8. Goodness-of- fit across error sizes sample size error size GOF std=10% 44.00 n = 50 std=5% 22.11 std=10% 36.42 n = 200 std=5% 18.13 Note: The introduced error sum = |e1n |+|e2n |. error sum 47.97 23.98 37.41 18.70 As for the accuracy of the estimates, the estimates are not very accurate for some parameters for noisy data. Moreover, the inaccuracy is amplified with the size of the introduced error across parameters of both activities in both data sets. The reason may be the following. When measurement error evn is introduced from a normal distribution, duration for the actual estimation is changed to v + e vn . If the estimation method is insensitive to the introduced random duration error, the marginal utility (the LHS of equation) should be averaged to the true values. In other words, given the marginal utility MU = f (v ) , f (v ) ≈ ( f (v + e) + f (v − e) )/ 2 . For exa mple, the marginal utility of the true duration 440 should (more or less) be the same as the average of two marginal utilities of duration 440+40 and duration 440-40. Assume that α, β, γ, αx , βx and Ux are 75, 0.15, 0.8, 7, 0.15 and 250, respectively, T is 15, and the average duration v is 86. Then, the std of the random error is 8.6 and 4.3, which are 10 % and 5 % of the average duration, respectively. The average of the absolute values of the error randomly drawn from these sizes then was 6.7 and 3.3, respectively. Figure 5 shows the error-size impacts in terms of the difference between f (v ) and ( f (v + 6 .7 ) + f (v − 6.7) ) / 2 for error size of 10% and the difference between f (v ) and ( f (v + 3.3) + f (v − 3.3)) / 2 for error size of 5%. The error- averaged marginal utility is drawn as bold lines. The difference between the two is explained in Figure 6. In both figures, the thin, dotted and bold lines respectively denote the true value, the value with negative error, and the value with the positive error. 24 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Figure 5 Average marginal utility curve with exact measurement and with measurement error of 10% (LHS) and 5% (RHS) of average duration 8 8 7 7 6 6 5 5 4 4 3 3 2 2 1 1 0 0 0 Figure 6 20 40 60 80 100 0 20 40 60 80 100 Marginal utility curve with exact measurement and with measurement error of ±10% (LHS) and ±5% (RHS) of average duration 8 7 6 5 4 3 2 1 0 8 7 6 5 4 3 2 1 0 0 20 40 60 80 100 0 20 40 60 80 100 The introduction of a single measurement error changes the α value, and hence, the inflection point of the critical point of the function. As long as the introduced measurement error is from a symmetrical normal distribution, however, the averaged marginal curve maintains the inflection point as before, which also keeps the predicted α values as before. Meanwhile, all other aspects of the functional curve changes as can be seen in Figure 5. As a result, the error introduction from a symmetric (normal) distribution implies the disruption of estimates of β, 25 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 αx and βx to some extent, but not α. In fact, throughout the estimation simulation, the α estimate was relatively irrelevant to the measurement error size. 5. Conclusions This paper has reported some main findings of a study, which aimed at developing an estimation method for deriving the parameters of the Aurora model. Unlike other utility models of time use, Aurora is based on a complex asymmetric S-shaped utility function. While we argue that this specification has some clear theoretical advantages, the estimation of the model becomes highly complex. Before applying a method to real empirical data, we felt it was important to first study the performance of the suggested approach to simulated data. The suggested estimation uses a combination of searching the solution space, using a tailored genetic algorithm and some theoretical concepts. In particular, a key assumption is that an activity schedule is the result of equalizing the marginal utilities of activities subject to time pressure. The method was specifically developed for the case where duration (time use) data are available. The results of the simulations suggest that the proposed estimation method performs well on the exact data, and reasonably good on the noisy data but with some exceptions of particular parameters. The simulated noises were time pressure, duration rounding and overall measurement error in duration. Among the simulated errors, the overall measurement error in duration has biggest impacts on the accuracy of the parameter estimation, bigger than the error in measuring the time pressure variables and the systematic rounding in reporting the duration. This suggests the need to attempt to increase the quality of duration data to the extent possible. It also shows the typical problems of non-linear models, especially the interaction between parameter estimates. Considering the latter, we recommend to use a sequential estimation strategy where each parameter is estimated in turn. It did provide the most stable results for the present model, and these results can probably be generalized to similar models. 26 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Having developed this estimation model, we can apply and test the model to real, empirical data. The results of that endeavor will be reported in future publications. 6. References Arentze, T.A. and H.J.P. Timmermans (2000) Albatross: A Learning-Based Transportation Oriented Simulation System, European Institute of Retailing and Services Studies, The Hague. Arentze, T.A., M. Dijst, E. Dugundji, C.H. Joh, L. Kapoen, S. Krijgsman, C. Maat, H.J.P. Timmermans and J. Veldhuisen (2001) Amadeus : Scope and conceptual development, Paper presented at the 9th World Conference on Transportation Research, Seoul, July 2001. Becker, G. S. (1965) A theory of the allocation of time, The Economic Journal, 75 493-517. Davis, L. (ed.) (1991) Handbook of Genetic Algorithms, Van Nostrand Reinhold, New York. Gärling, T., R. Gillholm and W. Montgomery (1999) The role of anticipated time pressure in activity scheduling, Transportation, 26 173-191. Goldberg, D.E. (1989) A gentle introduction to genetic algorithms, in D.E. Goldberg, Genetic Algorithms in Search, Optimization, and Machine Learning, 1–25, Addison-Wesley, Amsterdam. Herrera F., M. Lozano and J.L. Verdegay (1998) Tackling real-coded genetic algorithms: Operators and tools for behavioral analysis, Artificial Intelligence Review, 12 265-319. Joh, C.H., T.A. Arentze and H.J.P. Timmermans (2001a) A theory and simulation model of activity-travel rescheduling behavior, Paper presented at the 9th World Conference on Transportation Research, Seoul, July 2001. Joh, C.H., T.A. Arentze and H.J.P. Timmermans (2001b) Multidimensional sequence alignment methods for activity-travel pattern analysis: A comparison of dynamic programming and genetic algorithms, Geographical Analysis, 33 247-270. Joh, C.H., T.A. Arentze and H.J.P. Timmermans (2002) Modeling individuals' activity-travel rescheduling heuristics: Theory and numerical experiments, Transportation Research Record, 1807 16-25. Kitamura, R. (1984) A model of daily time allocation to discretionary out-of- home activities and trips, Transportation Research B, 18 255-266. Richards, F.J. (1959) A flexible growth function for empirical use, Journal of Experimental Botany, 10 290-300. 27 10th International Conference on Travel Behaviour Research _______________________________________________________________________ August 10-15, 2003 Timmermans, H.J.P., T.A. Arentze and C.H. Joh (2001) Modeling the effects of anticipated time pressure on the execution of activity programs, Transportation Research Record, 1752 8-15. Wright, A. (1991) Genetic algorithms for real parameter optimization, in G.J.E. Rawlin (ed.), Foundations of Genetic Algorithms I, 205-218, Morgan Kaufmann, San Mateo. 28