资源描述
交通统计与分析同济课件
学习—————好资料
Tongji University
School of Transportation Engineering
Homework 2
Discrete Choice Model
You are asked to develop a mode choice model for work trips in the City of Toronto, Canada. For this purpose, a one-day travel diary survey A subset of the data in the 1986 Transportation Tomorrow Survey (TTS) for the Greater Toronto Area
was conducted for a sample of 49 workers. The survey includes the worker’s socio-economic characteristics, mode characteristics and the worker’s mode choice among the four modes of travel (auto-drive, auto-passenger, local transit, and commuter rail). This survey data is stored in Homework 2.txt in the following format:
t
mode
choice
ivtt
ovtt
tc
nveh
Inc
female
age
hhsize
1
1
0
57
8.23
7.78
2
38
1
32
2
2
1
57
8.23
3.89
2
38
1
32
2
3
0
64.81
18.47
1.42
2
38
1
32
2
4
0
33.99
18.54
0.2
2
38
1
32
2
2
1
0
:
:
:
:
:
:
:
:
2
0
:
:
:
:
:
:
:
:
3
0
:
:
:
:
:
:
:
:
4
1
:
:
:
:
:
:
:
:
Note:
t = worker number
mode = 1 – auto-drive, 2 – auto-passenger, 3 – local transit, 4 – commuter rail
choice = 1 – selected mode, 0 – unselected modes
ivtt = in-vehicle travel time for each mode (min) Same travel time for auto-drive and auto-passenger
ovtt = out-of-vehicle travel time for each mode (min)2
tc = travel cost (dollars) Travel cost for auto-passenger is assumed to be a half of the travel cost for auto-drive.
nveh = number of vehicles in the worker’s household
inc = worker’s income (thousand dollars)
female = worker’s gender (1 – female, 0 – male)
age = worker’s age
hhsize = number of persons in the worker’s household
The following multinomial logit model was used to estimate the worker’s mode choice:
Where
Pit = probability that worker t chooses mode i
Vit = utility of mode i for worker t
V1t,V2t,V3t,V4t = utilities of auto-drive, auto-passenger, local transit and commuter rail,
respectively, for worker t
For four travel modes, four different systematic (observable) utility functions were specified as follows:
Auto-drive:
Auto-passenger:
Local transit:
Commuter rail:
Coefficients of the variables in the above utility functions were estimated using the LIMDEP 7.0 software (Greene, 1998). The coefficients are summarized as follows:
Estimated parameters
Variable
Coefficient
t-statistics
q1 (constant)
10.17
2.04
q2 (constant)
5.72
1.25
q3 (constant)
5.61
1.87
a1 (auto drive in-vehicle travel time)
-0.07
-1.65
a2 (auto passenger in-vehicle travel time)
-0.07
-1.20
a3 (local transit in-vehicle travel time)
-0.14
-3.00
a4 (commuter rail in-vehicle travel time)
0.05
0.68
b (out-of-vehicle travel time)
-0.06
-1.69
c (travel cost)
-1.60
-2.60
d (household size)
0.98
2.47
e (female worker)
1.63
1.76
The goodness-of-fit of the logit model is as follows:
Goodness of fit statistics:
Number of observations = 49
Log-likelihood at convergence = -46.4632
Log-likelihood at b=0 = -67.9284
Log-likelihood ratio index (r2) = 0.32
t23 = t32 = 2
t13 = t31 = 4
Question 1:
Discuss the results of the above logit model estimation in terms of
(a) statistical significance of variables at a 90% confidence level;
(b) the signs of the coefficients (e.g. why is the sign of out-of-vehicle travel time negative or the sign of household size positive?, etc.);
(c) model fit; and
(d) additional data (not included in TDS_sample.txt) which you would have liked to have had for inclusion in the utility functions.
Question 2:
Using the logit model, calculate the following:
(a) Calculate the probabilities of choosing the four modes for each worker.
(b) Calculate the average probabilities for all workers using the results in (a).
(c) Calculate the averages of variables in utility functions for all workers. Use these average values to estimate the probabilities of choosing the four modes for all workers (i.e. naive aggregation).
(d) Describe the characteristics of the mode with the highest probability compared to the other modes in terms of the averages of variables obtained in part (c).
(e) Classify the workers into 10 homogeneous worker groups according to the number of persons in the worker’s household and the worker’s gender. For instance, let the first group be a male worker whose household size is 1 (i.e. female = 0, hhsize = 1), the second group be a male worker whose household size is 2 (i.e. female = 0, hhsize =2), and so on. Calculate the averages of variables in utility functions for each worker group. Use these average values to estimate the probabilities of choosing the four modes for each worker group. Finally, calculate the “weighted” average probabilities using these probabilities for groups (i.e. classification with naive aggregation).
(f) Compare the results in (b), (c) and (e) with the “observed” percentages of mode choice from the original survey data. Evaluate the accuracy based on the root-mean-square errors as follows:
where Pi = observed percentage of mode i, and P¢i = predicted average probability of choosing mode i for all workers.
Which level of aggregation is the most accurate? Why?
(g) The local transit company plans to reduce transit fare to attract more passengers. Thus, the average transit travel cost for all workers will be reduced to $0.70. Estimate the forecasted modal split using naive aggregation, assuming that all other conditions remain the same.
精品资料
展开阅读全文