Abstract

Connected and automated vehicles (CAVs), particularly those with a hybrid electric powertrain, have the potential to significantly improve vehicle energy savings in real-world driving conditions. In particular, the ecodriving problem seeks to design optimal speed and power usage profiles based on available information from connectivity and advanced mapping features to minimize the fuel consumption over an itinerary. This paper presents a hierarchical multilayer model predictive control (MPC) approach for improving the fuel economy of a 48 V mild-hybrid powertrain in a connected vehicle environment. Approximate dynamic programing (DP) is used to solve the receding horizon optimal control problem, whose terminal cost is approximated with the base policy obtained from the long-term optimization. The controller was tested virtually (with deterministic and Monte Carlo simulation) across multiple real-world routes, demonstrating energy savings of more than 20%. The controller was then deployed on a test vehicle equipped with a rapid prototyping embedded controller. In-vehicle testing confirm the energy savings obtained in simulation and demonstrate the real-time ability of the controller.

1 Introduction

Connected and automated vehicles (CAVs) have the potential to increase safety, driving comfort, as well as fuel economy, by exploiting look-ahead driving information available via advanced navigation systems, vehicle-to-vehicle, and vehicle-to-infrastructure (V2I) communication [13]. Intuitively, with these connectivity technologies, a controller can plan a speed trajectory that minimizes unnecessary acceleration and braking events, thereby improving driver comfort and fuel-efficiency [4]. Meanwhile, powertrain electrification can increase the vehicle fuel (or energy) efficiency, by including battery packs and electric motors as alternative energy storage and power generation devices, respectively [5]. While combining these two technologies could further compound their efficiency improvements, they present a greater challenge from a planning and control perspective [6].

Ecodriving, defined in literature as the control of vehicle velocity for minimizing the fuel or energy consumption over an itinerary, is gaining prominence [7]. Based on the powertrain configuration considered in each application, the methods developed for determining the optimized velocity profile can vary, depending on whether the powertrain is equipped with a single power source [810] or a hybrid electric drivetrain [11,12]. The latter involves modeling multiple power sources and devising optimal control algorithms that can split the power demand in a synergistic manner to efficiently utilize the electric energy stored in the battery. Most of the existing literature consider the ecodriving problem in a decentralized fashion, i.e., the speed trajectory optimization and powertrain control are performed sequentially, and then integrated within a hierarchical control structure [13]. Based on overall system efficiency and calibration effort considerations, this paper explores an alternative approach, where the speed trajectory and powertrain torque split are jointly optimized.

Incorporation of V2I information enables the processing and use of signal phase and timing (SPaT) information in real-time for performing eco-approach and departure (eco-AND) at signalized intersections. Eco-AND refers to velocity control for increasing the likelihood to pass through a signalized intersection during a green window. Studies from literature have shown that such maneuvers can improve the overall fuel-efficiency [1416]. When eco-AND is combined with ecodriving, the added complexity means that the controller may not always be able to compute a speed trajectory without violating traffic rules (i.e., optimization constraints) [17]. This warrants a unified optimization framework combining ecodriving and eco-AND that can improve the fuel-efficiency and ensure constraint satisfaction in a broad range of driving scenarios, while remaining computationally tractable on embedded control hardware platforms.

The objective of the work is to conduct energy optimization of an individual vehicle (noncooperative case). The vehicle is however intended to operate in an environment with traffic and infrastructure (communicating and noncommunicating). This paper focuses on the development of a hierarchical multilayer ecodriving controller based on model predictive control (MPC), that jointly optimizes the vehicle speed and battery state of charge (SoC). The proposed control framework is a novel alternative to state-of-the-art [3,13], in that the problem of velocity optimization and energy management is solved as a centralized optimization problem.

To account for route uncertainty and limited V2I communication range, and to mitigate the intensive onboard computation, the optimization routine is split into a long-term and short-term optimal control problem as per the availability of the look-ahead information. The long-term optimization is performed over the entire route with the look-ahead information such as speed limits and position of stop signs available from an advanced navigation system. To account for variability in route conditions such as variable speed limits and SPaT information, the full-route optimization is cast as a receding horizon optimal control problem and evaluated periodically in real-time via MPC over a short horizon. While MPC has been widely used in real-time control applications [18], a key challenge still remains in its implementation—the definition of an appropriate terminal cost and/or terminal state constraints for performance and stability [19]. Using principles from approximate dynamic programing (DP), specifically the rollout algorithm, this paper provides a novel methodology for appropriately selecting the terminal cost of the MPC.

The proposed solution method yields optimal closed-loop strategies that are robust against external disturbances and has its theoretical foundations in the traditional DP [2022]. This hierarchical optimization framework is well-suited for in-vehicle implementation, where advanced mapping systems can provide long-term route information and vehicle-to-everything (V2X) technologies can update this information in the short-term. Further, this work proposes a systematic and comprehensive verification framework for virtual and experimental evaluation of fuel savings from the CAV optimization algorithms. Simulations and track testing results validate the developed rollout algorithm-based optimization framework making it suitable for real-time applications containing environmental disturbances and modeling uncertainties.

2 Model of Parallel Mild-Hybrid Electric Vehicle

The P0 mild-hybrid electric vehicle (mHEV) architecture studied in this work is illustrated in Ref. [23]. A belted starter generator (BSG) is connected to the crankshaft of a 1.8L turbocharged gasoline engine equipped with dynamic skip fire, DSF [24,25], and a 48V battery pack.

A forward-looking vehicle simulator was developed in matlabsimulink to evaluate fuel consumption and compare different control strategies over prescribed routes. The inputs to the plant model in Fig. 1 are the desired BSG torque (Tbsg,tdes) and desired engine torque (Teng,tdes), which are obtained from a simplified model of the software in the electronic control module (ECM). This contains a baseline torque split strategy and essential functions that convert the driver's input (pedal positions) to commands, which are fed to the powertrain components. Note that in the descriptions of the equations that follow, the arguments of certain terms may be suppressed for brevity.

Fig. 1
Block diagram of 48 V P0 mild-hybrid drivetrain
Fig. 1
Block diagram of 48 V P0 mild-hybrid drivetrain
Close modal

2.1 Vehicle Dynamics.

The vehicle acceleration is modeled with the road-load equation, considering only the longitudinal dynamics and disregarding the lateral dynamics [5]
(1)

where vt is the velocity of the vehicle, M is the effective mass of the vehicle (including driveline rotational inertia and the total payload), Tout,t is the net output torque, Fbrk,t is the braking force, Rw is the rolling radius of the wheel, and Froad,t is the road-load, which is defined as the force imparted on a vehicle from aerodynamic drag, tire rolling resistance, and road grade.

2.2 Powertrain.

The engine fuel consumption is modeled using static nonlinear maps ψt(·,·,·), as functions of the gear, engine speed, and torque
(2)

where ngr,t is the selected gear number, Teng,t is the engine torque, and ωeng,t is the engine speed.

The BSG is modeled using an efficiency map, which is a quasi-static representation of torque production
(3)

where ωbsg,t is the BSG speed, rbelt is the belt ratio, Pbsg,t is the electrical power required to produce a torque Tbsg,t at speed ωbsg,t, and ηbsg,t is the BSG efficiency.

The battery is modeled as a zeroth‐order equivalent circuit, which comprises an ideal voltage source and a resistor in series. The voltage across the circuit is considered the terminal battery voltage. The model equations are
(4)

where Voc,t is the battery open-circuit voltage, R0 is an approximation of the battery internal resistance, Ibatt,t is the battery current, ξt is the battery SoC, and Cnom is the nominal capacity of the battery. Further, a calibration term Ibias is introduced as a highly simplified representation of the onboard electrical auxiliary loads.

Power at the flywheel is transmitted to the wheels via a torque converter and six-speed automatic gearbox. A simplified torque converter model is developed with the purpose of computing the power losses during DSF activity. The desired clutch slip, ωslip,tdes, is determined by the transmission control module using maps as a function of the gear, engine speed, and torque. The model equations are
(5)

where ωp,t is the speed of the torque converter pump, ωturb,t is the speed of the turbine, ωeng,stall is the speed at which the engine stalls, ωidle is the idle speed (target) of the engine, stop is a flag from the ECM indicating engine shut-off when the vehicle is stationary, Tturb,t is the turbine torque, and Tpt,t is the powertrain torque.

The transmission is modeled as a static gearbox with efficiency ηtran,t, which is determined empirically from vehicle test data
(6)

where rf is the final drive ratio, rgr,t is the gear ratio, and Tout,t is the transmission output shaft torque.

The model was validated on experimental data [23]. Figure 2 presents results over the Federal Test Procedure (FTP) drive cycle, where the vehicle velocity, battery SoC, and fuel consumption are compared against experimental data. The predicted vehicle velocity, battery SoC, and fuel consumption profiles adequately match the experimental data, with slight mismatches that can be attributed to the simplifications made to the powertrain model.

Fig. 2
Validation of forward vehicle model over FTP cycle
Fig. 2
Validation of forward vehicle model over FTP cycle
Close modal

3 Problem Formulation

The objective of the nonlinear dynamic optimization problem, formulated in the spatial domain, is to minimize the fuel consumption of the vehicle over an entire itinerary. A key benefit of a spatial trajectory formulation is that it naturally lends itself to the incorporation of route-related information, such as posted speed limit signs, location of traffic lights, and stop signs, whose positions along the route remain fixed.

Consider a dynamic control problem discretized in the spatial domain having the form
(7)

where s is the discrete position, xsXp is the state, usUq is the input or control, and fs is a function that describes the state dynamics. In this work, the state variables chosen are the vehicle velocity and the battery SoC: xs=[vs,ξs]T. The engine torque and BSG torque are chosen as the control variables: us=[Teng,s,Tbsg,s]T.

The control and the state are constrained, and the constraint function hs:X×Ur is expressed as hs(xs,us)0,s=1,,N, which includes the route speed limits, operating limits of physical actuators and subsystems, constraints for drive comfort, and so on. An admissible control map at position s is a map μs:XU such that h(x,μs(x))0,xX. The collection of admissible control maps is denoted by M:=(μ1,,μN), which is referred to as the policy of the controller.

The controller aims at minimizing a cost, given by
(8)
where cs:X×U is the per stage cost function, defined in this work as a weighted average of the fuel consumption and travel time
(9)

where the weight γ(0,1) is a tunable penalty factor that can be used to tradeoff between the amount of fuel consumed and time taken to complete the route; effectively, it constitutes a driving aggressiveness parameter, m˙f,s is the fuel consumption rate, m˙fnorm is a cost normalizing weight, and ts is the travel time per step.

4 Full-Route Optimization Using Dynamic Programing

To perform the constrained optimization problem described in Sec. 3, a custom DP algorithm was developed and employed [26]. DP uses the Bellman principle of optimality equation to break the optimization problem into smaller subproblems, solving them via backward recursion
(10)
Further, the policy M*=(μ1*,,μN*) is optimal if for each xs and s, μs*(xs) minimizes the right side of Eq. (10) [27]. Here, Js(xs) is interpreted as the optimal cost for the (N+1s)-stage problem starting at state xs and position s, and ending at position N + 1. For use in Sec. 5, the following assignment is made:
(11)

where Vs is termed the value function, equal to the cost-to-go function at position s.

The constraints of the N-step optimization are defined as follows:
(12)
where vsmin,vsmax are the minimum and maximum route speed limits, respectively; ξmin,ξmax represent the static limits applied on battery SoC variation; amin, amax are the limits imposed on the vehicle acceleration for comfort; Teng,smin,Teng,smax are the state-dependent minimum and maximum torque limits of the engine, respectively; and Tbsg,smin,Tbsg,smax are the state-dependent minimum and maximum BSG torque limits, respectively. To ensure SoC-neutrality over the global optimization, a terminal constraint is applied on the battery SoC: ξ1=ξN+1. Here, as the dynamic optimization problem is solved by the DP algorithm, the state dynamics introduced in Eqs. (1) and (4) are discretized and transformed to spatial domain
(13)

where Δds is the distance over one step (i.e., Δds=ds+1ds, where ds is the distance traveled along the route at position s), and v¯s(=(vs+vs+1)/2) is the average velocity over one step.

Fig. 3
Process for verification of proposed optimization strategies
Fig. 3
Process for verification of proposed optimization strategies
Close modal

5 Model Predictive Control Using Rollout Algorithm (Ecodriving)

In the presence of variability in route conditions and/or uncertainty in route information, the vehicle velocity and powertrain optimization will need to be rerun with the updated information to reflect these changes. This serves as the primary motivation to construct a receding horizon optimization problem by truncating the full-route horizon of N-steps to NHN steps, formulated at a position s=1,,NNH+1 as
(14)

where NH is the number of steps in the receding horizon. This MPC or look-ahead optimization problem is subject to the same constraints introduced in Eq. (12).

A key challenge in Eq. (14) is the definition of an appropriate terminal cost and/or terminal state constraints that approximate the optimal solution provided by DP in a full-information scenario. Some other methods adopted in the literature for constructing approximations of the value function include local linear approximation [27,28], Monte Carlo-based [29], and Q-learning-based approaches [3032]. This work introduces a terminal cost (or equivalently, the cost to complete the remaining route) approximation strategy based on the use of approximate dynamic programing, specifically the rollout algorithm.

Rollout algorithms form an important class of DP-based online suboptimal control techniques based on approximation in the value space. Here, the optimal cost-to-go function is replaced by an approximation, resulting in a suboptimal DP policy. Using the rollout algorithm framework, the following one-step look-ahead optimization problem is solved from position k+NH1 to k,k=1,,NNH+1:
(15)

where the approximation J̃k+NH (and as a result J̃s+1) is the cost-to-go of a known suboptimal policy, termed as the base policy or base heuristic, and M̂*:=(μ̂k*,,μ̂k+NH1*) is the rollout policy. One of the properties that make the rollout algorithm attractive for onboard optimization is the cost improvement property [33], namely, if the base heuristic produces a feasible solution, the rollout algorithm also produces a feasible solution whose cost is no worse than the cost corresponding to the base heuristic (proof in Ref. [27]).

For cost improvement to be valid, it is important that the base heuristic and the rollout policy are computed over the same constraint set. Here, the base heuristic is chosen as the value function of the corresponding full-route DP

where Vk is the value function of the full-route DP solution at k, the global position along the route. The rationale behind this is explained using the Bellman principle of optimality equation. Setting the terminal cost in Eq. (15) to the value function of the full-route DP results in Eqs. (10) and (11) for the NH-step problem. Solving this system of equations thus yields the optimal cost for the look-ahead optimization problem. This claim is valid in the absence of traffic or other uncertainties en route, i.e., as long as the per stage cost remains the same for both the NH-step and N-step DP.

The formulation of the ecodriving problem using the rollout algorithm is summarized as follows. At each global position k along the route, the problem below is solved from local position k+NH1 to k,k=1,,NNH+1:
subject to: k=1,,NNH+1
(16)

5.1 Incorporation of Signal Phase and Timing Information and Eco-Approach and Departure.

Incorporating SPaT information in the ecodriving algorithm can be addressed by adding time as a state variable in the optimization problem formulation [34]. However, this is accompanied by an exponential increase in the computational effort required [35].

In this work, a rule-based approach is developed to increase the likelihood of passing-in-green by using SPaT information broadcast to the vehicle within communication range to determine kinematically feasible vehicle velocity constraints that are then imposed to the MPC routine.

Feasible velocity constraints are applied as offsets to the speed limits, determined by first examining the possibility to execute a pass-in-green maneuver while respecting vehicle dynamics constraints and route speed limits. If deemed infeasible, the traffic light is treated as a stop sign and the vehicle decelerates to a stop according to the fuel-optimal profile computed by the rollout algorithm. If a pass-in-green maneuver is deemed feasible, the eco-AND algorithm uses the current vehicle velocity, distance to the upcoming intersection, and the SPaT information to calculate the offsets voff,smin or voff,smax that are applied to the minimum and/or maximum route speed limit, respectively. The effective velocity constraints fed to the MPC are
(17)

To ensure a feasible solution, the constraints fed to the rollout algorithm are shaped by recursively applying vs+12=vs22aminΔds, from the current speed till the calculated (modified) speed limit is reached. Constraint shaping increases the likelihood with which the vehicle crosses the intersection in the green window while ensuring kinematic feasibility for use in the optimization routine.

6 Verification Framework

The process for verification of the vehicle velocity and powertrain optimization strategy is summarized in Fig. 3. The virtual evaluation of the ecodriving case involves the generation of multiple simulation scenarios, where the parameter γ in the optimization is varied to determine and quantify the Pareto-optimal fronts among the objectives (total fuel consumption and trip travel time). This evaluation is initially performed by assuming that all the traffic lights along the route are stop signs. The optimizer is benchmarked against a realistic baseline, which is assumed as the same demonstration vehicle. Further, no longitudinal automation is assumed, and for this reason all the simulations were conducted by including a validated enhanced driver model (EDM), a modified car-following model that mimics the behavior of a human driver in presence of speed limits and signalized intersections [36].

Fig. 4
Speed limits and locations of traffic lights and stop signs along the representative urban route selected for comparison
Fig. 4
Speed limits and locations of traffic lights and stop signs along the representative urban route selected for comparison
Close modal

Following this initial virtual verification, experimental testing was conducted on a closed test track at the Transportation Research Center (TRC) Inc. in East Liberty, OH, where the ecodriving algorithm was demonstrated through real-time implementation in the demonstration vehicle. To obtain a realistic baseline for benchmarking the results, the vehicle was fitted with a brake and throttle robot (BTR) [37], programed to follow pre-established velocity profiles generated in simulation with the EDM. This deterministic scenario was used to validate the simulation tools on specific scenarios, offering an initial estimate of the fuel-saving potentials of the developed ecodriving algorithm.

Finally, a more comprehensive virtual verification (evaluation of ecodriving algorithm with eco-AND, in Fig. 4) was performed using a Monte Carlo simulation framework, in which the driver aggressiveness and the SPaT information are treated as random variables. From the results of this simulation, in-vehicle verification at TRC was conducted by extracting and testing sample cases from corresponding scenarios selected from the Monte Carlo simulations. Here, SPaT variability, including communication latencies, was realistically emulated during testing. Benchmarking was performed using a BTR that follows the velocity profiles generated in simulation by the EDM, where the decision to approach a signalized intersection is based on the concept of line-of-sight (LoS) [38].

7 Virtual Evaluation

7.1 Test Route.

Figure 4 shows the features of the representative route over which the optimization routine is tested. This urban route set in Columbus, OH is 7.4km in length and comprises 22 traffic lights and three stop signs.

Fig. 5
Sample results from the ecodriving algorithm: Comparison of Pareto curve with the full-route DP and the MPC (Ecodriving algorithm)
Fig. 5
Sample results from the ecodriving algorithm: Comparison of Pareto curve with the full-route DP and the MPC (Ecodriving algorithm)
Close modal

7.2 Evaluation of Ecodriving Algorithm.

Figure 5 shows sample results from the ecodriving algorithm over the urban test route with no traffic, in which all the traffic lights are assumed to be stop signs. The figure compares the MPC ecodriving algorithm against the full-route solution obtained offline via DP. It is evident that the solution of the multi-objective optimization problem (9) yields a γ-dependent Pareto front. Along the Pareto curve, lower values of γ depict an increasingly aggressive driving style, while higher γ values represent more conservative behavior with smoother accelerations and braking maneuvers. For each γ, the vehicle velocity profile was smooth, and the torque split strategy determined by the optimization routine was charge-sustaining in nature. The comparison in Fig. 5 is obtained by imposing identical constraints on the full-route DP optimization and the MPC (ecodriving algorithm), and the value function from the full-route DP solution is applied as the terminal cost of each NH-horizon problem. The step sizes of the discretized distance are 10m, states are 1.36m/s,2%, and engine torque and BSG torque are 13.2m and 4.2Nm, respectively.

7.3 Evaluation of Ecodriving Algorithm With Eco-Approach and Departure.

The ecodriving algorithm with eco-AND is evaluated by performing large-scale Monte Carlo simulations over real-world routes in which each traffic light has time-varying SPaT information.

7.3.1 Baseline Case.

The baseline considered is the EDM, a deterministic reference velocity predictor that uses route characteristics to generate velocity profiles representing different levels of driver aggressiveness [36]. The EDM predicts the response of a human driver when operating a vehicle in presence of traffic, stop signs, and traffic lights by using a realistic LoS-based heuristic strategy [38]. For simulations over urban and mixed routes, a LoS of 100m is considered to be reasonable. The velocity reference from the EDM is fed to a tracking controller that generates the necessary inputs to the validated vehicle model described in Sec. 2.

An additional case is now constructed to quantify the fuel-saving potentials of eco-AND. The LoS approach (as introduced in Ref. [38]) is adapted and used in conjunction with the rollout algorithm to interact with traffic signals. Here, the signal phase information is available to the driver only within the LoS, and at any point beyond it, the traffic light is assumed to be a stop sign. It should be noted that unlike eco-AND, the ecodriving with LoS does not receive any timing information from the traffic light.

7.3.2 Variability in Signal Phase and Timing Information.

To recreate the degree of variability typically associated with SPaT at subsequent signalized intersections, a base SPaT sequence is extracted from the traffic simulation program sumo (simulationofurbanmobility) [39]. Starting from this base SPaT, all the traffic lights are offset by the same uniformly random value such that the phase difference between the traffic lights remains constant, i.e., the signals are synchronized. Alternately, this can be interpreted as considering the departure time as a uniform random variable: tdepU(0,tcyc), where tdep is the departure time and tcyc is the traffic cycle time, chosen in this study as 90s (fixed).

7.3.3 Monte Carlo Simulations.

Monte Carlo simulations are then performed over the test route by considering the SPaT information as a random variable. Here, 2000 different scenarios are generated by randomly changing the departure time (as described above). The different cases being compared in this simulation study are baseline (EDM with LoS of 100m), ecodriving with LoS (no timing information, current phase of each traffic signal is known within a LoS of 100m, NH = 20), and ecodriving with eco-AND (NH = 20). For both the ecodriving cases, three calibrations are considered: γ={0.4,0.7,0.82}, to represent an aggressive, normal, and relaxed driving behavior, respectively. Note that the choice of grid discretization used for this evaluation is shown in Table 1. For fair comparison, the baseline EDM for each simulation is calibrated to represent a driver aggressiveness comparable to that value of γ. In total, 6000 simulations are executed for each case.

Table 1

Choice of grid discretization in DP

Discretized variableStep size
Distance10m
Velocity1.36m/s
Battery SoC2%
Engine torque13.2Nm
BSG torque4.2Nm
Discretized variableStep size
Distance10m
Velocity1.36m/s
Battery SoC2%
Engine torque13.2Nm
BSG torque4.2Nm

Figure 6 shows the distributions of the fuel consumption for the baseline (inline image), ecodriving with LoS (inline image), and ecodriving with eco-AND (inline image) cases, corresponding to γ={0.4,0.7,0.82}. It is to be noted that the travel times obtained for each γ remain the same across the three cases being evaluated. A nonparametric probability density function known as the kernel density estimator is used to obtain the fitted distribution for each of the cases shown.

Fig. 6
Comparison of fuel consumption between baseline, ecodriving with LoS, and ecodriving with eco-AND cases (urban test route, γ={0.4,0.7,0.82}, all results at same travel time)
Fig. 6
Comparison of fuel consumption between baseline, ecodriving with LoS, and ecodriving with eco-AND cases (urban test route, γ={0.4,0.7,0.82}, all results at same travel time)
Close modal

Over the mixed test route, the ecodriving algorithm with eco-AND reduces the fuel consumption of the baseline 48V hybrid by 18%,18%, and 19% corresponding to γ={0.4,0.7,0.82}, respectively. Further, there is a significant reduction in the standard deviation of the fuel consumption distribution compared to the baseline. This is a notable result as this means that the ecodriving algorithm equipped with eco-AND can more consistently achieve a fuel consumption close to the mean of the distribution, i.e., the spread of the fuel consumption due to variability in the driver aggressiveness and SPaT information is significantly reduced. Note that the fuel savings from the ecodriving with eco-AND algorithms are not only dependent on the γ value and can vary significantly depending on the route features—particularly, the nature of the route (urban or extra-urban or mixed, affecting the average speeds), and number and density of traffic lights and stop signs. For the urban route shown, the high density of traffic lights (3 traffic lights/km) and low average speed limit reduce the influence of driver aggressiveness on the fuel consumption, thereby resulting in comparable fuel savings across the three γ values.

To quantify the benefits obtained from eco-AND specifically, the results from ecodriving with LoS are compared against that with eco-AND. For the urban route with 22 traffic lights, the eco-AND provides an incremental 6% fuel saving over the LoS implementation, with similar mean travel time. Simulations were also performed over other urban and mixed routes, the eco-AND results in 210% additional fuel savings. The reason for the increased benefits from the pass-in-green algorithm over urban routes can be attributed to the higher density of traffic lights.

8 In-Vehicle Implementation and Results

The MPC framework was implemented in the demonstration vehicle, a 2016 VW Passat 1.8L with a six-speed automatic transmission and turbocharged gasoline engine, which was retrofitted as a mHEV by installing a BSG and a 48V battery pack. The demonstration vehicle was also equipped with CAV technologies and DSF (shown in Fig. 7(a)). For all baseline testing, the 48V mild-hybrid system is active and DSF is disabled, while for all optimizer testing both the mild-hybrid system and DSF are enabled. The onboard CAV technologies include an advanced GPS module for enhanced navigation, a dedicated short range communication module that enables V2X communication, camera, and radar modules to support adaptive cruise control (ACC) functionality. These CAV technologies enable the ability of full longitudinal control of the vehicle to achieve SAE Level 1+ functionality (in accordance with SAE J3016).

Fig. 7
Images of demonstration vehicle and experimental testing location: (a) NEXTCAR test vehicle and (b) aerial view of high-speed test track at TRC
Fig. 7
Images of demonstration vehicle and experimental testing location: (a) NEXTCAR test vehicle and (b) aerial view of high-speed test track at TRC
Close modal

The real-time ecodriving algorithm (with eco-AND), integrated with V2X communication and ACC, has been implemented using rapid prototyping hardware (dSPACE MicroAutoBox II, or MABx in short) in the test vehicle. Online calibration of control parameters and data logging are performed using dSPACE controldesk software via the host Ethernet interface. A significant result to be noted here is that the implemented optimizer can compute the solution to a 20-step receding horizon optimization problem via the two-state, two-input DP in 200ms on the MABx.

8.1 Experimental Test Setup.

All vehicle tests were performed in a single lane with 0% grade on the 7.5mi high-speed test track located at TRC (refer to Fig. 7(b)). The results presented in this section are related to tests run over an urban route, as shown in Fig. 5. For each of the tests conducted, the measured and calculated variables are vehicle speed, battery SoC, cumulative fuel consumed, and travel time. Here, the reader is encouraged to refer to Ref. [40], in which the in-vehicle implementation of the optimizer is compared and verified against the corresponding simulation models over a mixed (urban-highway) route.

8.1.1 Test Setup for Evaluation of Ecodriving Algorithm.

For evaluation of the ecodriving case, three calibrations for the driver aggressiveness parameter are considered: γ={0.3,0.7,0.75}. In this scenario, all the traffic lights along the route are assumed to be flashing red, i.e., treated as stop signs. Table 2 summarizes the test setup for the ecodriving test scenario. Note that the horizon length corresponding to NH = 20 in this case takes a range of values (150750m), as the step size of the discretized distance grid is not fixed. This enables the usage of a fine discretization in certain sections of the route and coarser in other sections, allowing the ecodriving algorithm to satisfactorily capture the model dynamics while running in real-time on the MABx.

Table 2

Summary of ecodriving and baseline test setup

VariableEcodrivingBaseline
HEV state48V mHEV48V mHEV
DSF stateONOFF
Vehicle mass1850kg1850kg
Grade0%0%
Initial SoC50%50%
ControlsEcodriving controlECM
DriverACCBTR
VariableEcodrivingBaseline
HEV state48V mHEV48V mHEV
DSF stateONOFF
Vehicle mass1850kg1850kg
Grade0%0%
Initial SoC50%50%
ControlsEcodriving controlECM
DriverACCBTR

8.1.2 Test Setup for Evaluation of Ecodriving Algorithm With Eco-Approach and Departure.

In order to evaluate the ecodriving algorithm with eco-AND, a test environment was designed where traffic light SPaT information is broadcast to the optimizer as the vehicle approaches a signalized intersection. To significantly reduce the number of tests and the resulting testing time without compromising the validity of the results, sample cases from the Monte Carlo simulations (Fig. 6) were selected and reconstructed at TRC.

For each tested condition, the values of γ and the SPaT scenario were selected to represent a dominant mode of the respective Monte Carlo simulation over that route. In this work, γ=0.7 is chosen for the reconstructed route as it compactly represents normal driver aggressiveness. The SPaT scenarios are replicated in real-time by broadcasting them on a roadside unit (RSU). For convenience in setup and testing, this RSU is mounted on the rear seat and connected to a supplementary 12V battery as shown in Fig. 8. The V2I communication is emulated via the in-vehicle RSU that broadcasts SPaT and MAP information in accordance with the SAE J2735 standard to the onboard unit mounted on the vehicle.

Fig. 8
RSU (mounted in the rear seat) that broadcasts SPaT information within dedicated short range communication range of the upcoming traffic light: (a) RSU apparatus and (b) illustration of test vehicle approaching a signalized intersection
Fig. 8
RSU (mounted in the rear seat) that broadcasts SPaT information within dedicated short range communication range of the upcoming traffic light: (a) RSU apparatus and (b) illustration of test vehicle approaching a signalized intersection
Close modal

Note that the travel time recorded for testing the ecodriving with eco-AND case includes the time spent at stop signs and the wait time at red traffic lights along the route. Here, the test setup for evaluation is identical to that for the ecodriving case in Table 2 but for the “controls” category, where the ecodriving algorithm equipped with eco-AND capability is used.

8.1.3 Test Setup for Baseline Evaluation.

To ensure repeatability of the experiments, a BTR is programed and calibrated to follow reference velocity profiles which are generated in simulation using the EDM calibrated to represent a variety of driving styles. This ensures repeatability and controlled variability across baseline tests. The BTR accurately tracks the reference velocity profile by robotic actuation of the accelerator and brake pedals with a human driver only responsible for the steering action. Table 2 summarizes the test setup adopted to evaluate the baseline. Here, note that the processing of the baseline test results differ depending on whether the ecodriving or the ecodriving with eco-AND is being benchmarked. For fair evaluation of the ecodriving case, the travel time recorded excludes the time spent at stops along the route, while for the ecodriving with eco-AND case, the travel time includes the time spent at stop signs as well as the wait time at red traffic lights.

8.2 Experimental Results for Ecodriving.

To evaluate the benefits of the ecodriving algorithm over a representative baseline, multiple trips are executed over the same reconstructed urban route for the chosen γ values (refer Table 2). The travel times from the ecodriving tests are used to generate the EDM parameter sets with similar (driver) aggressiveness and travel times for each γ. The generated EDM velocity profiles are then used as reference inputs to the BTR.

Figure 9(a) compares the vehicle testing results for the baseline (executed using the BTR) against the ecodriving case, for each γ. Here, the error bars represent the maximum deviation of the data points from their respective mean and are determined using data from at least five test runs. Over the urban route tested, featuring 23 stops, the ecodriving algorithm with DSF enabled reduces the fuel consumption by over 25% (and up to 33%), relative to the baseline, with comparable travel time.

Fig. 9
In-vehicle testing of the ecodriving algorithm and comparison against baseline over urban route: (a) Pareto curve, (b) vehicle velocity and battery SoC profile, γ = 0.7, and (c) cumulative fuel consumption, γ = 0.7
Fig. 9
In-vehicle testing of the ecodriving algorithm and comparison against baseline over urban route: (a) Pareto curve, (b) vehicle velocity and battery SoC profile, γ = 0.7, and (c) cumulative fuel consumption, γ = 0.7
Close modal

In Figs. 9(b) and 9(c), a run-to-run comparison of the states (vehicle velocity and battery SoC) and cumulative fuel consumption along the urban route are, respectively, shown for γ=0.7. A noticeable contrast is observed between the baseline and optimizer tests, with the ecodriving algorithm resulting in 33% fuel savings for similar travel time. Here, the DSF-enabled optimizer increases the operation of the engine within the DSF fly-zones and thereby leverages the fuel-saving potential of DSF.

Figure 10 shows the vehicle speed trajectories measured on the demonstration vehicle for the urban route and indicates the different operating modes of the mHEV powertrain for the optimizer and baseline experiments. The operating modes are engine only, E-assist, regenerative braking, engine-on battery recharge, deceleration fuel cutoff, DCCO, and start–stop. Here, the engine only mode is self-explanatory. In the E-assist mode, the BSG provides a torque assist to the internal combustion engine. In the regenerative braking or regen mode, the BSG acts as a generator to slow the vehicle down by converting the (excess) kinetic energy from the motion of the wheels to electrical energy, which is then stored in the 48V battery. In the engine-on battery recharge mode, the BSG acts as a generator again, and here the engine produces additional power to supply the required mechanical energy for conversion (to electrical energy). In deceleration fuel cutoff mode, based on certain conditions (such as current gear, engine speed, and so on), no fuel is injected into the engine during deceleration events, while the valves still cyclically open and close. The negative engine brake torque from the pumping losses here contributes to “engine braking,” which is apparent as long as the engine remains mechanically engaged to the transmission. Under braking events in Deceleration Cylinder Cut-Off (DCCO) mode, the fuel can be cut-off, and additionally the intake and exhaust valves can be closed such that the pistons act as air springs, thereby reducing the pumping losses. Finally, a start–stop system automatically shuts down (when the vehicle is stationary) and restarts the engine in the start–stop mode to reduce the time spent in idling.

Fig. 10
Comparison of operating modes between ecodriving experimental and baseline tests: (a) ecodriving algorithm test operating modes over urban route and (b) baseline test operating modes over urban route
Fig. 10
Comparison of operating modes between ecodriving experimental and baseline tests: (a) ecodriving algorithm test operating modes over urban route and (b) baseline test operating modes over urban route
Close modal

The results show that the ecodriving algorithm increases the use of the BSG to expand powertrain operation in the E-assist and regen modes. Further, in contrast to the baseline, the optimizer increases the engine only operation while significantly reducing powertrain operation in the highly fuel-inefficient engine-on battery recharge mode. This indicates that the ecodriving algorithm, which utilizes look-ahead information for optimal control, is less conservative than the baseline torque split controls. The fuel savings can be further increased by improving the tracking performance of the ACC to achieve a smoother velocity profile and optimize the use of DCCO.

8.3 Experimental Results for Ecodriving With Eco-Approach and Departure.

The specific case (γ and SPaT scenario tuple) selected from the Monte Carlo simulations for vehicle testing corresponds to a dominant mode of the statistical distribution. Over the reconstructed urban route considered, the average (over five runs) in-vehicle fuel consumption and travel time are observed to lie within the range of their corresponding distributions. Further, as seen in Table 3, they are closely comparable to their respective mean values from the Monte Carlo simulations (refer to Fig. 6).

Table 3

Mean travel time and fuel consumption results (virtual and experimental) using MPC ecodriving algorithm with eco-AND over urban route, γ=0.7

CaseTravel time (s)Fuel consumed (g)
Virtual (Monte Carlo simulation)712404
Experimental tests735417
CaseTravel time (s)Fuel consumed (g)
Virtual (Monte Carlo simulation)712404
Experimental tests735417

The baseline case considered to benchmark the optimizer is constructed such that the driver aggressiveness (i.e., resulting travel time) matches a dominant mode of the statistical distribution obtained through Monte Carlo simulations while being comparable to the mean travel time from the ecodriving with eco-AND testing. Figure 11 compares the in-vehicle test results from the baseline (using the BTR) and the ecodriving with eco-AND cases. The enhanced range (compared to typical human LoS) and information (signal time in addition to the current phase) enabled by V2I technologies is used by the eco-AND feature to provide modified speed constraints to the rollout algorithm-based optimization. While the number of stop-at-red scenarios remains the same in this specific case, the resulting velocity profile is much smoother with fewer acceleration–deceleration events.

Fig. 11
Run-to-run comparison of vehicle velocity and time–space plots between baseline and ecodriving with eco-AND experimental tests over urban route: (a) vehicle velocity, γ = 0.7 and (b) time–space plot, γ = 0.7
Fig. 11
Run-to-run comparison of vehicle velocity and time–space plots between baseline and ecodriving with eco-AND experimental tests over urban route: (a) vehicle velocity, γ = 0.7 and (b) time–space plot, γ = 0.7
Close modal

The torque split strategies of the baseline and ecodriving with eco-AND cases over the urban route are evaluated by comparing the resulting battery SoC and cumulative fuel consumption trajectories, shown in Fig. 12. A key benefit from the designed ecodriving algorithm with eco-AND is the resulting charge-sustaining torque split strategy. In contrast, the terminal SoC for the baseline case is 40% (while its initial SoC was 50%).

Fig. 12
Run-to-run comparison of battery SoC and cumulative fuel consumption profiles between baseline and ecodriving with eco-AND experimental tests over urban route: (a) battery SoC, γ = 0.7 and (b) cumulative fuel consumption, γ = 0.7
Fig. 12
Run-to-run comparison of battery SoC and cumulative fuel consumption profiles between baseline and ecodriving with eco-AND experimental tests over urban route: (a) battery SoC, γ = 0.7 and (b) cumulative fuel consumption, γ = 0.7
Close modal
Table 4

Comparison of cumulative fuel consumption and travel time from baseline (without DSF) and ecodriving algorithm with eco-AND (with DSF), obtained from experimental tests

CaseTravel time (s)Fuel consumed (g)
Baseline714538
Ecodriving with eco-AND735417
CaseTravel time (s)Fuel consumed (g)
Baseline714538
Ecodriving with eco-AND735417

Table 4 summarizes the fuel consumption and travel time obtained by the baseline and the ecodriving algorithm with eco-AND, as obtained from the in-vehicle tests. For the specific route tested here, the optimizer saves over 22% in fuel with a marginal increase of 2.9% in trip time while ensuring SoC-neutrality over the entire trip.

9 Conclusions and Future Work

In this paper, a multilayer hierarchical MPC framework is developed to solve the ecodriving problem for a 48V mild-hybrid powertrain in a connected vehicle environment. The control framework comprises a long-term co-optimization of vehicle speed and SoC performed over the entire route itinerary, and a short-term optimization where the variability in route conditions and/or uncertainty in route information are considered by transforming the full-route optimization into a receding horizon optimal control problem, solved periodically over shorter horizons. The terminal cost for the receding horizon optimization is approximated as the residual cost (or cost to complete the remaining route) from the full-route optimization and solved using approximate dynamic programing, specifically the rollout algorithm.

A comprehensive verification that includes virtual and experimental evaluations on a test vehicle demonstrates the fuel-saving potential of the hierarchical optimization framework. Simulation and track testing data show more than 20% reduction in fuel consumption compared to the baseline. Extensions of the algorithm are part of the ongoing work, including the implementation of the algorithm with the consideration of traffic information via vehicle-to-vehicle and signal phase and timing information via V2I communication.

Acknowledgment

The authors gratefully acknowledge BorgWarner Inc. (formerly Delphi Technologies) for providing continued technical support and for the insightful discussions.

Funding Data

  • U.S. Department of Energy, Advanced Research Projects Agency—Energy (Award No. DE-AR0000794; Funder ID: 10.13039/100000015).

References

1.
Guanetti
,
J.
,
Kim
,
Y.
, and
Borrelli
,
F.
,
2018
, “
Control of Connected and Automated Vehicles: State of the Art and Future Challenges
,”
Annu. Rev. Control
,
45
, pp.
18
40
.10.1016/j.arcontrol.2018.04.011
2.
Gupta
,
S.
,
Deshpande
,
S. R.
,
Tufano
,
D.
,
Canova
,
M.
,
Rizzoni
,
G.
,
Aggoune
,
K.
,
Olin
,
P.
, and
Kirwan
,
J.
,
2020
, “
Estimation of Fuel Economy on Real-World Routes for Next-Generation Connected and Automated Hybrid Powertrains
,”
SAE
Paper No. 2020-01-0593.10.4271/2020-01-0593
3.
Dong
,
S.
,
Gao
,
B.
,
Chen
,
H.
,
Huang
,
Y.
, and
Liu
,
Q.
,
2022
, “
Real-Time Velocity Optimization for Energy-Efficient Control of Connected and Automated Vehicles
,”
ASME J. Dyn. Syst., Meas., Control
,
144
(
1
), p.
011102
.10.1115/1.4052765
4.
Xu
,
S.
, and
Peng
,
H.
,
2018
, “
Design and Comparison of Fuel-Saving Speed Planning Algorithms for Automated Vehicles
,”
IEEE Access
,
6
, pp.
9070
9080
.10.1109/ACCESS.2018.2805883
5.
Guzzella
,
L.
, and
Sciarretta
,
A.
,
2007
,
Vehicle Propulsion Systems
, Vol.
1
,
Springer
, Berlin.
6.
Alam
,
M. S.
, and
McNabola
,
A.
,
2014
, “
A Critical Review and Assessment of Eco-Driving Policy & Technology: Benefits & Limitations
,”
Transp. Policy
,
35
, pp.
42
49
.10.1016/j.tranpol.2014.05.016
7.
Sciarretta
,
A.
,
De Nunzio
,
G.
, and
Ojeda
,
L. L.
,
2015
, “
Optimal Ecodriving Control: Energy-Efficient Driving of Road Vehicles as an Optimal Control Problem
,”
IEEE Control Syst. Mag.
,
35
(
5
), pp.
71
90
.10.1109/MCS.2015.2449688
8.
Jin
,
Q.
,
Wu
,
G.
,
Boriboonsomsin
,
K.
, and
Barth
,
M. J.
,
2016
, “
Power-Based Optimal Longitudinal Control for a Connected Eco-Driving System
,”
IEEE Trans. Intell. Transp. Syst.
,
17
(
10
), pp.
2900
2910
.10.1109/TITS.2016.2535439
9.
Ozatay
,
E.
,
Onori
,
S.
,
Wollaeger
,
J.
,
Ozguner
,
U.
,
Rizzoni
,
G.
,
Filev
,
D.
,
Michelini
,
J.
, and
Di Cairano
,
S.
,
2014
, “
Cloud-Based Velocity Profile Optimization for Everyday Driving: A Dynamic-Programming-Based Solution
,”
IEEE Trans. Intell. Transp. Syst.
,
15
(
6
), pp.
2491
2505
.10.1109/TITS.2014.2319812
10.
Han
,
J.
,
Vahidi
,
A.
, and
Sciarretta
,
A.
,
2019
, “
Fundamentals of Energy Efficient Driving for Combustion Engine and Electric Vehicles: An Optimal Control Perspective
,”
Automatica
,
103
, pp.
558
572
.10.1016/j.automatica.2019.02.031
11.
Mensing
,
F.
,
Trigui
,
R.
, and
Bideaux
,
E.
,
2012
, “
Vehicle Trajectory Optimization for Hybrid Vehicles Taking Into Account Battery State-of-Charge
,”
2012 IEEE Vehicle Power and Propulsion Conference
, Seoul, Korea,
IEEE
, pp.
950
955
.
12.
Guo
,
L.
,
Gao
,
B.
,
Gao
,
Y.
, and
Chen
,
H.
,
2017
, “
Optimal Energy Management for HEVs in Eco-Driving Applications Using Bi-Level MPC
,”
IEEE Trans. Intell. Transp. Syst.
,
18
(
8
), pp.
2153
2162
.10.1109/TITS.2016.2634019
13.
Amini
,
M. R.
,
Gong
,
X.
,
Feng
,
Y.
,
Wang
,
H.
,
Kolmanovsky
,
I.
, and
Sun
,
J.
,
2019
, “
Sequential Optimization of Speed, Thermal Load, and Power Split in Connected HEVs
,” 2019 American Control Conference (
ACC
), Philadelphia, PA,
July 10–12
, pp.
4614
4620
.10.23919/ACC.2019.8815158
14.
Hao
,
P.
,
Wu
,
G.
,
Boriboonsomsin
,
K.
, and
Barth
,
M. J.
,
2019
, “
Eco-Approach and Departure (EAD) Application for Actuated Signals in Real-World Traffic
,”
IEEE Trans. Intell. Transp. Syst.
,
20
(
1
), pp.
30
40
.10.1109/TITS.2018.2794509
15.
Ye
,
F.
,
Hao
,
P.
,
Qi
,
X.
,
Wu
,
G.
,
Boriboonsomsin
,
K.
, and
Barth
,
M. J.
,
2019
, “
Prediction-Based Eco-Approach and Departure at Signalized Intersections With Speed Forecasting on Preceding Vehicles
,”
IEEE Trans. Intell. Transp. Syst.
,
20
(
4
), pp.
1378
1389
.10.1109/TITS.2018.2856809
16.
Altan
,
O. D.
,
Wu
,
G.
,
Barth
,
M. J.
,
Boriboonsomsin
,
K.
, and
Stark
,
J. A.
,
2017
, “
Glidepath: Eco-Friendly Automated Approach and Departure at Signalized Intersections
,”
IEEE Trans. Intell. Veh.
,
2
(
4
), pp.
266
277
.10.1109/TIV.2017.2767289
17.
Sun
,
C.
,
Guanetti
,
J.
,
Borrelli
,
F.
, and
Moura
,
S. J.
,
2018
, “
Optimal Eco-Driving Control of Connected and Autonomous Vehicles Through Signalized Intersections
,” IEEE Internet of Things J., 7(5), pp. 3759–3773
.
18.
Qin
,
S. J.
, and
Badgwell
,
T. A.
,
2003
, “
A Survey of Industrial Model Predictive Control Technology
,”
Control Eng. Pract.
,
11
(
7
), pp.
733
764
.10.1016/S0967-0661(02)00186-7
19.
Borrelli
,
F.
,
Bemporad
,
A.
, and
Morari
,
M.
,
2017
,
Predictive Control for Linear and Hybrid Systems
,
Cambridge University Press
, Cambridge, UK.
20.
Bellman
,
R.
,
1966
, “
Dynamic Programming
,”
Science
,
153
(
3731
), pp.
34
37
.10.1126/science.153.3731.34
21.
Lee
,
J. H.
, and
Wong
,
W.
,
2010
, “
Approximate Dynamic Programming Approach for Process Control
,”
J. Process Control
,
20
(
9
), pp.
1038
1048
.10.1016/j.jprocont.2010.06.007
22.
Yin
,
J.
,
Tang
,
T.
,
Yang
,
L.
,
Gao
,
Z.
, and
Ran
,
B.
,
2016
, “
Energy-Efficient Metro Train Rescheduling With Uncertain Time-Variant Passenger Demands: An Approximate Dynamic Programming Approach
,”
Transp. Res. Part B: Methodol.
,
91
, pp.
178
210
.10.1016/j.trb.2016.05.009
23.
Olin
,
P.
,
Aggoune
,
K.
,
Tang
,
L.
,
Confer
,
K.
,
Kirwan
,
J.
,
Deshpande
,
S. R.
,
Gupta
,
S.
,
Tulpule
,
P.
,
Canova
,
M.
, and
Rizzoni
,
G.
,
2019
, “
Reducing Fuel Consumption by Using Information From Connected and Automated Vehicle Modules to Optimize Propulsion System Control
,”
SAE
Paper No. 2019-01-1213.10.4271/2019-01-1213
24.
Wilcutts
,
M.
,
Switkes
,
J.
,
Shost
,
M.
, and
Tripathi
,
A.
,
2013
, “
Design and Benefits of Dynamic Skip Fire Strategies for Cylinder Deactivated Engines
,”
SAE Int. J. Engines
,
6
(
1
), pp.
278
288
.10.4271/2013-01-0359
25.
Wilcutts
,
M.
,
Nagashima
,
M.
,
Eisazadeh-Far
,
K.
,
Younkins
,
M.
, and
Confer
,
K.
,
2018
, “
Electrified Dynamic Skip Fire (EDSF): Design and Benefits
,”
SAE
Paper No. 2018-01-0864.10.4271/2018-01-0864
26.
Zhu
,
Z.
,
Gupta
,
S.
,
Pivaro
,
N.
,
Deshpande
,
S. R.
, and
Canova
,
M.
,
2021
, “
A GPU Implementation of a Look-Ahead Optimal Controller for Eco-Driving Based on Dynamic Programming
,” 2021 European Control Conference (ECC), Delft, The Netherlands.
27.
Bertsekas
,
D. P.
,
1995
,
Dynamic Programming and Optimal Control
, Vol.
1
,
Athena Scientific
,
Belmont, MA
.
28.
Johannesson
,
L.
, and
Egardt
,
B.
,
2008
, “
Approximate Dynamic Programming Applied to Parallel Hybrid Powertrains
,”
IFAC Proc. Vol.
,
41
(
2
), pp.
3374
3379
.10.3182/20080706-5-KR-1001.00573
29.
Hellström
,
E.
,
Åslund
,
J.
, and
Nielsen
,
L.
,
2010
, “
Design of an Efficient Algorithm for Fuel-Optimal Look-Ahead Control
,”
Control Eng. Pract.
,
18
(
11
), pp.
1318
1327
.10.1016/j.conengprac.2009.12.008
30.
Bae
,
S.
,
Choi
,
Y.
,
Kim
,
Y.
,
Guanetti
,
J.
,
Borrelli
,
F.
, and
Moura
,
S.
,
2019
, “
Real-Time Ecological Velocity Planning for Plug-In Hybrid Vehicles With Partial Communication to Traffic Lights
,” 2019 IEEE 58th Conference on Decision and Control (
CDC
), Nice, France,
Dec. 11–13
, pp.
1279
1285
.10.1109/CDC40024.2019.9030166
31.
Bertsekas
,
D. P.
, and
Tsitsiklis
,
J. N.
,
1996
,
Neuro-Dynamic Programming
,
Athena Scientific
, Belmont, MA.
32.
Melo
,
F. S.
,
Meyn
,
S. P.
, and
Ribeiro
,
M. I.
,
2008
, “
An Analysis of Reinforcement Learning With Function Approximation
,”
Proceedings of the 25th International Conference on Machine Learning
, Helsinki, Finland, pp.
664
671
.
33.
Bertsekas
,
D.
,
2005
, “
Rollout Algorithms for Constrained Dynamic Programming
,” Lab. for Information and Decision Systems, Report No. 2646.
34.
Sun
,
C.
,
Guanetti
,
J.
,
Borrelli
,
F.
, and
Moura
,
S. J.
,
2020
, “
Optimal Eco-Driving Control of Connected and Autonomous Vehicles Through Signalized Intersections
,”
IEEE Internet Things J.
,
7
(
5
), pp.
3759
3773
.10.1109/JIOT.2020.2968120
35.
Larson
,
R. E.
,
Casti
,
J. L.
, and
Casti
,
J. L.
,
1978
,
Principles of Dynamic Programming
, Vol.
7
,
M. Dekker
,
New York
.
36.
Gupta
,
S.
,
Deshpande
,
S. R.
,
Tulpule
,
P.
,
Canova
,
M.
, and
Rizzoni
,
G.
,
2019
, “
An Enhanced Driver Model for Evaluating Fuel Economy on Real-World Routes
,”
IFAC-PapersOnLine
,
52
(
5
), pp.
574
579
.10.1016/j.ifacol.2019.09.091
37.
Coovert
,
D. A.
,
Heydinger
,
G. J.
,
Bixel
,
R. A.
,
Andreatta
,
D.
,
Guenther
,
D. A.
,
Sidhu
,
A. S.
, and
Mikesell
,
D. R.
,
2009
, “
Design and Operation of a Brake and Throttle Robot
,”
SAE Int. J. Passenger Cars - Mech. Syst.
,
2
(
1
), pp.
613
621
.10.4271/2009-01-0429
38.
Rajakumar Deshpande
,
S.
,
Gupta
,
S.
,
Kibalama
,
D.
,
Pivaro
,
N.
, and
Canova
,
M.
,
2020
, “
Benchmarking Fuel Economy of Connected and Automated Vehicles in Real World Driving Conditions Via Monte Carlo Simulation
,”
ASME
Paper No. DSCC2020-3250.10.1115/DSCC2020-3250
39.
Behrisch
,
M.
,
Bieker
,
L.
,
Erdmann
,
J.
, and
Krajzewicz
,
D.
,
2011
, “
SUMO—Simulation of Urban Mobility: An Overview
,”
Proceedings of SIMUL 2011, The Third International Conference on Advances in System Simulation
, Barcelona, Spain, Oct.
23
29
.
40.
Deshpande
,
S. R.
,
Gupta
,
S.
,
Kibalama
,
D.
,
Pivaro
,
N.
,
Canova
,
M.
,
Rizzoni
,
G.
,
Aggoune
,
K.
,
Olin
,
P.
, and
Kirwan
,
J.
,
2021
, “
In-Vehicle Test Results for Advanced Propulsion and Vehicle System Controls Using Connected and Automated Vehicle Information
,”
SAE
Paper No. 2021-01-0430.10.4271/2021-01-0430