Latin American applied research
versión ISSN 0327-0793
Lat. Am. appl. res. v.36 n.2 Bahía Blanca abr./jun. 2006
Supervisory control of an HEV using an inventory control approach
L. V. Pérez1, G. R. Bossio1, D. Moitre2 and G. O. García1
Abstract Hybrid electric vehicles (HEV) are those equipped with two or more energy sources, usually, a fuel tank with its associated internal combustion engine (ICE) and an electrical storage system (ESS), typically a bank of batteries. In order to efficiently operate the system it is necessary to determine the instantaneous power split between the two sources when the vehicle performs a predetermined duty cycle. In this work, this problem is posed as an optimal control problem with constraints, specifically, as an inventory control problem and solved using dynamic programming (DP). Results obtained for the HEV being developed in the Applied Electronics Group, School of Engineering, National University of Río Cuarto are shown.
Keywords Optimal Control with Constraints. Dynamic Programming. Supervisory Control of Hybrid Electric Vehicles.
HEVs are those whose architecture includes two or more energy sources, usually a fuel tank and a bank of batteries. These energy sources are associated to energy converters such as an ICE and an electric motor respectively. These vehicles take advantage of the cleanliness and high efficiency of electrical traction and overcome its main drawback that is its low range. ESSs currently available have a low energy density. This is compensated in HEVs by the high energy density of fossil fuels, usually two or three orders higher than that of ESSs.
Energy storage elements and converters can be arranged following different topologies. Figure 1 shows a scheme of the so-called "series" configuration. In this configuration, an electric motor moves the wheels. An ESS that may consist of a bank of batteries and/or ultra capacitors feeds this motor. On the other hand, the ICE is fed by the fuel tank and drives an electric generator. This generator provides electric power to the traction motor when the power demanded by the driver exceeds that provided by the ESS. On the contrary, when the power provided by this generator exceeds that demanded by the driver, this excess is used to recharge the ESS.
Figure 1. Scheme of a series HEV.
Hybrid electric as well as purely electric powertrains have the advantage of "regenerative braking". This involves using the electric motors as generators during braking, transforming the mechanical energy into electrical energy. In this way the kinetic energy stored by the vehicle is recaptured by the ESS. The double arrows that connect the wheels to the ESS (see Fig. 1), represent this reverse energy flow.
For the same performance target, the ICE and ESS of a HEV can be of smaller size than those of a conventional or a pure electric vehicle. However, the whole system performance will also depend on how they interact. At first sight, it seems that the ESS should mainly perform velocity changes, taking advantage of the reversibility of the electrical path, whereas the ICE should supply the rest of the power. The nominal power of the latter should be such that it could be used most of the time near its optimal operation point. In this way, consumption as well as gas and sound emissions would be reduced.
HEVs need an electronic power manager that must determine at each instant the amount and direction of the flow in each path. This higher-level control is usually known as "supervisory control". Power electronics devices control each particular power converter according to the commands from the supervisory control.
The coordination between sources and physical and operational limitations of the many devices involved force trade-offs. Hence for an efficient operation of a hybrid powertrain it is necessary to optimize the supervisory control strategy. An HEV designed for city use is being developed by our research group. It is in city use where the advantages of HEVs are most noticeable, because of the frequent acceleration and deceleration. The purpose of this work is to contribute to the definition of an optimal strategy for the supervisory control of this vehicle.
This problem, as an optimal control problem, may be posed in different forms depending on the objective desired, the model considered, the control action and the constraints imposed. However, there are some common features to all approaches. Concerning the control objective, it is natural to consider minimizing fuel consumption while not degrading the vehicle dynamical response. Concerning the dynamical models, they unavoidably include combinations of linear and nonlinear, discrete and continuous, algebraic and dynamical systems. Moreover, they are subject to constraints not only on the control variables but also in the state variables as it is explained in Section II. Hence, most of the reported optimal supervisory control strategies use either intelligent control techniques such as rule-based, fuzzy logic and neural networks, or optimal control approaches. Within the latter, Delprat et al. (2001; 2004), Steinmauer and del Re (2001) and Daniels and Kumar (1999) use Pontryagin Maximum Principle optimality conditions. Zaremba et al. (2002), Brahma et al. (2000a;b) and Sciaretta et al., (2004) use a discrete approach and a dynamic programming algorithm.
In previous work (Pérez et al., 2004) we followed Brahma et al. (2000,a; b) and posed the problem as a shortest path problem. The drawback to this approach is the treatment of a constraint of integral form. This constraint arises from the need to preserve the batteries from depletion or overcharge, and/or if a "charge sustaining" operation of the vehicle is imposed. Brahma et al. (2000a;b) proposes a method using a penalization term added to the objective function. We applied this approach and proposed an alternative method based on checking this constraint as the algorithm proceeds. Both methods include heuristics and consequently give sub-optimal results.
In this paper, we have tried to overcome the use of heuristics by posing the problem as a deterministic inventory control problem. In this way we can find the optimum, up to the discretization step considered (Bertsekas, 2001). Section III refers to the algorithm used to find the solution.
This approach was applied to find the optimal power split between the bank of batteries and a hypothetical ICE and generator to be used in the HEV being developed. A brief example of some of the results obtained is included in Section IV.
We consider this work, one of the first steps towards the definition of a supervisory control strategy for our prototype. Finally, we include some conclusions and a prospective of our future work in Section V.
II. STATEMENT OF THE PROBLEM
A. Abstracted model for the vehicle
In order to solve the supervisory control problem, it will be enough considering an abstracted scheme for the system that represents the vehicle (Brahma et al., 2000a, 2000b; Rizzoni et al., 1999) where the intermediate devices of the powertrain such as rectifiers and converters, are replaced by the net power flow in each path. PFT(t) will indicate the power flow at time t in the fuel tank/engine/generator path (which we shall call FTpath henceforth) and PESS(t) the power flow at time t in the ESS-path (Fig. 2). The energy losses that take place in the intermediate converters will be represented by an efficiency factor.
Figure 2. Abstracted scheme for an HEV.
The following convention is also established: a positive power flow means power flowing away from the ESS. Consequently, during regenerative braking a negative flow will take place in the electrical path. Besides, the power flow from the fuel tank cannot be negative, as it cannot absorb any power.
The required vehicle velocity profile is considered a given function. The required power can be computed from this profile using a model of the vehicle longitudinal dynamics. Hence, it is also considered a known function that will be denoted by Preq(t). This is indeed not true for the real case, where the future velocity is not known but depends on the transit and road conditions, but we hope this approach will be able to be extended to real driving conditions by using a stochastic approach and/or short-term horizons.
B. Power balance
A balance equation can naturally be established, since the sum of the power from both sources has to be equal to the required power at all times:
PFT(t) + PESS(t) = Preq(t). (1)
In the series configuration this addition takes place in the form of electrical power.
C. Energy consumption and efficiencies
Regarding the net energy consumed from each source in a time interval, we must take into account that not all the power delivered by the source can be actually used to supply the demand, since in every energy conversion process there are losses. Therefore, only a portion of the power delivered will reach the summing junction in Fig. 2. In our abstracted model, this portion will be represented by two functions ηFT and ηESS that take values between 0 and 1, and will be referred to as the efficiencies associated to each path. Indeed, efficiencies depend on the power flows. We consider them known functions that in practice are experimentally determined for all possible power values. Hence, the net energy contained in the FT at time t can be computed as follows:
where EFTo is the initial energy in the fuel tank.
To compute the net energy consumed from the ESS it has to be considered that during acceleration (PESS >0) the effect of losses is represented by dividing the delivered power by ηESS as in the previous case. But during regenerative braking, because of losses, only a portion of the kinetic energy coming from the wheels will reach the ESS, and so the inverse situation has to be represented. Let us define
Then, the net energy contained in the ESS at time t is
where EESSo is the initial energy in the ESS.
D. Control objective
The control objective is to minimize fuel consumption in a time interval [0,T], T known. That is, to minimize the net energy consumed from the chemical source in the interval, i.e.:
E. Control action, state variable and state equation
Our purpose is to determine for each t in [0,T] the values of PESS and PFT that minimize the objective. Using the balance equation (1), we can eliminate one of these functions. Hence, the problem can be posed in two alternative forms. Either PFT or PESS can be taken as the control action or the independent variable, over which the minimization will be performed. The remaining one is considered the dependent variable and is obtained from the first. Physically, this implies that the supervisory control will be exerted either by the engine acceleration command or by the ESS power controller. The alternative device should provide the necessary remaining power to satisfy the demanded power Preq. Since both problems are similar only the case where PFT(t) is the control action will be described.
From (1) and (4) we arrive at
This will be considered the state equation and EESS the state variable, with initial condition EESSo. Expression (6) is indeed a simple integral rather than a state equation in the usual sense, since the right hand side does not depend on the state variable but only on the control input. However, it is convenient to consider it a state equation in order to use DP algorithms for optimal control in a straightforward manner.
Note that considering that/is defined piecewise and that ηFT and ηESS are non-linear functions, the state equation also results non linear.
Although it is not necessary to impose a final condition to the state, in order to simplify the presentation, we will set EESS(T) = EESSo. This represents a "charge sustaining operation" of the ESS, which may be a desired feature.
Clearly, the power flows are physically limited, hence:
In addition, if the ESS is a bank of batteries, it has to be protected from depletion and overcharge. This implies that the net consumed energy from the ESS has to be maintained between proper limits at each instant t. Then,
In summary, there are constraints on the control action PFT(t) and on the state variable EESS(t).
III. DYNAMIC PROGRAMMING SOLUTION
Because of the non-linearities of the dynamic system and the many constraints involved, we choose a discrete approach and a dynamic programming solution (Chiang, 1992). From this point of view the problem is similar to an inventory control problem (Bertsekas, 2001), where for the deterministic case, the demand is known, the cost function only considers the purchase inventory and there is the possibility of returning goods to stock.
A. Discrete formulation
Let us divide the interval [0, T] in N stages of length Δ.t. Let also PFTk = PFT(kΔt), k = 0, 1, ... N-l be the discrete control sequence that is being searched, i.e., the sequence of decisions on the power flow in the FT- path at each stage. Let EESSk represent the possible states of the system that satisfy the discrete state equation
The sequence Preqk, is known and in the usual form, Preqk=Preq(kΔt). Finally, let Vd be the discrete cost functional
Then the discrete problem statement is as follows:
B. Network and arc costs
In order to solve the problem, let us consider a network like the one shown in Fig. 3. The horizontal axis corresponds to time stages tk, k = 0, 1, ..., N, and the vertical axis corresponds to the possible discrete values for the state variable EESSi, i = 0,1, ..., M, equally spaced between EESSmin and EESSmax. Then, each node corresponds to a possible state EESSik at time k. Each node EESSik, i = 0, 1, ..., M, at stage k is connected to each node EESSjk+1, j = 0, l, ..., M at the following stage k+1. In the algorithm used, all possible connections between nodes of successive stages have been considered, except for the initial and final stages, since they are fixed. Hence, a discrete state trajectory is formed by a sequence of nodes, one for each stage k, such that connects the initial and final states. In the network there are, in principle, MN-1 possible trajectories. Because of the way in which the network is built, all of them satisfy (15).
Figure 3. Network considered in the supervisory control problem, exemplified for the case N=6, M=4.
Now, state trajectories are completely determined by PFTk. according to the state equation (10). Considering that PFTk is subject to the constraints (13) y (14), not all the trajectories of the network will be feasible. Let then Sk be the set of nodes of stage k, such that at least one feasible trajectory passes trough it.
As usually in DP, we call "arc" the segment that connects a state EESSik of stage k to another state EESSjk+1 of the following stage. Each arc has an associated "arc cost" that will be denoted by aijk and is the contribution to the total cost that is produced if the state changes from node EESSik to node EESSjk+1. For this problem, the arc cost is the energy consumed from the fuel tank when the energy contained in the ESS changes from EESSik to EESSjk+1 and is computed as follows:
where PFTkij is the control input needed to make the state go bom EESS ik to EESS jk+1, according to Eq. (10).
To solve this problem using DP we also need a function that maps states to controls. This function is usually named "policy". In this approach, this function is obtained from (10) and can be formally expressed as:
This will allow computing, for each arc between two possible successive nodes EESS ik and EESS jk+1, the value for PFTkij that drives from one to the other. From this value the corresponding arc cost is computed using (16).
It is worth noting that the formal expression (17) is a result of the discretization considered in (10). It is also the key point that allows to solve the problem by an algorithm which, even though it is of the same order of complexity than the one presented in the work of Brahma et al., (2000a;b), it does ensure that the constraint (9) is satisfied, without using any heuristics.
The expression (17) makes sense because of the form that the non-linear function f takes in the range of interest. Figure 4 shows the efficiency function ηESS(PESS) used in this work. It was obtained by fitting two polynomials to experimental data. The corresponding graph for f is shown in Fig. 5. It can be seen that because of the form of ηESS, f results strictly increasing and hence, setting f-1(0) = 0, it is possible to recover PFT for all pair of states in the network.
Figure 4. ESS efficiency as a function of electrical power
Figure 5. Net power entering or leaving the ESS as a function of the net power used for traction.
D. DP Algorithm
Regarding dynamic programming algorithms, we refer to Bertsekas (2001) or Kwong and Rogers (2004), and only will say here that they consist essentially in looking over every node in the network and computing the minimum cost to go from each particular node to the final node (tail subproblem). The complexity is reduced using a recursive algorithm that usually goes backwards in time and computes all the tail subproblems of each stage using the solutions found for the tail subproblems of the following stage. The minimum cost trajectory is found at the last recursive step, after looking over all nodes in the network.
In the particular implementation made in this work, arc costs are computed as the recursion proceeds. For each node ik, the value PFTkij capable of driving the state EESSik to the state EESSjk+1 is computed using (17) for all 0≤j≤M. Then, PFTkij is checked to see if it satisfies constraints (13) and (14) and, if so, aijk is computed using PFTkij in Eq.(16). Otherwise, the corresponding arc is discarded from the network. Let Sk+1 be the set of the surviving j's. This is the set over which the minimum indicated in expression (18) below is taken. We include the algorithm excluding, for brevity, the special treatment of the initial and final stages and writing E instead of EESS for short.
IV. SIMULATION RESULTS
The above method has been applied to the case of the experimental HEV being developed in our group. This prototype is currently powered in a purely electric form by a bank of batteries and is equipped by a 32 kW electric motor for traction. In previous work, the longitudinal dynamics of this prototype has been modeled and validated through road tests (Pérez et al. 2002). This model allows the computation of the power demanded by the vehicle to follow a velocity cycle, including mechanical losses and losses from the electrical motor. The model has been linked to the DP algorithm to provide the values for Preq.
A hypothetical fuel converter system capable of delivering a maximum power of 40 kW has been considered. Its efficiency function, ηFT, was taken from Brahma et al. (2000b). The discretization step for the energy was ΔEESS = 0.0028kWh and the time discretization was Δt = 2.5seg, since it was observed that the results did not change substantially for finer grids. In addition, the solution is currently being checked by using an alternative approach based on "direct transcription" where the state variable need not be quantized.
The electric storage system is a bank of 20 Yuasa-Exide EV-5 batteries (in series), with nominal charge equal to 197 Amph each. The minimum charge needed for this system to work properly is 20% of the nominal charge. The voltage can be, in a first approximation considered constant, equal to its nominal value (Unom = 120V). Then, EESSmin = 1.31kWh and EESSmax = 6.57kWh. However, this bank is excessively large for an HEV, since it has been sized for a pure electric vehicle. If the ESS has enough energy to perform the cycle and compensate for the losses, the solution of minimum consumption will be the trivial solution, PFT(t) = 0. That is, no usage of the FT-path and full usage of the ESS.
Hence, to show illustrative results, we used for the simulation in Fig. 6, a hypothetical smaller ESS, with EESSmin = l.25kWh and EESSmax = 2.5kWh, and an initial state EESSo = 2.0kWh.
Figure 6. Velocity profile and power split obtained for the European Normalized Driving Cycle
Figure 6-(a) shows the velocity profile corresponding to the European Normalized Driving Cycle that has to be limited to a maximum velocity of 60 km/h to fit the design constraints of this vehicle. Figure 6-(b) and (c) show in dashed line the power demanded by this vehicle to perform the velocity cycle in (a). The solid line shows in (b), the algorithm output, i.e., the optimal PFT profile, and in (c) the complementary PESS profile. The graph in (d) is the corresponding ESS energy profile. It can be seen that it moves within the required bounds and that performs a charge sustaining cycle.
Regarding consumption, it is difficult to establish a comparison with other control strategies, since strategies that are defined by rules, often fail in following the required velocity profile and, therefore, the comparison is not relevant. Nevertheless we include in Table I consumptions obtained using some other strategies, just as reference values. The algorithm we proposed in Perez (2004) fails to render a result for an initial ESS energy initial value so low as the used in this case, so it is not included. The only strategy that is comparable with our results is that corresponding to the method proposed by Brahma et al. (2000,a). It consists of adding a term to the objective functional to penalize the use of energy from the ESS. This term includes a parameter that regulates this penalization. This parameter has to be determined by trial and error in such a way that the operation over the whole time interval resulted "charge sustaining". For the data of the example in Fig. 4, a value for this parameter was searched so that the final energy in the ESS resulted as close as possible to EESSo. Then, we run our algorithm with that final condition and so we obtained comparable results. Note that although the ESS energy consumption is almost equal, the FT consumption is lower for our algorithm (see Table I). Finally, note that the optimal consumption shows a 27% reduction respect to the one that would have been obtained if only the FT-path had been used (as in the case of a conventional vehicle).
Table I. Consumption for different supervisory control approaches
* This strategy does not follow the velocity profile
** This strategy (Emadi et al., 2004) violates constraints (13) and (14).
Concerning the time consumed by this algorithm, it can be seen that its order of complexity is O(M2N). This is the same order than that of Brahma's algorithm. In this case, the execution time will depend mainly on the state discretization, while in the former depended on the discretization of the control function. In either case the choice is arbitrary. If the same power resolution than that of Brahma's algorithm is to be obtained, then M may result larger. However, the power resolution also depends on the choice of N. Hence, a trade-off may then be used to regulate the computation time. In the example of the figure, M = 450, N = 480 and so the whole network has 216000 nodes. This implies power steps greater than 4 kW. For these values, the "crude" main loop took about 30 min to run, using MATLAB on a standard PC with a 2.08 GH Athlon Processor. Clearly, this time can be reduced by translating the code to C language, interrupting internal loops when it is clear that constraints will be thereafter violated and replacing repetitive computations by look-up tables.
V. CONCLUSIONS AND FUTURE WORK
We think that this problem statement and the solution proposed have been successful for the off-line optimization of the power split between the two sources, without the need of heuristics used in previous works. Its main drawback is its high computational cost. We are now working on improving the algorithm to reduce this cost. In any case, the algorithm outputs provide a template to learn from, which can be used to develop rule based control laws.
In the presented approach, it seems that including stochastic features and additional controls would be conceptually simple, though, again, computationally heavy. Nevertheless, we think that considering short-term horizons reduces the computation time and thus, we will be able to extend this algorithm to on-line applications and to the problem of including an additional ESS such as a bank of ultra capacitors. This is an interesting point for hybrid traction, since ultra capacitors in contrast to batteries, can deliver high bursts of power, though for short periods of time. Therefore, they are usually included to improve the vehicle dynamical response.
Finally, we think that this algorithm may apply to other systems including sources, storage elements and consumers, of interest to our group. Examples of these are power electronic devices, stand-alone wind or solar stations and power networks including hydro electrical and nuclear power stations.
1. Bertsekas, D., Dynamic programming and optimal control, 2nd.ed., I and II, Athena Scientific, (2001). [ Links ]
2. Brahma, A., Y. Guezennec and G. Rizzoni, "Optimal energy management in series hybrid vehicle", Proc. of the Amer. Control Conf., 60-64 (2000a). [ Links ]
3. Brahma, A., Y. Guezennec and G. Rizzoni, "Dynamic optimization of mechanical/electric power flow in parallel hybrid electric vehicles", Proc. of AVEC 2000, 5th. Intern. Symp. on Advanced Vehicle Control, Ann Arbor, Michigan (2000b). [ Links ]
4. Chiang, A., Elements of dynamic optimization, McGraw-Hill, NY, (1992). [ Links ]
5. Daniels, M. and P. Kumar,"Racing with the sun: the optimal use of the solar powered automobile", IEEE Control Systems Magazine, 19, 12-22 (1999). [ Links ]
6. Delprat, S., T.M. Guerra, G. Paganelli, J. Lauber and M. Delhom, "Control strategy optimization for an hybrid parallel powertrain", Proc. of the American Control Conference, 1315-1320, (2001). [ Links ]
7. Delprat, S., J. Lauber, T.M. Guerra and J. Rimaux, "Control of a Parallel Hybrid Powertrain: Optimal Control", IEEE Transactions on Vehicular Technology, 53, 872-881, (2004). [ Links ]
8. Emadi, A., M. Ehsani and J. Miller, Vehicular Electric Power Systems, M. Dekker Inc., NY, (2004). [ Links ]
9. Kwong, R. and E. Rogers, "ECE 1652-Stochastic Processes with Applications", available in http://www.control.utoronto.edu/people/profs/Kwong, Comp. Eng., Univ. of Toronto (2004). [ Links ]
10. Pérez, L., E. Oliva, C. Careglio, R. Leidhold, C. Falco and G. García, "Simulación de un Vehículo Eléctrico Experimental", Anales del XVIII Congr. Arg. de Control Autom., Bs. As., (2002). [ Links ]
11. Perez, L. V., G.R. Bossio, D. Moitre and G.O. García, "Optimization of Power Management in an Hybrid Electric Vehicle Using Dynamic Programming", accepted for publication in Applied Numerical Mathematics, (2004). [ Links ]
12. Rizzoni, G., L. Guzzella and B. Baumann, "Unified modeling of Hybrid Electric Vehicles", IEEE/ASME Transactions on Mechatronics, 4, 246-257 (1999). [ Links ]
13. Sciaretta, A., M. Back and L. Guzzella, "Optimal control of parallel hybrid electric vehicles", IEEE Transactions on Control System Technology, 12, 352-363 (2004). [ Links ]
14. Steinmauer, G. and L. del Re, "Optimal control of dual power sources", Proceedings IEEE International Conference on Control Applications, 422-427 (2001). [ Links ]
15. Zaremba, A.T., R.M. Stuntz and A. Borichev, "Control strategy development for HEV in Matlab/Simulink using constrained optimization", preprint sent by the authors, (2002). [ Links ]
Received: September 21, 2005.
Accepted for publication: March 3, 2006.
Recommended by Guest Editors C. De Angelo, J. Figueroa and J. Solsona.