【摘 要】
:
In order to improve the agility and applicability of trajectory planning algorithm for autonomous vehicles,this paper proposes a novel actor-critic based learning method for decision-making and planning in multi-vehicle complex traffic.It is the coupling
【机 构】
:
Department of Vehicle Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,
论文部分内容阅读
In order to improve the agility and applicability of trajectory planning algorithm for autonomous vehicles,this paper proposes a novel actor-critic based learning method for decision-making and planning in multi-vehicle complex traffic.It is the coupling planning of vehicle\'s path and speed thus to make the trajectory more flexible.First,generations from the decided action to the planned trajectory are described by the end-point of the trajectory.Then,the actor-critic based learning method is built to learn an optimal policy for the decision process.It can update the policy by the gradient of the current policy\'s advantage.In this process,features of the real traffic are carefully extracted by time headway (TH) and speed distribution.Reward function is built by the safety,efficiency and driving comfort.Furthermore,to make the policy network have better convergency,the policy network is modularized in two parts:the lane-changing network and the lane-keeping network,which decide the optimal end-point of the path and speed candidates respectively.Finally,the curved overtaking scenario and the interaction process with human driver are conducted to illustrate the feasibility and superiority.The results show that the proposed method has better real-time performance and can make the planned coupling trajectory more continuous and smoother than the existing rule-based method.
其他文献
Metal foam material,which serves as an alternative replacement of the conventional flow distributor of proton exchange membrane (PEM) fuel cell,has been attracting much attention over last few decades.In this work,three-dimensional modeling work for PEM f
Bimodal carbon nanotube reinforced 7055Al (CNT/7055A1) composites containing coarse grain bands and ultra-fine grain zones were fabricated by high energy ball milling,vacuum hot pressing followed by hot extrusion.The effect of extrusion temperature varied
Developing high sensitive organic semiconductors (OSCs) in organic thin-film transistors (OTFTs) is the key for OTFT based gas sensors.Herein,we report a simple processing route of highly sensitive OSCs for high performance OTFT based nitrogen dioxide (NO
The perfectly matched layer (PML) boundary condition has been proven to be effective for attenuating reflections from model boundaries during wavefield simulation.As such,it has been widely used in time-domain finite-difference wavefield simulations.The c
Electromagnetic induction effect caused by neuron potential can be mimicked using memristor.This paper considers a flux-controlled memristor to imitate the electromagnetic induction effect of adapting feedback synapse and presents a memristive neuron mode
For the detection of shallow underwater targets by the hyperspectrum,the information of the target is carried in the water-leaving radiation.However,the differences between the water-leaving radiation of the underwater target and the ocean background radi
Landslides are recurrent geological phenomena on Earth that cause heavy casualties and property losses annually.In this study,we use the Vp-k stacking and nonlinear waveform inversion methods of high-frequency receiver functions extracted from local earth
Predicting the failure time of unstable slopes is one of the most pivotal issues.In this paper,the inverse square root acceleration(INSRA) method was proposed to estimate the time-of-failure (TOF) of landslides.Four collapsed slopes were presented in the
High-quality graphene is prepared by arc discharge with low cost under hydrogen atmosphere.However,the growth mechanism of graphene synthesis by arc discharge remains unclear.In this paper,the hydrogen-induced marginal growth (HIMG) model is deduced to st
Carbonate reservoirs have complex pore structures,which not only significantly affect the elastic properties and seismic responses of the reservoirs but also affect the accuracy of the prediction of the physical parameters.The existing rock-physics invers