In order to improve the energy ef-ficiency (EE) in the underlay cognitive radio(CR)networks, a power allocation strategy based on an actor-critic reinforcement
6G IoT networks aim for providing significantly higher data rates and extremely lower latency. However, due to the increasingly scarce spectrum bands and ever-growing massive number Io T devices(IoDs)