In addition, the domain randomization (DR) training strategy is employed to learn a universal control policy, which can access the approximate optimal trajectory under nonideal conditions. In this way ...
Atmospheric Chemistry Department (ACD), Leibniz Institute for Tropospheric Research (TROPOS), Permoserstraße 15, 04318 Leipzig, Germany ...