Regularization of Soft Actor-Critic Algorithms with Automatic Temperature Adjustment
This work provides incremental improvements to SAC for reinforcement learning practitioners.
The authors tackled the regularization of Soft Actor-Critic algorithms with automatic temperature adjustment by reformulating policy evaluation, improvement, and temperature adjustment to enhance theoretical clarity.
This work presents a comprehensive analysis to regularize the Soft Actor-Critic (SAC) algorithm with automatic temperature adjustment. The the policy evaluation, the policy improvement and the temperature adjustment are reformulated, addressing certain modification and enhancing the clarity of the original theory in a more explicit manner.