Reinforcement learning for robot manipulation tasks in human-robot collaboration using the CQL/SAC algorithms