Reinforcement learning for robot manipulation tasks in human-robot collaboration using the CQL/SAC algorithms

Contibutor(s)

Husaković, A. (author)
Banjanović-Mehmedović, Lejla (author)
Gurdić-Ribić, A. (author)
Prljača, Naser (author)
Karabegović, Isak (author)

Language

english

Source

Advances in production engineering and management

Year

2025

Numbering

volume 20, issue 1

Content

Conservative Q-learning deep reinforcement learning human-robot collaboration robot learning robot manipulation tasks soft actor-critic algorithm

Publisher

Fakulteta za strojništvo, Inštitut za proizvodno strojništvo

Provider

Univerza v Mariboru, Fakulteta za strojništvo
(Obvezni izvod spletne publikacije)

description

The integration of human-robot collaboration (HRC) into industrial and service environments demands efficient and adaptive robotic systems capable of executing diverse tasks, including pick-and-place operations. This paper investigates the application of Soft Actor-Critic (SAC) and Conservative Q-Learning (CQL)—two deep reinforcement learning (DRL) algorithms—for the learning and optimization of pick-and-place actions within HRC scenarios. By leveraging SAC’s capability to balance exploration and exploitation, the robot autonomously learns to perform pick-and-place tasks while adapting to dynamic environments and human interactions. Moreover, the integration of CQL ensures more stable learning by mitigating Q-value overestimation, which proves particularly advantageous in offline and suboptimal data scenarios. The combined use of CQL and SAC enhances policy robustness, facilitating safer and more efficient decision-making in continually evolving environments. The proposed framework combines simulation-based training with transfer learning techniques, enabling seamless deployment in real-world environments. The critical challenge of trajectory completion is addressed through a meticulously designed reward function that promotes efficiency, precision, and safety. Experimental validation demonstrates a 100 % success rate in simulation and an 80 % success rate on real hardware, confirming the practical viability of the proposed model. This work underscores the pivotal role of DRL in enhancing the functionality of collaborative robotic systems, illustrating its applicability across a range of industrial environments.

Rights

URN

URN:NBN:SI:DOC-V6OH4GKA

COBISSID

264960259

DOI

10.14743/apem2025.1.523

Added

20.01.2026

Metadata

Citation

APA:

Husaković, A., Banjanović-Mehmedović, Lejla, Gurdić-Ribić, A., Prljača, Naser, Karabegović, Isak (2025). Reinforcement learning for robot manipulation tasks in human-robot collaboration using the CQL/SAC algorithms. Advances in production engineering and management, volume 20, issue 1, str. 5-17. URN:NBN:SI:DOC-V6OH4GKA from http://www.dlib.si

MLA:

Husaković, A., Banjanović-Mehmedović, Lejla, Gurdić-Ribić, A., Prljača, Naser, Karabegović, Isak. "Reinforcement learning for robot manipulation tasks in human-robot collaboration using the CQL/SAC algorithms." Advances in production engineering and management volume 20. issue 1 (2025) str. 5-17.
<http://www.dlib.si/?URN=URN:NBN:SI:DOC-V6OH4GKA>

Advances in production engineering & management
2006-
(Assembly record)