Multimodal reinforcement learning for partner specific adaptation in robot-multi-robot interaction

Kırtay, M.Hafner, V. V.Asada, M.Kuhlen, A. K.Öztop, Erhan2023-08-042023-08-042022979-835030979-92164-0572http://hdl.handle.net/10679/8569https://doi.org/10.1109/Humanoids53995.2022.10000205Successful and efficient teamwork requires knowledge of the individual team members' expertise. Such knowledge is typically acquired in social interaction and forms the basis for socially intelligent, partner-Adapted behavior. This study aims to implement this ability in teams of multiple humanoid robots. To this end, a humanoid robot, Nao, interacted with three Pepper robots to perform a sequential audio-visual pattern recall task that required integrating multimodal information. Nao outsourced its decisions (i.e., action selections) to its robot partners to perform the task efficiently in terms of neural computational cost by applying reinforcement learning. During the interaction, Nao learned its partners' specific expertise, which allowed Nao to turn for guidance to the partner who has the expertise corresponding to the current task state. The cognitive processing of Nao included a multimodal auto-Associative memory that allowed the determination of the cost of perceptual processing (i.e., cognitive load) when processing audio-visual stimuli. In turn, the processing cost is converted into a reward signal by an internal reward generation module. In this setting, the learner robot Nao aims to minimize cognitive load by turning to the partner whose expertise corresponds to a given task state. Overall, the results indicate that the learner robot discovers the expertise of partners and exploits this information to execute its task with low neural computational cost or cognitive load.engrestrictedAccessMultimodal reinforcement learning for partner specific adaptation in robot-multi-robot interactionconferenceObject84385000092589430011210.1109/Humanoids53995.2022.100002052-s2.0-85146360474