Actor-critic reinforcement learning for bidding in bilateral negotiation

Arslan, Furkan; Aydoğan, Reyhan

Publication:
Actor-critic reinforcement learning for bidding in bilateral negotiation

dc.contributor.author	Arslan, Furkan
dc.contributor.author	Aydoğan, Reyhan
dc.contributor.department	Computer Science
dc.contributor.ozuauthor	AYDOĞAN, Reyhan
dc.contributor.ozugradstudent	Arslan, Furkan
dc.date.accessioned	2023-08-11T06:21:07Z
dc.date.available	2023-08-11T06:21:07Z
dc.date.issued	2022
dc.description.abstract	Designing an effective and intelligent bidding strategy is one of the most compelling research challenges in automated negotiation, where software agents negotiate with each other to find a mutual agreement when there is a conflict of interests. Instead of designing a hand-crafted decision-making module, this work proposes a novel bidding strategy adopting an actor-critic reinforcement learning approach, which learns what to offer in a bilateral negotiation. An entropy reinforcement learning framework called Soft Actor-Critic (SAC) is applied to the bidding problem, and a self-play approach is employed to train the model. Our model learns to produce the target utility of the coming offer based on previous offer exchanges and remaining time. Furthermore, an imitation learning approach called behavior cloning is adopted to speed up the learning process. Also, a novel reward function is introduced that does take not only the agent’s own utility but also the opponent’s utility at the end of the negotiation. The developed agent is empirically evaluated. Thus, a large number of negotiation sessions are run against a variety of opponents selected in different domains varying in size and opposition. The agent’s performance is compared with its opponents and the performance of the baseline agents negotiating with the same opponents. The empirical results show that our agent successfully negotiates against challenging opponents in different negotiation scenarios without requiring any former information about the opponent or domain in advance. Furthermore, it achieves better results than the baseline agents regarding the received utility at the end of the successful negotiations.	en_US
dc.description.sponsorship	Scientific and Research Council of Turkey ; TÜBİTAK
dc.description.version	Publisher version	en_US
dc.identifier.doi	10.55730/1300-0632.3899	en_US
dc.identifier.endpage	1714	en_US
dc.identifier.issn	1300-0632	en_US
dc.identifier.issue	5	en_US
dc.identifier.scopus	2-s2.0-85139306597
dc.identifier.startpage	1695	en_US
dc.identifier.uri	http://hdl.handle.net/10679/8624
dc.identifier.uri	https://doi.org/10.55730/1300-0632.3899
dc.identifier.volume	30	en_US
dc.identifier.wos	000904725600002
dc.language.iso	eng	en_US
dc.peerreviewed	yes	en_US
dc.publicationstatus	Published	en_US
dc.publisher	TÜBİTAK	en_US
dc.relation	info:eu-repo/grantAgreement/TUBITAK/1001 - Araştırma/118E197
dc.relation.ispartof	Turkish Journal of Electrical Engineering and Computer Sciences
dc.relation.publicationcategory	International Refereed Journal
dc.rights	Attribution 4.0 International
dc.rights	openAccess
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject.keywords	Automated bilateral negotiation	en_US
dc.subject.keywords	Bidding strategy	en_US
dc.subject.keywords	Deep reinforcement learning	en_US
dc.subject.keywords	Entropy reinforcement learning	en_US
dc.subject.keywords	Imitation learning	en_US
dc.subject.keywords	Multi-agent systems	en_US
dc.title	Actor-critic reinforcement learning for bidding in bilateral negotiation	en_US
dc.type	article	en_US
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery	85662e71-2a61-492a-b407-df4d38ab90d7

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Actor-critic reinforcement learning for bidding in bilateral negotiation.pdf
Size:: 918.13 KB
Format:: Adobe Portable Document Format
Description:

Download