Publication:
Maintaining connectivity for multi-UAV multi-target search using reinforcement learning

dc.contributor.authorGüven, İslam
dc.contributor.authorAdam, Evşen Yanmaz
dc.contributor.departmentElectrical & Electronics Engineering
dc.contributor.ozuauthorADAM, Evşen Yanmaz
dc.contributor.ozugradstudentGüven, İslam
dc.date.accessioned2024-01-29T12:45:53Z
dc.date.available2024-01-29T12:45:53Z
dc.date.issued2023-10-30
dc.description.abstractWe propose a dynamic path planner that uses a multi-Agent reinforcement learning (MARL) model with novel reward functions for multi-drone search and rescue (SAR) missions. We design a mission environment where a multi-drone team covers an area to detect randomly distributed targets and inform the ground base station (BS) by continuously forming relay chains between the targets and the BS. The training procedure of the agents includes a convolutional neural network (CNN) that uses images which represent trajectory histories and connectivity states of each environment entity such as drones, targets, BS. Agents take actions and get feedback from the environment until the mission is completed. The model is trained with multiple missions with randomized target locations. Our results show that the trained model successfully produces mission plans such that the multi-drone system searches the area efficiently while dynamically forming relay chains. The proposed dynamic method leads up to 45% better total detection and mission times in comparison to a pre-planned optimized path planner.en_US
dc.description.sponsorshipTÜBİTAK
dc.identifier.doi10.1145/3616392.3623414en_US
dc.identifier.endpage114en_US
dc.identifier.isbn979-840070369-0
dc.identifier.scopus2-s2.0-85178375702
dc.identifier.startpage109en_US
dc.identifier.urihttp://hdl.handle.net/10679/9114
dc.identifier.urihttps://doi.org/10.1145/3616392.3623414
dc.identifier.wos001122484300015
dc.language.isoengen_US
dc.publicationstatusPublisheden_US
dc.publisherAssociation for Computing Machinery, Incen_US
dc.relationinfo:eu-repo/grantAgreement/TUBITAK/1001 - Araştırma/121E408
dc.relation.ispartofDIVANet '23: Proceedings of the Int'l ACM Symposium on Design and Analysis of Intelligent Vehicular Networks and Applications
dc.relation.publicationcategoryInternational
dc.rightsrestrictedAccess
dc.subject.keywordsConvolutional neural networksen_US
dc.subject.keywordsDrone networksen_US
dc.subject.keywordsMaintaining connectivityen_US
dc.subject.keywordsMulti-Agent reinforcement learningen_US
dc.subject.keywordsMulti-UAV path planningen_US
dc.subject.keywordsReinforcement learningen_US
dc.subject.keywordsUnmanned aerial vehiclesen_US
dc.titleMaintaining connectivity for multi-UAV multi-target search using reinforcement learningen_US
dc.typeconferenceObjecten_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication7b58c5c4-dccc-40a3-aaf2-9b209113b763
relation.isOrgUnitOfPublication.latestForDiscovery7b58c5c4-dccc-40a3-aaf2-9b209113b763

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description: