Language inference with multi-head automata through reinforcement learning

Şekerci, Alper; Köken, Özlem Salehi

dc.contributor.author	Şekerci, Alper
dc.contributor.author	Köken, Özlem Salehi
dc.date.accessioned	2021-06-14T13:06:59Z
dc.date.available	2021-06-14T13:06:59Z
dc.date.issued	2020
dc.identifier.isbn	978-172816926-2
dc.identifier.uri	http://hdl.handle.net/10679/7432
dc.identifier.uri	https://ieeexplore.ieee.org/document/9207156
dc.description.abstract	The purpose of this paper is to use reinforcement learning to model learning agents which can recognize formal languages. Agents are modeled as simple multi-head automaton, a new model of finite automaton that uses multiple heads, and six different languages are formulated as reinforcement learning problems. Two different algorithms are used for optimization. First algorithm is Q-learning which trains gated recurrent units to learn optimal policies. The second one is genetic algorithm which searches for the optimal solution by using evolution-inspired operations. The results show that genetic algorithm performs better than Q-learning algorithm in general but Q-learning algorithm finds solutions faster for regular languages.	en_US
dc.language.iso	eng	en_US
dc.publisher	IEEE	en_US
dc.relation.ispartof	2020 International Joint Conference on Neural Networks (IJCNN)
dc.rights	restrictedAccess
dc.title	Language inference with multi-head automata through reinforcement learning	en_US
dc.type	Conference paper	en_US
dc.publicationstatus	Published	en_US
dc.contributor.department	Özyeğin University
dc.contributor.authorID	(ORCID 0000-0003-2033-2881 & YÖK ID ) Salehi, Özlem
dc.contributor.ozuauthor	Köken, Özlem Salehi
dc.identifier.wos	WOS:000626021404062
dc.identifier.doi	https://doi.org/10.1109/IJCNN48605.2020.9207156	en_US
dc.subject.keywords	Finite automata	en_US
dc.subject.keywords	Reinforcement learning	en_US
dc.subject.keywords	Neural network	en_US
dc.subject.keywords	Q-learning	en_US
dc.subject.keywords	Genetic algorithm	en_US
dc.identifier.scopus	SCOPUS:2-s2.0-85093867777
dc.contributor.ozugradstudent	Şekerci, Alper
dc.relation.publicationcategory	Conference Paper - International - Institutional Academic Staff and Undergraduate Student