Language inference with multi-head automata through reinforcement learning
dc.contributor.author | Şekerci, Alper | |
dc.contributor.author | Köken, Özlem Salehi | |
dc.date.accessioned | 2021-06-14T13:06:59Z | |
dc.date.available | 2021-06-14T13:06:59Z | |
dc.date.issued | 2020 | |
dc.identifier.isbn | 978-172816926-2 | |
dc.identifier.uri | http://hdl.handle.net/10679/7432 | |
dc.identifier.uri | https://ieeexplore.ieee.org/document/9207156 | |
dc.description.abstract | The purpose of this paper is to use reinforcement learning to model learning agents which can recognize formal languages. Agents are modeled as simple multi-head automaton, a new model of finite automaton that uses multiple heads, and six different languages are formulated as reinforcement learning problems. Two different algorithms are used for optimization. First algorithm is Q-learning which trains gated recurrent units to learn optimal policies. The second one is genetic algorithm which searches for the optimal solution by using evolution-inspired operations. The results show that genetic algorithm performs better than Q-learning algorithm in general but Q-learning algorithm finds solutions faster for regular languages. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | IEEE | en_US |
dc.relation.ispartof | 2020 International Joint Conference on Neural Networks (IJCNN) | |
dc.rights | restrictedAccess | |
dc.title | Language inference with multi-head automata through reinforcement learning | en_US |
dc.type | Conference paper | en_US |
dc.publicationstatus | Published | en_US |
dc.contributor.department | Özyeğin University | |
dc.contributor.authorID | (ORCID 0000-0003-2033-2881 & YÖK ID ) Salehi, Özlem | |
dc.contributor.ozuauthor | Köken, Özlem Salehi | |
dc.identifier.wos | WOS:000626021404062 | |
dc.identifier.doi | https://doi.org/10.1109/IJCNN48605.2020.9207156 | en_US |
dc.subject.keywords | Finite automata | en_US |
dc.subject.keywords | Reinforcement learning | en_US |
dc.subject.keywords | Neural network | en_US |
dc.subject.keywords | Q-learning | en_US |
dc.subject.keywords | Genetic algorithm | en_US |
dc.identifier.scopus | SCOPUS:2-s2.0-85093867777 | |
dc.contributor.ozugradstudent | Şekerci, Alper | |
dc.relation.publicationcategory | Conference Paper - International - Institutional Academic Staff and Undergraduate Student |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |
This item appears in the following Collection(s)
Share this page