Vis enkel innførsel

dc.contributor.authorAbolpour Mofrad, Asieh
dc.contributor.authorYazidi, Anis
dc.contributor.authorAbolpour Mofrad, Samaneh
dc.contributor.authorHammer, Hugo Lewi
dc.contributor.authorArntzen, Erik
dc.date.accessioned2021-02-15T14:27:28Z
dc.date.available2021-02-15T14:27:28Z
dc.date.created2021-02-03T14:59:44Z
dc.date.issued2021
dc.identifier.citationMofrad, A. A., Yazidi, A., Mofrad, S. A., Hammer, H. L., & Arntzen, E. (2021). Enhanced equivalence projective simulation: A framework for modeling formation of stimulus equivalence classes. Neural Computation, 33(2), 483-527.en_US
dc.identifier.issn0899-7667
dc.identifier.urihttps://hdl.handle.net/11250/2728180
dc.descriptionThis is the accepted manusctipt version (postprint) of an article published by The MIT Press in Neural Computation on 2 February 2021, available from: https://doi.org/10.1162/neco_a_01346en_US
dc.description.abstractFormation of stimulus equivalence classes has been recently modeled through equivalence projective simulation (EPS), a modified version of a projective simulation (PS) learning agent. PS is endowed with an episodic memory that resembles the internal representation in the brain and the concept of cognitive maps. PS flexibility and interpretability enable the EPS model and, consequently the model we explore in this letter, to simulate a broad range of behaviors in matching-to-sample experiments. The episodic memory, the basis for agent decision making, is formed during the training phase. Derived relations in the EPS model that are not trained directly but can be established via the network's connections are computed on demand during the test phase trials by likelihood reasoning. In this letter, we investigate the formation of derived relations in the EPS model using network enhancement (NE), an iterative diffusion process, that yields an offline approach to the agent decision making at the testing phase. The NE process is applied after the training phase to denoise the memory network so that derived relations are formed in the memory network and retrieved during the testing phase. During the NE phase, indirect relations are enhanced, and the structure of episodic memory changes. This approach can also be interpreted as the agent's replay after the training phase, which is in line with recent findings in behavioral and neuroscience studies. In comparison with EPS, our model is able to model the formation of derived relations and other features such as the nodal effect in a more intrinsic manner. Decision making in the test phase is not an ad hoc computational method, but rather a retrieval and update process of the cached relations from the memory network based on the test trial. In order to study the role of parameters on agent performance, the proposed model is simulated and the results discussed through various experimental settings.en_US
dc.language.isoengen_US
dc.publisherThe MIT Pressen_US
dc.subjectstimulus equivalence classesen_US
dc.subjectequivalence projective simulationen_US
dc.subjectreinforcement learningen_US
dc.subjectnetwork enhancementen_US
dc.titleEnhanced Equivalence Projective Simulation: A Framework for Modeling Formation of Stimulus Equivalence Classesen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionacceptedVersionen_US
dc.source.pagenumber483-527en_US
dc.source.volume33en_US
dc.source.journalNeural Computationen_US
dc.source.issue2en_US
dc.identifier.doi10.1162/neco_a_01346
dc.identifier.cristin1886392
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode2


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel