Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maryam Hashemzadeh

From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?

Oct 14, 2020
Maryam Hashemzadeh, Greta Kaufeld, Martha White, Andrea E. Martin, Alona Fyshe

Figure 1 for From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?

Figure 2 for From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?

Figure 3 for From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?

Figure 4 for From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?

The representations generated by many models of language (word embeddings, recurrent neural networks and transformers) correlate to brain activity recorded while people read. However, these decoding results are usually based on the brain's reaction to syntactically and semantically sound language stimuli. In this study, we asked: how does an LSTM (long short term memory) language model, trained (by and large) on semantically and syntactically intact language, represent a language sample with degraded semantic or syntactic information? Does the LSTM representation still resemble the brain's reaction? We found that, even for some kinds of nonsensical language, there is a statistically significant relationship between the brain's activity and the representations of an LSTM. This indicates that, at least in some instances, LSTMs and the human brain handle nonsensical data similarly.

* 12 pages

Via

Access Paper or Ask Questions

Exploiting generalization in the subspaces for faster model-based learning

Oct 25, 2017
Maryam Hashemzadeh, Reshad Hosseini, Majid Nili Ahmadabadi

Figure 1 for Exploiting generalization in the subspaces for faster model-based learning

Figure 2 for Exploiting generalization in the subspaces for faster model-based learning

Figure 3 for Exploiting generalization in the subspaces for faster model-based learning

Figure 4 for Exploiting generalization in the subspaces for faster model-based learning

Due to the lack of enough generalization in the state-space, common methods in Reinforcement Learning (RL) suffer from slow learning speed especially in the early learning trials. This paper introduces a model-based method in discrete state-spaces for increasing learning speed in terms of required experience (but not required computational time) by exploiting generalization in the experiences of the subspaces. A subspace is formed by choosing a subset of features in the original state representation (full-space). Generalization and faster learning in a subspace are due to many-to-one mapping of experiences from the full-space to each state in the subspace. Nevertheless, due to inherent perceptual aliasing in the subspaces, the policy suggested by each subspace does not generally converge to the optimal policy. Our approach, called Model Based Learning with Subspaces (MoBLeS), calculates confidence intervals of the estimated Q-values in the full-space and in the subspaces. These confidence intervals are used in the decision making, such that the agent benefits the most from the possible generalization while avoiding from detriment of the perceptual aliasing in the subspaces. Convergence of MoBLeS to the optimal policy is theoretically investigated. Additionally, we show through several experiments that MoBLeS improves the learning speed in the early trials.

Via

Access Paper or Ask Questions