The document summarizes recent research related to "theory of mind" in multi-agent reinforcement learning. It discusses three papers that propose methods for agents to infer the intentions of other agents by applying concepts from theory of mind:
1. The papers propose that in multi-agent reinforcement learning, being able to understand the intentions of other agents could help with cooperation and increase success rates.
2. The methods aim to estimate the intentions of other agents by modeling their beliefs and private information, using ideas from theory of mind in cognitive science. This involves inferring information about other agents that is not directly observable.
3. Bayesian inference is often used to reason about the beliefs, goals and private information of other agents based
The document discusses hyperparameter optimization in machine learning models. It introduces various hyperparameters that can affect model performance, and notes that as models become more complex, the number of hyperparameters increases, making manual tuning difficult. It formulates hyperparameter optimization as a black-box optimization problem to minimize validation loss and discusses challenges like high function evaluation costs and lack of gradient information.
【DL輪読会】Efficiently Modeling Long Sequences with Structured State SpacesDeep Learning JP
This document summarizes a research paper on modeling long-range dependencies in sequence data using structured state space models and deep learning. The proposed S4 model (1) derives recurrent and convolutional representations of state space models, (2) improves long-term memory using HiPPO matrices, and (3) efficiently computes state space model convolution kernels. Experiments show S4 outperforms existing methods on various long-range dependency tasks, achieves fast and memory-efficient computation comparable to efficient Transformers, and performs competitively as a general sequence model.
The document discusses hyperparameter optimization in machine learning models. It introduces various hyperparameters that can affect model performance, and notes that as models become more complex, the number of hyperparameters increases, making manual tuning difficult. It formulates hyperparameter optimization as a black-box optimization problem to minimize validation loss and discusses challenges like high function evaluation costs and lack of gradient information.
【DL輪読会】Efficiently Modeling Long Sequences with Structured State SpacesDeep Learning JP
This document summarizes a research paper on modeling long-range dependencies in sequence data using structured state space models and deep learning. The proposed S4 model (1) derives recurrent and convolutional representations of state space models, (2) improves long-term memory using HiPPO matrices, and (3) efficiently computes state space model convolution kernels. Experiments show S4 outperforms existing methods on various long-range dependency tasks, achieves fast and memory-efficient computation comparable to efficient Transformers, and performs competitively as a general sequence model.
This document discusses using R and RStudio to simulate reinforcement learning models. It demonstrates simulating a Rescorla-Wagner model to update action values Q_A and Q_B based on payoffs from actions A and B over time. The model is expanded to select actions stochastically using a softmax function of the difference between Q_A and Q_B. Plots show the evolution of Q_A and Q_B over time for different learning rate and temperature parameters. The document provides an example code implementation of this reinforcement learning model in R.
Karl Fristonが提唱している「自由エネルギー原理(free-energy principle = FEP)」について、北大文学部の聴衆を対象にして、物理学や機械学習の知識の前提抜きにして、説明を行い、その意義を説明したものです。FEPの意識研究への応用に向けて、FEPとエナクション説の近接性について強調したものとなっております。
This document summarizes two lectures about consciousness and neuroscience:
1) It discusses theories of consciousness such as qualia and awareness, and the distinction between the dorsal and ventral visual pathways and their roles in vision for action vs perception. It also covers blindsight and the idea that a "feeling of something" without qualia may arise from saliency computation.
2) It discusses using bistable percepts like binocular rivalry to study neural correlates of awareness. It introduces the ideas of neurophenomenology and heterophenomenology to study first-person experience through intersubjective methods. It provides an example of neurophenomenology applied to the aura experience before epileptic seizures.
駒場学部講義2015 総合情報学特論III 「意識の神経科学:「気づき」と「サリエンシー」を手がかりに」Masatoshi Yoshida
1. The document summarizes a lecture about the neural basis of consciousness, focusing on awareness, attention, and the study of blindsight.
2. It discusses evidence that the dorsal visual pathway is involved in vision for action while the ventral pathway is involved in vision for perception.
3. In blindsight, there is a "feeling of something happening" in the blind field that can be explained by saliency computation and sensorimotor contingencies rather than conscious visual experience.
4. Friston K, Breakspear M, Deco G. Perception and self-
organized instability. Front Comput Neurosci. 2012 Jul 6;6:44.
FIg.1 (CC BY 3.0) http://www.frontiersin.org/files/Articles/
23035/fncom-06-00044-r4/image_m/fncom-06-00044-g001.jpg
自由エネルギー最小化原理
44. 自由エネルギー原理 (FEP)
x (or psi)とsは時間的なシークエンス
視覚サリエンスが高い
= そこを見ればqのエン
トロピーを最小化する
場所 => その場所を
control state u(x)のprior
として採用する。
Control state u(x)によって運動aを計画
実際のモデルはこの図のようにもっと複雑なのだけど、
エッセンスはこれでつかめたと思う。
Perceptions as hypotheses: saccades as
experiments. Friston K, Adams RA,
Perrinet L, Breakspear M. Front Psychol.
2012 May 28;3:151.
45. 参考文献
Friston K, Adams RA, Perrinet L, Breakspear M. (2012) Perceptions as hypotheses:
saccades as experiments. Front Psychol. 3:151. (視線移動のモデル及び反実仮想の概
念の初出)
McGregor, Simon; Baltieri, Manuel; Buckley, Christopher L. (2015) A Minimal Active
Inference Agent. arXiv:1503.04187 https://arxiv.org/abs/1503.04187 (このスライドの
説明の元ネタ。離散バージョンのFEPの説明などがあり、これがもっともわかりやす
い。)
Buckley, Christopher L.; Kim, Chang Sub; McGregor, Simon; Seth, Anil K. (2017) The
free energy principle for action and perception: A mathematical review. arXiv:
1705.09156 (省略無しで丁寧な説明。)
Friston, K. (2010). The free-energy principle: a unified brain theory? Nature Reviews
Neuroscience, 11(2), 127–138. (FEPの全体像について網羅的な記述)
Friston K, Rigoli F, Ognibene D, Mathys C, Fitzgerald T, Pezzulo G. (2015) Active
inference and epistemic value. Cogn Neurosci. 2015;6(4):187-214. (視線移動を
epistemic valueと捉えて、他のvalueと対置して扱っている)
Ryota Kanai (2017) Creating Consciousness. https://www.slideshare.net/ryotakanai/
creating-consciousness (FEPの反実仮想についての言及あり。)