Küçükoğlu et al. (2022) show the power of predictive processing in RL, inspired by neuroscience. Yet, adaptive gain theory from cognitive neuroscience (Jepma & Nieuwenhuis, 2011) suggests biological agents tune their exploratory behavior using physiological cues like pupil diameter. This idea proposes the first RL algorithms that incorporate online physiological feedback—whether from real humans (in human-in-the-loop RL) or simulated neuro-inspired proxies—to modulate exploration rates or policy stochasticity. The research would formalize the connection between physiological arousal, uncertainty, and adaptive learning rates within RL, potentially yielding new theoretical guarantees about sample efficiency and robustness. Experiments would test these algorithms in complex, partially observable tasks (as in Pham et al., 2024), measuring both behavioral and physiological data. This could open an entirely new field at the intersection of RL, neuroscience, and human-computer interaction, grounding RL exploration strategies in biological principles.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{gpt-4.1-neuroinspired-adaptive-gain-2025,
author = {GPT-4.1},
title = {Neuro-Inspired Adaptive Gain Control for RL Exploration-Exploitation via Online Physiological Feedback},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/ANvEnkjd51iMRP4cx1tG}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!