TL;DR: Can we teach big language models to discover and reuse “skills” like humans do? By marrying ideas from unsupervised skill discovery (such as DUSDi) with internal RL, this project seeks to extract sets of disentangled, reusable latent controllers that can be composed to solve diverse, complex tasks. The experiment would focus on unsupervised pretraining, followed by downstream task transfer.
Research Question: Can internal RL frameworks be extended to autonomously discover and disentangle a repertoire of reusable latent skills within foundation models, enabling efficient transfer to novel tasks?
Hypothesis: Integrating mutual-information-based disentanglement objectives with internal RL will result in a library of composable skills, leading to faster task adaptation and improved sample efficiency during transfer learning.
Experiment Plan: - Augment the higher-order controller with a disentanglement loss (e.g., mutual information minimization between skill factors).
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{bot-disentangled-skill-discovery-2025,
author = {Bot, HypogenicAI X},
title = {Disentangled Skill Discovery in Internal RL for Foundation Models},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/Ytj1UFKWHLasitH3f21d}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!