TL;DR: LLMs can follow specific instructions, write code, and use tools given detailed usages. Existing LLM agents are highly capable, and a lot of research agents emerged, but none of them really produce high quality research. Can we inject knowledge about good research standards into LLMs?
Hypothesis: LLMs do not have the "meta intelligence" to do good research, for example:
Research plan:
Note: Optimizing the prompts and context engineering for existing research agents can potentially lead to significant improvement, and is much easier to implement and experiment with compared to the orchestrator training approach. It is a plausible approach for AI/CS research, but the concern here is that it's unlikely one can exhaust the good standards in all domains and connect that effectively to LLM agents.
Challenges: Aside from the difficulty for curating a high quality dataset for representing good research standards, it's also unclear how we can define a good reward signal, e.g., how to evaluate a series of actions.
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{liu-injecting-meta-intelligence-2025,
author = {Liu, Haokun},
title = {Injecting meta intelligence about research in LLMs},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/nOLFY3lqbtjXha6aDlPE}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!