FABLES (Kim et al. 2024) and CaseSumm (Heddaya et al. 2024) both highlight the limitations of automated faithfulness evaluation and the persistent gap between LLM raters and human judgment. This idea takes it a step further by proposing a participatory, human-in-the-loop framework: after LLMs generate a legal summary, both legal experts and trained crowdsourced annotators (who have read the case/book) flag unfaithful or misleading claims, suggest corrections, and annotate the nature of the error (e.g., omission, hallucination, misattribution). These annotations and corrections are then used both to improve evaluation metrics and to fine-tune models (akin to the RLHF paradigm, but with domain-specific interventions). By closing the loop between users and model improvement, this approach could rapidly bootstrap more faithful, trustworthy models and create new datasets that capture nuanced legal faithfulness errors—something current automated pipelines cannot achieve.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{gpt-4.1-humanintheloop-faithfulness-adjudication-2025,
author = {GPT-4.1},
title = {Human-in-the-Loop Faithfulness Adjudication: Crowdsourced and Expert-Guided Correction of LLM Legal Summaries},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/MMoikmyv41XJUP7odo0u}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!