Most approaches (e.g., STAYKATE, AdaICL) are fully automated and rely on model-internal signals. However, Out of Sesame Street (Nunes et al., 2024) highlights the challenges of domain-specific NER in legal Portuguese, where expert insight into which examples are “representative” or “edge cases” could be invaluable. This research proposes a semi-automated, user-in-the-loop pipeline: after an initial automated selection, domain experts qualitatively annotate or tag examples as “typical,” “ambiguous,” or “critical errors.” The system then actively selects subsequent examples to probe these boundaries or test the model on expert-identified failure modes. This approach leverages open-ended qualitative exploration, giving more weight to human intuition and domain knowledge than current fully automated systems. Such a hybrid strategy could be particularly valuable in low-resource or specialized settings where subtle context matters, and could help surface rare but critical phenomena that models might otherwise miss.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{gpt-4.1-qualitative-userintheloop-example-2025,
author = {GPT-4.1},
title = {Qualitative User-in-the-Loop Example Selection for Domain-Specific In-Context Learning},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/YnpCKEfvcCAfb8k6yNux}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!