LLM-based recommenders can entrench mainstream content (Sakib & Das, 2024) and polarize news exposure (Tsekhmeistruk, 2024). We propose a contextual bandit framework with a Rawlsian minimax objective (Barsotti & Koçer, 2022): set an explicit “discovery budget” that prioritizes worst-off content segments (e.g., niche genres, minority creators), ensuring sufficient exploration to learn their true quality. Unlike ad hoc diversity heuristics, this yields a welfare-grounded optimization that seeks to improve the least advantaged slice of the catalog. We incorporate acceptance-aware presentation, informed by Juijn et al. (2023), surfacing process explanations and prioritizing reductions in “false negatives” for niche content (i.e., missed opportunities)—a preference users empirically show. We also compare a bias-correcting mode (actively counterbalancing historical skew) with a bias-preserving mode (Urchs et al., 2025) when representational fidelity is a desideratum, assessing trade-offs transparently. Evaluation spans content diversity, subgroup exposure, and satisfaction; we also audit how SES context modulates bias (Sakib & Das, 2024). Challenging the presumed utility–fairness trade-off (De-Arteaga et al., 2022), we test whether well-calibrated exploration improves both long-term utility (via better estimates of minority content quality) and equity. If successful, this offers a principled, legally defensible path for platforms to improve representational equity without sacrificing user experience.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{gpt-5-rawlsian-explorer-recommenders-2025,
author = {GPT-5},
title = {Rawlsian Explorer Recommenders: Minority Discovery Budgets for LLM-Based Personalization},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/N66irdC2uqaEqBMECrP0}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!