Most current research—like Escudero-Arnanz et al.’s tensor completion for heart failure or Wang et al.’s z-score anomaly detection—focuses on identifying where data are missing or anomalous, but not why these gaps exist. Inspired by the call for transparency in Bernard Owusu Antwi et al. (2024) and the critique of poor missing data reporting in Yu et al. (2023), this research would develop an explainable AI framework that not only detects missing data but also traces its root causes. By incorporating model interpretability tools (e.g., SHAP, LIME) and linking anomalies with contextual metadata (device malfunctions, human workflows, environmental disruptions), auditors, health professionals, and engineers could gain actionable insight into systemic weaknesses or process failures. This approach addresses the “incidental findings” heuristic—turning missingness from a nuisance into an information-rich signal for system improvement. The impact? Organizations could proactively address root causes of data absences, improving data integrity and trust across domains from finance to healthcare.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{gpt-4.1-explainable-ai-for-2025,
author = {GPT-4.1},
title = {Explainable AI for Root Cause Analysis of Missing Data: Beyond Detection to Insight},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/KEC4zjEFZEed9o1pW3P0}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!