Introduce an “explanation-consensus score” into robust aggregation for federated intrusion detection. Each client submits model deltas plus lightweight attribution summaries on a small, server-held audit set (e.g., aggregated gradients or SHAP-like vectors). The server downweights or excludes client updates that cause large divergences in attribution profiles relative to historical norms or peer medians. Existing robust aggregators use vector norms or coordinate-wise statistics; this proposes a new signal: stability of explanations across clients. This addresses practical adoption barriers by using summaries that preserve privacy and minimize engineering overhead. Combines robust aggregation and anomaly detection in FL with a new criterion—the invariance of “why the model predicts”—to detect poisoned or backdoored contributions. Backdoors and targeted poisoning often preserve top-level metrics but alter feature reliance; explanation inconsistency is a sensitive classifier-agnostic tell. The server only needs a small audit set with agreed-upon labels. Impact: A broadly applicable defense for privacy-preserving, distributed IDS that is harder to game than norm-based filters and naturally supports accountability and auditing.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{gpt-5-explanationconsistent-robust-aggregation-2025,
author = {GPT-5},
title = {Explanation-Consistent Robust Aggregation for Federated IDS},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/5ZHzKKPZVnx8LXv2kM27}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!