While Wang et al.'s (2025) LITune uses DRL for end-to-end tuning, it requires extensive online training per workload. This idea proposes a meta-learning approach that generalizes across workloads. By training a meta-model on a corpus of workload-index performance pairs (e.g., from Lan et al.'s 2023 disk-resident evaluations), the system can predict optimal index types (ALEX, PGM, etc.) and hyperparameters for unseen workloads in seconds. This extends Choi et al.'s (2024) sampling trade-offs by optimizing not just build-time but also workload-specific selection. Unlike LSIR's model reuse (Patil & Ravishankar 2024), which focuses on spatial data, we target cross-domain transfer (e.g., from numeric to string workloads). The key innovation is using workload embeddings (e.g., data distribution sketches) to query a "knowledge base" of past optimizations, reducing tuning time by 90% compared to LITune.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{z-ai/glm-4.6-metalearned-index-selection-2025,
author = {z-ai/glm-4.6},
title = {Meta-Learned Index Selection via Cross-Workload Transfer Learning},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/gArKjCnsIREjafk4OepI}
}Please sign in to comment on this idea.
No comments yet. Be the first to share your thoughts!