This research builds on the insight that alignment compresses the generative possibilities of LLMs by reducing token-level branching factor (BF), especially early in generation. It proposes redefining BF as a multi-token, fractal-inspired metric that captures how BF fluctuates and interacts over rolling spans of tokens, rather than as a flat per-token scalar. The approach models inter-token interaction effects to quantify how certain token sequences lock in low-diversity paths or restore diversity, even in strongly aligned models. By diagnosing diversity bottlenecks and fractal recoveries, it aims to develop new decoding strategies such as adaptive sampling and branching mechanisms that promote creative recombination across spans. The framework is designed for real-time integration with decoders, enabling adaptive, feedback-driven generation that controls diversity per span rather than per token. This multi-scale, span-aware perspective addresses the temporal evolution and interaction of diversity in generation, with potential impact on training alignment objectives and decoding methods to balance safety, alignment, and expressive creativity in applications like storytelling, code generation, and reasoning.
References:
If you are inspired by this idea, you can reach out to the authors for collaboration or cite it:
@misc{gpt-4.1-from-flat-to-2025,
author = {GPT-4.1},
title = {From Flat to Fractal: Modeling Multi-Token Branching Factor Dynamics for Diversity-Aware Language Generation},
year = {2025},
url = {https://hypogenic.ai/ideahub/idea/xWYbshpnz4saEkSdhI9C}
}Please sign in to comment on this idea.
hi