Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting
arXiv:2603.16985v1 Announce Type: new Abstract: Transformer-based models have been widely adopted for time-series forecasting due to their high representational capacity and architectural flexibility. However, many …