Walk-forward Validation · 2021–2024
Model Performance
Train on years T−3 to T−1, test on year T. Zero lookahead bias. IC = Pearson correlation of predictions vs actual returns.
📊 Interpretation
The Baseline's high IC mean (0.043) is misleading — its standard deviation of 0.114 reveals extreme instability driven by lucky quarters. LightGBM achieves IC=0.0198 with std=0.009, making it 10× more consistent year-over-year. The LSTM achieves the highest hit rate (54.7%), with its best performance in 2022 (IC=+0.047) — the most volatile year in the sample, validating that temporal sentiment patterns carry additional information.