Quiz: ML in Signal Generation
Module 3 of 4 · Hard
Quick Quiz
1. You have 500 assets, 20 features, and 10 years of daily data (2,520 observations per asset). You want to predict 1-month (21-day) forward returns using a ridge regression. Why is standard -fold cross-validation invalid here, and what is the minimum purge gap needed?
2. Ridge regression with penalty has closed-form solution . As , . What is the bias and variance of relative to OLS , and under what condition does ridge dominate OLS in MSE?
3. A quant researcher reports a gradient boosted tree model with in-sample IC = 0.12 and out-of-sample IC = 0.02 over a purged walk-forward evaluation. What does this IC ratio indicate, and what hyperparameter adjustments would you recommend?
4. Impurity-based feature importance in gradient boosted trees (total reduction in objective function across all splits using each feature) is an unbiased estimator of each feature's true predictive contribution, even when features have different cardinalities or are correlated with each other.
5. You construct a cross-sectional equity momentum feature: 12-month return excluding the last month (12-1 momentum). Before training, you winsorise at and cross-sectionally standardise. A colleague suggests also using the full 12-month return (including the last month). Why is the 12-1 formulation standard, and what is the danger of using the 12-month return including the last month?
6. The Fundamental Law of Active Management (Grinold 1989) states that the annualised information ratio satisfies , where is the number of independent bets per year. A strategy has IC = 0.05 and trades monthly across 200 stocks (). What is the theoretical IR, and why might the realised IR be much lower?