Glossary term
Adjusted R-Squared
Adjusted R-squared is a regression statistic that modifies R-squared to account for the number of explanatory variables in the model.
Updated
Read time
What Is Adjusted R-Squared?
Adjusted R-squared is a regression statistic that modifies R-squared to account for the number of explanatory variables in the model. It is designed to reduce the temptation to add variables that make a model look better without actually improving explanatory power.
R-squared usually rises or stays the same when more variables are added. Adjusted R-squared can fall if the added variable does not improve the model enough to justify the added complexity.
Key Takeaways
- Adjusted R-squared modifies R-squared for model complexity.
- It penalizes adding explanatory variables that do not add enough value.
- It is useful when comparing regression models with different numbers of variables.
- A higher adjusted R-squared can indicate a better-fitting model, but it does not prove causation.
- The statistic should be read with residuals, assumptions, and out-of-sample performance.
The Formula
One common adjusted R-squared formula is:
In this expression, R2 is ordinary R-squared, n is the number of observations, and k is the number of explanatory variables. The adjusted value reflects both fit and the number of variables used.
For example, if a two-factor model and a five-factor model have similar explanatory power, adjusted R-squared may favor the simpler model. The statistic is asking whether the extra variables earn their keep.
R-Squared vs. Adjusted R-Squared
Measure | What it shows | What to watch |
|---|---|---|
R-squared | Share of variation explained by the model. | Can rise when variables are added, even weak ones. |
Adjusted R-squared | Model fit after adjusting for variable count. | Better for comparing models with different complexity. |
Residual analysis | What the model fails to explain. | Can reveal bias, nonlinearity, or outliers. |
How Investors Use It
Adjusted R-squared can help compare factor models, return models, valuation models, or economic regressions. For example, an analyst testing whether several factors explain a fund's returns can use adjusted R-squared to see whether additional factors improve the model enough to justify their inclusion.
A high adjusted R-squared means the model explains a large share of the variation in the sample after considering complexity. It does not mean the model is true, stable, predictive, or economically meaningful.
The statistic is most useful as a comparison tool. It helps ask whether a more complicated model earns its complexity, which is different from asking whether the model is good enough to trust for decisions.
Where It Can Mislead
Adjusted R-squared still relies on the quality of the regression. A model can have a strong adjusted R-squared and still suffer from omitted variables, unstable relationships, multicollinearity, outliers, nonlinearity, or overfitting. It also says little about whether the model performs well on new data.
In finance, this is especially important because relationships can look strong in one market regime and weaken in another. Adjusted R-squared should help screen models, not replace judgment about whether the variables actually belong together.
The Bottom Line
Adjusted R-squared improves on basic R-squared by penalizing unnecessary complexity. It is useful for comparing regression models, but it should be read alongside economic logic, residual diagnostics, and out-of-sample testing.