Glossary term
Simple Linear Regression
Simple linear regression is a statistical method for estimating a straight-line relationship between one dependent variable and one independent variable.
Updated
Read time
What Is Simple Linear Regression?
Simple linear regression is a statistical method for estimating a straight-line relationship between one dependent variable and one independent variable. It asks how one outcome tends to change as one explanatory variable changes.
In finance, simple linear regression can be used to estimate market beta, compare a stock's return with an index, test the relationship between sales and advertising, or explore how one variable moves with another.
Key Takeaways
- Simple linear regression uses one explanatory variable.
- It estimates a straight-line relationship between two variables.
- The slope shows the estimated change in the dependent variable for a one-unit change in the independent variable.
- Residuals show what the line does not explain.
- A regression relationship does not automatically prove causation.
The Basic Equation
A simple linear regression is commonly written as:
In this expression, y is the dependent variable, x is the independent variable, α is the intercept, β is the slope, and ε is the error term or residual component.
For example, a simple regression can compare a stock's monthly returns with market-index returns. If the slope is 1.2, the stock has tended to move about 1.2% for each 1% market move in the sample, before considering the residual.
What the Parts Mean
Part | Meaning | Financial example |
|---|---|---|
Dependent variable | The outcome being explained. | A stock's return. |
Independent variable | The variable used to explain the outcome. | A market index return. |
Slope | Estimated sensitivity. | Market beta in a simple return regression. |
Residual | Unexplained difference. | Company-specific return not explained by the market. |
How Investors Use It
Simple linear regression is often the first step in understanding relationships. If a fund's returns are regressed against a benchmark, the slope can show market sensitivity and the residuals can show what the benchmark does not explain.
The method is useful because it is simple and interpretable. It can also be too simple. Many financial outcomes are affected by multiple drivers, nonlinear relationships, changing regimes, and outliers. A straight line may summarize the sample without capturing the real economic process.
A simple regression is strongest when the relationship is easy to explain and the data pattern roughly supports a straight line. If the scatterplot bends, fans out, or contains influential outliers, the simple model may be telling an overly neat story.
What to Watch
Important checks include the scatterplot, residual pattern, sample period, outliers, R-squared, and whether the relationship makes economic sense. A high R-squared does not prove causation. A low R-squared does not mean the relationship is irrelevant if the variable is only one part of a broader process.
The model is best used as a clear first lens. If one variable cannot plausibly explain the outcome on its own, simple regression can still be useful as a baseline before moving to a richer model.
The Bottom Line
Simple linear regression estimates a straight-line relationship between two variables. It is useful for interpreting sensitivity and fit, but it should be paired with residual analysis, economic judgment, and caution about causation.