Update time:
- 02/03/2020 orginal post
- 02/18/2020 update linear regression knowledge
Conceptual Questions
Prediction VS Forecasting
Forecasting would be a subset of prediction. Any time you predict into the future it is a forecast.
All forecasts are predictions, but not all predictions are forecasts, as when you would use regression to explain the relationship between two variables.
Correlation vs Casuation
Correlation does not imply causation. Causation explicitly applies to cases where action A causes outcome B.
Further read: causation-correlation
Linear Regression
What’s the assumption of linear regression
-
Regression model is linear in parameters
-
The mean of residuals is zero
- No auto-correlation ( previous data points)
-
No perfect multicollinearity
-
Normality of residuals
QQ-plot
What is Covariance
Covariance is a measure of the relationship between two random variables. The metric evaluates the variance between two variables.
- Positive covariance
- Negative covariance
Covariance vs Correlation
Covariance measures the total variation of two random variables from their expected values. Using covariance, we can only gauge the direction of the relationship (whether the variables tend to move in tandem or show an inverse relationship). However, it does not indicate the strength of the relationship, nor the dependency between the variables.
On the other hand, correlation measures the strength of the relationship between variables. Correlation is the scaled measure of covariance. It is dimensionless. In other words, the correlation coefficient is always a pure value and not measured in any units.
The relationship between the two concepts can be expressed using the formula below:
To Be Continued …
I’ll keep updating this post along the time