SST, or Total Sum of Squares, is a measure of the total variability in a dataset. It’s commonly used in statistical analysis.
Here’s how to calculate SST in Excel:
Organize Your Data
Make sure your data is organized in an Excel worksheet. For example, you might have your data in cells A1 to A10.
Calculate the Mean
This formula calculates the mean of the data in cells A1 to A10. Replace this range with your actual data range if necessary.
Calculate the Squared Differences
In an adjacent column, calculate the squared differences between each data point and the mean. Use the following formula (assuming your mean is in cell B1):
=(A1 – B1)^2
Drag this formula down to apply it to all data points. This calculates the squared difference for each data point.
Sum the Squared Differences
Sum up all the squared differences to calculate SST. Use the SUM function:
Replace the range C1:C10 with the actual range where you calculated the squared differences.
The result of the last formula is your Total Sum of Squares (SST) for the given dataset. It represents the total variability in your data.
SST is used in model assessment and selection. When comparing different models (e.g., linear regression vs. polynomial regression), you can evaluate how much each model reduces SST. Lowering SST indicates a better fit to the data, but you should also consider model complexity and overfitting.