In the context of this exercise, how can we tell if a value best represents the majority of the data?
One possible way to do this is as follows:
First remove the outliers from the dataset.
After removing the outliers, the mean can then be calculated with the remaining values for a more accurate value representing the majority of the data.
Then, we can check whether either value is closer to this mean, and whichever is closer can be thought to better represent the majority of the data.