I am a beginner in data analysis project and would like to know how to identify outliers in a dataset like this.
I have a “Satisfaction” column which refers to the overall experience reported by participants. However, when I calculated the averages from other variables, they didn’t align with the “Satisfaction” column.
example:
data
I did identify outliers for each value in the “Satisfaction” column, but i think it doesn’t justify the “calculated satisfaction” column.
Should I create outliers for each value in the “Satisfaction” column, or for the overall dataset? Alternatively, should I create another column for “Satisfaction” based on calculated values and then identify outliers?
What do you recommend for handling this type of data? Please guide me ‘._.
heyhey asea is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.