I’m working with a relatively small data set that consists of several hundred social media posts, key engagement metrics and up to 10 content “tags” that describe the image of each post. We leveraged Google Vision API along with a manual review to construct the tags. I’ve linked to an example of what we’re working with here (http://imgur.com/vcZkWi9).
What I’m trying to do: I would like to leverage a statistically valid methodology to identify which one or more (in combination) of tags tend to perform the best across the data set. It’s easy enough to look at an individual tag and calculate the mean of the KPI, but any suggestions on how to evaluate combinations of tags that yield high performance? It wouldn’t necessarily need to be all tags in combination, but could be 3 out of the 10 perform the best.
What approach would you recommend to understand what tags are most closely associated with the highest mean KPI score? I’ve been debating whether a multiple regression analysis is best, but looking for some insight on this.
I’ve had a tough time finding any other resources online so any help would be greatly appreciated!
What I’m trying to do: I would like to leverage a statistically valid methodology to identify which one or more (in combination) of tags tend to perform the best across the data set. It’s easy enough to look at an individual tag and calculate the mean of the KPI, but any suggestions on how to evaluate combinations of tags that yield high performance? It wouldn’t necessarily need to be all tags in combination, but could be 3 out of the 10 perform the best.
What approach would you recommend to understand what tags are most closely associated with the highest mean KPI score? I’ve been debating whether a multiple regression analysis is best, but looking for some insight on this.
I’ve had a tough time finding any other resources online so any help would be greatly appreciated!