1 : Journey of a Sexist Tweet
Sexism is a social issue that is widespread on different social media platforms such as twitter
Here, we aim to understand how sexism is responded by people on social media platforms by using a mixture of social media scraping and behavioral experiments (interviews/surveys)
Hypothesis
- Sexist tweets garner more attention (in terms of number of likes and retweets) compared to non-sexist tweets.
- Replies to sexist tweets are significantly more sexist than replies to non-sexist tweets.
RESEARCH QUESTIONS
- Observing how much attention does a sexism tweet garner on social media platforms, how people react to it in general.
- What kind of replies do people give to a sexist tweet, what proportion of user support and what proportion stand up against it. What are the general public’s opinions regarding the people who post such sexist content on social media?
- Understanding difference in reactions and opinion of people regarding their thoughts on replies to such sexist tweet in two cases: On social media When asked to express thoughts in a anonymous survey
- Understanding difference in opinion of people on basis on age and gender
Public Opinion
EXPERIMENTATIONS
Data Collection
We have scraped Twitter (100000 tweets) for sexist tweets by using the keywords such as ‘sexism’ etc.
For each tweet, we got all the replies to the tweet. Each tweet has its public metrics (retweet_count, reply_count, like_count, quote_count), author, created_at, and conversation_id )
Methodology
Search for tweets based on particular tags. Employ a sexism detection model to find if a given tweet's text is sexist.
Analyze the text of the replies to a tweet, and number of replies to the tweet, based on the level of sexism
Preprocess the tweets and apply the model on each tweet and reply to get a sexism score/level from 0 to 1.
Perform analysis on the tweets and replies by splitting the dataset based on the sexism score
GENDER
MAXIMUM DEGREE LEVEL
CURRENT OCCUPATION
RESULTS
- Group 0 represents non sexist tweets and group 1 represents sexist tweets. Graph shows that the number of likes on sexist tweets were less than that of non-sexist tweets which is obvious. However, on performing the permutation testing on this data, we found out that there was not much significant difference between the two groups.
- Above graph shows the sexism level along with the likes obtained by tweets under the respective category. This sexism level was predicted by our AI model. The pearson correlation is -0.66 which is moderately good but the spearman correlation value is -0.26 which is quite low.
- Group 0 represents non sexist tweets and group 1 represents sexist tweets. Graph shows that the number of likes on sexist tweets were less than that of non-sexist tweets which is obvious. However, on performing the permutation testing on this data, we found out that there was not much significant difference between the two groups.
- Above graph shows the sexism level along with the likes obtained by tweets under the respective category. This sexism level was predicted by our AI model. The pearson correlation is -0.66 which is moderately good but the spearman correlation value is -0.26 which is quite low.
- Here we observe the number of times a particular tweet was retweeted. This surprisingly is almost similar for both sexist and non-sexist tweets.
- For the sexism level vs like obtained graph, the pearson and spearman correlation values are -0.4 and -0.26 which are both quite low.
- Here we observe the number of times a particular tweet was retweeted. This surprisingly is almost similar for both sexist and non-sexist tweets.
- For the sexism level vs like obtained graph, the pearson and spearman correlation values are -0.4 and -0.26 which are both quite low.
- In this case, we analyze how many users reply to both the categories of tweets. We observe that the sexist tweets have somewhat more replies than the non-sexist tweets.
- For the sexism level vs like obtained graph, the pearson and spearman correlation values are 0.68 and 0.64 which are both moderately good in this scenario.
- In this case, we analyze how many users reply to both the categories of tweets. We observe that the sexist tweets have somewhat more replies than the non-sexist tweets.
- For the sexism level vs like obtained graph, the pearson and spearman correlation values are 0.68 and 0.64 which are both moderately good in this scenario.
- The average sexist replies of the sexist tweets are significantly high than the non-sexist replies.
- For the sexism level vs like obtained graph, the pearson and spearman correlation values are 0.93 and 0.98 which are both very good in this scenario.
- The average sexist replies of the sexist tweets are significantly high than the non-sexist replies.
- For the sexism level vs like obtained graph, the pearson and spearman correlation values are 0.93 and 0.98 which are both very good in this scenario.
DISCUSSIONS
We conducted a survey among 50 students from IIITH campus. Following are our observations -
- We asked them to fill a form which consisted of a sexist tweet and asked their opinion about the sexist level of tweet on the scale of 1 to 5 .
- They were also asked to fill an appropriate reply to that tweet.
- They also must categorize each tweet in one of the following categories:
- Derogatory
- Non-derogatory
- Sexist
- Non-sexist
- Counter-sexist.
- The survey results were transcribed and summarized for qualitative analysis.
ANALYSIS
PROPOSALS TO DEAL WITH IT 
The good thing was most people did not find the tweet appropriate and were offended by the same but their thoughts were on the extreme end. Most people demanded banning them from the media. After giving them options, some of them reach the middle ground. They agreed to shadow ban them from social media. There were comments like -Sorry to burst your bubble but these duties are not just for women but also for men too. Morning! PS: Its 21st century and not the eighties.. Well you shouldn't even have said morning men cause they would have been sleeping.There were also some neutral comments It should be treated as a joke This isn’t the way of describing a gender Household responsibilities are for both men and women equally.
66% people think of this as a sexist tweet and 4% find it neutral. Almost 80% of the people were students and out of which 50% post-graduated.

CONCLUSIONS
- We observe the number of times a particular tweet was retweeted was surprisingly similar for both sexist and non-sexist tweets.
- We observe that the sexist tweets have somewhat more replies than the non-sexist tweets.
- We observe the number of times a particular tweet was retweeted was surprisingly similar for both sexist and non-sexist tweets.
- We observe that the sexist tweets have somewhat more replies than the non-sexist tweets.
FUTURE WORK
- We can extend our survey from interviewing 50 people to some thousands of people whether offline or with some forms.
- We are just running our models based on some sexist keywords, but there might be some tweets which don't have those keywords and still can be highly sexist. We can include those in any further iterations.
- We can also include image and video tweets in these experiments which will make the experiments more inclusive.
- We can extend our survey from interviewing 50 people to some thousands of people whether offline or with some forms.
- We are just running our models based on some sexist keywords, but there might be some tweets which don't have those keywords and still can be highly sexist. We can include those in any further iterations.
- We can also include image and video tweets in these experiments which will make the experiments more inclusive.










Comments
Post a Comment