Team 8 : Detecting the Approval Rating of Political Actors Tweets
Computational Social Science
Detecting the
Approval Rating of
Political Actors Tweets
September 4, 20XX
Introduction
Surveys of political opinions and, more specifically, the approval of political players have the potential to be supplemented by social media platforms like Twitter. However, fresh questions about the validity and reliability of social-media-based estimations have arisen. Measuring public opinion accurately and without systematic errors is as vital for a functioning democracy as it is for scholars to understand society. Survey methodologists have developed techniques over several decades to precisely quantify public opinion. The major goal is to analyze the tweets, posts on famous political actors on twitter,facebook or any social media platform and rank them based on the approval rating.
Methods & Findings
We created a google form to conduct the survey to find out the opinion of people on most acknowledged political actors in our domain.
The main aim of this data collection was to find the 8-10 political actors who are liked by the majority of people.
Then analyze the tweets and work on their approval ratings.
Survey Snaps
Tweet Analysis
For this we used elevated access to the twitter api, we accumulated the tweets of political actors,as those accumulated tweets were in various indian languages so we used google translator API for the purpose
Analysis Methods
In our work we focus on the specific case of stance detection, closely related to TS, which is the task of inferring whether a document is written in favor or against the given target.
We use Linear SVM-based method (SVM-SD) trained on a stance-annotated tweet corpus with character and word n-grams.
We use a large collection of tweets containing keywords relevant to the political actor, weakly labeled based on the presence of certain keywords or hashtags.
SVM-SD is fine-tuned through five-fold cross validation and grid search
The model is trained with SemEval 2016 Stance Dataset.
The hyperparameter C was tuned to 100 and the training time was 1053 seconds.
Analysis Results
Relationship between increasing training data and performance
(Mean Macro F1)
Future Work
We would like to analyze the geographic diversity in the approval ratings.
We would also try to get better training datasets and train on different models for better results.
We will extrapolate this code to access posts from other social media websites as well.
Comments
Post a Comment