Exploring the political pulse on Telegram
by
How does offline public perception compare with discussion on telegram groups, regarding the government's handling of specific issues in the country?
Overall, this research question requires a mixed-methods approach, combining data collected from online sources (Telegram groups) with data collected from offline sources (surveys). The analysis of both datasets helped determine the sentiment towards specific political topics and compare online and offline opinions.
Motivation
Unprecedentedly high ratings of a democratic leader! But what does the public really feel about the situation in the country? Are people freer with opinions when anonymous? These are the questions we wanted to find answers to. We wanted to look at what the public and political perception on and off social media was towards key issues of the country.
Why choose telegram as the social media platform to analyze? Telegram is a popular messaging app and a well-liked messaging service. It has over 70.48 million users in India, making it a potential source of valuable data. Telegram being an end-to-end encrypted service, is known for its security and privacy features. Political organizations and activists now frequently use Telegram to organize and mobilize their supporters.
We joined relevant (public) groups to collect data. It allows export of all historical messages in the group, making it suitable for study in a project of limited time frame and scope.
Analysis and results
Overview: The goal of the project was to conduct a comprehensive analysis of politically affiliated Telegram groups, with the aim of identifying the topics being discussed and extracting sentiment towards those topics. In order to compare attitudes between online and offline, the project also included the release of a survey to elicit offline opinions on the same subjects. The analysis of the Telegram groups involved collecting and clustering text message data. The subjects and sentiment contained in the text data have been identified and classified using machine learning and natural language processing techniques.
Data collection: Exported data from 20 official and unofficial telegram groups over time period of 2017-2023. Parties include BJP, AAP and Congress. Languages are English and Hindi.
Topic Modeling: Used BERTopic to cluster the aggregate message data and used multilingual sentence embeddings (paraphrase-multilingual-MiniLM-L12-v2) to process Hindi messages.
About BERTopic: it is a topic modeling technique that leverages BERT embeddings and a class-based TF-IDF to create dense clusters allowing for easily interpretable topics whilst keeping important words in the topic descriptions.
Main topics found were regarding repealed farmers' laws, Coronavirus situation and response to China.
=> Temporal topic modeling
2018: bank_delhi; evms; petrol oil price
=> Sentiment analysis of aggregate data:
Used zero-shot-classifiier to classify with respect to labels of positive, negative and neutral. The model is Multilingual mDeBERTa-v3-base-mnli-xnli. This multilingual model can perform natural language inference (NLI) on 100 languages and is therefore also suitable for multilingual zero-shot classification.


Comments
Post a Comment