Loading Now

Summary of Persian Slang Text Conversion to Formal and Deep Learning Of Persian Short Texts on Social Media For Sentiment Classification, by Mohsen Khazeni et al.


Persian Slang Text Conversion to Formal and Deep Learning of Persian Short Texts on Social Media for Sentiment Classification

by Mohsen Khazeni, Mohammad Heydari, Amir Albadvi

First submitted to arxiv on: 9 Mar 2024

Categories

  • Main: Computation and Language (cs.CL)
  • Secondary: Machine Learning (cs.LG)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The researchers address a crucial gap in the field of sentiment analysis for Persian conversational texts. They develop PSC (Persian Slang Converter), a tool that formalizes informal texts, enabling better machine learning-based sentiment analysis. By leveraging recent deep learning methods and integrating PSC, they improve the accuracy of sentiment classification for short Persian language texts. The team utilizes over 10 million unlabeled texts from various sources and about 10 million news articles to train unsupervised models. They also employ supervised data from Instagram comments (60,000 labeled texts) to fine-tune their emotion classification model. The formalized text conversion yields an impressive 57% accuracy rate. Furthermore, the FastText model and deep LSTM network achieve an accuracy of 81.91% on test data.
Low GrooveSquid.com (original content) Low Difficulty Summary
The researchers created a tool called PSC that helps machines understand Persian texts better. They used old and new ways of learning to make a program that can recognize if someone is happy, sad, or neutral when talking about something. To train the program, they looked at over 10 million texts from different places like social media and movies. They also used labeled texts from Instagram comments to help the program learn. The tool works well, converting most of the informal text into formal language. This helps machines understand Persian texts more accurately.

Keywords

* Artificial intelligence  * Classification  * Deep learning  * Fasttext  * Lstm  * Machine learning  * Supervised  * Unsupervised