Dimitrios Zaikis, Nikolaos Stylianou, and Ioannis Vlahavas. “PIMA: Parameter-Shared Intelligent Media Analytics Framework for Low Resource Languages.” In: Applied Sciences 13.5 (Mar. 2023). IF: 2.7, p. 3265. issn: 2076-3417. doi: 10.3390/app13053265. url: https://www.mdpi.com/2076-3417/13/5/3265.

Author(s): Dimitrios Zaikis, Nikolaos Stylianou, and Ioannis Vlahavas

Keywords: natural language processing; media analysis; low resource languages; language model; domain adaption

Tags:

Abstract: Media analysis (MA) is an evolving area of research in the field of text mining and an important research area for intelligent media analytics. The fundamental purpose of MA is to obtain valuable insights that help to improve many different areas of business, and ultimately customer experience, through the computational treatment of opinions, sentiments, and subjectivity on mostly highly subjective text types. These texts can come from social media, the internet, and news articles with clearly defined and unique targets. Additionally, MA-related fields include emotion, irony, and hate speech detection, which are usually tackled independently from one another without leveraging the contextual similarity between them, mainly attributed to the lack of annotated datasets. In this paper, we present a unified framework to the complete intelligent media analysis, where we propose a shared parameter layer architecture with a joint learning approach that takes advantage of each separate task for the classification of sentiments, emotions, irony, and hate speech in texts. The proposed approach was evaluated on Greek expert-annotated texts from social media posts, news articles, and internet articles such as blog posts and opinion pieces. The results show that this joint classification approach improves the classification effectiveness of each task in terms of the micro-averaged F1-score.