Accepted Papers

Accepted papers will be presented online and in-person. WiNLP provided financial support to authors from around the world who chose to attend EMNLP in-person. Note that some papers for authors who wished to remain anonymous or not publicly publish a camera-ready have been omitted.

No. Title Author
3 Lexical methods for bias exploration from a Latin American perspective Luciana Benotti, Laura Alonso Alemany and Lucía Gonzalez
8 Developing Language Technology and NLP tools for endangered languages: Torwali Naeem Uddin Hadi 
9 Low Resourced Multilingual Neural Machine Translation for Ometo-English Michael Melese Woldeyohannis, Atnafu Lambebo Tonja and Mesay Gemeda Yigezu
12 Improving neural machine translation for low-resource languages using related language resources Atnafu Lambebo Tonja 
13 Transformer Based Amharic Headline Generation using Sub-word2Vec Representation Mahlet Taye, Yaregal Assabie and Abebaw Eshetu
14 DistillEmb: Distilling word embeddings via contrastive learning Amanuel N. Mersha and Stephen Wu 
15 Perturbation-based Active Learning for Question Answering Fan Luo and Mihai Surdeanu 
16 Short Comparative Analysis on Pretrained BART and RoBERTa in Detecting Hate Speech on YouTube and Reddit Platforms Dinuja Ratnayake Perera and Nisansa de Silva
17 A Primer on Synthesis and Evaluation of a Domain-specific Large Data Set for Dungeons & Dragons Akila Peiris and Nisansa de Silva 
22 Amharic Fake News Detection on Social Media Using Feature Fusion Menbere Worku and Michael Melese Woldeyohannis 
28 MBTI Personality Prediction Approach on Persian Twitter Samin Fatehi, Zahra Anvarian, Yasmin Madani, MohammadJavad Mehditabar and Sauleh Eetemadi
29 Afaan Oromo Hate Speech Detection and Classification on Social Media Teshome Mulugeta Ababu and Michael Melese Woldeyohannis
30 Amharic-Kistanigna Bi-directional Machine Translation using Deep Learning Mengistu Negia and Rahel Mekonen Tamiru
34 Exploiting Available Resources for the Training of Manglish Language Models Meisin Lee and Lay-Ki Soon
35 Towards a general purpose machine translation system for Sranantongo Just Zwennicker and David Stap
36 Transfer Learning and Word Sense Disambiguation for Low-resource Language, the Case of Amharic Neima Ahmed and Million Meshesha
38 An Annotated Social Media Health Corpus for Bengali Salim Sazzed
40 Boosting the Performance of Gender Subspace in Domain-Specific Gender Bias Analysis Yanqin Tan, Cassandra L. Jacobs, Mimi Zhang, Marvin Thielk and Yi Chu
41 Contextual Embeddings Can Distinguish Homonymy from Polysemy in a Human-Like Way Kyra Wilson and Alec Marantz
42 The BERT Walked Down the Garden Path Assigned Semantic Roles Tovah Irwin, Kyra Wilson and Alec Marantz
43 The Effect of Normalization for Bi-directional Amharic-English Neural Machine Translation Tadesse Destaw Belay, Atnafu Lambebo Tonja, Olga Kolesnikova, Seid Muhie Yimam and Abinew Ali Ali Ayele
46 Automatic Speech Recognition using Self-Supervised Learning Approach RAHEL MEKONEN TAMIRU and Rosa Tsegaye Aga
47 Prosody Based Automatic Speech segmentation for Amharic RAHEL MEKONEN TAMIRU and Hana Mekonen Tamiru
48 Before and beyond MeToo: Measuring changes in power and agency within sexual abuse news stories over time Gyulim Kang and Hope Schroeder
53 Challenges of Amharic Hate Speech Data Annotation Using Yandex Toloka Crowdsourcing Platform Abinew Ali Ayele, Tadesse Destaw Belay, Seid Muhie Yimam, Skadi Dinter, Tesfa Tegegne Asfaw and Chris Biemann
54 Question Answering Classification for Amharic Social Media Community Based Questions Tadesse Destaw Belay, Seid Muhie Yimam, Abinew Ayele and Chris Biemann
57 On Genitalia, Reproduction and Pleasure: Biases in the Representation of Sexes Sadhi Vornberger and Peter Bourgonje
58 An Unsupervised Learning Approach for Categorising Research Proposals and Recommending Papers Annie En-Shiun Lee, Mariia Ponomarenko and Peiyuan Zhou
59 Detecting Depression on Twitter with a Time-Aware Multimodal Transformer Ana-Maria Bucur, Adrian Cosma, Paolo Rosso and Liviu P. Dinu
61 Detecting Adverse Drug Events from social media: A brief literature review Imane Guellil, Nidhaleddine Chenni, Yousra Berrachedi, Massinissa Abboud, Jinge Wu, Beatrice Alex and Honghan Wu
62 Data Augmentation to Address the Out-of-Vocabulary Problem in Low-Resource Sinhala-English Neural Machine Translation Aloka Fernando and Surangika Ranathunga
63 Evaluating Gender Bias in Pre-trained Indic Language Models Neeraja Kirtane, V Manushree and Aditya Kane
65 Leveraging Bias in Pre-trained Word Embeddings for Unsupervised Microaggression Detection Tolulope Ogunremi, Valerio Basile and Tommaso Caselli
66 Tackling Gender Microaggressions in Hindi Text Vishakha Agrawal
68 ParsVQA-Caps: A Benchmark for Visual Question Answering and Image Captioning in Persian Shaghayegh Mobasher, Ghazal Zamaninejad, Maryam Hashemi, Melika Nobakhtian and Sauleh Eetemadi
69 HERDPhobia: A Dataset for Hate Speech Detection against Fulani Herdsmen in Nigeria Saminu Mohammad Aliyu, Gregory Maksha Wajiga, Muhammad Murtala, Shamsuddeen Muhammad, Idris Abdulmumin and Ibrahim Said Ahmad
70 Experiments on Generalizability of BERTopic on Short Text Muriël de Groot, Mohammad Aliannejadi and Marcel R. Haas
71 Domain-Specific Lexicon-Based Sentiment Analysis using Contextual Shifter Patterns Shamsuddeen Muhammad and Idris Abdulmumin
72 HMIST: Hierarchical Multilingual Isometric Speech Translation using Multi-Task Learning Framework for Automatic Dubbing Nidhir Bhavsar, Aakash Bhatnagar and Muskaan Singh
73 Generate Answer to Visual Questions with Pre-trained Vision-and-Language Embeddings Hadi Sheikhi, Maryam Hashemi and Sauleh Eetemadi