Natural Language Processing


Publications

Title Authors Conference / Journal
2023
Entropy-guided Vocabulary Augmentation of Multilingual Language Models for Low-resource Tasks. Arijit Nag, Bidisha Samanta, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti ACL (Findings) 2023: 8619-8629
Transfer Learning for Low-Resource Multilingual Relation Classification. Arijit Nag, Bidisha Samanta, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti ACM Trans. Asian Low Resour. Lang. Inf. Process. 22(2): 50:1-50:24 (2023)
Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing. Jivnesh Sandhan, Laxmidhar Behera, Pawan Goyal: EACL 2023: 2156-2163
Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages. Ankan Mullick, Ishani Mondal, Sourjyadip Ray, R. Raghav, G. Sai Chaitanya, Pawan Goyal EACL (Findings) 2023: 1825-1836
SanskritShala: A Neural Sanskrit NLP Toolkit with Web-Based Interface for Pedagogical and Annotation Purposes. Jivnesh Sandhan, Anshul Agarwal, Laxmidhar Behera, Tushar Sandhan, Pawan Goyal ACL (demo) 2023: 103-112
Meta-ED: Cross-lingual Event Detection Using Meta-learning for Indian Languages. Aniruddha Roy, Isha Sharma, Sudeshna Sarkar, Pawan Goyal ACM Trans. Asian Low Resour. Lang. Inf. Process. 22(2): 46:1-46:22 (2023)
Fairness for both Readers and Authors: Evaluating Summaries of User Generated Content. Garima Chhikara, Kripabandhu Ghosh, Saptarshi Ghosh, Abhijnan Chakraborty SIGIR 2023: 1996-2000
Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation. Shubham Kumar Nigam, Aniket Deroy, Noel Shallum, Ayush Kumar Mishra, Anup Roy, Shubham Kumar Mishra, Arnab Bhattacharya, Saptarshi Ghosh, Kripabandhu Ghosh SemEval@ACL 2023: 1293-1303
How Ready are Pre-trained Abstractive Models and LLMs for Legal Case Judgement Summarization? Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh LegalAIIA@ICAIL 2023: 8-19
DeepRhole: deep learning for rhetorical role labeling of sentences in legal case documents. Paheli Bhattacharya, Shounak Paul, Kripabandhu Ghosh, Saptarshi Ghosh, Adam Wyner Artif. Intell. Law 31(1): 53-90 (2023)
2022
ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts. Rajdeep Mukherjee, Abhinav Bohra, Akash Banerjee, Soumya Sharma, Manjunath Hegde, Afreen Shaikh, Shivani Shrivastava, Koustuv Dasgupta, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal EMNLP 2022: 10893-10906
Fast Few Shot Self-attentive Semi-supervised Political Inclination Prediction. Souvic Chakraborty, Pawan Goyal, Animesh Mukherjee ICADL 2022: 3-20
Hate Speech and Offensive Language Detection in Bengali. Mithun Das, Somnath Banerjee, Punyajoy Saha, Animesh Mukherjee AACL/IJCNLP (1) 2022: 286-296
HateCheckHIn: Evaluating Hindi Hate Speech Detection Models. Mithun Das, Punyajoy Saha, Binny Mathew, Animesh Mukherjee LREC 2022: 5378-5387
Is This Bug Severe? A Text-Cum-Graph Based Model for Bug Severity Prediction. Mithun Das, Punyajoy Saha, Binny Mathew, Animesh Mukherjee ECML/PKDD (6) 2022: 236-252
A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type Identification in Sanskrit. Jivnesh Sandhan, Ashish Gupta, Hrishikesh Terdalkar, Tushar Sandhan, Suvendu Samanta, Laxmidhar Behera, Pawan Goyal COLING 2022: 4071-4083
Does Meta-learning Help mBERT for Few-shot Question Generation in a Cross-lingual Transfer Setting for Indic Languages? Aniruddha Roy, Rupak Kumar Thakur, Isha Sharma, Ashim Gupta, Amrith Krishna, Sudeshna Sarkar, Pawan Goyal COLING 2022: 4251-4257
TransLIST: A Transformer-Based Linguistically Informed Sanskrit Tokenizer. Jivnesh Sandhan, Rathin Singha, Narein Rao, Suvendu Samanta, Laxmidhar Behera, Pawan Goyal EMNLP (Findings) 2022: 6902-6912
ArgGen: Prompting Text Generation Models for Document-Level Event-Argument Aggregation. Debanjana Kar, Sudeshna Sarkar, Pawan Goyal AACL/IJCNLP (Findings) 2022: 399-404
Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation. Abhay Shukla, Paheli Bhattacharya, Soham Poddar, Rajdeep Mukherjee, Kripabandhu Ghosh, Pawan Goyal, Saptarshi Ghosh AACL/IJCNLP (1) 2022: 1048-1064
Linguistically Informed Post-processing for ASR Error correction in Sanskrit. Rishabh Kumar, Devaraja Adiga, Rishav Ranjan, Amrith Krishna, Ganesh Ramakrishnan, Pawan Goyal, Preethi Jyothi INTERSPEECH 2022: 2293-2297
Using Sentence-level Classification Helps Entity Extraction from Material Science Literature. Ankan Mullick, Shubhraneel Pal, Tapas Nayak, Seung-Cheol Lee, Satadeep Bhattacharjee, Pawan Goyal LREC 2022: 4540-4545
Effectiveness of Data Augmentation to Identify Relevant Reviews for Product Question Answering. Kalyani Roy, Avani Goel, Pawan Goyal WWW (Companion Volume) 2022: 298-301
A sequence labeling model for catchphrase identification from legal case documents. Arpan Mandal, Kripabandhu Ghosh, Saptarshi Ghosh, Sekhar Mandal Artif. Intell. Law 30(3): 325-358 (2022)
Legal case document similarity: You need both network and text. Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh Inf. Process. Manag. 59(6): 103069 (2022)
A Framework to Generate High-Quality Datapoints for Multiple Novel Intent Detection. Ankan Mullick, Sukannya Purkayastha, Pawan Goyal, Niloy Ganguly NAACL-HLT (Findings) 2022: 282-292
CAVES: A Dataset to facilitate Explainable Classification and Summarization of Concerns towards COVID Vaccines. Soham Poddar, Azlaan Mustafa Samad, Rajdeep Mukherjee, Niloy Ganguly, Saptarshi Ghosh SIGIR 2022: 3154-3164
MTLTS: A Multi-Task Framework To Obtain Trustworthy Summaries From Crisis-Related Microblogs. Rajdeep Mukherjee, Uppada Vishnu, Hari Chandana Peruri, Sourangshu Bhattacharya, Koustav Rudra, Pawan Goyal, Niloy Ganguly WSDM 2022: 755-763
Data Bootstrapping Approaches to Improve Low Resource Abusive Language Detection for Indic Languages. Mithun Das, Somnath Banerjee, Animesh Mukherjee HT 2022: 32-42
CounterGeDi: A Controllable Approach to Generate Polite, Detoxified and Emotional Counterspeech. Punyajoy Saha, Kanishk Singh, Adarsh Kumar, Binny Mathew, Animesh Mukherjee IJCAI 2022: 5157-5163
CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection. Souvic Chakraborty, Parag Dutta, Sumegh Roychowdhury, Animesh Mukherjee NAACL-HLT (Findings) 2022: 1874-1886
Augmenting Video Lectures: Identifying Off-topic Concepts and Linking to Relevant Video Lecture Segments. Krishnendu Ghosh, Sharmila Reddy Nangi, Yashasvi Kanchugantla, Pavan Gopal Rayapati, Plaban Kumar Bhowmick, Pawan Goyal Int. J. Artif. Intell. Educ. 32(2): 382-412 (2022)
Network embeddings from distributional thesauri for improving static word representations. Abhik Jana, Siddhant Haldar, Pawan Goyal Expert Syst. Appl. 187: 115868 (2022)
LeSICiN: A Heterogeneous Graph-Based Approach for Automatic Legal Statute Identification from Indian Legal Documents. Shounak Paul, Pawan Goyal, Saptarshi Ghosh AAAI 2022: 11139-11146
Representation Learning for Conversational Data using Discourse Mutual Information Maximization. Bishal Santra, Sumegh Roychowdhury, Aishik Mandal, Vasu Gurram, Atharva Naik, Manish Gupta, Pawan Goyal NAACL-HLT 2022: 1718-1734
AR-BERT: Aspect-relation enhanced Aspect-level Sentiment Classification with Multi-modal Explanations. Sk Mainul Islam, Sourangshu Bhattacharya WWW 2022: 987-998
2021
A Hierarchical VAE for Calibrating Attributes while Generating Text using Normalizing Flow. Bidisha Samanta, Mohit Agrawal, Niloy Ganguly ACL/IJCNLP (1) 2021: 2405-2415
Knowledge-Aware Neural Networks for Medical Forum Question Classification. Soumyadeep Roy, Sudip Chakraborty, Aishik Mandal, Gunjan Balde, Prakhar Sharma, Anandhavelu Natarajan, Megha Khosla, Shamik Sural, Niloy Ganguly CIKM 2021: 3398-3402
A Data Bootstrapping Recipe for Low-Resource Multilingual Relation Classification. Arijit Nag, Bidisha Samanta, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti CoNLL 2021: 575-587
Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework. Abhilash Nandy, Soumya Sharma, Shubham Maddhashiya, Kapil Sachdeva, Pawan Goyal, Niloy Ganguly EMNLP (Findings) 2021: 4600-4609
tWT-WT: A Dataset to Assert the Role of Target Entities for Detecting Stance of Tweets. Ayush Kaushal, Avirup Saha, Niloy Ganguly NAACL-HLT 2021: 3879-3889
Understanding the Role of Affect Dimensions in Detecting Emotions from Tweets: A Multi-task Approach. Rajdeep Mukherjee, Atharva Naik, Sriyash Poddar, Soham Dasgupta, Niloy Ganguly SIGIR 2021: 2303-2307
HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection. Binny Mathew, Punyajoy Saha, Seid Muhie Yimam, Chris Biemann, Pawan Goyal, Animesh Mukherjee AAAI 2021: 14867-14875
Debiasing Multilingual Word Embeddings: A Case Study of Three Indian Languages. Srijan Bansal, Vishal Garimella, Ayush Suhane, Animesh Mukherjee HT 2021: 27-34
Deep Neural Approaches to Relation Triplets Extraction: a Comprehensive Survey. Tapas Nayak, Navonil Majumder, Pawan Goyal, Soujanya Poria Cogn. Comput. 13(5): 1215-1232 (2021)
Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights. Devaraja Adiga, Rishabh Kumar, Amrith Krishna, Preethi Jyothi, Ganesh Ramakrishnan, Pawan Goyal ACL/IJCNLP (Findings) 2021: 5039-5050
A Little Pretraining Goes a Long Way: A Case Study on Dependency Parsing Task for Low-resource Morphologically Rich Languages. Jivnesh Sandhan, Amrith Krishna, Ashim Gupta, Laxmidhar Behera, Pawan Goyal EACL (Student Research Workshop) 2021: 111-120
Reproducibility, Replicability and Beyond: Assessing Production Readiness of Aspect Based Sentiment Analysis in the Wild. Rajdeep Mukherjee, Shreyas Shetty, Subrata Chattopadhyay, Subhadeep Maji, Samik Datta, Pawan Goyal ECIR (2) 2021: 92-106
PASTE: A Tagging-Free Decoding Framework Using Pointer Networks for Aspect Sentiment Triplet Extraction. Rajdeep Mukherjee, Tapas Nayak, Yash Butala, Sourangshu Bhattacharya, Pawan Goyal EMNLP (1) 2021: 9279-9291
COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews. Shruti Singh, Mayank Singh, Pawan Goyal JCDL 2021: 238-241
Hierarchical Transformer for Task Oriented Dialog Systems. Bishal Santra, Potnuru Anusha, Pawan Goyal NAACL-HLT 2021: 5649-5658
Unsupervised approaches for measuring textual similarity between legal court case reports. Arpan Mandal, Kripabandhu Ghosh, Saptarshi Ghosh, Sekhar Mandal Artif. Intell. Law 29(3): 417-451 (2021)
An Unsupervised Normalization Algorithm for Noisy Text: A Case Study for Information Retrieval and Stance Detection. Anurag Roy, Shalmoli Ghosh, Kripabandhu Ghosh, Saptarshi Ghosh ACM J. Data Inf. Qual. 13(3): 17:1-17:25 (2021)
Incorporating domain knowledge for extractive summarization of legal case documents. Paheli Bhattacharya, Soham Poddar, Koustav Rudra, Kripabandhu Ghosh, Saptarshi Ghosh ICAIL 2021: 22-31
Improving Legal Case Summarization Using Document-Specific Catchphrases. Arpan Mandal, Paheli Bhattacharya, Sekhar Mandal, Saptarshi Ghosh JURIX 2021: 76-81
An Analytical Study of Algorithmic and Expert Summaries of Legal Cases. Aniket Deroy, Paheli Bhattacharya, Kripabandhu Ghosh, Saptarshi Ghosh JURIX 2021: 90-99