My research focuses mainly on text mining. Some of the research problems that I am tackling include developing effective dialog models. I am also actively working in the field of Sanskrit Computational Linguistics, where the focus is on developing a generic framework for solving a variety of NLP tasks such as word segmentation, morph analysis, dependency parsing, poetry to prose conversion, with a little task-specific annotated data.
-
The role of citation context in predicting long-term citation profiles: an experimental study based on a massive bibliographic text dataset by Singh, Mayank, Patidar, Vikas, Kumar, Suhansanu, Chakraborty, Tanmoy, Mukherjee, Animesh and Goyal, Pawan 24th ACM Conference on Information and Knowledge Management (CIKM) - (2015)
-
Relay-Linking Models for Prominence and Obsolescence in Evolving Networks by Singh M., Sarkar R. , Goyal P. , Mukherjee A. , Chakrabarti S. KDD - (2017)
-
Extracting Situational Information from Microblogs during Disaster Events: A Classification-Summarization Approach by Rudra, Koustav, Ghosh, Shubham, Ganguly, Niloy, Goyal, Pawan and Ghosh, Saptarshi 24th ACM Conference on Information and Knowledge Management (CIKM) - (2015)
-
On the formation of circles in co-authorship networks by Chakraborty, Tanmoy, Patranabis, Sikhar, Goyal, Pawan and Mukherjee, Animesh 21st ACM SIGKDD - (2015)
-
Thats sick dude!: Automatic identification of word sense change across different timescales by Mitra, Sunny, Mitra, Ritwik, Riedl, Martin, Biemann, Chris, Mukherjee, Animesh and Goyal, Pawan 52nd Annual Meeting of the Association for Computational Linguistics (ACL) 1020-1029 (2014)
-
On the categorization of scientific citation profiles in computer sciences by Chakraborty, Tanmoy, Kumar, Suhansanu, Goyal, Pawan, Ganguly, Niloy and Mukherjee, Animesh Communications of the ACM Vol. 58, No. 9 82-90 (2015)
-
A Context based Word Indexing Model for Document Summarization by Goyal, Pawan, Behera, Laxmidhar and McGinnity, TM IEEE Transactions on Knowledge and Data Engineering Vol. 25 No. 8 1693-1705 (2013)
-
Query Representation through Lexical Association for Information Retrieval by Goyal, Pawan, Behera, Laxmidhar and McGinnity, TM IEEE Transactions on Knowledge and Data Engineering Vol. 24 No. 12 2260-2273 (2012)
-
Towards a Stratified Learning Approach to Predict Future Citation Counts by Chakraborty, Tanmoy, Kumar, Suhansanu, Goyal, Pawan, Ganguly, Niloy and Mukherjee, Animesh ACM/IEEE Joint Conference on Digital Libraries (JCDL) 351 - 360 (2014)
-
An automatic approach to identify word sense changes in text media across timescales by Mitra, Sunny, Mitra, Ritwik, Maity, Suman Kalyan, Riedl, Martin, Biemann, Chris, Goyal, Pawan and Mukherjee, Animesh JNLE special issue on Graph Methods for NLP Cambridge University Press - (2014)
Principal Investigator
- A Novel Framework to Compress Multimodal Dialogue Contexts and Identify User Satisfaction Index Merlyn Mind, Inc
- Advancement of NLP Techniques for Indian Languages with Focus on Bangla and Hindi Science and Engineering Research Board (SERB)
- IoE Seed Grant: AI for Accelerated Materials Development IIT KHARAGPUR
- Large Language Model for Legal Assistance IIT Mandi iHUB and HCi Foundation
- Ranking and Analytics for the Institution Sponsored Research and Industrial Consultancy (SRIC)
- Sanskrit Knowledge Accessor Ministry of Electronics and Information Technology
- Sanskrit Knowledge Accessor Ministry of Electronics and Information Technology
- TCS IoN Elective Course - Social Media and Text Analytics TATA CONSULTANCY SERVICES LTD
- Unrestricted Grant for Research in Sanskrit Computational Linguistics Svarupa Inc.
- Unrestricted grant for research in social networks and user-generated content Dr. M Chelliah, Head, Academic Relations, Yahoo India R & D
- Unrestricted Travel Grant Various Institutes/Organisations
- Using Large Language Models to Enhance Learning Efficiency and Student Engagement in Indian Education System IIT KHARAGPUR AI4ICPS I HUB FOUNDATION
Co-Principal Investigator
- Cognitive Stimuli : Detecting and Generating Exaggeration in Online and Social Media Content Adobe Systems Inc
- CrysLDM: Latent Diffusion Model for Crystal Material Generation INDO KOREA SCIENCE AND TECHNOLOGY CENTER
- Google Unrestricted Fund for Social Computing GOOGLE INDIA PRIVATE LIMTED
- MSR India PhD Award Unrestricted Grant MICROSOFT RESEARCH LAB. INDIA PVT. LTD., BANGALORE
- Targeted Blass in Indian Media Outlets FACEBOOK INDIA ONLINE SERVICES PRIVATE LIMITED
Ph. D. Students
Abhilash Nandy
Area of Research: Information Retrieval
Hari Kishore Kusumakar
Area of Research: AI Governance and Law Enforcement
J Manoj Balaji
Area of Research: Sanskrit Computational Linguistics
Kavin R V
Area of Research: Natural Language Processing
Mohit Agrawal
Area of Research: Natural Language Processing
Omprakash Sonie
Area of Research: Natural Language Processing
Rahul Mehta
Area of Research: Information Retrieval
Shivraj Anand
Area of Research: Natural Language Processing
Shounak Paul
Area of Research: Natural Language Processing on Legal Text
Sombit Bose
Area of Research: Natural Language Processing
Subha Mondal
Area of Research: Reasoning, NLP, Education and LLMs
Subhendu Khatuya
Area of Research: Natural Language Processing
Subhojyoti Khastagir
Area of Research: Machine Learning
Sujeet Kumar
Area of Research: Natural Language Processing
Sujoy Sarkar
Area of Research: Text Mining
MS Students
Aritra Dutta
Area of Research: Reasoning, Vision and Language
Kunal Kingkar Das
Area of Research: Vision and Language
Pretam Ray
Area of Research: Natural Language Processing
Sourjyadip Ray
Area of Research: Vision and Language