Specialization : Artificial Intelligence, NLP and Information Retrieval

Natural language processing (NLP) is a field of computer science, artificial intelligence and computational linguistics concerned with the interactions between computers and human (natural) languages, and, in particular, concerned with programming computers to fruitfully process large natural language corpora. Challenges in natural language processing frequently involve natural language understanding, natural language generation (frequently from formal, machine-readable logical forms), connecting language and machine perception, dialog systems, or some combination thereof.
Information Retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers). The application in this field focuses on searching and retrieving (mainly textual) information from desktop, web, mobile sources.

Sub Areas:
Artificial Intelligence (AI)
Natural Language Processing (NLP)
Information Retrieval

Recent Publications:

Learning Cross-Lingual Phonological and Orthagraphic Adaptations: A Case Study in Improving Neural Machine Translation between Low-Resource Languages. Saurav Jha, Akhilesh Sudhakar, Anil Kumar Singh. Journal of Language Modelling. 2019 (Accepted for publication).
SWOW-8500: Word Association task for Intrinsic Evaluation of Word Embeddings. Avijit Thawani, Biplav Srivastava and Anil Singh. The Third Workshop on Evaluating Vector Space Representations for NLP at NAACL 2019. Minneapolis, USA.
Di-LSTM Contrast: A Deep Neural Network for Metaphor Detection. Krishnkant Swarnkar and Anil Kumar Singh. Proceedings of the Workshop on Figurative Language Processing. NAACL 2018. Pages 115-120. New Orleans, Louisiana, US.
NLPRL-IITBHU at SemEval-2018 Task 3: Combining Linguistic Features and Emoji pre-trained CNN for Irony Detection in Tweets. Harsh Rangwani, Devang Kulshreshtha and Anil Kumar Singh. Proceedings of SemEval 2018. Pages 638-642. New Orleans, Louisiana, US.
IIT (BHU) System for Indo-Aryan Language Identification (ILI) at VarDial 2018. Divyanshu Gupta, Gourav Dhakad, Jayprakash Gupta and Anil Kumar Singh. Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects. Pages 185-190. Santa Fe, New Mexico, USA.
Experiments on Deep Morphological Inflection. Akhilesh Sudhakar, Rajesh Kumar Mundotiya and Anil Kumar Singh. Proceedings of the International Conference on Computational Linguistics and Intelligent Text Processing. 2018. Hanoi, Vietnam.
How emotional are you? Neural Architectures for Emotion Intensity Prediction in Microblogs. Devang Kulshreshtha, Pranav Goel, and Anil Kumar Singh. Proceedings of the 27th International Conference on Computational Linguistics. Pages 2914–2926. Santa Fe, New Mexico, USA.
IIT (BHU) Submission for the ACL Shared Task on Named Entity Recognition on Code-switched Data. Trivedi, Shashwat and Rangwani, Harsh and Kumar Singh, Anil. Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching. Pages 148-153. Melbourne, Australia.
IIT (BHU) Varanasi at MSR-SRST 2018: A Language Model Based Approach for Natural Language Generation. Avi Chawla, Ayush Sharma, Shreyansh Singh and A.K. Singh. Proceedings of the First Workshop on Multilingual Surface Realisation. Pages 29-34. Melbourne, Australia.
Evaluating Opinion Summarization in Ranking. Anil Kumar Singh, Avijit Thawani, Anubhav Gupta, Rajesh Kumar Mundotiya. In Proceedings of the Asian Information Retrieval Symposium (AIRS 2017). 2017. Jeju, South Korea. pp 222-234.
Experiments on Morphological Reinflection: CoNLL-2017 Shared Task. Akhilesh Sudhakar, Anil Kumar Singh. In Proceedings of the CoNLL 2017. 2017. Vancouver, Canada. pp 71-78.
Word Transduction for Addressing the OOV Problem in Machine Translation for Similar Resource-Scarce Languages. Shashikant Sharma, Anil Kumar Singh. In Proceedings of the Conference on Finite State Methods in NLP (FSMNLP 2017). 2017. Umea, Sweden. pp 56-63.
IJCNLP-2017 Task 3: Review Opinion Diversification (RevOpiD-2017). Anil Kumar Singh, Avijit Thawani, Mayank Panchal, Anubhav Gupta, Julian McAuley. In Proceedings of the IJCNLP (Shared Tasks) 2017. 2017. Teipei, Taiwan. pp 17-25.
IIT (BHU): System description for LSDSem’17. Pranav Goel and Anil Kumar Singh. In Proceedings of the 2nd Workshop on Linking Models of Lexical, Sentential and Discourse-level Semantics (LSDSem). 2017. Valencia, Spain. Association for Computational Linguistics.
Reference Scope Identification for Citances Using Convolutional Neural Networks. Saurav Jha, Aanchal Chaurasia, Akhilesh Sudhakar and Anil Kumar Singh. In Proceedings of the International Conference on Natural Language Processing (ICON 2017). 2017. Kolkata, India.
Neural Morphological Disambiguation Using Surface and Contextual Morphological Awareness. Akhilesh Sudhakar and Anil Kumar Singh. In Proceedings of the International Conference on Natural Language Processing (ICON 2017). 2017. Kolkata, India.
Manpreet Kaur, Nishu Kumari, Anil Kumar Singh, Rajeev Sangal. IIT (BHU) Submission for the CoNLL-2016 Shared Task. Shallow Discourse Parsing using Semantic Lexicon. Berlin, August, 2016.
Shallow Discourse Parsing with Syntactic and (a Few) Semantic Features. Shubham Mukherjee, Abhishek Tiwari, Mohit Gupta and Anil Kumar Singh. CoNLL Shared Task on Shallow Discourse Parsing. Beijing, China, July, 2015.
Responding to Retrieval: A Proposal to Use Retrieval Information for Better Presentation of Website Content. Ravindranath Chowdary, Anil Kumar Singh and Anil Nelakanti. ICWE Workshop on PErvasive WEb Technologies (PEWET 2015): Trends and Challenges. Rotterdam, Netherlands, June, 2015.
Centrality based Document Ranking. Anil Kumar Singh and C. Ravindranath Chowdary. In Proceedings of The Twenty-Third Text REtrieval Conference, TREC 2014, Gaithersburg, Maryland, USA, November 19-21, 2014.
A Language Identification Method Applied to Twitter Data. Anil Kumar Singh and Pratya Goyal. In Proceedings of the Twitter Language Identification Workshop at the SEPLN Conference. Girona, Spain. September, 2014.
SSF: A Common Representation Scheme for Language Analysis for Language Technology Infrastructure Development. Akshar Bharati, Rajeev Sangal, Dipti Sharma and Anil Kumar Singh. In Proceedings of the COLING Workshop on Open Infrastructures and Analysis Frameworks for HLT, pp66–76. Dublin, Ireland. August, 2014.
Dinesh Kumar Prabhakar and Sukomal Pal, A Survey on Transliterated Text Processing, Sadhana-Academy Proceedings in Engineering Science, (Online version available).
R. Soni and Sukomal Pal, Gold Standard Creation for Microblog Retrieval: Challenges of Completeness in IRMiDis 2017, Companion of the Web Conference 2018 on The Web Conference 2018, 1639-1642.
A Kanapala, Sukomal Pal, R Pamula, Design of a meta search system for legal domain, Advanced Computing and Communication Systems (ICACCS), 2017.
A Kanapala, Sukomal Pal, R Pamula, Text summarization from legal documents: a survey, Artificial Intelligence Review (2017). https://doi.org/10.1007/s10462-017-9566-2, pp 1-32
R. Soni and Sukomal Pal, Microblog Retrieval for Disaster Relief: How To Create Ground Truths?, SMERP@ ECIR, pp. 42-51.
H. Mehrotra, R. Soni, Sukomal Pal, IIT BHU at FIRE 2017 IRMiDis Track-Fully Automatic Approaches to Information Retrieval., FIRE (Working Notes), pp. 59-60.
R. Soni, Sukomal Pal, IIT BHU at FIRE 2016 Microblog Track: A Semi-automatic Microblog Retrieval System., FIRE (Working Notes), pp. 74-75
M. Chakraborty, Sukomal Pal, R. Pramanik, C. R. Chowdary, Recent developments in social spam detection and combating techniques: A survey, Information Processing & Management52 (6), pp.1053-1073.
M. Niyogi, Sukomal Pal, IR-IITBHU at TREC 2016 Open Search Track: Retrieving documents using Divergence From Randomness model in Terrier, TREC.
J. Kumar, S. S. Prasad, Sukomal Pal, IRISM@ NTCIR-12 Temporalia Task: Experiments with MaxEnt, Naive Bayes and Decision Tree Classifiers., NTCIR.
SS Prasad, J Kumar, DK Prabhakar, Sukomal Pal, Sentiment classification: An approach for Indian language tweets using decision tree, International Conference on Mining Intelligence and Knowledge Exploration.
R Pramanik, Sukomal Pal, M Chakraborty, What the user does not want?: query reformulation through term inclusion-exclusion, Proceedings of the Second ACM IKDD Conference on Data Sciences, pp. 116-117.
DK Prabhakar, Sukomal Pal, ISM@ FIRE-2015: Mixed Script Information Retrieval., FIRE Workshops, pp.55-58.
DK Prabhakar, S Dubey, B Goel, Sukomal Pal, ISM@ FIRE-2014: Named Entity Recognition for Indian Languages, Proceedings of the Forum for Information Retrieval Evaluation, pp. 98-102.
A Kanapala, Sukomal Pal, Test collection for legal ir from online discussion forums, Proceedings of the Forum for Information Retrieval Evaluation, pp. 126-129.
P Yadav, Sukomal Pal, R Kumar, S Singh, H Singh, Popular Acronym Retrieval through Text Messaging, Proceedings of the Forum for Information Retrieval Evaluation, pp. 142-145.
R. Kumar, Sukomal Pal, Social Book Search Track: ISM@ INEX'14 Suggestion Task, CLEF (Working Notes), pp. 521-524.
D Yadav, CR Chowdary, “OOIMASP: Origin based association rule mining with order independent mostly associated sequential patterns” Expert Systems with Applications 93 (C), 62-71, 2018.
Dinesh Kumar Prabhakar, Sukomal Pal : “A Survey on Transliterated Text Processing”, Sadhana - Academy Proceedings in Engineering Science, (Online version available).
Ambedkar Kanapala, Sukomal Pal, Rajendra Pamula: “Summarization and Information Access in Legal Domain: A Survey”, Artificial Intelligence Review, DOI 10.1007/s10462-017-9566-2, 2017 (online version published June 2017).
International Conference on Natural Language Processing (ICON-2017). Kolkata, India. 14th and Anil Kumar Singh. Proceedings of the Sudhakar AkhileshNeural Morphological Disambiguation Using Surface and Contextual Morphological Awareness. December, 2017.
Multilingual Akshar Based Transducer for South and South East Asian Languages which Use Indic Scripts. Anil Kumar Singh and Harshit Surana. In Proceedings of the Seventh International Symposium on Natural Language Processing. Pattaya, Thailand. 2007.
Manu Agrawal, Kartik Manchanda, Ribhav Soni, Anurag Lal, C. Ravindranath Chowdary, “Parallel Implementation of Local Similarity Search for Unstructured Text using Prefix Filtering”, Proceedings of the 18th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Taiwan, IEEE 2017.
Sukomal Pal, Mandar Mitra, Jaap Kamps: Evaluating the INEX Focused Task Evaluation Process, Journal of the American Society for Information Science and Technology.
Manu Agrawal, Kartik Manchanda, Ribhav Soni, Anurag Lal, C. Ravindranath Chowdary, “Parallel Implementation of Local Similarity Search for Unstructured Text using Prefix Filtering”, Proceedings of the 18th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Taiwan, IEEE 2017
Vivek Sourabh, Parth Pahariya, Isha Agarwal, Ankit Gautam, C. Ravindranath Chowdary, “Parallel Implementation of Dynamic Programming Problems using Wavefront and Rank Convergence with Full Resource Utilization”, Proceedings of the 18th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Taiwan, IEEE 2017

Faculty:

News updates

Project shortlisted under Innovation incubator, a CSR intitiative by HP :

Mr. Pranav Goel (B.Tech 4th Yr) - Emotional Intensity detection.
Mr. Harsh Rangwani (B.Tech 3rd Yr) - Questioning Answering system for tourism domain
Runner up prize for Mr. Trivikram Pradhan (Ph.D.) in poster presentation on topic "SNAVER : A social network analysis based scholarly venue recommender system" on Institute day.

Facilities

Sponsored Projects

Topic: Efficient Generation of a Query-Specific Extractive Summary on Multiple Documents – A Distributive Approach, PI: Dr. Ravindra Nath Chowdary, Sponsoring Agency: SERB, Govt. of India
Topic: Building a Sanskrit Text Collection for Information Retrieval, PI: Dr. Sukomal Pal, Co-PI: Prof. Gopabandhu Mishra, Sponsoring Agency: Project Varanasi, Duration: 1 Year (2018-2019)

Research Scholars

Mr. Rajesh Kumar Mundotiya
Mr. Bhavana Srivastava
Mr. Amit Kumar
Mr. Rupjyoti Baruah
Mr. Naina Yadav
Mr. Ashwini kumar singh
Mr. Sushant Kumar Pandey
Mr. Amit Kumar
Ms. Akanksha Mishra
Mr. Rajesh Kumar Mundotiya
Ms. Shivang Agarwal
Mr. Chintoo Kumar
Ms. Anita Saroj
Mr. Supriya Chanda
Mr. Tribikram Pradhan
Mr. Siba Sankar Sahu
Mr. Sushil Kulkarni
Mr. Rupjyoti Baruah

PhD Students (from Humanisitic Studies Department)

Ashish Ranjan
Samapika Roy
Prashant Priyadarshi

Quick links

Events

Regional ICON (regICON-2015)
International Conference on Natural Language Processing (ICON-2016)
Regional ICON (regICON-2016)
GIAN course on Machine Translation, 2016
Second Workshop on Experimental and Empirical Linguistics

Specialization : Artificial Intelligence, NLP and Information Retrieval

Recent Publications:

Faculty:

Search form