|
Natural Language Processing or Computational Linguistics (NLP/CL) deals with understanding and developing computational theories of human language. Such theories allow us to understand the structure of language and build computer software that can process language,
NLP/CL is expected to play a major role in facilitating man-machine communication as well as man-man communication. Goals are to create computer systems that can speak and listen to users, machines that can translate from one language to another, thus bringing about a virtual revolution in access to information.
In the NLP-MT Lab at LTRC, IIIT-H, work is undertaken in many different sub-areas of NLP including syntax and parsing, semantics and word
sense disambiguation, discourse and tree banking, machine translation, etc. Computational models are built inspired from
linguistics, which are combined with machine learning techniques.
The Lab. and the Centre as a whole, has done original work on developing Computational Paninian Grammar (CPG) framework for Indian languages. Using such a framework, treebank for Indian languages have been developed. These provide a rich testbed for studying and understanding language in actual use, and are also used for developing parsers using machine learning. This has given rise to full sentence parsers with broad coverage for Indian languages.
Semantic processing involves developing semantic PurposeNet, semantic category assigners, identifying semantic relations in nominals, etc. Work has also been going on in discourse processing including development of discourse treebank and dialog processing.
Machine translation (MT) has been a driving application on which intense research is being done. Work is going on for English to Hindi MT as well as MT from one Indian language to another.
Sub Areas
The following are the major sub-areas:
1. Computational Grammatical Model
- Treebank for Hindi/Urdu
- Computational Paninian Grammar framework
- Language Typology
2. Parsing
- Constraint based parsers for Indian languages
- Data-driven parsers for Indian languages
- Shallow parsers, Part-of-Speech taggers, Morphological analyzers
3. Machine Translation
- Transfer based approaches
- Automataic learning of transfer rules
- Statistical machine translation (English to Indian languages)
4. Semantics
- Purpose-net
- Unsupervised/semi-supervised word category disambiguation
5. Dialogue and Discourse analysis
- Anaphora resolution in text
- Generation of sentences from words
Major Funded Projects
- Indian language to Indian language machine translation : Consortium project (2006-2013) (Department of Information Technology (DIT), Govt. of India (GoI))
- English to Indian language machine translation : Consortium project (2006-2009) (DIT, GoI)
- Multi-Representational and Multi-Layered Treebank for Hindi and Urdu (2008-2011) (NSF, USA)
- Development of Sanskrit Computational Toolkit and Sanskrit-Machine Translation system (2008-2011) (DIT, GoI)
- Discourse and Dialog Management (2008-2011) (Tata Consultancy Services)
- Language Database Development for Example Based Machine Translation (2003-2005) (DIT, GoI)
- Multilingual Morphological Analysis and Chunking/Phrasing modules for Text-to-Speech (2003-2004) (Outside Echo Limited, UK)
- lndian Language to lndian Language Machine Translation System (ILMT) Phase-II (2010-2013) (Department of Information Technology, Govt of India)
- Dashboard Development Environment for NLP Applications (2009-2011) (Department of Information Technology, Govt of India)
- Development of English to Indian Language Machine Translation System Phase-II (2010-2013) (Department of Information Technology, Govt of India)
Achievements
Contests
-
- A Contest on Shallow Parsing for 3 Indian Languages (Hindi, Bengali & Telugu) in IJCAI-07: International Joint Conference on Artificial Intelligence and secured first place in the shared task held in the SPSAL Workshop during IJCAI-2007 (6-12 Jan 2007)
- Shared Task Contest held as part of the Workshop on Named Entity Recognition for South and South East Asian Languages in IJCNLP-08: 3rd International Joint Conference on Natural Language Processing (7-12 Jan 2008)
- IIIT students participated and won the contests in NLP Tools Contest and Student Paper Competition in ICON Conference (International Conference on Natural Language Processing) which is an annual event of the Centre.
Conferences Organized
- ICON-2009: Seventh International Conference on Natural Language Processing at IIIT-H and University of Hyderabad, Dec 2009.
- ICON-2008: Sixth International Conference on Natural Language Processing at CDAC Pune, Dec 2008
- IJCNLP-08: 3rd International Joint Conference on Natural Language Processing, Hyderabd, Jan 2008
- IASNLP: IIIT-Hyderabad Advanced School on Natural Language Processing, May 2008
- TCS NLP Winter School, Dec 2007
- ICON-2007: Fifth International Conference on Natural Language Processing at IIIT Hyderabad, Jan 2007
- ICON-2005: Fourth International Conference on Natural Language Processing at IIT Kanpur, Dec 2005
- ICON-2004: Third International Conference on Natural Language Processing at IIIT Hyderabad, Dec 2004
- ICON-2003: Second International Conference on Natural Language Processing at CIIL Mysore, Dec 2003
- ICON-2002: First International Conference on Natural Language Processing at NCST, Mumbai (presently called as CDAC-Mumbai), Dec 2002
- ICON-2010: Eighth International Conference on Natural Language Processing at IIT Kharagpur, Dec 2010.
- ICON-2011: Ninth International Conference on Natural Language Processing at Anna University, Chennai, Dec 2011.
Faculty
- Radhika Mamidi
- Soma Paul
- Rajeev Sangal
(Head)
- Dipti Misra Sharma
- Manish Shrivastava
- Sriram Venkatapathy
Past Faculty
- Prashanth Mannem
Past Lecturers/Research Lecturers
- Samar Husain
- Prashanth Mannem
- Sriram Venkatapathy
Past Research Scientists/Engineers
- anil.singh@iiit.ac.in Singh
Adjunct Faculty
- B Lakshmi Bai
Students
- K Varun
- Pranav Garg
- Sai Kiran Gorthi
- BHAGWAT NACHIKET KISHOR
- Akshay Kulkarni
- Mudit Maheshwari
- Hardeep Singh Rajpal
- Y Jayendra Rakesh
- A Vinay Bhargav Reddy
- Himanshu Rustagi
- Nitesh Surtani
- Rahul Gupta
- E Anil Krishna
- Himanshu Sharma
- Ajay Dubey
- Sambhav Jain
- M A Rafiya Begum
- Susanta Kisore Mahakunda
- Himani Chaudhry
- Riyaz Ahmad Bhat
Past Students
- Karthik Gali
- Manohar Reddy
- Ravikiran Vadlapudi Vadlapudi
- Bharat Ram Ambati
- Abhilash Inumella
- Prashant Mathur
- Vipul Mittal
- Siva Reddy
- phani_gadde@students.iiit.ac.in
- meher_vijay@students.iiit.ac.in
- Aswarth Abhilash
- Sriram Anuroop
- G S K Chaitanya
- Kalyan Deepak
- Raghu Pujitha Gade
- Hemant Sagar
- Nagaraju Yedlapalli
- harika@students.iiit.ac.in
- Manish Agarwal
- Rahul Agrawal
- B Krishna Chaitanya
- Rahul Goutam
- Ankush Gupta
- B Indupriya
- Karan Jindal
- Prudhvi Kosaraju
- Shashikant Muktyar
- K V S Nikhil
- D Nithin
- K Sruthilaya Reddy
- Rakshit Shah
- R S Sruti
- Navni Bhojwani
- Pranav Goyal
- Gupta Jayant
- Puneeth Kukkadapu
- Hitesh Kumar
- Aman Mahajan
- Sarvesh Ajit Ranade
- Akula Arjun Reddy
- R D Chinmayananda Reddy
- Piyush Shukla
- Ashok Vardhan
- Ujjaval Verma
- Suman Yelati
- Itisree Jena
- Viswanath Naidu
- Sapna Sharma
- gowri@students.iiit.ac.in
- Katkar Geeta Mallappa
- Nitin Kumar Hardeniya
- Prasanth Kolachina
- D V Sri Ram
- Abhijeet Gupta
- Sudheer Kolachina
- R Ravi Teja
- Suman Yelati
- PVS Avinesh
- Anil Kumar Singh
- Arafat Ahsan
- Renjini Narendranath
- Serajul Afreen
- Sriram Venkatapathy
- Prashanth Mannem
- P Kiran Mayee
- Ritu Notani
- Aman Kumar Bahl
- Harshit Jain
|