acm - an acm publication


The E2B machine translation
a new approach to HLT

Ubiquity, Volume 2005 Issue August | BY Goutam Kumar Saha 


Full citation in the ACM Digital Library

This article describes a new approach to machine translator that translates English text into Bangla text with disambiguation. The translated Bengali text in English scripts is also useful for learning Bengali or Bangla language as a foreign language. At the same time the Bengali rural people who do not know English language well can understand the English matter with the translated output. The proposed approach is a new one that uses both the rule-based and transformation-based machine translation schemes along with three level parsing approaches. This is a significant contribution towards creation of a low-cost Human Language Technology (HLT). About two hundred million people in the West Bengal, Tripura (two states in India) and in Bangladesh (a country), speak and write Bangla as their first language. This English to Bangla (E2B)-ANUBAD or translator system or E2B takes a paragraph of English sentences as input sentences and produces equivalent Bangla sentences. The E2B-ANUBAD system compries of a preprocessor, morphological parser, semantic parser using English word ontology for context disambiguation, an electronic lexicon associated with grammatical information and a discourse processor. It also employs a lexical disambiguation analyzer. This system does not rely on a stochastic approach. Rather, it is based on a special kind of hybrid architecture of transformer and rule-based NLE architectures along with various linguistic knowledge components of both English and Bangla for creation of a low-cost HLT.

Connection Failure


English to Bengali tanslation

— gobinda tarafdar, Thu, 13 Nov 2014 13:04:36 UTC

References: Akshar Bharati, et al., "Natural Language Processing," PHI, 2000. Wingrad, Terry, "Understanding Natural Language," Academic Press, New York, 1972. D. Jurafsky and J.H. Martin, "Speech and Language Processing," Pearson Education, 2000. Ma Quing, "Natural Language Processing with Neural Networks," Language Engineering Conference 2002, Hyderabad. Goutam Kumar Saha, et al, "Computer Assisted Bangla POS Tagging," Proceedings of the International Symposium ISTRANS 2004, Tata McGraw-Hill, New Delhi, 2004. K.C. Dash (ed.), " Indian Semantics," Agamakala Publications, Delhi, 1994. Bamondeb Chakroborty, "Uchchotoro Bangla Byakaron," Akshay Malancha, 2003. Akshar Bharati, et al, "A Computational Grammar for Indian Languages Processing, " Indian Linguistics Journal, 52, 91-103, 1991a. Goutam Kumar Saha, "BANGANUBAD - An English to Bangla Translator," in press, International Journal CPOL, 2005, USA. Goutam Kumar Saha, "Bangla Text Parsing with Intelligence," Proceedings of the International Conference MS'05, 2005, Morocco. Author's Biography: Goutam Kumar Saha has been working as a computer scientist in various premier R&D organizations in India for last seventeen years. He has worked in LRDE, DRDO, Bangalore, ER&DCI, Calcutta, and at present he is with the Centre for Development of Advanced Computing, Kolkata, India, as a Scientist-F. He has authored around hundred of research papers in various International Journals, Conference etc. He is a referee for CSI Journal, AMSE-Modeling Journal (France), IJCPOL and IEEE Magazine. His field of interest is on fault tolerant computing softw

— harun, Sat, 18 Aug 2012 12:02:53 UTC

Leave this field empty