Recognition and Segmentation of English Long and Short Sentences Based on Machine Translation

Authors

  • Tiehu Zhang School of Foreign Languages, Xi'an Aeronautical University

DOI:

https://doi.org/10.3991/ijet.v14i19.10182

Keywords:

machine translation, long sentence, regular match, error-driven method

Abstract


With the advent of the information age, long sentences which include many words and have more complex structures.. The translation of long sentences in English-Chinese machine translation has always been the focus of research. In this study, 400 long sentences were randomly selected from NTCIR-9 patent corpus for testing the recognition and segmentation effects of regular match method and error-driven method, and the accuracy rate of the translation was compared on Baidu Online Translation Platform. The results demonstrated that the regular matching method was effective in recognizing and segmenting long sentences, nevertheless there were many defects; the error-driven method was more effective in recognizing and segmenting long sentences; the former increased by 4.8% of the BLEU value of the translated text on Baidu Online Translation Platform and the latter increased by 12.1%, which showed that the error-driven method was more effective in machine translation.

Downloads

Published

2020-01-15

How to Cite

Zhang, T. (2020). Recognition and Segmentation of English Long and Short Sentences Based on Machine Translation. International Journal of Emerging Technologies in Learning (iJET), 15(01), pp. 152–162. https://doi.org/10.3991/ijet.v14i19.10182

Issue

Section

Papers