CROSS SENTENCE ALIGNMENT BASED ON SINGULAR VALUE DECOMPOSITION
This paper describes a way of performing alignment of Portuguese-Chinese bilingual pairs from the given two documents. Extracting the alignment pairs is a critical step for building Portuguese-Chinese bilingual corpus for Example Based Machine Translation Systems (EBMT). In short, the proposed alignment system performs four steps: break down the document into sentences level, score each pair of sentence by different features, apply Singular Value Decomposition on the results, extract the aligned pair base on a similarity function.
Bilingual document alignment Singular value decomposition Segmentation Cross sentence alignment
ANNA HO FAI WONG FRANCISCO DE OLIVEIRA YI-PING LI
Faculty of Science and Technology of University of Macau, PO Box 3001 Macao SAR
国际会议
2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)
昆明
英文
2524-2529
2008-07-12(万方平台首次上网日期,不代表论文的发表时间)