会议专题

CROSS SENTENCE ALIGNMENT BASED ON SINGULAR VALUE DECOMPOSITION

This paper describes a way of performing alignment of Portuguese-Chinese bilingual pairs from the given two documents. Extracting the alignment pairs is a critical step for building Portuguese-Chinese bilingual corpus for Example Based Machine Translation Systems (EBMT). In short, the proposed alignment system performs four steps: break down the document into sentences level, score each pair of sentence by different features, apply Singular Value Decomposition on the results, extract the aligned pair base on a similarity function.

Bilingual document alignment Singular value decomposition Segmentation Cross sentence alignment

ANNA HO FAI WONG FRANCISCO DE OLIVEIRA YI-PING LI

Faculty of Science and Technology of University of Macau, PO Box 3001 Macao SAR

国际会议

2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)

昆明

英文

2524-2529

2008-07-12(万方平台首次上网日期,不代表论文的发表时间)