MFSM: Chinese-English sentence alignment based on multi- feature self-attention mechanism fusion
Bilingual parallel corpora is a very important basic resource in the research field of natural language processing based on statistics.There are cross alignment and empty alignment in Chinese-English bilingual text, it is easy to affect the effect of Chinese-English sentence alignment.Therefore, we propose a novel Chinese-English sentence alignment