This paper studies transliteration align-ment, its evaluation metrics and applica-tions. We propose a new evaluation met-ric, alignment entropy, grounded on the information theory, to evaluate the align-ment quality without the need for the gold standard reference and compare the metric with F-score. We study the use of phono-logical features and affinity statistics for transliteration alignment at phoneme and grapheme levels. The experiments show that better alignment consistently leads to more accurate transliteration. In transliter-ation modeling application, we achieve a mean reciprocal rate (MRR) of 0.773 on Xinhua personal name corpus, a signifi-cant improvement over other reported re-sults on the same corpus. In transliteration valid...
This paper proposes a novel noise-aware char-acter alignment method for bootstrapping sta-tistical m...
Abstract: Transliteration has been a challenging problem in natural language processing specially in...
Most current machine transliteration systems employ a corpus of known sourcetarget word pairs to tra...
AbstractForeign name transliterations typically include multiple spelling variants. These variants c...
We propose a language-independent method for the automatic extraction of transliteration pairs from ...
In this paper we present a statistical transliteration technique that is language independent. This ...
Transcribing named entities from one language into another is called transliteration. This thesis pr...
Most foreign names are transliterated into Chinese, Japanese or Korean with approximate phonetic equ...
Most current machine transliteration systems employ a corpus of known sourcetarget word pairs to tra...
Machine transliteration is a method for automatically converting words in one language into phonetic...
In this paper, we first carry out an investi-gation on two existing pivot strategies for statistical...
Most current machine transliteration sys-tems employ a corpus of known source-target word pairs to t...
[[abstract]]This paper describes a framework for modeling the machine transliteration problem. The p...
In this paper we present a statistical translit-eration technique that is language indepen-dent. Thi...
Transliteration is the process of proper name translation based on pronunciation. It is an important...
This paper proposes a novel noise-aware char-acter alignment method for bootstrapping sta-tistical m...
Abstract: Transliteration has been a challenging problem in natural language processing specially in...
Most current machine transliteration systems employ a corpus of known sourcetarget word pairs to tra...
AbstractForeign name transliterations typically include multiple spelling variants. These variants c...
We propose a language-independent method for the automatic extraction of transliteration pairs from ...
In this paper we present a statistical transliteration technique that is language independent. This ...
Transcribing named entities from one language into another is called transliteration. This thesis pr...
Most foreign names are transliterated into Chinese, Japanese or Korean with approximate phonetic equ...
Most current machine transliteration systems employ a corpus of known sourcetarget word pairs to tra...
Machine transliteration is a method for automatically converting words in one language into phonetic...
In this paper, we first carry out an investi-gation on two existing pivot strategies for statistical...
Most current machine transliteration sys-tems employ a corpus of known source-target word pairs to t...
[[abstract]]This paper describes a framework for modeling the machine transliteration problem. The p...
In this paper we present a statistical translit-eration technique that is language indepen-dent. Thi...
Transliteration is the process of proper name translation based on pronunciation. It is an important...
This paper proposes a novel noise-aware char-acter alignment method for bootstrapping sta-tistical m...
Abstract: Transliteration has been a challenging problem in natural language processing specially in...
Most current machine transliteration systems employ a corpus of known sourcetarget word pairs to tra...