A High-Accurate Chinese-English NE Backward Translation System Combining Both Lexical Information and Web Statistics

Conrad Chen

Hsin-Hsi Chen

Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan drchen@nlg.csie.ntu.edu.tw hhchen@csie.ntu.edu.tw

Abstract
Named entity translation is indispensable in cross language information retrieval nowadays. We propose an approach of combining lexical information, web statistics, and inverse search based on Google to backward translate a Chinese named entity (NE) into English. Our system achieves a high Top-1 accuracy of 87.6%, which is a relatively good performance reported in this area until present.

1

Introduction

Translation of named entities (NE) attracts much attention due to its practical applications in World Wide Web. The most challenging issue behind is: the genres of NEs are various, NEs are open vocabulary and their translations are very flexible. Some previous approaches use phonetic similarity to identify corresponding transliterations, i.e., translation by phonetic values (Lin and Chen, 2002; Lee and Chang, 2003). Some approaches combine lexical (phonetic and meaning) and semantic information to find corresponding translation of NEs in bilingual corpora (Feng et al., 2004; Huang et al., 2004; Lam et al., 2004). These studies focus on the alignment of NEs in parallel or comparable corpora. That is called "close-ended" NE translation. In "open-ended" NE translation, an arbitrary NE is given, and we want to find its corresponding translations. Most previous approaches exploit web search engine to help find translating candidates on the Internet. Al-Onaizan and Knight (2003) adopt language models to generate
81

possible candidates first, and then verify these candidates by web statistics. They achieve a Top1 accuracy of about 72.6% with Arabic-toEnglish translation. Lu et al. (2004) use statistics of anchor texts in web search result to identify translation and obtain a Top-1 accuracy of about 63.6% in translating English out-of-vocabulary (OOV) words into Traditional Chinese. Zhang et al. (2005) use query expansion to retrieve candidates and then use lexical information, frequencies, and distances to find the correct translation. They achieve a Top-1 accuracy of 81.0% and claim that they outperform state-of-the-art OOV translation techniques then. In this paper, we propose a three-step approach based on Google to deal with open-ended Chinese-to-English translation. Our system integrates various features which have been used by previous approaches in a novel way. We observe that most foreign Chinese NEs would have their corresponding English translations appearing in their returned snippets by Google. Therefore we combine lexical information and web statistics to find corresponding translations of given Chinese foreign NEs in returned snippets. A highly effective verification process, inverse search, is then adopted and raises the performance in a significant degree. Our approach achieves an overall Top-1 accuracy of 87.6% and a relatively high Top-4 accurracy of 94.7%.

2

Background

Translating NEs, which is different from translating common words, is an "asymmetric" translation. Translations of an NE in various languages can be organized as a tree according to the relations of translation language pairs, as shown in Figure 1. The root of the translating tree is the NE in its original language, i.e., initially de-

Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 81­88, Sydney, July 2006. c 2006 Association for Computational Linguistics


nominated. We call the translation of an NE along the tree downward as a "forward translation". On the contrary, "backward translation" is to translate an NE along the tree upward.

we find translations of NEs in monolingual corpora? While mentioning a translated name during writing, sometimes we would annotate it with its original name in the original foreign language, especially when the name is less commonly known. But how often would it happen? With our testing data, which would be introduced in Section 4, over 97% of translated NEs would have its original NE appearing in the first 100 returned snippets by Google. Figure 2 shows several snippets returned by Google which contains the original NE of the given foreign NE.
CEPS -- ;-1
, .  , Symbolic Means of the Author "The Old Man and the Sea" ... ,   " ... www.ceps.com.tw/ec/ecjnlarticleView.aspx?jnlcattype=1& jnlptype=4&jnltype=29&jnliid=1370&i... - 26k -  -  

Figure 1. Translating tree of "Cien años soledad". Generally speaking, forward translation is easier than backward translation. On the one hand, there is no unique answer to forward translation. Many alternative ways can be adopted to forward translate an NE from one language to another. For example, "Jordan" can be translated into "  (Qiao-Dan)", "   (Qiao-Deng)", "   (Yue-Dan)", and so on. On the other hand, there is generally one unique corresponding term in backward translation, especially when the target language is the root of the translating tree. In addition, when the original NE appears in documents in the target language in forward translation, it often comes together with a corresponding translation in the target language (Cheng et al., 2004). That makes forward translation less challenging. In this paper, we focus our study on Chinese-English backward translation, i.e., the original language of NE and the target language in translation is English, and the source language to be translated is Chinese. There are two important issues shown below to deal with backward translation of NEs or OOV words. · Where to find the corresponding translation? · How to identify the correct translation? NEs seldom appear in multi-lingual or even mono-lingual dictionaries, i.e., they are OOV or unknown words. For unknown words, where can we find its corresponding translation? A bilingual corpus might be a possible solution. However, NEs appear in a vast context and bilingual corpora available can only cover a small proportion. Most text resources are monolingual. Can
82

.:JSDVD Mall:. -
- · -(DTS) ·   ·  16- ·  ·  ·  - · - ... -. The Old Man and The Sea. 4715320115018,   ... mall.jsdvd.com/product_info.php?products_id=3198 - 48k -   -  - 

Figure 2. Several Traditional Chinese snippets of "" returned by Google which contains the translation "The Old Man and the Sea". When translations can be found in snippets, the next work would be identifying which name is the correct translation of NEs. First we should know how NEs would be translated. The commonest case is translating by phonetic values, or so-called transliteration. Most personal names and location names are transliterated. NEs may also be translated by meaning. It is the way in which most titles and nicknames and some organization names would be translated. Another common case is translating by phonetic values for some parts and by meaning for the others. For example, "Sears Tower" is translated into "  (Xi-Er-Si)   (tower)" in Chinese. NEs would sometimes be translated by semantics or contents of the entity it indicates, especially with movies. Table 1 summarizes the possible translating ways of NEs. From the above discussion, we may use similarities in phonetic values, meanings of constituent words, semantics, and so


on to identify corresponding translations. Besides these linguistic features, non-linguistic features such as statistical information may also help use
Translating Way Translating by Phonetic Values Translating by Meanin g Translating by Phonetic Values for Some Parts and by Meaning for the Others Translating by Both Phonetic Values and Meaning Translating NEs by Heterography Translating by Semantic or Content Parallel Names

well. We would discuss how to combine these features to identify corresponding translation in detail in the next section.
Examples "New York" and "(pronounced as NiuYue)" "  (red)  (chamber)  (dream)" and "The Dream of the Red Chamber" "Uncle Tom's Cabin" and "(pronounced as Tang-Mu)(uncle's)(cabin)" "New Yorker" and "(pronounced as NiuYue)(people, pronounced as Ke)" "" and "Yokohama", "" and "Ichiro Suzuki" "The Mask" and "   (modern)  (great)  (saint)" "(Sun Zhong-Shan)" and "Sun Yat-Sen"

Description The translation would have a similar pronunciation to its original NE. The translation would have a similar or a related meaning to its original NE. The entire NE is supposed to be translated by its meaning and the name parts are transliterated. The translation would have both a similar pronunciation and a similar meaning to its original NE. The NE is translated by these heterographic words in neighboring languages. The NE is translated by its semantic or the content of the entity it refers to. NE is initially denominated as more than one name or in more than one language.

Table 1. Possible translating ways of NEs.

3

Chinese-to-English NE Translation

As we have mentioned in the last section, we could find most English translations in Chinese web page snippets. We thus base our system on web search engine: retrieving candidates from returned snippets, combining both linguistic and statistical information to find the correct translation. Our system can be split into three steps: candidate retrieving, candidate evaluating, and candidate verifying. An overview of our system is given in Figure 3.

method and several preprocessing procedures are applied to obtain possible candidates from returned snippets. In the second step, four features (i.e., phonetic values, word senses, recurrences, and relative positions) are exploited to give these candidates a score. In the last step, the candidates with higher scores are sent to Google again. Recurrence information and relative positions concerning with the candidate to be verified of GN in returned snippets are counted along with the scores to decide the final ranking of candidates. These three steps will be detailed in the following subsections. 3.1 Retrieving Candidates

Before we can identify possible candidates, we must retrieve them first. In the returned traditional Chinese snippets by Google, there are still many English fragments. Therefore, the first task our system would do is to separate these English fragments into NEs and non-NEs. We propose a simple method to recognize possible NEs. All fragments conforming to the following properties would be recognized as NEs: · The first and the last word of the fragment are numerals or capitalized. Figure 3. An Overview of the System. In the first step, the NE to be translated, GN, is sent to Google to retrieve traditional Chinese web pages, and a simple English NE recognition
83

· There are no three or more consequent lowercase words in the fragment. · The whole fragment is within one sentence. After retrieving possible NEs in returned snippets, there are still some works to do to make a


finer candidate list for verification. First, there might be many different forms for a same NE. For example, "Mr. & Mrs. Smith" may also appear in the form of "Mr. and Mrs. Smith", "Mr. And Mrs. Smith", and so on. To deal with these aliasing forms, we transform all different forms into a standard form for the later ranking and identification. The standard form follows the following rules: · All letters are transformed into upper cases. · Words consist "'"s are split. · Symbols are rewritten into words. For example, all forms of "Mr. & Mrs. Smith" would be transformed into "MR. AND MRS. SMITH". The second work we should complete before ranking is filtering useless substrings. An NE may comprise many single words. These component words may all be capitalized and thus all substrings of this NE would be fetched as candidates of our translation work. Therefore, substrings which always appear with a same preceding and following word are discarded here, since they would have a zero recurrence score in the next step, which would be detailed in the next subsection. 3.2 Evaluating Candidates

may be translated either by phonetic values or by word senses. Given a translation pair, we could split them into fragments which could be bipartite matched according to their translation relationships, as Figure 4 shows.

Figure 4. The translation relationships of " ". To identify the lexical similarity between two NEs, we could estimate the similarity scores between the matched fragment pairs first, and then sum them up as a total score. We postulate that the matching with the highest score is the correct matching. Therefore the problem becomes a weighted bipartite matching problem, i.e., given the similarity scores between any fragment pairs, to find the bipartite matching with the highest score. In this way, our next problem is how to estimate the similarity scores between fragments. We treat an English single word as a fragment unit, i.e., each English single word corresponds to one fragment. An English candidate Ci consisting of n single words would be split into n fragment units, Ci1, Ci2, ..., Cin. We define a Chinese fragment unit that it could comprise one to four characters and may overlap each other. A fragment unit of GN can be written as GNab, which denotes the ath to bth characters of GN, and b - a < 4. The linguistic similarity score between two fragments is:
LSim(GN ab , Cij ) = Max{PVSim(GN ab , Cij ), WSSim(GN ab , Cij )}

After candidate retrieving, we would obtain a sequence of m candidates, C1, C2, ..., Cm. An integrated evaluating model is introduced to exploit four features (phonetic values, word senses, recurrences, and relative positions) to score these m candidates, as the following equation suggests: Score ( C i , GN ) = SScore ( C i , GN )  LScore ( C i , GN ) LScore(Ci,GN) combines phonetic values and word senses to evaluate the lexical similarity between Ci and GN. SScore(Ci,GN) concerns both recurrences information and relative positions to evaluate the statistical relationship between Ci and GN. These two scores are then combined to obtain Score(Ci,GN). How to estimate LScore(Cn, GN) and SScore(Cn, GN) would be discussed in detail in the following subsections. 3.2.1 Lexical Similarity

Where PVSim() estimates the similarity in phonetic values while WSSim() estimate it in word senses. Phonetic Value In this paper, we adopt a simple but novel method to estimate the similarity in phonetic values. Unlike many approaches, we don't introduce an intermediate phonetic alphabet system for comparison. We first transform the Chinese fragments into possible English strings, and then estimate the similarity between transformed strings and English candidates in surface strings, as Figure 5 shows. However, similar pronunciations does not equal to similar surface strings. Two quite dissimilar strings may have very similar pronunciations. Therefore, we take this strat84

The lexical similarity concerns both phonetic values and word senses. An NE may consist of many single words. These component words


egy: generate all possible transformations, and regard the one with the highest similarity as the English candidate.

PVSim( A, B) = 1 -

ED AB ( Len( A), Len( B)) max{Len( A), Len( B)}

Len() denotes the length of the string. In the above equation, the similarity scores are ranged from 0 to 1. We build the fixed transformation table manually. All possible transformations from Chinese transliterating characters to corresponding English strings are built. If we cannot precisely indicate which vowel combination should be transformed, or there are too many possible combinations, we ignores vowels. Then we use a training set of 3,000 transliteration names to examine possible omissions due to human ignorance. Word Senses More or less similar to the estimation of phonetic similarity, we do not use an intermediate representation of meanings to estimate word sense similarity. We treat the English translations in the C-E bilingual dictionary (reference removed for blind review) directly as the word senses of their corresponding Chinese word entries. We adopt a simple 0-or-1 estimation of word sense similarity between two strings A and B, as the following equation suggests:
0, if B is not a translation of A  in the dictionary  WSSim ( A, B ) =  1, if B is a translation of A   in the dictionary 

Figure 5. Phonetic similarity estimation of our system. Edit distances are usually used to estimate the surface similarity between strings. However, the typical edit distance does not completely satisfy the requirement in the context of translation identification. In translation, vowels are an unreliable feature. There are many variations in pronunciation of vowels, and the combinations of vowels are numerous. Different combinations of vowels may have a same phonetic value, however, same combinations may pronounce totally differently. The worst of all, human often arbitrarily determine the pronunciation of unfamiliar vowel combinations in translation. For these reasons, we adopt the strategy that vowels can be ignored in transformation. That is to say when it is hard to determine which vowel combination should be generated from given Chinese fragments, we can only transform the more certain part of consonants. Thus during the calculation of edit distances, the insertion of vowels would not be calculated into edit distances. Finally, the modified edit distance between two strings A and B is defined as follow:
EDAB (0, t ) = t EDAB (s,0) = s  EDAB (s, t - 1) + Ins(t ),    EDAB (s, t ) = min EDAB (s - 1, t ) + 1,  ED (s - 1, t - 1) + Rep(s, t )   AB 0, if Bt is a vowl Ins(t ) =  1, if Bt is a consonant 0, if As = Bt Rep(s, t ) =  1, else

All the Chinese foreign names appearing in test data is removed from the dictionary. From the above equations we could derive that LSim() of fragment pairs is also ranged from 0 to 1. Candidates to be evaluated may comprise different number of component words, and this would result the different scoring base of the weighted bipartite matching. We should normalize the result scores of bipartite matching. As a result, the following equation is applied:
LScore (C i , GN ) =  all matched pairs GN ab and Cij LSim (GN ab , C ij )  ,  Total # of words in C i min    all matched pairs GN ab and Cij LSim (GN ab , C ij )  (b - a + 1)  Total # of characters in GN        

3.2.2

Statistical Similarity

The modified edit distances are then transformed to similarity scores:

Two pieces of information are concerned together to estimate the statistical similarity: recurrences and relative positions. A candidate Ci might appear l times in the returned snippets, as Ci,1, Ci,2, ..., Ci,l. For each Ci,k, we find the dis85


tance between it and the nearest GN in the returned snippets, and then compute the relative position scores as the following equation:
RP(Ci ,k , GN ) = 1 Distance(GN , Ci,k ) / 4 + 1

Next we use Entropy(Context of Ci) to weight the primitive score PSS(Ci, GN) to obtain the final statistical score.:
SScore ( C i ,GN ) = Entropy ( Context of C i )  PSS ( C i ,GN )

In other words, if the candidate is adjacent to the given NE, it would have a relative position score of 1. Relative position scores of all Ci,k would be summed up to obtain the primitive statistical score:
PSS(Ci, GN) = k RP(Cn,k, GN)

3.3

Verifying Candidates

As we mentioned before, since the imprecision of NE recognition, most substrings of NEs would also be recognized as candidates. This would result a problem. There are often typos in the information provided on the Internet. If some component word of an NE is misspelled, the substrings constituted by the rest words would have a higher statistical score than the correct NE. To prevent such kind of situations, we introduce entropy of the context of the candidate. If a candidate has a more varied context, it is more possible to be an independent term instead of a substring of other terms. Entropy provides such a property: if the possible cases are more varied, there is higher entropy, and vice versa. Entropy function here concerns the possible cases of the most adjacent word at both ends of the candidate, as the following equation suggests:
Entropy (Context of Ci ) = , while # of possible context = 1  1 - NCTr / NCi  log NPTi NCTr / NC i , else  CTi 

In evaluating candidate, we concern only the appearing frequencies of candidates when the NE to be translated is presented. In the other direction, we should also concern the appearing frequencies of the NE to be translated when the candidate is presented to prevent common words getting an improper high score in evaluation. We perform the inverse search approach for this sake. Like the evaluation of statistical scores in the last step, candidates are sent to Google to retrieve Traditional Chinese snippets, and the same equation of SScore() is computed concerning the candidate. However, since there are too many candidates, we cannot perform this process on all candidates. Therefore, an elimination mechanism is adopted to select candidates for verification. The elimination mechanism works as follows: 1. Send the Top-3 candidates into Google for verification. 2. Count SScore(GN, Ci). (Notice that the order of the parameter is reversed.) Re-weight Score(Ci, GN) by multiplying SScore(GN, Ci) 3. Re-rank candidates 4. After re-ranking, if new candidates become the Top-3 ones, redo the first step. Otherwise end this process. The candidates have been verified would be recorded to prevent duplicate re-weighting and unnecessary verification. There is one problem in verification we should concern. Since we only consider recurrence information in both directions, but not cooccurrence information, this would result some problem when dealing rarely used translations. For example, "Peter Pan" can be translated into "" or "" (both pronounced as BiDe-Pan) in Chinese, but most people would use the former translation. Thus if we send "Peter Pan" to verification when translating "", we would get a very low score. To deal with this situation, we adopt the strategy of disbelieving verification in some situa-

Where NCTr and NCi denote the appearing times of the rth context CTr and the candidate Ci in the returned snippets respectively, and NPTi denotes the total number of different cases of the context of Ci. Since we want to normalize the entropy to 0~1, we take NPTi as the base of the logarithm function. While concerning context combinations, only capitalized English word is discriminated. All other words would be viewed as one sort "OTHER". For example, assuming the context of "David" comprises three times of (Craig, OTHER), three times of (OTHER, Stern), and six times of (OTHER, OTHER), then:
Entropy (Context of " David") = -( 3 33 36 6 log 3 +  log 3 +  log 3 ) = 0.946 12 12 12 12 12 12

86


tions. If all candidates have scores lower than the threshold, we presume that the given NE is a rarely used translation. In this situation, we use only Score(Cn, GN) estimated by the evaluation step to rank its candidates, without multiplying SScore(GN, Ci) of the inverse search. The threshold is set to 1.5 by heuristic, since we consider that a commonly used translation is supposed to have their SScore() larger than 1 in both directions.

4

Experiments

To evaluate the performance of our system, 15 common users are invited to provide 100 foreign NEs per user. These users are asked to simulate a scenario of using web search machine to perform cross-lingual information retrieval. The proportion of different types of NEs is roughly conformed to the real distribution, except for creation titles. We gathers a larger proportion of creation titles than other types of NEs, since the ways of translating creation titles is less regular and we may use them to test how much help could the web statistics provide. After removing duplicate entries provided by users, finally we obtain 1,119 nouns. Among them 7 are not NEs, 65 are originated from Oriental languages (Chinese, Japanese, and Korean), and the rest 1,047 foreign NEs are our main experimental subjects. Among these 1,047 names there are 455 personal names, 264 location names, 117 organization names, 196 creation titles, and 15 other types of NEs. Table 2 and Figure 5 show the performance of the system with different types of NEs. We could observe that the translating performance is best with location names. It is within our expectation, since location names are one of the most limited NE types. Human usually provide location names in a very limited range, and thus there are less location names having ambiguous
Total PER LOC ORG TITLE Other All NE Oriental Non-NE Overall 455 264 117 196 15 1047 65 7 1119 Top-1 Num Recall 408 89.7% 242 91.7% 98 83.8% 151 77.0% 10 66.7% 909 87.6% 47 72.3% 6 85.7% 962 86.0%

translations and less rare location names in the test data. Besides, because most location names are purely transliterated, it can give us some clues about the performance of our phonetic model. Our system performs worst with creation titles. One reason is that the naming and translating style of creation titles are less formulated. Many titles are not translated by lexical information, but by semantic information or else. For example, "Mr. & Mrs. Smith" is translated into " (Smiths' Mission)" by the content of the creation it denotes. Another reason is that many titles are not originated from English, such as "le Nozze di Figaro". It results the C-E bilingual dictionary cannot be used in recognizing word sense similarity. A more serious problem with titles is that titles generally consist of more single words than other types of NEs. Therefore, in the returned snippets by Google, the correct translation is often cut off. It would results a great bias in estimating statistical scores. Table 3 compares the result of different feature combinations. It considers only foreign NEs in the test data. From the result we could conclude that both statistical and lexical features are helpful for translation finding, while the inverse search are the key of our system to achieve a good performance.
100% 95% 90%

PER LOC ORG Title Oth er Orien tal N o n -N E

Re call at TOP N

85% 80% 75% 70% 65% 60% 1 5 9 13 17 21 25 29

Ran kin g

Figure 5. Curve of recall versus ranking.
Top-4 Num Recall 436 95.8% 253 95.8% 108 92.3% 181 92.3% 14 93.3% 992 94.7% 55 84.6% 6 85.7% 1053 94.1% Top-M Num Recall 443 97.3% 264 100.0% 114 97.4% 189 96.4% 15 100.0% 1025 97.9% 60 92.3% 7 100.0% 1092 97.6%

Top-2 Num Recall 430 94.5% 252 95.5% 106 90.6% 168 85.7% 13 86.7% 969 92.6% 52 80.0% 6 85.7% 1027 91.8%

Table 2. Experiment results of our system with different NE types.

87


SScore LScore SScore + LScore + Inverse Search

Top-1 Num Recall 540 51.6% 721 68.9% 837 79.9% 909 87.6%

Top-2 Num Recall 745 71.2% 789 75.4% 916 87.5% 969 92.6%

Top-4 Num Recall 887 84.7% 844 80.6% 953 91.0% 992 94.7%

Table 3. Experiment results of our system with different feature combinations. From the result we could also find that our system has a high recall of 94.7% while considering top 4 candidates. If we only count in the given NEs with their correct translation appearing in the returned snippets, the recall would go to 96.8%. This achievement may be not yet good enough for computer-driven applications, but it is certainly a good performance for user querying.
Feng, Donghui, Lv Y., and Zhou M. 2004. A New Approach for English-Chinese Named Entity Alignment. EMNLP 2004: 372-379. Huang, Fei, Stephan Vogel, and Alex Waibel. 2003. Improving Named Entity Translation Combining Phonetic and Semantic Similarities. HLT-NAACL 2004: 281-288. Lam, Wai, Ruizhang Huang, and Pik-Shan Cheung. 2004. Learning phonetic similarity for matching named entity translations and mining new translations. SIGIR 2004: 289-296. Lee, Chun-Jen and Jason S. Chang. 2003. Acquisition of. English-Chinese Transliterated Word Pairs from Parallel-Aligned Texts. HLT-NAACL 2003. Workshop on Data Driven MT: 96-103. Lin, Wei-Hao and Hsin-Hsi Chen. 2002. Backward Machine Transliteration by Learning Phonetic Similarity. Proceedings of CoNLL-2002: 139-145. Lu, Wen-Hsiang, Lee-Feng Chien, and Hsi-Jian Lee. 2004. Anchor Text Mining for Translation of Web Queries: A Transitive Translation Approach. ACM Transactions on Information Systems 22(2): 242269. Zhang, Ying, Fei Huang, and Stephan Vogel. 2005. Mining translations of OOV terms from the web through cross-lingual query expansion. SIGIR 2005: 669-670. Zhang, Ying and Phil Vines. 2004. Using the web for automated translation extraction in cross-language information retrieval. SIGIR 2004: 162-169.

5

Conclusion

In this study we combine several relatively simple implementations of approaches that have been proposed in the previous studies and obtain a very good performance. We find that the Internet is a quite good source for discovering NE translations. Using snippets returned by Google we can efficiently reduce the number of the possible candidates and acquire much useful information to verify these candidates. Since the number of candidates is generally less than processing with unaligned corpus, simple models can performs filtering quite well and the over-fitting problem is thus prevented. From the failure cases of our system, (see Appendix A) we could observe that the performance of this integrated approach could still be boosted by more sophisticated models, more extensive dictionaries, and more delicate training mechanisms. For example, performing stemming or adopting a more extensive dictionary might enhance the accuracy of estimating word sense similarity; the statistic formula can be replaced by more formal measures such as co-occurrences or mutual information to make a more precise assessment of statistical relationship. These tasks would be our future works in developing a more accurate and efficient NE translation system.

Appendix A. Some Failure Cases of Our System
GN               Top 1 CBS JERSEY ONLINE ROYCE NBA LAVIGNE JK RICKY DAVIS MONET TUPOLEV TU NBA TOS AROUND03 JACK LAYTON Correct Translation Rank SADDAM HUSSEIN 2 NEW JERSEY 2 ARABIAN NIGHTS 2 ROLLS ROYCE 2 JULIUS ERVING 2 AVRIL LAVIGNE 2 JK. ROWLING 2 CELTICS 8 IMPRESSION SUNRISE 9 USSR 33 MEDVENDENKO N/A SYMPHONY NO. 5 N/A CUORE N/A DEMOCRATIC PARTY N/A

Reference
Al-Onaizan, Yaser and Kevin Knight. 2002. Translating Named Entities Using Monolingual and Bilingual Resources. ACL 2002: 400-408. Cheng, Pu-Jen, J.W. Teng, R.C. Chen, J.H. Wang, W.H. Lu, and L.F. Chien. Translating unknown queries with web corpora for cross-language information retrieval. SIGIR 2004: 146-153.

88