Wikibibliographie ENCYCLEN

WIKINDX Resources

Proceedings Article: BibTeX citation key:  fissahaadafre.651
Fissaha Adafre Sisay & de Rijke Maarten (2006). « Finding Similar Sentences across Multiple Languages in Wikipedia ». In 11th Conference on the European Chapter of the Association for Computational Linguistics, 3-7 avril 2006. Trento, Italy :
Added by: Laure Endrizzi 2007-11-16 07:26:32    Last edited by: Laure Endrizzi 2007-11-16 07:33:43
Categories: 4. interfaces et modes de consultation
Creators: Fissaha Adafre, de Rijke
Collection: 11th Conference on the European Chapter of the Association for Computational Linguistics

Number of views:  944
Popularity index:  7.87%

 
Abstract
We investigate whether the Wikipedia corpus is amenable to multilingual analysis
that aims at generating parallel corpora. We present the results of the application of two simple heuristics for the identification of similar text across multiple languages in Wikipedia. Despite the simplicity of the methods, evaluation carried out on a sample of Wikipedia pages shows encouraging results.
Added by: Laure Endrizzi

 
Further information may be found at:

 
Notes

Added by: Laure Endrizzi

 
>

 

wikindx  v3.4.7 ©2006 VST v 1.0     |     Total Resources:  611     |     Database queries:  33     |     Script execution:  0.28425 secs

 


École normale supérieure de Lyon
Institut français de l'Éducation
Veille et Analyses
15 parvis René-Descartes BP 7000 . 69342 Lyon cedex 07
Standard : +33 (0)4 72 76 61 00
Télécopie : +33 (0)4 72 76 61 93