000 04464nam a22004815i 4500
001 978-3-642-20128-8
003 DE-He213
005 20200421112043.0
007 cr nn 008mamaa
008 131213s2013 gw | s |||| 0|eng d
020 _a9783642201288
_9978-3-642-20128-8
024 7 _a10.1007/978-3-642-20128-8
_2doi
050 4 _aP98-98.5
072 7 _aUYQL
_2bicssc
072 7 _aCOM042000
_2bisacsh
082 0 4 _a006.35
_223
245 1 0 _aBuilding and Using Comparable Corpora
_h[electronic resource] /
_cedited by Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum, Pascale Fung.
264 1 _aBerlin, Heidelberg :
_bSpringer Berlin Heidelberg :
_bImprint: Springer,
_c2013.
300 _aXII, 335 p. 70 illus., 14 illus. in color.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
505 0 _aPreface - Building and Using Comparable Corpora. S.Sharoff, R.Rapp, P.Zweigenbaum -- Overviewing Important Aspects of the Last 20 Years of Research in Comparable Corpora -- S.Sharoff, R.Rapp, P.Zweigenbaum -- Part I: Compiling and Measuring Comparable Corpora -- Multilingual Corpus Collection. S.Shi, P.Fung -- Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dictionary Making. A.Gurrutxaga, I.Leturia, I.San Vicente, X.Saralegi -- Statistical Comparability: Methodological Caveats. R.K�ohler -- Methods for Collection and Evaluation of Comparable Documents. M.Lestari Paramita, D.Guthrie, E.Kanoulas, R.Gaizauskas, P.Clough and M.Sanderson -- Measuring the Distance between Comparable Corpora between Languages. S.Sharoff -- Exploiting Comparable Corpora for Lexicon Extraction: Measuring and Improving Corpus Quality. B.Li, E.Gaussier -- Statistical Corpus and Language Comparison on Comparable Corpora. T.Eckart, U.Quasthoff -- Comparable Multilingual Patents as Large-scale Parallel Corpora. B.Lu and B.Tsou -- Part II: Using Comparable Corpora -- Extracting Parallel Phrases from Comparable Data. S.Hewavitharana, S.Vogel -- Exploiting Comparable Corpora. D.S.Munteanu, D.Marcu -- Paraphrase Detection in Comparable Monolingual Corpora. L.Deleger, B.Cartoni, P.Zweigenbaum -- Information Network Construction and Alignment from Automatically Acquired Comparable Corpora. H.Ji, W.-P.Lin -- Bilingual Terminology Mining from Comparable Corpora. B.Daille, E.Morin -- The Place of Comparable Corpora in Providing Terminological Reference Information to Online Translators: A Strategic Framework. K.Kageura, T.Abekawa -- Old Needs, New Solutions: Comparable Corpora for Language Professionals. S.Bernardini, A.Ferraresi -- Exploiting the Incomparability of Comparable Corpora for Contrastive Linguistics and Translation Studies. S.Neumann, S.Hansen-Schirra.
520 _aThe 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.
650 0 _aComputer science.
650 0 _aComputational linguistics.
650 1 4 _aComputer Science.
650 2 4 _aLanguage Translation and Linguistics.
650 2 4 _aComputational Linguistics.
650 2 4 _aInformation Systems Applications (incl. Internet).
700 1 _aSharoff, Serge.
_eeditor.
700 1 _aRapp, Reinhard.
_eeditor.
700 1 _aZweigenbaum, Pierre.
_eeditor.
700 1 _aFung, Pascale.
_eeditor.
710 2 _aSpringerLink (Online service)
773 0 _tSpringer eBooks
776 0 8 _iPrinted edition:
_z9783642201271
856 4 0 _uhttp://dx.doi.org/10.1007/978-3-642-20128-8
912 _aZDB-2-SCS
942 _cEBK
999 _c56776
_d56776