000 | 04553nam a22005655i 4500 | ||
---|---|---|---|
001 | 978-3-031-02174-9 | ||
003 | DE-He213 | ||
005 | 20240730165216.0 | ||
007 | cr nn 008mamaa | ||
008 | 220601s2020 sz | s |||| 0|eng d | ||
020 |
_a9783031021749 _9978-3-031-02174-9 |
||
024 | 7 |
_a10.1007/978-3-031-02174-9 _2doi |
|
050 | 4 | _aQ334-342 | |
050 | 4 | _aTA347.A78 | |
072 | 7 |
_aUYQ _2bicssc |
|
072 | 7 |
_aCOM004000 _2bisacsh |
|
072 | 7 |
_aUYQ _2thema |
|
082 | 0 | 4 |
_a006.3 _223 |
100 | 1 |
_aDror, Rotem. _eauthor. _4aut _4http://id.loc.gov/vocabulary/relators/aut _987852 |
|
245 | 1 | 0 |
_aStatistical Significance Testing for Natural Language Processing _h[electronic resource] / _cby Rotem Dror, Lotem Peled-Cohen, Segev Shlomov, Roi Reichart. |
250 | _a1st ed. 2020. | ||
264 | 1 |
_aCham : _bSpringer International Publishing : _bImprint: Springer, _c2020. |
|
300 |
_aXVII, 98 p. _bonline resource. |
||
336 |
_atext _btxt _2rdacontent |
||
337 |
_acomputer _bc _2rdamedia |
||
338 |
_aonline resource _bcr _2rdacarrier |
||
347 |
_atext file _bPDF _2rda |
||
490 | 1 |
_aSynthesis Lectures on Human Language Technologies, _x1947-4059 |
|
505 | 0 | _aPreface -- Acknowledgments -- Introduction -- Statistical Hypothesis Testing -- Statistical Significance Tests -- Statistical Significance in NLP -- Deep Significance -- Replicability Analysis -- Open Questions and Challenges -- Conclusions -- Bibliography -- Authors' Biographies. | |
520 | _aData-driven experimental analysis has become the main evaluation tool of Natural Language Processing (NLP) algorithms. In fact, in the last decade, it has become rare to see an NLP paper, particularly one that proposes a new algorithm, that does not include extensive experimental analysis, and the number of involved tasks, datasets, domains, and languages is constantly growing. This emphasis on empirical results highlights the role of statistical significance testing in NLP research: If we, as a community, rely on empirical evaluation to validate our hypotheses and reveal the correct language processing mechanisms, we better be sure that our results are not coincidental. The goal of this book is to discuss the main aspects of statistical significance testing in NLP. Our guiding assumption throughout the book is that the basic question NLP researchers and engineers deal with is whether or not one algorithm can be considered better than another one. This question drivesthe field forward as it allows the constant progress of developing better technology for language processing challenges. In practice, researchers and engineers would like to draw the right conclusion from a limited set of experiments, and this conclusion should hold for other experiments with datasets they do not have at their disposal or that they cannot perform due to limited time and resources. The book hence discusses the opportunities and challenges in using statistical significance testing in NLP, from the point of view of experimental comparison between two algorithms. We cover topics such as choosing an appropriate significance test for the major NLP tasks, dealing with the unique aspects of significance testing for non-convex deep neural networks, accounting for a large number of comparisons between two NLP algorithms in a statistically valid manner (multiple hypothesis testing), and, finally, the unique challenges yielded by the nature of the data and practices of the field. | ||
650 | 0 |
_aArtificial intelligence. _93407 |
|
650 | 0 |
_aNatural language processing (Computer science). _94741 |
|
650 | 0 |
_aComputational linguistics. _96146 |
|
650 | 1 | 4 |
_aArtificial Intelligence. _93407 |
650 | 2 | 4 |
_aNatural Language Processing (NLP). _931587 |
650 | 2 | 4 |
_aComputational Linguistics. _96146 |
700 | 1 |
_aPeled-Cohen, Lotem. _eauthor. _4aut _4http://id.loc.gov/vocabulary/relators/aut _987854 |
|
700 | 1 |
_aShlomov, Segev. _eauthor. _4aut _4http://id.loc.gov/vocabulary/relators/aut _987856 |
|
700 | 1 |
_aReichart, Roi. _eauthor. _4aut _4http://id.loc.gov/vocabulary/relators/aut _987857 |
|
710 | 2 |
_aSpringerLink (Online service) _987859 |
|
773 | 0 | _tSpringer Nature eBook | |
776 | 0 | 8 |
_iPrinted edition: _z9783031001857 |
776 | 0 | 8 |
_iPrinted edition: _z9783031010460 |
776 | 0 | 8 |
_iPrinted edition: _z9783031033025 |
830 | 0 |
_aSynthesis Lectures on Human Language Technologies, _x1947-4059 _987861 |
|
856 | 4 | 0 | _uhttps://doi.org/10.1007/978-3-031-02174-9 |
912 | _aZDB-2-SXSC | ||
942 | _cEBK | ||
999 |
_c86160 _d86160 |