000 03828nam a22005295i 4500
001 978-3-031-01912-8
003 DE-He213
005 20240730163444.0
007 cr nn 008mamaa
008 220601s2018 sz | s |||| 0|eng d
020 _a9783031019128
_9978-3-031-01912-8
024 7 _a10.1007/978-3-031-01912-8
_2doi
050 4 _aQA76.9.D343
072 7 _aUNF
_2bicssc
072 7 _aUYQE
_2bicssc
072 7 _aCOM021030
_2bisacsh
072 7 _aUNF
_2thema
072 7 _aUYQE
_2thema
082 0 4 _a006.312
_223
100 1 _aRen, Xiang.
_eauthor.
_4aut
_4http://id.loc.gov/vocabulary/relators/aut
_978640
245 1 0 _aMining Structures of Factual Knowledge from Text
_h[electronic resource] :
_bAn Effort-Light Approach /
_cby Xiang Ren, Jiawei Han.
250 _a1st ed. 2018.
264 1 _aCham :
_bSpringer International Publishing :
_bImprint: Springer,
_c2018.
300 _aXV, 183 p.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
490 1 _aSynthesis Lectures on Data Mining and Knowledge Discovery,
_x2151-0075
505 0 _aAcknowledgments -- Introduction -- Background -- Literature Review -- Entity Recognition and Typing with Knowledge Bases -- Fine-Grained Entity Typing with Knowledge Bases -- Synonym Discovery from Large Corpus -- Joint Extraction of Typed Entities and Relationships -- Pattern-Enhanced Embedding Learning for Relation Extraction -- Heterogeneous Supervision for Relation Extraction -- Indirect Supervision: Leveraging Knowledge from Auxiliary Tasks -- Mining Entity Attribute Values with Meta Patterns -- Open Information Extraction with Global Structure Cohesiveness -- Open Information Extraction with Global Structure Cohesiveness -- Applications -- Conclusions -- Vision and Future Work -- Bibliography -- Authors' Biographies.
520 _aThe real-world data, though massive, is largely unstructured, in the form of natural-language text. It is challenging but highly desirable to mine structures from massive text data, without extensive human annotation and labeling. In this book, we investigate the principles and methodologies of mining structures of factual knowledge (e.g., entities and their relationships) from massive, unstructured text corpora. Departing from many existing structure extraction methods that have heavy reliance on human annotated data for model training, our effort-light approach leverages human-curated facts stored in external knowledge bases as distant supervision and exploits rich data redundancy in large text corpora for context understanding. This effort-light mining approach leads to a series of new principles and powerful methodologies for structuring text corpora, including (1) entity recognition, typing and synonym discovery, (2) entity relation extraction, and (3) open-domain attribute-valuemining and information extraction. This book introduces this new research frontier and points out some promising research directions.
650 0 _aData mining.
_93907
650 0 _aStatisticsĀ .
_931616
650 1 4 _aData Mining and Knowledge Discovery.
_978641
650 2 4 _aStatistics.
_914134
700 1 _aHan, Jiawei.
_eauthor.
_4aut
_4http://id.loc.gov/vocabulary/relators/aut
_92109
710 2 _aSpringerLink (Online service)
_978642
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9783031001079
776 0 8 _iPrinted edition:
_z9783031007842
776 0 8 _iPrinted edition:
_z9783031030406
830 0 _aSynthesis Lectures on Data Mining and Knowledge Discovery,
_x2151-0075
_978643
856 4 0 _uhttps://doi.org/10.1007/978-3-031-01912-8
912 _aZDB-2-SXSC
942 _cEBK
999 _c84625
_d84625