000 03889nam a22005055i 4500
001 978-3-031-01853-4
003 DE-He213
005 20240730163732.0
007 cr nn 008mamaa
008 220601s2015 sz | s |||| 0|eng d
020 _a9783031018534
_9978-3-031-01853-4
024 7 _a10.1007/978-3-031-01853-4
_2doi
050 4 _aTK5105.5-5105.9
072 7 _aUKN
_2bicssc
072 7 _aCOM043000
_2bisacsh
072 7 _aUKN
_2thema
082 0 4 _a004.6
_223
100 1 _aDong, Xin Luna.
_eauthor.
_4aut
_4http://id.loc.gov/vocabulary/relators/aut
_980266
245 1 0 _aBig Data Integration
_h[electronic resource] /
_cby Xin Luna Dong, Divesh Srivastava.
250 _a1st ed. 2015.
264 1 _aCham :
_bSpringer International Publishing :
_bImprint: Springer,
_c2015.
300 _aXX, 178 p.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
490 1 _aSynthesis Lectures on Data Management,
_x2153-5426
505 0 _aPreface -- Acknowledgments -- Getting Started -- From Services to Service Worlds -- The Human Condition -- Service Concepts -- Design and its Limits -- Service Design -- An anthropology of Services -- References -- Author Biographies.
520 _aThe big data era is upon us: data are being generated, analyzed, and used at an unprecedented scale, and data-driven decision making is sweeping through all aspects of society. Since the value of data explodes when it can be linked and fused with other data, addressing the big data integration (BDI) challenge is critical to realizing the promise of big data. BDI differs from traditional data integration along the dimensions of volume, velocity, variety, and veracity. First, not only can data sources contain a huge volume of data, but also the number of data sources is now in the millions. Second, because of the rate at which newly collected data are made available, many of the data sources are very dynamic, and the number of data sources is also rapidly exploding. Third, data sources are extremely heterogeneous in their structure and content, exhibiting considerable variety even for substantially similar entities. Fourth, the data sources are of widely differing qualities, with significant differences in the coverage, accuracy and timeliness of data provided. This book explores the progress that has been made by the data integration community on the topics of schema alignment, record linkage and data fusion in addressing these novel challenges faced by big data integration. Each of these topics is covered in a systematic way: first starting with a quick tour of the topic in the context of traditional data integration, followed by a detailed, example-driven exposition of recent innovative techniques that have been proposed to address the BDI challenges of volume, velocity, variety, and veracity. Finally, it presents merging topics and opportunities that are specific to BDI, identifying promising directions for the data integration community.
650 0 _aComputer networks .
_931572
650 0 _aData structures (Computer science).
_98188
650 0 _aInformation theory.
_914256
650 1 4 _aComputer Communication Networks.
_980267
650 2 4 _aData Structures and Information Theory.
_931923
700 1 _aSrivastava, Divesh.
_eauthor.
_4aut
_4http://id.loc.gov/vocabulary/relators/aut
_980268
710 2 _aSpringerLink (Online service)
_980269
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9783031007255
776 0 8 _iPrinted edition:
_z9783031029813
830 0 _aSynthesis Lectures on Data Management,
_x2153-5426
_980270
856 4 0 _uhttps://doi.org/10.1007/978-3-031-01853-4
912 _aZDB-2-SXSC
942 _cEBK
999 _c84929
_d84929