Multimodal Location Estimation of Videos and Images [electronic resource] / edited by Jaeyoung Choi, Gerald Friedland. - XII, 191 p. 80 illus. in color. online resource.

Introduction -- The Benchmark as a Research Catalyst: Charting the Progress of Geo-Prediction for Social Multimedia -- Large-scale Image Geolocalization -- Vision-based Fine-Grained Location Estimation -- Image-Based Positioning of Mobile Devices in Indoor Environments -- Application of Large-Scale Classification Techniques for Simple Location Estimation Experiments -- Collaborative Multimodal Location Estimation of Consumer Media -- Georeferencing Flickr resources based on multimodal features -- Human vs Machine: Establishing a Human Baseline for Multimodal Location Estimation -- Personalized Travel Navigation and Photo-Shooting Navigation Using Large-Scale Geotags.

This book presents an overview of the field of multimodal location estimation, i.e. using acoustic, visual, and/or textual cues to estimate the shown location of a video recording. The authors' sample research results in this field in a unified way integrating research work on this topic that focuses on different modalities, viewpoints, and applications. The book describes fundamental methods of acoustic, visual, textual, social graph, and metadata processing as well as multimodal integration methods used for location estimation. In addition, the text covers benchmark metrics and explores the limits of the technology based on a human baseline. �         Discusses localization of multimedia data; �         Examines fundamental methods of establishing location metadata for images and videos (other than GPS tagging); �         Covers Data-Driven as well as Semantic Location Estimation.

9783319098616

10.1007/978-3-319-09861-6 doi


Engineering.
Multimedia information systems.
Electrical engineering.
Engineering.
Signal, Image and Speech Processing.
Communications Engineering, Networks.
Multimedia Information Systems.

TK5102.9 TA1637-1638 TK7882.S65

621.382