Normal view MARC view ISBD view

Real-time Speech and Music Classification by Large Audio Feature Space Extraction [electronic resource] / by Florian Eyben.

By: Eyben, Florian [author.].
Contributor(s): SpringerLink (Online service).
Material type: materialTypeLabelBookSeries: Springer Theses, Recognizing Outstanding Ph.D. Research: Publisher: Cham : Springer International Publishing : Imprint: Springer, 2016Description: XXXVIII, 298 p. 41 illus., 39 illus. in color. online resource.Content type: text Media type: computer Carrier type: online resourceISBN: 9783319272993.Subject(s): Engineering | User interfaces (Computer systems) | Computational linguistics | Acoustical engineering | Engineering | Signal, Image and Speech Processing | User Interfaces and Human Computer Interaction | Engineering Acoustics | Computational LinguisticsAdditional physical formats: Printed edition:: No titleDDC classification: 621.382 Online resources: Click here to access online
Contents:
Abstract -- Introduction -- Acoustic Features and Modelling -- Standard Baseline Feature Sets -- Real-time Incremental Processing -- Real-life Robustness -- Evaluation -- Discussion and Outlook -- Appendix -- Mel-frequency Filterbank Parameters.
In: Springer eBooksSummary: This book reports on an outstanding thesis that has significantly advanced the state-of-the-art in the automated analysis and classification of speech and music. It defines several standard acoustic parameter sets and describes their implementation in a novel, open-source, audio analysis framework called openSMILE, which has been accepted and intensively used worldwide. The book offers extensive descriptions of key methods for the automatic classification of speech and music signals in real-life conditions and reports on the evaluation of the framework developed and the acoustic parameter sets that were selected. It is not only intended as a manual for openSMILE users, but also and primarily as a guide and source of inspiration for students and scientists involved in the design of speech and music analysis methods that can robustly handle real-life conditions.
    average rating: 0.0 (0 votes)
No physical items for this record

Abstract -- Introduction -- Acoustic Features and Modelling -- Standard Baseline Feature Sets -- Real-time Incremental Processing -- Real-life Robustness -- Evaluation -- Discussion and Outlook -- Appendix -- Mel-frequency Filterbank Parameters.

This book reports on an outstanding thesis that has significantly advanced the state-of-the-art in the automated analysis and classification of speech and music. It defines several standard acoustic parameter sets and describes their implementation in a novel, open-source, audio analysis framework called openSMILE, which has been accepted and intensively used worldwide. The book offers extensive descriptions of key methods for the automatic classification of speech and music signals in real-life conditions and reports on the evaluation of the framework developed and the acoustic parameter sets that were selected. It is not only intended as a manual for openSMILE users, but also and primarily as a guide and source of inspiration for students and scientists involved in the design of speech and music analysis methods that can robustly handle real-life conditions.

There are no comments for this item.

Log in to your account to post a comment.