000 | 08583nam a22006615i 4500 | ||
---|---|---|---|
001 | 978-3-031-48309-7 | ||
003 | DE-He213 | ||
005 | 20240730202734.0 | ||
007 | cr nn 008mamaa | ||
008 | 231121s2023 sz | s |||| 0|eng d | ||
020 |
_a9783031483097 _9978-3-031-48309-7 |
||
024 | 7 |
_a10.1007/978-3-031-48309-7 _2doi |
|
050 | 4 | _aQ334-342 | |
050 | 4 | _aTA347.A78 | |
072 | 7 |
_aUYQ _2bicssc |
|
072 | 7 |
_aCOM004000 _2bisacsh |
|
072 | 7 |
_aUYQ _2thema |
|
082 | 0 | 4 |
_a006.3 _223 |
245 | 1 | 0 |
_aSpeech and Computer _h[electronic resource] : _b25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023, Proceedings, Part I / _cedited by Alexey Karpov, K. Samudravijaya, K. T. Deepak, Rajesh M. Hegde, Shyam S. Agrawal, S. R. Mahadeva Prasanna. |
250 | _a1st ed. 2023. | ||
264 | 1 |
_aCham : _bSpringer Nature Switzerland : _bImprint: Springer, _c2023. |
|
300 |
_aXXV, 642 p. 226 illus., 158 illus. in color. _bonline resource. |
||
336 |
_atext _btxt _2rdacontent |
||
337 |
_acomputer _bc _2rdamedia |
||
338 |
_aonline resource _bcr _2rdacarrier |
||
347 |
_atext file _bPDF _2rda |
||
490 | 1 |
_aLecture Notes in Artificial Intelligence, _x2945-9141 ; _v14338 |
|
505 | 0 | _aAutomatic Speech Recognition -- Extreme Learning Layer: A Boost for Spoken Digit Recognition with Spiking Neural Networks -- EMO-AVSR: Two-Level Approach for Audio-Visual Emotional Speech Recognition -- Significance of Audio Quality in Speech-to-Text Translation Systems -- Everyday Conversations: a Comparative Study of Expert Transcriptions and ASR Outputs at a Lexical Level -- Improving Automatic Speech Recognition with Dialect-Specific Language Models -- Emotional speech recognition of Holocaust survivors with deep neural network models for Russian language -- Computational Paralinguistics -- Aggregation Strategies of Wav2vec 2.0 Embeddings for Computational Paralinguistic Tasks -- Rhythm Formant Analysis for Automatic Depression Classification -- Determining Alcohol Intoxication Based on Speech and Neural Networks -- Linear Frequency Residual Cepstral Coefficients for Speech Emotion Recognition -- Enhancing Stutter Detection in Speech using Zero Time Windowing Cepstral Coefficients and Phase Information -- Source and System-based Modulation Approach for Fake Speech Detection -- Digital Signal Processing -- Investigation of Different Calibration Methods for Deep Speaker Embedding based Verification Systems -- Learning to Predict Speech Intelligibility from Speech Distortions -- Sparse Representation Frameworks for Acoustic Scene Classification -- Driver Speech Detection in Real Driving Scenario -- Regularization based Incremental Learning in TCNN for Robust Speech Enhancement Targeting Effective Human Machine Interaction -- Candidate Speech Extraction from Multi-Speaker Single-Channel Audio Interviews -- Post-Processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality -- Region Normalized Capsule Network based Generative Adversarial Network for Non-Parallel Voice Conversion -- Speech Enhancement using LinkNet Architecture -- ATT:Adversarial Trained Transformer for Speech Enhancement -- Human Identification by Dynamics of Changes in Brain Frequencies Using Artificial Neural Networks -- Speech Prosody -- Analysis of Formant Trajectories of a Speech Signal for the Purpose of Forensic Identification of a Foreign Speaker -- Gestures vs. Prosodic Structure in Laboratory Ironic Speech -- Sounds of < sil > ence: Acoustics of Inhalation in Read Speech -- Prolongations as Hesitation Phenomena in Spoken Speech in First and Second Language -- Study of Indian English Pronunciation Variabilities Relative to Received Pronunciation -- Multimodal Collaboration in Expository Discourse: Verbal and Nonverbal Moves Alignment -- Association of Time Domain Features with Oral Cavity Configuration during Vowel Production and its Application in Vowel Recognition -- Prosodic Interaction Models in a Conversation -- Natural Language Processing -- Development and Research of Dialogue Agents with Long-Term Memory and Web Search -- Pre- and Post-Textual Contexts in Assessment of a Message as Offensive or Defensive Aggression Verbalization -- Boosting Rule-based Grapheme-to-Phoneme Conversion with Morphological Segmentation and Syllabification in Bengali -- Revisiting Assessment of Text Complexity: Lexical and Syntactic Parameters Fluctuations -- Analysis of Natural Language Understanding Systems with L2 Learner Specific Synthetic Grammatical Errors based on Parts-of-Speech -- On the Most Frequent Sequences of Words in Russian Spoken Everyday Language (Bigrams and Trigrams): An Experience of Classification -- Child Speech Processing -- Recognition of the Emotional State of Children by Video and Audio Modalities by Indian and Russian Experts -- Effect of Linear Prediction Order to Modify Formant Locations for Children Speech Recognition -- Gammatone-Filterbank based Pitch-Normalized Cepstral Coefficients for Zero-Resource Children's ASR -- System Assisted Vocal Response Analysis and Assessment of Autism in Children: A Machine Learning Based Approach -- Addressing Effects of Formant Dispersion and Pitch Sensitivity for the Development of Children's KWS System -- Development of Children's KWS System Perceptual Experiment and Automatic Recognition by Video, Audio and Text Modalities -- Linear Frequency Residual Features for Infant Cry Classification -- Speech Processing for Medicine -- Identification of Voice Disorders: A Comparative Study of Machine Learning Algorithms -- Transfer Learning using Whisper for Dysarthric Automatic Speech Recognition -- Significance of Duration Modification in Reducing Listening Effort of Slurred Speech from Patients with Traumatic Brain Injury -- Significance of Duration Modification in Reducing Listening Effort of Slurred Speech from Patients with Traumatic Brain Injury -- Respiratory Sickness Detection from Audio Recordings using CLIP Models -- Investigating the Effect of Data Impurity on the Detection Performances of Mental Disorders through Spoken Dialogues. | |
520 | _aThe two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29-December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization. | ||
650 | 0 |
_aArtificial intelligence. _93407 |
|
650 | 0 |
_aComputer engineering. _910164 |
|
650 | 0 |
_aComputer networks . _931572 |
|
650 | 0 |
_aApplication software. _9172799 |
|
650 | 0 |
_aImage processing _xDigital techniques. _94145 |
|
650 | 0 |
_aComputer vision. _9172800 |
|
650 | 1 | 4 |
_aArtificial Intelligence. _93407 |
650 | 2 | 4 |
_aComputer Engineering and Networks. _9172801 |
650 | 2 | 4 |
_aComputer and Information Systems Applications. _9172802 |
650 | 2 | 4 |
_aComputer Imaging, Vision, Pattern Recognition and Graphics. _931569 |
700 | 1 |
_aKarpov, Alexey. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172803 |
|
700 | 1 |
_aSamudravijaya, K. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172804 |
|
700 | 1 |
_aDeepak, K. T. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172805 |
|
700 | 1 |
_aHegde, Rajesh M. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172806 |
|
700 | 1 |
_aAgrawal, Shyam S. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172807 |
|
700 | 1 |
_aPrasanna, S. R. Mahadeva. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt _9172808 |
|
710 | 2 |
_aSpringerLink (Online service) _9172809 |
|
773 | 0 | _tSpringer Nature eBook | |
776 | 0 | 8 |
_iPrinted edition: _z9783031483080 |
776 | 0 | 8 |
_iPrinted edition: _z9783031483103 |
830 | 0 |
_aLecture Notes in Artificial Intelligence, _x2945-9141 ; _v14338 _9172810 |
|
856 | 4 | 0 | _uhttps://doi.org/10.1007/978-3-031-48309-7 |
912 | _aZDB-2-SCS | ||
912 | _aZDB-2-SXCS | ||
912 | _aZDB-2-LNC | ||
942 | _cELN | ||
999 |
_c97154 _d97154 |