Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1378795296 |
Titel | Speech and Computer : 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I / edited by Alexey Karpov, Gábor Gosztolya |
Person(en) |
Karpov, Aleksej (Herausgeber) Gosztolya, Gábor (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2026 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2026 |
Umfang/Format | Online-Ressource, XXIII, 347 p. 105 illus., 93 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-032-07955-8 Printed edition:: ISBN: 978-3-032-07957-2 |
Inhalt | -- Invited Paper. -- Towards Responsible Multimodal Modeling for Mental Healthcare. -- Speech Perception and Synthesis. -- When Voice Matters: Evidence of Gender Disparity in Positional Bias of SpeechLLMs. -- WhiSQA: Non-Intrusive Speech Quality Prediction using Whisper Encoder Features. -- Prompting the Mind: EEG-to-Text Translation with Multimodal LLMs and Semantic Contro. -- Effectiveness of Tacotron2 for Intonation Model Synthesis in Russian. -- Enhancing Sinhala Text-to-Speech with End-to-End VITS Architecture. -- Computational Paralinguistics. -- Spoken Emotion Recognition using Soft Labels. -- NAMTalk: From Muscle Vibrations to Emotional Speech. -- What Do LLMs Know about Human Emotions? The Russian Case Study. -- Emotions Manifestation by Adolescents with Intellectual Disabilities. -- Retention-Augmented Voice Assistant: A Lightweight Architecture for Stateful Interaction with Comprehensive Evaluation and Privacy-Preserving Design. -- Speech Processing for Healthcare. -- Investigation of Explainable Multimodal Methods for Detecting Mental Disorders. -- Attention Deficit Hyperactivity Disorder: Identifying Approaches for Early Diagnosis, a Pilot Study. -- Text-to-Dysarthric-Speech Generation for Dysarthric Automatic Speech Recognition: Is Purely Synthetic Data Enough?. -- Colour Preferences in Schizophrenic Speech. -- Automated Assessment of Phrase Intelligibility for Russian Speech Based on Esophageal Voice. -- Speech and Language Resources. -- Subtle Changes in L1 Stops of Late Salento Italian-French Bilinguals: An Acoustic Study using AutoVOT Adapted for Italian and French. -- Sound and Colour in Phonosemantics: Perceptual and Acoustic Correlates of Mongolian Vowels. -- Rhythmic Diglossia Based on Discourse Types and Dialects of English: Australian and New Zealand Corpora. -- Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment Corpus. -- Speaker Recognition. -- Effect of Spoof Speech on Forensic Voice Comparison using Deep Speaker Embeddings. -- Source Vendor Tracing of Audio Deepfakes. -- Language-Specific Adaptation Strategies for Speaker Recognition using MobileNet. -- Enhancing Audio Replay Attack Detection with Silence-based Blind Channel Impulse Response Estimation |
Persistent Identifier |
URN: urn:nbn:de:101:1-2510130419267.253456856249 DOI: 10.1007/978-3-032-07956-5 |
URL | https://doi.org/10.1007/978-3-032-07956-5 |
ISBN/Einband/Preis | 978-3-032-07956-5 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Artificial Intelligence ; 16187 |
DDC-Notation | 006.45 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
