Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1378795334 |
Titel | Speech and Computer : 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part II / edited by Alexey Karpov, Gábor Gosztolya |
Person(en) |
Karpov, Aleksej (Herausgeber) Gosztolya, Gábor (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2026 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2026 |
Umfang/Format | Online-Ressource, XXIII, 347 p. 92 illus., 81 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-032-07958-9 Printed edition:: ISBN: 978-3-032-07960-2 |
Inhalt | -- Automatic Speech Recognition. -- In-Domain SSL Pre-Training and Streaming ASR: Application to Air Traffic Control Communications. -- Evaluating the Performance of Several ASR Systems in Environmental and Industrial Noise. -- Ground Truth-Free WER Prediction for ASR via Audio Quality and Model Confidence Features. -- Enhancing Speech Recognition through Text-to-Speech and Voice Conversion Augmentation. -- Best Data is more Supervised Data - Even for Hungarian ASR. -- Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-based Models. -- Speech Processing for Under-Resourced Languages. -- Effect of Increased Temporal Resolution on Speech Recognition for French Quebec using Features from Speech Self-Supervised Learning Models. -- Modeling Intra-Word Code-Switching for Karelian ASR. -- Improving Whisper-based Serbian ASR using Synthetic Speech. -- Domain Knowledge and Language Embeddings for Low-Resource Multilingual Phoneme ASR. -- Whistler Identification in Whistled Spanish (Silbo): A Case Study. -- Digital Speech Processing. -- PinkVocalTransformer: Neural Acoustic-to-Articulatory Inversion based on the Pink Trombone. -- CrossMP-SENet: Transformer-based Cross-Attention for Joint Magnitude-Phase Speech Enhancement. -- Adaptive Singing Voice Enhancement for Live Stages. -- Revealing the Hidden Temporal Structure of HubertSoft Embeddings based on the Russian Phonetic Corpus. -- Natural Language Processing. -- Analyzing Web-Scraped and Generated Inputs for Automatic and Scalable Intent Classification. -- Enhancing Retrieval Performance via LLM Hard-Negative Filtering. -- Sector-Wise Backpropagation for Low-Resource Text Classification in Deep Models. -- High-Frequency Multiword Units and the Typological Distribution of Multiword Units in Spoken Russian. -- Estimation of the Genre Composition of the English Subcorpus of the Google Books Ngram. -- Multimodal Systems. -- Ensembling Synchronisation-based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings. -- Phonetic and Visual Characteristics of Cognitive Load. -- Cognitive Humor Processing in the Russian and English Internet Meme Chatting: EEG Study. -- Saudi Sign Language Translation Using T5 |
Persistent Identifier |
URN: urn:nbn:de:101:1-2510130420170.682311780670 DOI: 10.1007/978-3-032-07959-6 |
URL | https://doi.org/10.1007/978-3-032-07959-6 |
ISBN/Einband/Preis | 978-3-032-07959-6 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Artificial Intelligence ; 16188 |
DDC-Notation | 006.45 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
