Katalog der Deutschen Nationalbibliothek

Neuigkeiten

Leichte Bedienung, intuitive Suche: Die Betaversion unseres neuen Katalogs ist online! → Zur Betaversion des neuen DNB-Katalogs

 
 

Ergebnis der Suche nach: "Image"



Treffer 46 von 98304 < < > <



Online Ressourcen
Link zu diesem Datensatz https://d-nb.info/1378795334
Titel Speech and Computer : 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part II / edited by Alexey Karpov, Gábor Gosztolya
Person(en) Karpov, Aleksej (Herausgeber)
Gosztolya, Gábor (Herausgeber)
Organisation(en) SpringerLink (Online service) (Sonstige)
Ausgabe 1st ed. 2026
Verlag Cham : Springer Nature Switzerland, Imprint: Springer
Zeitliche Einordnung Erscheinungsdatum: 2026
Umfang/Format Online-Ressource, XXIII, 347 p. 92 illus., 81 illus. in color. : online resource.
Andere Ausgabe(n) Printed edition:: ISBN: 978-3-032-07958-9
Printed edition:: ISBN: 978-3-032-07960-2
Inhalt -- Automatic Speech Recognition. -- In-Domain SSL Pre-Training and Streaming ASR: Application to Air Traffic Control Communications. -- Evaluating the Performance of Several ASR Systems in Environmental and Industrial Noise. -- Ground Truth-Free WER Prediction for ASR via Audio Quality and Model Confidence Features. -- Enhancing Speech Recognition through Text-to-Speech and Voice Conversion Augmentation. -- Best Data is more Supervised Data - Even for Hungarian ASR. -- Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-based Models. -- Speech Processing for Under-Resourced Languages. -- Effect of Increased Temporal Resolution on Speech Recognition for French Quebec using Features from Speech Self-Supervised Learning Models. -- Modeling Intra-Word Code-Switching for Karelian ASR. -- Improving Whisper-based Serbian ASR using Synthetic Speech. -- Domain Knowledge and Language Embeddings for Low-Resource Multilingual Phoneme ASR. -- Whistler Identification in Whistled Spanish (Silbo): A Case Study. -- Digital Speech Processing. -- PinkVocalTransformer: Neural Acoustic-to-Articulatory Inversion based on the Pink Trombone. -- CrossMP-SENet: Transformer-based Cross-Attention for Joint Magnitude-Phase Speech Enhancement. -- Adaptive Singing Voice Enhancement for Live Stages. -- Revealing the Hidden Temporal Structure of HubertSoft Embeddings based on the Russian Phonetic Corpus. -- Natural Language Processing. -- Analyzing Web-Scraped and Generated Inputs for Automatic and Scalable Intent Classification. -- Enhancing Retrieval Performance via LLM Hard-Negative Filtering. -- Sector-Wise Backpropagation for Low-Resource Text Classification in Deep Models. -- High-Frequency Multiword Units and the Typological Distribution of Multiword Units in Spoken Russian. -- Estimation of the Genre Composition of the English Subcorpus of the Google Books Ngram. -- Multimodal Systems. -- Ensembling Synchronisation-based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings. -- Phonetic and Visual Characteristics of Cognitive Load. -- Cognitive Humor Processing in the Russian and English Internet Meme Chatting: EEG Study. -- Saudi Sign Language Translation Using T5
Persistent Identifier URN: urn:nbn:de:101:1-2510130420170.682311780670
DOI: 10.1007/978-3-032-07959-6
URL https://doi.org/10.1007/978-3-032-07959-6
ISBN/Einband/Preis 978-3-032-07959-6
Sprache(n) Englisch (eng)
Beziehungen Lecture Notes in Artificial Intelligence ; 16188
DDC-Notation 006.45 (maschinell ermittelte DDC-Kurznotation)
Sachgruppe(n) 004 Informatik

Online-Zugriff Archivobjekt öffnen




Treffer 46 von 98304
< < > <


E-Mail-IconAdministration