Katalog der Deutschen Nationalbibliothek

Ergebnis der Suche nach: "Image"

Zurück zur Trefferliste

Treffer 46 von 98304


Link zu diesem Datensatz	https://d-nb.info/1378795334
Titel	Speech and Computer : 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part II / edited by Alexey Karpov, Gábor Gosztolya
Person(en)	Karpov, Aleksej (Herausgeber) Gosztolya, Gábor (Herausgeber)
Organisation(en)	SpringerLink (Online service) (Sonstige)
Ausgabe	1st ed. 2026
Verlag	Cham : Springer Nature Switzerland, Imprint: Springer
Zeitliche Einordnung	Erscheinungsdatum: 2026
Umfang/Format	Online-Ressource, XXIII, 347 p. 92 illus., 81 illus. in color. : online resource.
Andere Ausgabe(n)	Printed edition:: ISBN: 978-3-032-07958-9 Printed edition:: ISBN: 978-3-032-07960-2
Inhalt	-- Automatic Speech Recognition. -- In-Domain SSL Pre-Training and Streaming ASR: Application to Air Traffic Control Communications. -- Evaluating the Performance of Several ASR Systems in Environmental and Industrial Noise. -- Ground Truth-Free WER Prediction for ASR via Audio Quality and Model Confidence Features. -- Enhancing Speech Recognition through Text-to-Speech and Voice Conversion Augmentation. -- Best Data is more Supervised Data - Even for Hungarian ASR. -- Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-based Models. -- Speech Processing for Under-Resourced Languages. -- Effect of Increased Temporal Resolution on Speech Recognition for French Quebec using Features from Speech Self-Supervised Learning Models. -- Modeling Intra-Word Code-Switching for Karelian ASR. -- Improving Whisper-based Serbian ASR using Synthetic Speech. -- Domain Knowledge and Language Embeddings for Low-Resource Multilingual Phoneme ASR. -- Whistler Identification in Whistled Spanish (Silbo): A Case Study. -- Digital Speech Processing. -- PinkVocalTransformer: Neural Acoustic-to-Articulatory Inversion based on the Pink Trombone. -- CrossMP-SENet: Transformer-based Cross-Attention for Joint Magnitude-Phase Speech Enhancement. -- Adaptive Singing Voice Enhancement for Live Stages. -- Revealing the Hidden Temporal Structure of HubertSoft Embeddings based on the Russian Phonetic Corpus. -- Natural Language Processing. -- Analyzing Web-Scraped and Generated Inputs for Automatic and Scalable Intent Classification. -- Enhancing Retrieval Performance via LLM Hard-Negative Filtering. -- Sector-Wise Backpropagation for Low-Resource Text Classification in Deep Models. -- High-Frequency Multiword Units and the Typological Distribution of Multiword Units in Spoken Russian. -- Estimation of the Genre Composition of the English Subcorpus of the Google Books Ngram. -- Multimodal Systems. -- Ensembling Synchronisation-based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings. -- Phonetic and Visual Characteristics of Cognitive Load. -- Cognitive Humor Processing in the Russian and English Internet Meme Chatting: EEG Study. -- Saudi Sign Language Translation Using T5
Persistent Identifier	URN: urn:nbn:de:101:1-2510130420170.682311780670 DOI: 10.1007/978-3-032-07959-6
URL	https://doi.org/10.1007/978-3-032-07959-6
ISBN/Einband/Preis	978-3-032-07959-6
Sprache(n)	Englisch (eng)
Beziehungen	Lecture Notes in Artificial Intelligence ; 16188
DDC-Notation	006.45 (maschinell ermittelte DDC-Kurznotation)
Sachgruppe(n)	004 Informatik

Online-Zugriff

Archivobjekt öffnen

Treffer 46 von 98304

Aktionen

MARC21-XML-Repräsentation dieses Datensatzes

Korrekturanfrage

buchhandel.de