Katalog der Deutschen Nationalbibliothek

Neuigkeiten

Leichte Bedienung, intuitive Suche: Die Betaversion unseres neuen Katalogs ist online! → Zur Betaversion des neuen DNB-Katalogs

 
Neuigkeiten 2. bis 5. Oktober 2025: Der Kartenlesesaal in Leipzig ist geschlossen. // 2 to 5 October 2025: The map reading room in Leipzig is closed.
 
 

Ergebnis der Suche nach: "Image"



Treffer 275 von 115169 < < > <



Online Ressourcen
Link zu diesem Datensatz https://d-nb.info/1347219633
Art des Inhalts Konferenzschrift
Titel Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part X / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol
Person(en) Leonardis, Aleš (Herausgeber)
Ricci, Elisa (Herausgeber)
Roth, Stefan (Herausgeber)
Russakovsky, Olga (Herausgeber)
Sattler, Torsten (Herausgeber)
Varol, Gül (Herausgeber)
Organisation(en) SpringerLink (Online service) (Sonstige)
Ausgabe 1st ed. 2025
Verlag Cham : Springer Nature Switzerland, Imprint: Springer
Zeitliche Einordnung Erscheinungsdatum: 2025
Umfang/Format Online-Ressource, LXXXV, 497 p. 179 illus., 173 illus. in color. : online resource.
Andere Ausgabe(n) Printed edition:: ISBN: 978-3-031-72683-5
Printed edition:: ISBN: 978-3-031-72685-9
Inhalt Modeling and Driving Human Body Soundfields through Acoustic Primitives -- m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks -- Label-anticipated Event Disentanglement for Audio-Visual Video Parsing -- High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding -- Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization -- I-MedSAM: Implicit Medical Image Segmentation with Segment Anything -- ReMamber: Referring Image Segmentation with Mamba Twister -- TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting -- CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios -- Segmentation-guided Layer-wise Image Vectorization with Gradient Fills -- Implicit Style-Content Separation using B-LoRA -- OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models -- ActionVOS: Actions as Prompts for Video Object Segmentation -- FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance -- U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation -- Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization -- Rotary Position Embedding for Vision Transformer -- Local All-Pair Correspondence for Point Tracking -- MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection -- ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments -- S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis -- ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos -- Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos -- PQ-SAM: Post-training Quantization for Segment Anything Model -- CPM: Class-conditional Prompting Machine for Audio-visual Segmentation -- Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition -- DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment
Persistent Identifier URN: urn:nbn:de:101:1-2411050323224.675971214886
DOI: 10.1007/978-3-031-72684-2
URL https://doi.org/10.1007/978-3-031-72684-2
ISBN/Einband/Preis 978-3-031-72684-2
Sprache(n) Englisch (eng)
Beziehungen Lecture Notes in Computer Science ; 15068
DDC-Notation 006.3 (maschinell ermittelte DDC-Kurznotation)
Sachgruppe(n) 004 Informatik

Online-Zugriff Archivobjekt öffnen




Treffer 275 von 115169
< < > <


E-Mail-IconAdministration