Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1347219633 |
Art des Inhalts | Konferenzschrift |
Titel | Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part X / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
Person(en) |
Leonardis, Aleš (Herausgeber) Ricci, Elisa (Herausgeber) Roth, Stefan (Herausgeber) Russakovsky, Olga (Herausgeber) Sattler, Torsten (Herausgeber) Varol, Gül (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2025 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2025 |
Umfang/Format | Online-Ressource, LXXXV, 497 p. 179 illus., 173 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-031-72683-5 Printed edition:: ISBN: 978-3-031-72685-9 |
Inhalt | Modeling and Driving Human Body Soundfields through Acoustic Primitives -- m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks -- Label-anticipated Event Disentanglement for Audio-Visual Video Parsing -- High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding -- Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization -- I-MedSAM: Implicit Medical Image Segmentation with Segment Anything -- ReMamber: Referring Image Segmentation with Mamba Twister -- TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting -- CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios -- Segmentation-guided Layer-wise Image Vectorization with Gradient Fills -- Implicit Style-Content Separation using B-LoRA -- OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models -- ActionVOS: Actions as Prompts for Video Object Segmentation -- FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance -- U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation -- Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization -- Rotary Position Embedding for Vision Transformer -- Local All-Pair Correspondence for Point Tracking -- MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection -- ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments -- S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis -- ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos -- Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos -- PQ-SAM: Post-training Quantization for Segment Anything Model -- CPM: Class-conditional Prompting Machine for Audio-visual Segmentation -- Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition -- DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment |
Persistent Identifier |
URN: urn:nbn:de:101:1-2411050323224.675971214886 DOI: 10.1007/978-3-031-72684-2 |
URL | https://doi.org/10.1007/978-3-031-72684-2 |
ISBN/Einband/Preis | 978-3-031-72684-2 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Computer Science ; 15068 |
DDC-Notation | 006.3 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
