Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1346772282 |
Art des Inhalts | Konferenzschrift |
Titel | Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXIV / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
Person(en) |
Leonardis, Aleš (Herausgeber) Ricci, Elisa (Herausgeber) Roth, Stefan (Herausgeber) Russakovsky, Olga (Herausgeber) Sattler, Torsten (Herausgeber) Varol, Gül (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2025 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2025 |
Umfang/Format | Online-Ressource, LXXXV, 492 p. 167 illus., 163 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-031-73038-2 Printed edition:: ISBN: 978-3-031-73040-5 |
Inhalt | Depth-guided NeRF Training via Earth Mover’s Distance -- INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding -- DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks -- Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time -- Diagnosing and Re-learning for Balanced Multimodal Learning -- Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration -- Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders -- BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion -- SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views -- MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning -- Discovering Unwritten Visual Classifiers with Large Language Models -- LITA: Language Instructed Temporal-Localization Assistant -- MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain -- Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs -- Bridging the Pathology Domain Gap: Efficiently Adapting CLIP for Pathology Image Analysis with Limited Labeled Data -- AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation -- CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection -- SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging -- Minimalist Vision with Freeform Pixels -- All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation -- LatentEditor: Text Driven Local Editing of 3D Scenes -- Single-Photon 3D Imaging with Equi-Depth Photon Histograms -- Asynchronous Bioplausible Neuron for Spiking Neural Networks for Event-Based Vision -- Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models -- POET: Prompt Offset Tuning for Continual Human Action Adaptation -- Domain Generalization of 3D Object Detection by Density-Resampling -- IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers |
Persistent Identifier |
URN: urn:nbn:de:101:1-2410310322357.869135825553 DOI: 10.1007/978-3-031-73039-9 |
URL | https://doi.org/10.1007/978-3-031-73039-9 |
ISBN/Einband/Preis | 978-3-031-73039-9 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Computer Science ; 15122 |
DDC-Notation | 006.3 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
