Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1346770700 |
Art des Inhalts | Konferenzschrift |
Titel | Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part V / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
Person(en) |
Leonardis, Aleš (Herausgeber) Ricci, Elisa (Herausgeber) Roth, Stefan (Herausgeber) Russakovsky, Olga (Herausgeber) Sattler, Torsten (Herausgeber) Varol, Gül (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2025 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2025 |
Umfang/Format | Online-Ressource, LXXXV, 478 p. 173 illus., 169 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-031-72651-4 Printed edition:: ISBN: 978-3-031-72653-8 |
Inhalt | SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark -- AttnZero: Efficient Attention Discovery for Vision Transformers -- Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search -- Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search -- UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation -- TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning -- Spectral Subsurface Scattering for Material Classification -- nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding -- Dynamic Neural Radiance Field From Defocused Monocular Video -- PiTe: Pixel-Temporal Alignment for Large Video-Language Model -- CarFormer: Self-Driving with Learned Object-Centric Representations -- FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models -- Plain-Det: A Plain Multi-Dataset Object Detector -- Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation -- Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation -- Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching -- Text-Guided Video Masked Autoencoder -- Diffusion Models for Open-Vocabulary Segmentation -- Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation -- EvSign: Sign Language Recognition and Translation with Streaming Events -- QUAR-VLA: Vision-Language-Action Model for Quadruped Robots -- Zero-shot Object Counting with Good Exemplars -- TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering -- SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds -- PartSTAD: 2D-to-3D Part Segmentation Task Adaptation -- FutureDepth: Learning to Predict the Future Improves Video Depth Estimation -- LLM as Copilot for Coarse-grained Vision-and-Language Navigation |
Persistent Identifier |
URN: urn:nbn:de:101:1-2410310309257.761370015206 DOI: 10.1007/978-3-031-72652-1 |
URL | https://doi.org/10.1007/978-3-031-72652-1 |
ISBN/Einband/Preis | 978-3-031-72652-1 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Computer Science ; 15063 |
DDC-Notation | 006.3 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
