Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1346427364 |
Art des Inhalts | Konferenzschrift |
Titel | Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part VIII / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
Person(en) |
Leonardis, Aleš (Herausgeber) Ricci, Elisa (Herausgeber) Roth, Stefan (Herausgeber) Russakovsky, Olga (Herausgeber) Sattler, Torsten (Herausgeber) Varol, Gül (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2025 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2025 |
Umfang/Format | Online-Ressource, LXXXV, 499 p. 187 illus., 186 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-031-73241-6 Printed edition:: ISBN: 978-3-031-73243-0 |
Inhalt | Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs -- Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition -- DiffiT: Diffusion Vision Transformers for Image Generation -- WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation -- GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding -- FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis -- FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection -- SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs -- ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities -- MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? -- See and Think: Embodied Agent in Virtual Environment -- PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects -- Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases -- VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding -- Masked Angle-Aware Autoencoder for Remote Sensing Images -- Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm -- MultiGen: Zero-shot Image Generation from Multi-modal Prompts -- GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths -- Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning -- SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis -- Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets -- FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition -- Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting -- UniCode : Learning a Unified Codebook for Multimodal Large Language Models -- When Do We Not Need Larger Vision Models? -- GVGEN: Text-to-3D Generation with Volumetric Representation -- Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model |
Persistent Identifier |
URN: urn:nbn:de:101:1-2410290353589.861406067912 DOI: 10.1007/978-3-031-73242-3 |
URL | https://doi.org/10.1007/978-3-031-73242-3 |
ISBN/Einband/Preis | 978-3-031-73242-3 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Computer Science ; 15066 |
DDC-Notation | 006.3 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
