Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1343659787 |
Art des Inhalts | Konferenzschrift |
Titel | Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LVII / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
Person(en) |
Leonardis, Aleš (Herausgeber) Ricci, Elisa (Herausgeber) Roth, Stefan (Herausgeber) Russakovsky, Olga (Herausgeber) Sattler, Torsten (Herausgeber) Varol, Gül (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2025 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2025 |
Umfang/Format | Online-Ressource, LXXXV, 499 p. 197 illus., 188 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-031-72997-3 Printed edition:: ISBN: 978-3-031-72999-7 |
Inhalt | ST-LLM: Large Language Models Are Effective Temporal Learners -- Exact Diffusion Inversion via Bidirectional Integration Approximation -- Textual Query-Driven Mask Transformer for Domain Generalized Segmentation -- EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head -- Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors -- Object-Centric Diffusion for Efficient Video Editing -- Single-Mask Inpainting for Voxel-based Neural Radiance Fields -- McGrids: Monte Carlo-Driven Adaptive Grids for Iso-Surface Extraction -- Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval -- Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic Rewards via Failure Prompts -- Diffusion for Natural Image Matting -- Agglomerative Token Clustering -- CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection -- Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning -- ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition -- NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition -- GIVT: Generative Infinite-Vocabulary Transformers -- Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment -- Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density -- Multi-Modal Video Dialog State Tracking in the Wild -- Factorized Diffusion: Perceptual Illusions by Noise Decomposition -- To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now -- Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions -- StereoGlue: Joint Feature Matching and Robust Estimation -- Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory -- Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction -- Robust Zero-Shot Crowd Counting and Localization with Adaptive Resolution SAM |
Persistent Identifier |
URN: urn:nbn:de:101:1-2410010523271.720646189416 DOI: 10.1007/978-3-031-72998-0 |
URL | https://doi.org/10.1007/978-3-031-72998-0 |
ISBN/Einband/Preis | 978-3-031-72998-0 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Computer Science ; 15115 |
DDC-Notation | 006.3 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
