Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1346423709 |
Art des Inhalts | Konferenzschrift |
Titel | Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part XXXIII / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
Person(en) |
Leonardis, Aleš (Herausgeber) Ricci, Elisa (Herausgeber) Roth, Stefan (Herausgeber) Russakovsky, Olga (Herausgeber) Sattler, Torsten (Herausgeber) Varol, Gül (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2025 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2025 |
Umfang/Format | Online-Ressource, LXXXV, 493 p. 149 illus., 147 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-031-73413-7 Printed edition:: ISBN: 978-3-031-73415-1 |
Inhalt | OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks -- Multistain Pretraining for Slide Representation Learning in Pathology -- T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy -- Harmonizing knowledge Transfer in Neural Network with Unified Distillation -- Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data -- Click Prompt Learning with Optimal Transport for Interactive Segmentation -- 3D Human Pose Estimation via Non-Causal Retentive Networks -- OMR: Occlusion-Aware Memory-Based Refinement for Video Lane Detection -- 6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry -- Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging -- Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition -- Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition -- Modeling Label Correlations with Latent Context for Multi-Label Recognition -- LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model -- Finding a needle in a haystack: A Black-Box Approach to Invisible Watermark Detection -- DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction -- MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos -- ARoFace: Alignment Robustness to Improve Low-quality Face Recognition -- Learning Diffusion Models for Multi-View Anomaly Detection -- Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation -- Multi-modal Relation Distillation for Unified 3D Representation Learning -- Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization -- Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation -- Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification -- MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation -- LongVLM: Efficient Long Video Understanding via Large Language Models -- The All-Seeing Project V2: Towards General Relation Comprehension of the Open World |
Persistent Identifier |
URN: urn:nbn:de:101:1-2410290325069.033539599382 DOI: 10.1007/978-3-031-73414-4 |
URL | https://doi.org/10.1007/978-3-031-73414-4 |
ISBN/Einband/Preis | 978-3-031-73414-4 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Computer Science ; 15091 |
DDC-Notation | 006.3 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
