Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Image"
![]() |
|
Link zu diesem Datensatz | https://d-nb.info/1346770549 |
Art des Inhalts | Konferenzschrift |
Titel | Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LVI / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol |
Person(en) |
Leonardis, Aleš (Herausgeber) Ricci, Elisa (Herausgeber) Roth, Stefan (Herausgeber) Russakovsky, Olga (Herausgeber) Sattler, Torsten (Herausgeber) Varol, Gül (Herausgeber) |
Organisation(en) | SpringerLink (Online service) (Sonstige) |
Ausgabe | 1st ed. 2025 |
Verlag | Cham : Springer Nature Switzerland, Imprint: Springer |
Zeitliche Einordnung | Erscheinungsdatum: 2025 |
Umfang/Format | Online-Ressource, LXXXV, 499 p. 181 illus., 177 illus. in color. : online resource. |
Andere Ausgabe(n) |
Printed edition:: ISBN: 978-3-031-72991-1 Printed edition:: ISBN: 978-3-031-72993-5 |
Inhalt | HowToCaption: Prompting LLMs to Transform Video Annotations at Scale -- LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection -- Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction -- On Pretraining Data Diversity for Self-Supervised Learning -- Look Around and Learn: Self-Training Object Detection by Exploration -- Bayesian Self-Training for Semi-Supervised 3D Segmentation -- Motion and Structure from Event-based Normal Flow -- ParCo: Part-Coordinating Text-to-Motion Synthesis -- Learning to Complement and to Defer to Multiple Users -- Tiny Models are the Computational Saver for Large Models -- DragVideo: Interactive Drag-style Video Editing -- Multi-Sentence Grounding for Long-term Instructional Video -- Do Generalised Classifiers really work on Human Drawn Sketches? -- KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding -- Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360° -- MotionDirector: Motion Customization of Text-to-Video Diffusion Models -- Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer -- Enhanced Motion Forecasting with Visual Relation Reasoning -- Rate-Distortion-Cognition Controllable Versatile Neural Image Compression -- Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers -- LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar -- MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models -- Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models -- Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer -- Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors -- Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation -- StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion |
Persistent Identifier |
URN: urn:nbn:de:101:1-2410310308279.606693140269 DOI: 10.1007/978-3-031-72992-8 |
URL | https://doi.org/10.1007/978-3-031-72992-8 |
ISBN/Einband/Preis | 978-3-031-72992-8 |
Sprache(n) | Englisch (eng) |
Beziehungen | Lecture Notes in Computer Science ; 15114 |
DDC-Notation | 006.3 (maschinell ermittelte DDC-Kurznotation) |
Sachgruppe(n) | 004 Informatik |
Online-Zugriff | Archivobjekt öffnen |
