Katalog der Deutschen Nationalbibliothek
Ergebnis der Suche nach: "Machine Learning"
|   | |
| Link zu diesem Datensatz | https://d-nb.info/1346427364 | 
| Art des Inhalts | Konferenzschrift | 
| Titel | Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part VIII / edited by Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol | 
| Person(en) | Leonardis, Aleš (Herausgeber) Ricci, Elisa (Herausgeber) Roth, Stefan (Herausgeber) Russakovsky, Olga (Herausgeber) Sattler, Torsten (Herausgeber) Varol, Gül (Herausgeber) | 
| Organisation(en) | SpringerLink (Online service) (Sonstige) | 
| Ausgabe | 1st ed. 2025 | 
| Verlag | Cham : Springer Nature Switzerland, Imprint: Springer | 
| Zeitliche Einordnung | Erscheinungsdatum: 2025 | 
| Umfang/Format | Online-Ressource, LXXXV, 499 p. 187 illus., 186 illus. in color. : online resource. | 
| Andere Ausgabe(n) | Printed edition:: ISBN: 978-3-031-73241-6 Printed edition:: ISBN: 978-3-031-73243-0 | 
| Inhalt | Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs -- Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition -- DiffiT: Diffusion Vision Transformers for Image Generation -- WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation -- GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding -- FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis -- FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection -- SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs -- ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities -- MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? -- See and Think: Embodied Agent in Virtual Environment -- PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects -- Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases -- VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding -- Masked Angle-Aware Autoencoder for Remote Sensing Images -- Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm -- MultiGen: Zero-shot Image Generation from Multi-modal Prompts -- GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths -- Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning -- SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis -- Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets -- FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition -- Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting -- UniCode : Learning a Unified Codebook for Multimodal Large Language Models -- When Do We Not Need Larger Vision Models? -- GVGEN: Text-to-3D Generation with Volumetric Representation -- Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | 
| Persistent Identifier | URN: urn:nbn:de:101:1-2410290353589.861406067912 DOI: 10.1007/978-3-031-73242-3 | 
| URL | https://doi.org/10.1007/978-3-031-73242-3 | 
| ISBN/Einband/Preis | 978-3-031-73242-3 | 
| Sprache(n) | Englisch (eng) | 
| Beziehungen | Lecture Notes in Computer Science ; 15066 | 
| DDC-Notation | 006.3 (maschinell ermittelte DDC-Kurznotation) | 
| Sachgruppe(n) | 004 Informatik | 
| Online-Zugriff | Archivobjekt öffnen | 
 Administration
Administration
		







