Muutke küpsiste eelistusi

Computer Vision ECCV 2024: 18th European Conference, Milan, Italy, September 29October 4, 2024, Proceedings, Part X 2024 ed. [Pehme köide]

Edited by , Edited by , Edited by , Edited by , Edited by , Edited by
  • Formaat: Paperback / softback, 497 pages, kõrgus x laius: 235x155 mm, 173 Illustrations, color; 6 Illustrations, black and white; LXXXV, 497 p. 179 illus., 173 illus. in color., 1 Paperback / softback
  • Sari: Lecture Notes in Computer Science 15068
  • Ilmumisaeg: 03-Nov-2024
  • Kirjastus: Springer International Publishing AG
  • ISBN-10: 3031726839
  • ISBN-13: 9783031726835
  • Pehme köide
  • Hind: 70,93 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Tavahind: 83,45 €
  • Säästad 15%
  • Raamatu kohalejõudmiseks kirjastusest kulub orienteeruvalt 2-4 nädalat
  • Kogus:
  • Lisa ostukorvi
  • Tasuta tarne
  • Tellimisaeg 2-4 nädalat
  • Lisa soovinimekirja
  • Formaat: Paperback / softback, 497 pages, kõrgus x laius: 235x155 mm, 173 Illustrations, color; 6 Illustrations, black and white; LXXXV, 497 p. 179 illus., 173 illus. in color., 1 Paperback / softback
  • Sari: Lecture Notes in Computer Science 15068
  • Ilmumisaeg: 03-Nov-2024
  • Kirjastus: Springer International Publishing AG
  • ISBN-10: 3031726839
  • ISBN-13: 9783031726835
The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29October 4, 2024.





The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.
Modeling and Driving Human Body Soundfields through Acoustic
Primitives.- m&ms: A Benchmark to Evaluate Tool-Use for multi-step
multi-modal Tasks.- Label-anticipated Event Disentanglement for Audio-Visual
Video Parsing.- High-Fidelity 3D Textured Shapes Generation by Sparse
Encoding and Adversarial Decoding.- Semi-Supervised Video Desnowing Network
via Temporal Decoupling Experts and Distribution-Driven Contrastive
Regularization.- I-MedSAM: Implicit Medical Image Segmentation with Segment
Anything.- ReMamber: Referring Image Segmentation with Mamba Twister.-
TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian
Splatting.- CAT: Enhancing Multimodal Large Language Model to Answer
Questions in Dynamic Audio-Visual Scenarios.- Segmentation-guided Layer-wise
Image Vectorization with Gradient Fills.- Implicit Style-Content Separation
using B-LoRA.- OpenPSG: Open-set Panoptic Scene Graph Generation via Large
Multimodal Models.- ActionVOS: Actions as Prompts for Video Object
Segmentation.- FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot
Performance.- U-COPE: Taking a Further Step to Universal 9D Category-level
Object Pose Estimation.- Integrating Markov Blanket Discovery into Causal
Representation Learning for Domain Generalization.- Rotary Position Embedding
for Vision Transformer.- Local All-Pair Correspondence for Point Tracking.-
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object
Detection.- ReALFRED: An Embodied Instruction Following Benchmark in
Photo-Realistic Environments.- S^3D-NeRF: Single-Shot Speech-Driven Neural
Radiance Field for High Fidelity Talking Head Synthesis.- ActionSwitch:
Class-agnostic Detection of Simultaneous Actions in Streaming Videos.-
Hierarchically Structured Neural Bones for Reconstructing Animatable Objects
from Casual Videos.- PQ-SAM: Post-training Quantization for Segment Anything
Model.- CPM: Class-conditional Prompting Machine for Audio-visual
Segmentation.- Optimizing Factorized Encoder Models: Time and Memory
Reduction for Scalable and Efficient Action Recognition.- DVLO: Deep
Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional
Structure Alignment.