Muutke küpsiste eelistusi

E-raamat: MultiMedia Modeling: 31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025, Proceedings, Part V

Edited by , Edited by , Edited by , Edited by , Edited by , Edited by , Edited by , Edited by
  • Formaat - PDF+DRM
  • Hind: 74,09 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Lisa ostukorvi
  • Lisa soovinimekirja
  • See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.

DRM piirangud

  • Kopeerimine (copy/paste):

    ei ole lubatud

  • Printimine:

    ei ole lubatud

  • Kasutamine:

    Digitaalõiguste kaitse (DRM)
    Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale  Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

    Vajalik tarkvara
    Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

    PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

    Seda e-raamatut ei saa lugeda Amazon Kindle's. 

This five-volume set LNCS 15520-15524 constitutes the proceedings of the 31st International Conference on Multimedia Modeling, MMM 2025, held in Nara, Japan, January 8–10, 2025.
The 135 full papers and 41 short papers presented in these proceedings were carefully reviewed and selected from 348 submissions. The MMM conference was organized in topics related to multimedia modelling, particularly: audio, image, video processing, coding and compression; multimodal analysis for retrieval applications, and multimedia fusion methods.

Special Session on Multimedia Research in Robotics.- Multimodal
Engagement Prediction in Human-Robot Interaction using Transformer Neural
Networks.- What Should Autonomous Robots Verbalize and What Should They
Not?.- Special Session: SpIMA: Special Session on Spatial Intelligence in
Multimedia Analytics.- Counting Unique Objects in Geo-Tagged Street Images: A
Case Study Of Homeless Encampments in Los Angeles.- Special Session on
Simulating Edge Computing and Multimodal AI: A Benchmark for Real-World
Applications.- Correlation-Based Weighted Federated Learning with Multimodal
Sensing and Knowledge Distillation: An Application on a Real-World Benchmark
Dataset.- Leveraging Pruning, Quantization and Multi-Objective Optimization
for an Efficient Deployment of Multi-modal Models.- Demo Papers.- A User
Identification and Reading Style Detection System Based on Eye Movement
Patterns During Reading.- AMDA: Advancing Multimedia Data Annotation for
Human-centric Situations.- An Implementation of Networked
JamSketch.- Badminton Footwork Practice via an Immersive Virtual Reality
System.- Better Image Segmentation with Classification: Guiding Zero-Shot
Models Using Class Activation Maps.- CleverFox: Integrating Visual Mnemonics
with AI for Enhanced Language Learning.- Enhancing User Control in AI-Based
Video Summarization for Social Media.- FencBuddy: Action-aware Depth
Perception Training for Fencing Attacks.- Fingering Prediction for Classical
Guitar: Dataset Creation and Model Development.- KuzushijiFontDiff: Diffusion
Model for Japanese Kuzushiji Font Generation.- Leveraging Latent Diffusion in
3D Gaussian Splatting for Novel View Synthesis.-Movie Retrieval Systems Using
Genre-guided Multimodal Learning Techniques.- Multi-Dimensional Exploration
of Media Collection Metadata.- Multimodal Interoperability with the CLAMS
Platform.- Real-time Visualizer for Turntablist Performance.- RoboDJ: Live
Commentary Robots System Driven by Physical- and Cyber-world
Observations.-SceneTextStyler: Editing Text with Style
Transformation.- SelectSum: Topic-Based Selective Summarization of
Speech-Based Videos.- Smart Driving Assistance with Real-time Risk Assessment
and Personalized Driving Coaching to Enhance Road Safety.- System Demo of
Modeling Smart University Campus Virtual Environments.- Training a
Segmentation-based Visual Anonymization Service for Street
Scenes.- Transformer-Based Audio Generation Conditioned by 2D Latent Maps: A
Demonstration.- Using Language Models to Generate and Forget the Narrative
Memories of an Assistive Robot.- WaveFontStyler: Font Style Transfer Based on
Sound.- Video Browser Showdown.- diveXplore at the Video Browser Showdown
2025.- Exquisitor at the Video Browser Showdown 2025: Unifying Conversational
Search and User Relevance Feedback.- Feature-driven Video Segmentation and
Advanced Querying with vitrivr-engine.- FUSIONISTA: Fusion of 3-D Information
of Video in Retrieval System.- HORUS: Multimodal Large Language Models
Framework for Video Retrieval at VBS 2025.- IMSearch 2.0: Toward User-centric
and Efficient Interactive Multimedia Retrieval System.- Interactive Video
Search with Multi-modal LLM Video Captioning.- MediaMix: Multimedia Retrieval
in Mixed Reality.- NII-UIT at VBS2025: Multimodal Video Retrieval with LLM
Integration and Dynamic Temporal Search.-PraK Tool V3: Enhancing Video Item
Search Using Localized Text and Texture Queries.- Simplified Video Retrieval
in Virtual Reality with vitrivr-VR.- SnapSeek 2.0 at Video Browser Showdown
2025.- VEAGLE: Eye Gaze-Assisted Guidance for Video Browser Showdown.- VERGE
in VBS 2025.- VideoEase at VBS2025: An Interactive Video Retrieval
System.-ViewsInsight2.0: Enhancing Video Retrieval for VBS 2025 with an
Automatic Query Generator Powered by Large Language Models.- ViFi: A Video
Finding System at Video Browser Showdown 2025.