Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessHow Google's Ad Review Bots Have Evolved in 2026: What Media Buyers Need to KnowDEV CommunityApfel: The Free AI Already Built Into Your MacDEV CommunityOpenClaw SaaS vs Self-Hosting: Which One Should You Choose in 2026?DEV Community7 Best AI Coding Assistant Tools in 2026DEV CommunityHow Is Agentic AI Changing Travel Booking? What Ask Skift Says - SkiftGNews AI agenticWhat is GEO (Generative Engine Optimization)? The 2026 GuideDev.to AI[D] CVPR 2026 Travel Grant/Registration WaiverReddit r/MachineLearningIAPP Global Privacy Summit 2026: State AI Trends, FTC Signals, California’s DROP Build-Out, and the Hard Work of Cookie Compliance - JD SupraGNews AI privacy[D] When to transition from simple heuristics to ML models (e.g., DensityFunction)?Reddit r/MachineLearningQIS for Energy Grids: Why Distributed Renewable Integration Keeps Failing and What Outcome Routing ChangesDev.to AIBig Banks Seeking a Piece of SpaceX’s I.P.O. Must Subscribe to Elon Musk’s GrokNYT TechnologyCan We Fix Political Conversation Online? Joe Kiani's CitizeX Is Betting on Identity Verification, Not AlgorithmsInternational Business TimesBlack Hat USADark ReadingBlack Hat AsiaAI BusinessHow Google's Ad Review Bots Have Evolved in 2026: What Media Buyers Need to KnowDEV CommunityApfel: The Free AI Already Built Into Your MacDEV CommunityOpenClaw SaaS vs Self-Hosting: Which One Should You Choose in 2026?DEV Community7 Best AI Coding Assistant Tools in 2026DEV CommunityHow Is Agentic AI Changing Travel Booking? What Ask Skift Says - SkiftGNews AI agenticWhat is GEO (Generative Engine Optimization)? The 2026 GuideDev.to AI[D] CVPR 2026 Travel Grant/Registration WaiverReddit r/MachineLearningIAPP Global Privacy Summit 2026: State AI Trends, FTC Signals, California’s DROP Build-Out, and the Hard Work of Cookie Compliance - JD SupraGNews AI privacy[D] When to transition from simple heuristics to ML models (e.g., DensityFunction)?Reddit r/MachineLearningQIS for Energy Grids: Why Distributed Renewable Integration Keeps Failing and What Outcome Routing ChangesDev.to AIBig Banks Seeking a Piece of SpaceX’s I.P.O. Must Subscribe to Elon Musk’s GrokNYT TechnologyCan We Fix Political Conversation Online? Joe Kiani's CitizeX Is Betting on Identity Verification, Not AlgorithmsInternational Business Times
AI NEWS HUBbyEIGENVECTOREigenvector

Dynamic Graph Neural Network with Adaptive Features Selection for RGB-D Based Indoor Scene Recognition

arXiv cs.CVby [Submitted on 1 Apr 2026]April 2, 20262 min read2 views
Source Quiz

arXiv:2604.00372v1 Announce Type: new Abstract: Multi-modality of color and depth, i.e., RGB-D, is of great importance in recent research of indoor scene recognition. In this kind of data representation, depth map is able to describe the 3D structure of scenes and geometric relations among objects. Previous works showed that local features of both modalities are vital for promotion of recognition accuracy. However, the problem of adaptive selection and effective exploitation on these key local features remains open in this field. In this paper, a dynamic graph model is proposed with adaptive node selection mechanism to solve the above problem. In this model, a dynamic graph is built up to model the relations among objects and scene, and a method of adaptive node selection is proposed to ta

View PDF HTML (experimental)

Abstract:Multi-modality of color and depth, i.e., RGB-D, is of great importance in recent research of indoor scene recognition. In this kind of data representation, depth map is able to describe the 3D structure of scenes and geometric relations among objects. Previous works showed that local features of both modalities are vital for promotion of recognition accuracy. However, the problem of adaptive selection and effective exploitation on these key local features remains open in this field. In this paper, a dynamic graph model is proposed with adaptive node selection mechanism to solve the above problem. In this model, a dynamic graph is built up to model the relations among objects and scene, and a method of adaptive node selection is proposed to take key local features from both modalities of RGB and depth for graph modeling. After that, these nodes are grouped by three different levels, representing near or far relations among objects. Moreover, the graph model is updated dynamically according to attention weights. Finally, the updated and optimized features of RGB and depth modalities are fused together for indoor scene recognition. Experiments are performed on public datasets including SUN RGB-D and NYU Depth v2. Extensive results demonstrate that our method has superior performance when comparing to state-of-the-arts methods, and show that the proposed method is able to exploit crucial local features from both modalities of RGB and depth.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2604.00372 [cs.CV]

(or arXiv:2604.00372v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2604.00372

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Muyao Peng [view email] [v1] Wed, 1 Apr 2026 01:43:56 UTC (1,452 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelneural networkannounce

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Dynamic Gra…modelneural netw…announceupdatefeaturepaperarXiv cs.CV

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 179 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!