On the Transferability and Discriminability of Representation Learning in Unsupervised Domain Adaptation
Forget Me Not: Fighting Local Overfitting With Knowledge Fusion and Distillation
GrowSP++: Growing Superpoints and Primitives for Unsupervised 3D Semantic Segmentation
Learning Deep Tree-Based Retriever for Efficient Recommendation: Theory and Method
You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling With Gradient Shortcuts
DiFaReli++: Diffusion Face Relighting With Consistent Cast Shadows
Iterative Differential Entropy Minimization (IDEM) Method for Fine Rigid Pairwise 3D Point Cloud Registration: A Focus on the Metric
Fast Multi-view Discrete Clustering via Spectral Embedding Fusion
Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation
Human Motion Prediction via Continual Prior Compensation
Continuous Review and Timely Correction: Enhancing the Resistance to Noisy Labels via Self-Not-True and Class-Wise Distillation
MADTP++: Bridge the Gap Between Token and Weight Pruning for Accelerating VLTs
Unleashing the Power of Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation
Model Lineage Analysis: Determination and Closeness Measurement
A Hierarchical Prior Mining Approach for Non-Local Multi-View Stereo
RealLiFe: Real-Time Light Field Reconstruction via Hierarchical Sparse Gradient Descent
Incremental Online Learning of Randomized Neural Network With Forward Regularization
Robust Distributed Cooperative Classification With Learned Compressed-Feature Diffusion
HGNNv2: Stable Hypergraph Neural Networks
OIF-PCR++: Point Cloud Registration via Progressive Distillation of Conditional Positional Encoding
Instructed Diffuser With Temporal Condition Guidance for Offline Reinforcement Learning
Causal Inference via Style Bias Deconfounding for Domain Generalization
Next Bit Prediction: A Unified Lossless and Lossy Point Cloud Geometry Compression Framework
Hierarchical Context Alignment With Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction
CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey
Integrating Affordances and Attention Models for Short-Term Object Interaction Anticipation
How to Break It Down for Building It Up? Theory-Guided Graph Decomposition Learning for Spatiotemporal Traffic Prediction
Efficient Exploration for Multi-Agent Diversity With Agent Identity
NuwaDynamics+: A Causality-Aware Generative Framework for Spatio-Temporal Representation Learning
Beyond LLaVA-HD: Diving Into High-Resolution Multimodal Large Language Models
Accelerated Optimization of Large Mixture-of-Experts Models by Density-Aware Multi-Stage Learning
Deep Orientational Representation Learning for Ordinal Regression
Neuron Abandoning Attention Flow: Visual Explanation of Dynamics Inside CNN Models
Lifelong Learning of Large Language Model Based Agents: A Roadmap
Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination
Parse, Align and Aggregate: Graph-Driven Compositional Reasoning for Video Question Answering
Low-Rank Tensor Learning by Generalized Nonconvex Regularization
LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting
ConsistentID: Portrait Generation With Multimodal Fine-Grained Identity Preserving
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models
Seeing Through Satellite Images at Street Views
Searching to Modulate for Cold-Start Recommendation
AtomThink: Multimodal Slow Thinking With Atomic Step Reasoning
An Efficient Multi-Estimation-Based Parameter Centroid Decision via Linear Regression Approach
DyDiT++: Diffusion Transformers With Timestep and Spatial Dynamics for Efficient Visual Generation
Reinforced Refinement With Self-Aware Expansion for End-to-End Autonomous Driving
MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition
Non-Gradient Hash Factor Learning for High-Dimensional and Incomplete Data Representation Learning
Single-Photon Imaging in Complex Scenarios via Physics-Informed Deep Neural Networks