Personalized image generation with deep generative models: A decade survey
Gaussian-plus-SDF SLAM: High-fidelity 3D reconstruction at 150+ fps
GarTrans: Transformer-based architecture for dynamic and detailed garment deformation
Learning multi-grained interpretable latent representation for 3D face manipulation
Human pose estimation with general contact
VarGes: Improving variation in co-speech 3D gesture generation via StyleCLIPS
PuzzleSorter: Certainty-aware visual restoration of multiple cultural artifacts
Continuous indexed points for multivariate volume visualization
GRIG: Data-efficient generative residual image inpainting
Adaptive content-aware correction for wide-angle portrait photos
Multi-task gradual inference with a single encoder–decoder network for automatic portrait matting
Heuristic weakly supervised 3D human pose estimation
Remote sensing tuning: A survey
PCAC-GAN: A sparse-tensor-based generative adversarial network for 3D point cloud attribute compression
A discrete microfacet model for transparent glints rendering
FRNeRF: Fusion and regularization fields for dynamic view synthesis
BDA: Bi-directional attention for zero-shot learning
Prediction of scene plausibility
Class incremental learning via feature space calibration
LDSwap: A semantic-related latent code disentangling method in StyleSpace towards high-resolution face swapping
MDFP-Net: A model-driven deep neural network for Fourier ptychography
TransCeption: Enhancing medical image segmentation with an inception-like transformer design for efficient feature fusion
JVCSR+: Adaptively learned video compressive sensing reconstruction with joint in-loop reference enhancement and out-loop super-resolution
Exploring a hierarchical cross-attention transformer for high-speed tracking
Uncertainty aware multiple view stereo network with accurate supervision
Message from Guest Editors of the CVM 2025 Special Issue
Computer-aided layout generation for building design: A review
DS-MAE: Dual-Siamese masked autoencoders for point cloud analysis
NeuS-PIR: Learning relightable neural surface using pre-integrated rendering
TexPro: Text-guided PBR texturing with procedural material modeling
MagicTalk: Implicit and explicit correlation learning for diffusion-based emotional talking face generation
Ultra-high resolution facial texture reconstruction from a single image
Decoupled two-stage talking head generation via Gaussian-landmark-based neural radiance fields
Diff-OSGN: Diffusion-based occlusal surface generation network with geometric constraints
Revitalizing image dehazing in the real world: A high-quality dataset and a customized method
Spatiotemporal fusion transformer for video demoiréing
ImVoxelENet: Image to voxels epipolar transformer for multi-view RGB-based 3D object detection
SAD: Style-aware diffusion adaptation for few-shot style transfer image generation
3D indoor scene geometry estimation from a single omnidirectional image: A comprehensive survey
Swin3D++: Effective multi-source pretraining for 3D indoor scene understanding
FastMAE: Efficient masked autoencoder with offline tokenizer
Point mask transformer for outdoor point cloud semantic segmentation
Sem-iNeRF: Camera pose refinement by inverting neural radiance fields with semantic feature consistency
MMRelief: Modeling multi-human relief from a single photograph
Unified transformed t-SVD using unfolding tensors for visual inpainting
Anchor-regularized GAN priors
Emotion amplification of facial videos using a fine-tuned StyleGAN
Weakly supervised instance action recognition
MA2Net: Multi-scale adaptive mixed attention network for image demoiréing
Neural-Polyptych: Content controllable painting recreation for diverse genres