ICLR2026
Implicit 4D Gaussian Splatting for Fast Motion with Large Inter-Frame Displacements
Seung-gyeom Kim, Areum Kim, Yongjae Yoo, Sukmin Yun
Abstract
Recent 4D Gaussian Splatting (4DGS) methods often fail under fast motion with large inter-frame displacements, where Gaussian attributes are poorly learned during training, and fast-moving objects are often lost from the reconstruction. In this work, we introduce Spatiotemporal Position Implicit Network for 4DGS, coined SPIN-4DGS, which learns Gaussian attributes from explicitly collected spatiotemporal positions rather than modeling temporal displacements, thereby enabling more faithful splatting under fast motions with large inter-frame displacements. To avoid the heavy memory overhead of explicitly optimizing attributes across all spatiotemporal positions, we instead predict them with a lightweight feed-forward network trained under a rasterization-based reconstruction loss. Consequently, SPIN-4DGS learns shared representations across Gaussians, effectively capturing spatiotemporal consistency and enabling stable high-quality Gaussian splatting even under challenging motions. Across extensive experiments, SPIN-4DGS consistently achieves higher fidelity under large displacements, with clear improvements in PSNR and SSIM on challenging sports scenes from the CMU Panoptic dataset. For example, SPIN-4DGS notably outperforms the strongest baseline, D3DGS, by achieving +1.83 higher PSNR on the Basketball scene. 4DGaussian Realtime-4DGS D3DGS Ours Figure 1: Faithful reconstruction of fast motion with large inter-frame displacements. Existing 4DGS approaches often produce blurred or incomplete reconstructions of fast-moving objects. In contrast, ours successfully reconstructs clear and accurate details, such as the basketball in the scene.