|
GeoDiT: Point-Conditioned Diffusion Transformer for Satellite Image Synthesis
Srikumar Sastry,
Dan Cher,
Brian Wei,
Aayush Dhakal,
Subash Khanal,
Dev Gupta,
Nathan Jacobs
arXiv, 2026
GeoDiT is a diffusion transformer for text-to-satellite image generation that replaces pixel-level conditioning with a simpler point-based control scheme and adaptive attention, enabling more flexible and semantically rich control while outperforming prior remote sensing generative models.
|
|
Crossview Registered Multiview Pose Estimation
Alexander Wollam,
Dev Gupta,
Nathan Jacobs
Paper, 2025
The work extends cross-view pose estimation from single images to multi-view inputs by adapting a multi-view 3D reconstruction framework and training on aligned ground-to-aerial datasets to jointly estimate 3DoF poses more robustly in aerial reference frames.
|