Mind the Gap: Quantifying the Domain Gap in Cross-Sensor Diffusion Super-Resolution
arXiv:2606.28039v1 Announce Type: cross Abstract: Demand for high-resolution satellite imagery has increased interest in super-resolution (SR) to bridge the spatial resolution gap between freely available missions such as Sentinel-2 and commercial systems like PlanetScope. Because no sensor...
The Unseen Hurdle in Satellite Super-Resolution
A new paper from arXiv (2606.28039) tackles a practical bottleneck in Earth observation AI: the "domain gap" between different satellite sensors when applying super-resolution (SR) models. The research specifically addresses the challenge of training diffusion-based SR models on one sensor’s data (e.g., Sentinel-2 at 10m resolution) and deploying them on another (e.g., PlanetScope at 3m). While the abstract focuses on the problem statement, the core contribution is a systematic quantification of how sensor-specific characteristics—such as spectral response functions, noise patterns, and spatial sampling—degrade model performance when the source and target domains differ.
This matters because the satellite imagery ecosystem is fragmented. Free, open-access missions like Sentinel-2 provide global coverage but at moderate resolution, while commercial systems like PlanetScope offer higher detail but at a cost and with limited historical archives. The dream of training a single SR model on abundant Sentinel-2 data and applying it to enhance PlanetScope imagery (or vice versa) has been hampered by this domain gap. Previous work often treated SR as a purely spatial problem, ignoring that each sensor "sees" the Earth differently due to optics, calibration, and atmospheric correction pipelines.
Why This Matters for AI Practitioners
For AI teams working in remote sensing, this research underscores a hard lesson: domain adaptation is not optional. The naive approach of training on one sensor and inferring on another will likely fail, even with state-of-the-art diffusion models. The paper’s quantification of the gap provides a diagnostic tool—practitioners can now measure how much their source and target data diverge before investing in model training. This shifts the problem from "can we build a better SR model?" to "how do we align sensor-specific distributions?"
The implications extend beyond satellite imagery. Any domain with multi-sensor data—medical imaging (CT vs. MRI), autonomous driving (different camera sensors), or industrial inspection—faces similar cross-sensor degradation. The diffusion model community, which has focused heavily on unconditional generation and text-to-image, now has a concrete benchmark for conditional SR under domain shift.
Implications for AI Practitioners
- Data preprocessing becomes a first-class concern: Expect to invest in sensor-specific calibration, spectral matching, and noise modeling before training. Off-the-shelf SR models will not transfer.
- Evaluation metrics need revision: Standard SR metrics (PSNR, SSIM) may mislead if they don't account for sensor-specific frequency responses. The paper likely proposes or implies domain-aware metrics.
- Transfer learning strategies are critical: Practitioners should explore fine-tuning on target sensor data (even small amounts) or using adversarial domain alignment. Diffusion models’ iterative denoising may offer natural avenues for domain conditioning.
- Cost-benefit analysis of data acquisition: If the domain gap is too large, it may be cheaper to buy target-sensor training data than to engineer complex domain adaptation.
Key Takeaways
- Cross-sensor super-resolution in satellite imagery suffers from a quantifiable "domain gap" due to differences in sensor optics, noise, and spectral response.
- Training SR models on one sensor and applying them to another without domain adaptation leads to significant performance degradation, even with diffusion models.
- AI practitioners must prioritize sensor-specific preprocessing and domain alignment techniques over model architecture improvements alone.
- The findings generalize to any multi-sensor AI application, making this a cautionary case study for domain transfer in computer vision.