An End-to-End Real-World Camera Imaging Pipeline

Graphical Abstract

Top: Compared with previous methods, our approach takes into account multiple complex distortions..

Abstract

Recent advances in neural camera imaging pipelines have demonstrated notable progress. Nevertheless, the real-world imaging pipeline still faces challenges including the lack of joint optimization in system components, computational redundancies, and optical distortions such as lens shading. In light of this, we propose an end-to-end camera imaging pipeline (RealCamNet) to enhance real-world camera imaging performance. Our methodology diverges from conventional, fragmented multi-stage image signal processing towards end-to-end architecture. This architecture facilitates joint optimization across the full pipeline and the restoration of coordinate-biased distortions. RealCamNet is designed for high-quality conversion from RAW to RGB and compact image compression. Specifically, we deeply analyze coordinate-dependent optical distortions, e.g., vignetting and dark shading, and design a novel Coordinate-Aware Distortion Restoration (CADR) module to restore coordinate-biased distortions. Furthermore, we propose a Coordinate-Independent Mapping Compression (CIMC) module to implement tone mapping and redundant information compression. Existing datasets suffer from misalignment and overly idealized conditions, making them inadequate for training real-world imaging pipelines. Therefore, we collected a real-world imaging dataset. Experiment results show that RealCamNet achieves the best rate-distortion performance with lower inference latency.

Coordinate-Related Distortion

Top: Examples of coordinate-related distortion (vignetting and dark shading).

Network Architecture

Top: The architecture of RealCamNet.

Rate-Distortion Curve Results

Quantitative comparison on Rate-Distortion Curve.

Quantitative results

Quantitative results. We compare with state-of-the-art ISPNet and image compression methods, including learning-based methods: PyNet(CVPR'20), LiteISPNet(ICCV'21), MwISPNet(ECCV'20), MLIC(ACMMM'23), TCM(CVPR'23) and the most advanced traditional image compression method VTM/H.266. We show BD-Rate, BD-PSNR, BD-MSSSIM, BD-δE and BD-LPPHS for all methods. We use PyNet+VTM as anchor.

Visual Comparison Results

Compared with the state-of-the-art methods, our framework, optimized end-to-end, demonstrates significant enhancements in imaging systems' performance. It offers comprehensive benefits, encompassing reduced bit rate, augmented color fidelity, and elevated PSNR.

Acknowledgements

We would like to thank Ma Zijia for his important contribution to the experiment, Xu Li for her constructive suggestions on the thesis design, and He Gang for his support in the process of establishing the thesis topic.

BibTeX

@inproceedings{realcamnet,
  author    = {Kepeng Xu and Zijia Ma and Li Xu and Gang He and Yunsong Li and Wenxin Yu and Taichu Han and Cheng Yang},
  title     = {An End-to-End Real-World Camera Imaging Pipeline},
  booktitle = {ACM MULTIMEDIA 2024},
  year      = {2024},
 }

RealCamNet: An End-to-End Real-World Camera Imaging Pipeline