To read this content please select one of the options below:

Light field image coding using a residual channel attention network–based view synthesis

Faguo Liu (Shanghai Normal University, Shanghai, China)
Qian Zhang (Shanghai Normal University, Shanghai, China)
Tao Yan (Putian University, Putian, China)
Bin Wang (Shanghai Normal University, Shanghai, China)
Ying Gao (Shanghai Normal University, Shanghai, China)
Jiaqi Hou (Shanghai Normal University, Shanghai, China)
Feiniu Yuan (Shanghai Normal University, Shanghai, China)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 21 February 2024

Issue publication date: 5 September 2024

103

Abstract

Purpose

Light field images (LFIs) have gained popularity as a technology to increase the field of view (FoV) of plenoptic cameras since they can capture information about light rays with a large FoV. Wide FoV causes light field (LF) data to increase rapidly, which restricts the use of LF imaging in image processing, visual analysis and user interface. Effective LFI coding methods become of paramount importance. This paper aims to eliminate more redundancy by exploring sparsity and correlation in the angular domain of LFIs, as well as mitigate the loss of perceptual quality of LFIs caused by encoding.

Design/methodology/approach

This work proposes a new efficient LF coding framework. On the coding side, a new sampling scheme and a hierarchical prediction structure are used to eliminate redundancy in the LFI's angular and spatial domains. At the decoding side, high-quality dense LF is reconstructed using a view synthesis method based on the residual channel attention network (RCAN).

Findings

In three different LF datasets, our proposed coding framework not only reduces the transmitted bit rate but also maintains a higher view quality than the current more advanced methods.

Originality/value

(1) A new sampling scheme is designed to synthesize high-quality LFIs while better ensuring LF angular domain sparsity. (2) To further eliminate redundancy in the spatial domain, new ranking schemes and hierarchical prediction structures are designed. (3) A synthetic network based on RCAN and a novel loss function is designed to mitigate the perceptual quality loss due to the coding process.

Keywords

Acknowledgements

This work was supported by Program for the Putian Science and Technology Bureau (2021G2001-8), New Century Excellent Talents in Fujian Province University (2018JYTRC (PU) Tao Yan), and Natural Science Foundation of Fujian (Tao Yan), in part by Putian University's Initiation Fee Project for Importing Talents for Scientific Research (2019003), colleges and universities in Hebei Province Science and Technology Research Project (ZC2021006), and National Natural Science Foundation of China under Grant (62301320).

Citation

Liu, F., Zhang, Q., Yan, T., Wang, B., Gao, Y., Hou, J. and Yuan, F. (2024), "Light field image coding using a residual channel attention network–based view synthesis", Data Technologies and Applications, Vol. 58 No. 4, pp. 652-668. https://doi.org/10.1108/DTA-03-2023-0071

Publisher

:

Emerald Publishing Limited

Copyright © 2024, Emerald Publishing Limited

Related articles