基于张量分解和卷积稀疏表示的多曝光图像融合

戚余斌; 郁梅; 姜浩; 邵华; 蒋刚毅

doi:10.12086/oee.2019.180084

基于张量分解和卷积稀疏表示的多曝光图像融合

- 1.
  宁波大学信息科学与工程学院，浙江宁波 315211
- 2.
  南京大学计算机软件新技术国家重点实验室，江苏南京 210093
- 3.
  浙江工商职业技术学院智能家电宁波市重点实验室，浙江宁波 315012
基金项目:
国家自然科学基金项目(61671258)；浙江省自然科学基金项目(LY15F010005)

详细信息

作者简介:
戚余斌(1995-)，男，硕士研究生，主要从事多媒体信号处理的研究。E-mail：1541486747@qq.com

**^*通讯作者:** 郁梅(1968-)，女，博士，教授，主要从事多媒体信号处理与通信的研究。E-mail：yumei@nbu.edu.cn

中图分类号: O436.3;TP391.41

收稿日期: 2018-04-14

修回日期: 2018-07-13

刊出日期: 2019-01-01

Multi-exposure image fusion based on tensor decomposition and convolution sparse representation

- 1.
  Faculty of Information Science and Engineering, Ningbo University, Ningbo, Zhejiang 315211, China
- 2.
  National Key Lab of Software New Technology Nanjing University, Nanjing, Jiangsu 210093, China
- 3.
  Ningbo Key Lab of Intelligent Household Appliances, Zhejiang Business Technology Institute, Ningbo, Zhejiang 315012, China
Fund Project: Supported by National Natural Science Foundation of China (61671258) and Zhejiang Province Natural Science Foundation (LY15F010005)

More Information

**^*Corresponding author:** Yu Mei, E-mail:yumei@nbu.edu.cn

Received Date 14 April 2018

Revised Date 13 July 2018

Published Date 01 January 2019

摘要

摘要

针对多曝光图像融合中存在细节丢失和颜色失真等问题，本文提出了一种基于张量分解和卷积稀疏表示的多曝光图像融合方法。张量分解作为一种对高维数据低秩逼近的方式，在多曝光图像特征提取方面有较大的潜力，而卷积稀疏表示是对整幅图像进行稀疏优化，能最大程度地保留图像的细节信息。同时，为了避免融合图像出现颜色失真，本文采取亮度与色度分别融合的方式。首先通过张量分解得到源图像的核心张量；然后在包含信息最多的第一子带上提取边缘特征；接着对边缘特征图进行卷积稀疏分解，继而利用分解系数的L1范数来得到每个像素的活跃水平；最后用"赢者取全"策略生成权重图，从而加权得到融合后的亮度分量。与亮度融合不同的是，色度分量则采用简单的高斯加权方式进行融合，在一定程度上解决了融合图像的颜色失真问题。实验结果表明，所提出的方法具有良好的细节保留能力。
- 张量分解 /
- 卷积稀疏表示 /
- 字典学习 /
- 多曝光融合
Abstract

In view of the problem about the loss of detail and color distortion in multi-exposure image fusion, this paper proposed a multi-exposure image fusion method based on tensor decomposition and convolution sparse representation. Tensor decomposition, as an approach of low-rank approximation for high-dimensional data, has great potential in feature extraction of multi-exposure images. Convolution sparse representation is a sparse optimization method for the whole image, which can preserve the detail information of the image to the greatest extent. At the same time, in order to avoid color distortion in the fused image, this paper adopted the method of separately fusing luminance and chrominance. Firstly, the core tensor of the source image was obtained by tensor decomposition. Besides, edge features were extracted on the first sub-band which contains the most information. Then the edge feature map was sparsely decomposed to obtain the activity level of each pixel by using L1 norm of the decomposition coefficient. Finally, take "winner-take-all" strategy to generate weight map so as to obtain the fused luminance components. Unlike the process of luminance fusion, chrominance components were fused by simple Gaussian weighting method, which solves the color distortion problem for the fused image to a certain extent. The experimental results show that the proposed method has great detail preservation ability.
- tensor decomposition /
- convolution sparse representation /
- dictionary learning /
- multi-exposure fusion

Overview

Overview

Overview: The real scene usually has a luminance range from 10^-5 cd/m² to 10⁸ cd/m², but the existing image video devices can capture a limited luminance dynamic range. Thus, it cannot retain all the details of the real scene. In recent years, multi-exposure fusion (MEF), as an effective quality enhancement technology, has gradually become a hot research topic in digital media field. This technique combines multiple low dynamic range (LDR) images with different exposures taken by ordinary cameras to generate an image with rich details and saturated color. At present, many MEF algorithms have been proposed by relevant researchers and they can achieve great results when processing image sequences with simple background. However, when multi-exposure image sequences contain many objects with complex textures, the performance of these algorithms is not satisfactory and the terrible phenomena such as detail loss and color distortion often appear in the fused images. To solve the above problem, this paper proposesd a multi-exposure image fusion method based on tensor decomposition (TD) and convolution sparse representation (CSR). Among them, TD, as a method of low rank approximation for high-dimensional data, has great potential in multi-exposure image feature extraction, while CSR performs sparse optimization on the whole image, which can retain the detail information to the greatest extent. At the same time, in order to avoid color distortion in the fused image, luminance and chrominance were fused separately. Firstly, the core tensor of the source image was obtained through tensor decomposition and the edge feature extraction was carried out on the first sub-band which contains the most information. Secondly, the edge feature map was sparsely decomposed to obtain the activity level of each pixel by using L1 norm of the decomposition coefficient. Finally, take the "winner-take-all" strategy to generate the weight map so as to obtain the fused luminance component. Different from the luminance fusion process, chrominance components were fused by Gaussian weighting method simply according to the color space characteristics. The experiment used two sets of image sequences with complex background. Compared with other five advanced MEF algorithms, the fusion image by the proposed algorithm not only had rich details, but also did not appear the large-scale color distortion. In addition, in order to evaluate the detail preserving ability of the proposed algorithm more comprehensively, seven groups of multi-exposure image sequences were selected for objective measurement. Experimental results show that the proposed method has strong edge information preserving ability.

HTML全文

1. 引言

现实世界的场景通常具有10^-5 cd/m^2~10⁸ cd/m²的亮度范围，而现有的图像视频设备能够捕捉的亮度动态范围十分有限，无法保留真实场景中的所有细节信息^[1]。目前，随着高动态范围成像技术(high dynamic range imaging, HDRI)的快速发展，这个问题正在得到解决。HDRI是用普通相机拍摄多幅不同曝光的低动态范围(low dynamic range, LDR)图像并将其融合成一幅动态范围宽的图像，但是合成的高动态范围(high dynamic range, HDR)图像质量极其依赖于相机响应曲线(camera response function, CRF)的精度^[2]。由于现有的终端显示设备大多为LDR类型，近年来，多曝光图像融合(multi-exposure image fusion, MEF)技术正成为成像领域的研究热点。它通过合并多张LDR图像间的互补信息来直接获得一幅视觉效果类似于HDR图像的LDR图像，而无需进行CRF估计和色调映射(Tone mapping, TM)等处理，且能够在普通的LDR显示设备上直接观看^[3]。从图像变换的角度，图像融合方法可分为4类^[4]：基于多尺度分解(multi-scale decomposition, MSD)的方法^[5]，基于稀疏表示(sparse representation, SR)的方法^[6]，基于空间域^[7]和基于混合变换的方法^[8]。这些融合方法的思想可以近一步延伸到多曝光融合领域中，Mertens等^[9]在多尺度图像分解下，利用对比度、饱和度及曝光量三个特征来构造融合权重值，但该方法在场景灰暗或明亮处会丢失较多细节。Kang等^[10]提出一种基于引导滤波的多曝光融合方法，它充分利用空间一致性原则来消除融合图像中的边缘伪影现象，但该算法在场景较复杂时仍会丢失一些细节。Liu等^[11]提出了一种基于稠密尺度不变特征变换(dense scale invariant feature transformation, DSIFT)的多曝光融合方法，该方法能较好地保留源图像的边缘信息，但容易出现颜色退化等现象。Ma等^[12]将图像块信号分解成强度、结构和平均强度三部分，并分别对各部分进行融合，能较好地保留源图像的结构信息，但在复杂的场景下容易出现块伪影。Ma等^[13]在文献[12]基础上又提出一种基于结构相似度指标来优化融合图像的方法，但该算法很大程度上依赖于原始融合图像的质量。虽然现有的多曝光融合算法能较好地融合简单背景下的图像序列，但是当场景中出现较多纹理复杂的物体时，这些算法均不能很好地保留源图像的细节和纹理信息。

本文提出一种结合张量分解(tensor decomposition, TD)和卷积稀疏表示(convolution sparse representation, CSR)的多曝光融合算法，主要从三方面提高了融合图像的性能：首先通过张量分解将源图像的三个通道的信息高度汇聚到三个子带图中，而第一子带所包含的信息量和特征最多，方便进行特征提取，这既避免了算法同时处理三通道的复杂性，也提高了特征提取的完整性；其次对第一子带图提取的边缘特征进行卷积稀疏表示，相较于传统的稀疏表示方法具有更出色的细节保留能力；最后分别采用不同的融合策略来处理源图像的亮度信息和色度信息，一定程度上避免了融合图像中出现颜色失真现象。

2. 基于张量分解和卷积稀疏表示的多曝光图像融合

现有的多曝光图像融合算法在处理复杂背景下的图像序列时，容易出现细节丢失、颜色失真和光晕等视觉现象。本文先对源图像 ${\mathit{\boldsymbol{I}}_k}$ 进行颜色空间转换，继而利用不同的融合策略来处理图像的亮度信息 ${\mathit{\boldsymbol{Y}}_k}$ 和色度信息 ${\mathit{\boldsymbol{Cb}}_k}$ 、 ${\mathit{\boldsymbol{Cr}}_k}$ ，以减少融合图像的颜色失真；考虑到多曝光源图像的亮度信息对最终融合图像的细节、纹理等特征的保留有较大影响，故提出一种结合张量分解和卷积稀疏表示的多曝光图像亮度融合方法，具体流程如图 1所示。其中，张量分解作为一种对高维数据低秩逼近的方法，利于特征提取，而卷积稀疏表示是对整幅图像进行稀疏优化，避免了融合图像中细节特征的大量丢失。

图 1. 基于张量分解和卷积稀疏表示的多曝光图像融合框图

Figure 1. Flowchart of tensor decomposition and convolution sparse representation for multi-exposure image fusion

Sequences	Mertens^[9]	Kang^[10]	Liu^[11]	Ma^[12]	Ma^[13]	The proposed
Room	0.6629	0.6653	0.6573	0.6263	0.6598	0.6721
House	0.6878	0.6962	0.6910	0.5894	0.6780	0.6997
Forth4	0.6462	0.6477	0.6418	0.6258	0.6331	0.6539
Garage	0.6860	0.6864	0.6837	0.6686	0.6785	0.6956
Cafe	0.6755	0.6842	0.6865	0.6721	0.6665	0.6866
Tower	0.7598	0.7699	0.7793	0.7707	0.7689	0.7695
SwissSunset	0.6331	0.6132	0.6028	0.5971	0.6077	0.6279
Average	0.6788	0.6804	0.6775	0.6500	0.6704	0.6865

[1]	Artusi A, Richter T, Ebrahimi T, et al. High dynamic range imaging technology[J]. IEEE Signal Processing Magazine, 2017, 34(5): 165-172. doi: 10.1109/MSP.2017.2716957
[2]	Chiang J C, Kao P H, Chen Y S, et al. High-dynamic-range image generation and coding for multi-exposure multi-view images[J]. Circuits, Systems, and Signal Processing, 2017, 36(7): 2786-2814. doi: 10.1007/s00034-016-0437-x
[3]	都琳, 孙华燕, 王帅, 等.针对动态目标的高动态范围图像融合算法研究[J].光学学报, 2017, 37(4): 101-109. 10.3788/aos201737.0410001 Du L, Sun H Y, Wang S, et al. High dynamic range image fusion algorithm for moving targets[J]. Acta Optica Sinica, 2017, 37(4): 101-109. 10.3788/aos201737.0410001
[4]	Li S T, Kang X D, Fang L Y, et al. Pixel-level image fusion: a survey of the state of the art[J]. Information Fusion, 2017, 33: 100-112. doi: 10.1016/j.inffus.2016.05.004
[5]	Zhao C H, Guo Y T, Wang Y L. A fast fusion scheme for infrared and visible light images in NSCT domain[J]. Infrared Physics & Technology, 2015, 72: 266-275. 10.1016/j.infrared.2015.07.026
[6]	Chen C, Li Y Q, Liu W, et al. Image fusion with local spectral consistency and dynamic gradient sparsity[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014: 2760-2765.
[7]	Sun J, Zhu H Y, Xu Z B, et al. Poisson image fusion based on Markov random field fusion model[J]. Information Fusion, 2013, 14(3): 241-254. doi: 10.1016/j.inffus.2012.07.003
[8]	Liu Y, Liu S P, Wang Z F. A general framework for image fusion based on multi-scale transform and sparse representation[J]. Information Fusion, 2015, 24: 147-164. doi: 10.1016/j.inffus.2014.09.004
[9]	Mertens T, Kautz J, van Reeth F. Exposure fusion: a simple and practical alternative to high dynamic range photography[J]. Computer Graphics Forum, 2009, 28(1): 161-171. doi: 10.1111/cgf.2009.28.issue-1
[10]	Li S T, Kang X D, Hu J W. Image fusion with guided filtering[J]. IEEE Transactions on Image Processing, 2013, 22(7): 2864-2875. doi: 10.1109/TIP.2013.2244222
[11]	Liu Y, Wang Z F. Dense SIFT for ghost-free multi-exposure fusion[J]. Journal of Visual Communication and Image Representation, 2015, 31: 208-224. doi: 10.1016/j.jvcir.2015.06.021
[12]	Ma K D, Li H, Yong H W, et al. Robust multi-exposure image fusion: a structural patch decomposition approach[J]. IEEE Transactions on Image Processing, 2017, 26(5): 2519-2532. doi: 10.1109/TIP.2017.2671921
[13]	Ma K D, Duanmu Z F, Yeganeh H, et al. Multi-exposure image fusion by optimizing a structural similarity index[J]. IEEE Transactions on Computational Imaging, 2018, 4(1): 60-72. doi: 10.1109/TCI.2017.2786138
[14]	Kolda T G, Bader B W. Tensor decompositions and applications[J]. SIAM Review, 2009, 51(3): 455-500. doi: 10.1137/07070111X
[15]	Wang H Z, Ahuja N. A tensor approximation approach to dimensionality reduction[J]. International Journal of Computer Vision, 2008, 76(3): 217-229. 10.1007/s11263-007-0053-0
[16]	Zeiler M D, Krishnan D, Taylor G W, et al. Deconvolutional networks[C]//Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 2010: 2528-2535.
[17]	Chen S S, Donoho D L, Saunders M A. Atomic decomposition by basis pursuit[J]. SIAM Journal on Scientific Computing, 1998, 20(1): 33-61. doi: 10.1137/S1064827596304010
[18]	Wohlberg B. Efficient algorithms for convolutional sparse representations[J]. IEEE Transactions on Image Processing, 2016, 25(1): 301-315. doi: 10.1109/TIP.2015.2495260
[19]	Liu J L, Garcia-Cardona C, Wohlberg B, et al. Online convolutional dictionary learning[C]//Proceedings of 2017 IEEE International Conference on Image Processing, Beijing, China, 2017.
[20]	Liu Y, Chen X, Ward R K, et al. Image fusion with convolutional sparse representation[J]. IEEE Signal Processing Letters, 2016, 23(12): 1882-1886. doi: 10.1109/LSP.2016.2618776
[21]	Paul S, Sevcenco I S, Agathoklis P. Multi-exposure and multi-focus image fusion in gradient domain[J]. Journal of Circuits, Systems, and Computers, 2016, 25(10): 1650123. doi: 10.1142/S0218126616501231
[22]	Banterle F, Artusi A, Debattista K, et al. Advanced High Dynamic Range Imaging: Theory and Practice[M]. Natick, MA: A K Peters, 2011.
[23]	Ma K D. Multi-Exposure Image Fusion by Optimizing A Structural Similarity Index[DB/OL]. https://ece.uwaterloo.ca/~k29ma/dataset/MEFOpt_Database, 2018.
[24]	Xydeas C S, Petrovic V. Objective image fusion performance measure[J]. Electronics Letters, 2000, 36(4): 308-309. doi: 10.1049/el:20000267

[1]	李玉龙, 陈晔曜, 崔跃利, 郁梅. LF-UMTI：基于多尺度空角交互的无监督多曝光光场图像融合. 光电工程, 2024, 51(6): 240093. doi: 10.12086/oee.2024.240093
[2]	丁俊华, 袁明辉. 基于双分支多尺度融合网络的毫米波SAR图像多目标语义分割方法. 光电工程, 2023, 50(12): 230242. doi: 10.12086/oee.2023.230242
[3]	郝明, 白鹤, 徐婷婷. 融合ResNeSt和多尺度特征融合的遥感影像道路提取. 光电工程, 2025, 52(1): 240236. doi: 10.12086/oee.2025.240236
[4]	梁礼明, 董信, 李仁杰, 何安军. 基于注意力机制多特征融合的视网膜病变分级算法. 光电工程, 2023, 50(1): 220199. doi: 10.12086/oee.2023.220199
[5]	姜文涛, 陈晨, 张晟翀. 空间位置矫正的稀疏特征图像分类网络. 光电工程, 2024, 51(5): 240050. doi: 10.12086/oee.2024.240050
[6]	黄盼, 何鹏, 杨兴, 罗家洋, 肖华亮, 田素坤, 冯鹏. 基于自适应融合和显微成像的乳腺肿瘤分级网络. 光电工程, 2023, 50(1): 220158. doi: 10.12086/oee.2023.220158
[7]	张沛, 任恒英, 田佳麒, 陈童, 闫伟伟, 张为. 基于跨尺度融合的图像型航空火灾探测器. 光电工程, 2025, 52(1): 240253. doi: 10.12086/oee.2025.240253
[8]	王帅, 何春元, 荣会钦, 鲍华, 侯佳林, 饶长辉. 二阶广义总变分约束的太阳图像多帧盲解卷积. 光电工程, 2023, 50(2): 220207. doi: 10.12086/oee.2023.220207

1.	史艳琼，王昌文，卢荣胜，查昭，朱广. 基于低秩稀疏矩阵分解和离散余弦变换实现多聚焦图像融合的算法. 激光与光电子学进展. 2024(10): 416-423 . 百度学术
2.	徐胜超，熊茂华. 基于子模式的人脸局部遮挡智能识别方法. 信息技术. 2023(03): 35-39 . 百度学术
3.	蒙友波，廖艳梅，覃锋，王晓红. 遥感影像融合下自然资源地类特征提取仿真. 计算机仿真. 2023(09): 162-166 . 百度学术
4.	王树泽，张志华，邓砚学. 基于机载高光谱遥感图像的城市绿地覆盖研究. 激光杂志. 2022(02): 77-81 . 百度学术
5.	杨涛，闫杰. 乡村建筑群形态结构的演变过程三维仿真. 计算机仿真. 2022(07): 238-242 . 百度学术
6.	王娟，柯聪，刘敏，熊炜，袁旭亮，丁畅. 神经网络框架下的红外与可见光图像融合算法综述. 激光杂志. 2020(07): 7-12 . 百度学术

基于张量分解和卷积稀疏表示的多曝光图像融合

作者简介: 戚余斌(1995-)，男，硕士研究生，主要从事多媒体信号处理的研究。E-mail：1541486747@qq.com

*通讯作者: 郁梅(1968-)，女，博士，教授，主要从事多媒体信号处理与通信的研究。E-mail：yumei@nbu.edu.cn

Multi-exposure image fusion based on tensor decomposition and convolution sparse representation

*Corresponding author: Yu Mei, E-mail:yumei@nbu.edu.cn

摘要

Abstract

Overview

1. 引言

2. 基于张量分解和卷积稀疏表示的多曝光图像融合

2.1 张量分解和特征提取

2.2 卷积稀疏表示和字典学习

2.3 亮度权重图生成

2.4 亮度融合

2.5 色度融合

2.6 融合图像生成

3. 实验结果与分析

4. 总结

参考文献

相关文章

施引文献

期刊类型引用(6)

其他类型引用(9)

访问统计

计量

出版历程

目录

摘要

Abstract

Overview

1. 引言

2. 基于张量分解和卷积稀疏表示的多曝光图像融合

2.1 张量分解和特征提取

2.2 卷积稀疏表示和字典学习

2.3 亮度权重图生成

2.4 亮度融合

2.5 色度融合

2.6 融合图像生成

3. 实验结果与分析

4. 总结

参考文献

访问统计

作者须知

其他内容

条款和政策

作者简介:
戚余斌(1995-)，男，硕士研究生，主要从事多媒体信号处理的研究。E-mail：1541486747@qq.com

**^*通讯作者:** 郁梅(1968-)，女，博士，教授，主要从事多媒体信号处理与通信的研究。E-mail：yumei@nbu.edu.cn

**^*Corresponding author:** Yu Mei, E-mail:yumei@nbu.edu.cn