Using Kalman filter in the frequency domain for multi-frame scalable super resolution

Accepted Manuscript Using Kalman filter in the frequency domain for multi-frame scalable super resolution Akbar Rahimi , Payman Moallem , Kamal Shaht...

Download PDF

4MB Sizes 0 Downloads 22 Views

Report

Full Text

Accepted Manuscript

Using Kalman filter in the frequency domain for multi-frame scalable super resolution Akbar Rahimi , Payman Moallem , Kamal Shahtalebi , Mehdi Momeni PII: DOI: Reference:

S0165-1684(18)30296-2 https://doi.org/10.1016/j.sigpro.2018.09.012 SIGPRO 6922

To appear in:

Signal Processing

Received date: Accepted date:

14 August 2018 8 September 2018

Please cite this article as: Akbar Rahimi , Payman Moallem , Kamal Shahtalebi , Mehdi Momeni , Using Kalman filter in the frequency domain for multi-frame scalable super resolution, Signal Processing (2018), doi: https://doi.org/10.1016/j.sigpro.2018.09.012

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

Using Kalman filter in the frequency domain for multi-frame scalable super resolution Akbar Rahimi1, Payman Moallem1*, Kamal Shahtalebi1, Mehdi Momeni2 1- Department of Electrical Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran

*Corresponding Author

CR IP T

2- Department of Surveying Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran

Abstract: Kalman filter (KF) as a linear estimator which is used in super-resolution (SR)

AN US

problems, suffers from high computational costs and storage requirements. To gain appreciable success in the elimination of these two challenges, this paper advances a SR framework employing KF in the frequency domain, while no resort is made to any approximations or extra assumptions in the dynamic system modeling and statistical matrices.

M

Generally, previous KF-based SR methods organized the system with huge-sized matrices in the spatial domain, following which they tried to reduce the system dimension using

ED

approximation and/or limitation on point spread function (PSF). In this study, first, several

PT

small-dimension dynamic systems are separately made in the frequency domain supporting space-invariant PSFs of an arbitrary form and size. Then, the acquired small-dimension KF

CE

estimators are applied rather than the traditional huge-dimension one. These will greatly reduce computational complexity, decrease storage requirements allowing parallel

AC

implementation as well. Furthermore, our proposed SR framework can be used to produce high resolution image of an expedient size, that is, a scalable SR. Experimental results with both simulated and real world sequences indicate that our proposed framework works more effectively than the other compared methods, especially in fine details restoration.

Keywords: Super-resolution, Kalman filter, frequency domain 1

ACCEPTED MANUSCRIPT

1-Introduction The higher the resolution of an image, the more the details that emerge in observation. So a digital image with higher resolution is always more desirable in such wide applications as remote sensing, military, medical diagnosis, traffic monitoring than traditional vision imaging [1]. However, the available array imaging sensor is not dense enough due to the physical

CR IP T

limitations and forbidding cost constraints. The multi-frame super-resolution (SR) reconstruction can greatly assist in extracting the requisite information from a set of low resolution (LR) degraded images by creating a high resolution (HR) image [2]. The LR images are captured at different perspectives of the same scene. Generally, at least one of

AN US

sub-pixel translations among the LR frames is considered a good assumption in the very area of real applications.

The multi-frame image super resolution framework was first introduced by Tsai and Huang

M

[3] in the frequency domain. They derived a linear system equation in accordance with the

ED

relation between LR images and the desired HR image. They also assumed that the frames were band-limited having sub-pixel translations among them. Then, Tekalp et al. [4]

PT

considered observation noise and spatial blurring in developing their modeling formulation. Later, Kim et al. [5] provided another extension of the frequency domain approaches based

CE

on weighted least squares formulation and Tikhonov regularization. To reduce the memory

AC

requirements and the computational costs, a discrete cosine transforms (DCT)-based method was proposed by Rhee and Kang [6]. Frequency domain algorithms typically establish use of the relationship between the desired HR image and the observed LR images based on a simple theory, requiring short computation time [1]. On the other hand, a variety of spatial domain methods have been developed incorporating non-uniform interpolation [7], iteration back-projection (IBP) [8], projection onto convex sets (POCS) [9], HR reconstruction without primary registration [10], Kalman filter-based 2

ACCEPTED MANUSCRIPT methods, and regularized methods. It should be noted that most of the recent research works in the spatial-based SR problems focus on the regularized frameworks because of their effectiveness and flexibility [1]. The regularized methods estimate the desired HR image by minimizing an objective function which includes two main terms: Data fidelity and regularization. While the first one measures the closeness between the reconstructed HR

resulting in a robust solution [11,12,13,14,15,16,17,18].

CR IP T

image and the captured LR images, the second one is employed to regularize the problem

Generally, tuning regularized sensitive parameters is a notorious challenge in regularized SR

AN US

methods. In comparison to the spatial domain methods, frequency domain approaches might have difficultly supporting complicated motion conditions and being sensitive to model errors. Nonetheless, they enjoy several major advantages including theoretical simplicity,

M

facility for parallel implementation, and needing no sensitive parameters tuning. Kalman filtering theory is widely known as an optimal estimation theory under linear

ED

constraints and Gaussian noise [19]. Some SR reconstruction methods are based on the Kalman filter theory, which methods try to estimate the HR image from a set of observed

PT

aliased noisy LR images with regard to a linear observation model. On the whole, the

CE

traditional Kalman filter-based SR method suffer from high computational complexity and huge storage requirements, which emanate from the storage of high dimension matrices and

AC

special type of operations like matrix inversion during updating procedures. Some researchers have tried to meet the major challenges inherent in Kalman filter-based SR methods. Dellaert et al. [20] proposed a simplified Kalman filter-based algorithm which neglects off-diagonal elements of covariance matrix for the computational feasibility. Farsiu and Milanfar [21] carried out a two-step SR algorithm. In the first step, they used the Kalman filter framework for fusing the data, and in the second one, they applied the de-blurring technique. Newland et al. [22] developed a modified Kalman filter using the steady state Kalman gain matrix and 3

ACCEPTED MANUSCRIPT square-form filter for imaging point spread function (PSF). Recently, Wei et al. [23] proposed point-wise Kalman filter-based method by introducing a new system of modeling and changing covariance matrix to modified block diagonal covariance matrix in order to solve the computation and storage problems. Although the developed Kalman filter-based approaches have overcome some of the existing disadvantages of the Kalman filters,

degrade the quality of the reconstructed HR image.

CR IP T

limitations on the PSF type and size, and the approximations on covariance matrix still

The present study introduces a novel Kalman filter-based SR method without any recourse to

AN US

approximations to achieve better performance. Also the method we have proposed requires no tuning parameters, supports the space-invariant PSFs of any form and size, and is capable of reconstructing the HR image with an arbitrary size and scale. Further, it reduces the computational complexity and the required storage simultaneously. As a first step, an

M

observation model is developed relating the LR images to the continuous scene. Then the

ED

discrete Fourier transform (DFT) is evolved to obtain the frequency domain observation model. Next, the low-dimension dynamic system equations are organized based on the

PT

spectrum observation relation. Finally, the Kalman filter is employed to estimate the system

CE

states of constituting the frequency components of the desired HR image. In fact, the frequency domain helps to develop separate low-dimension dynamic systems. In

AC

other words, all pixels of HR image in the spatial domain relate to all pixels of each LR image while in the spectrum domain, a few frequency components of HR image relate to each frequency component of LR image causing a separation of several dynamic systems and leading to several low-dimension Kalman filters. After this general overview, it is in order to present the format of what appears in the following parts. In Section 2, an observation model is developed in the spectrum domain. In

4

ACCEPTED MANUSCRIPT Section 3, we organize the dynamic system equations, derive corresponding Kalman filter estimators, and describe our proposed SR algorithm. The experimental results are presented in Section 4. And the final Section 5 brings this study to an end by enumerating the

CR IP T

outstanding characteristic achievements of the method we have proposed here.

2-Image observation model

An observation model which simulates the relation between the desired HR image and each of the observed LR images is an essential step in the SR image procedure. Supposing the

AN US

imaging model which universally involves warping, blurring, down sampling, and noise adding, relates each observed LR image to a desired HR image.

Similar to the most represented SR methods, only global sub-pixel translations from among

M

LR images are considered, i.e., those which adapt to the most real successive videos. Let us suppose scene which is considered as a two dimensional (2-D) continuous signal being

(

). To obtain each observed LR image for the continuous scene, as a

) is translated by

and

coming from camera shifting to a reference

PT

first step,

(

ED

represented as

point in -direction and -direction, respectively. Then the shifted continuous signal

(

CE

), assumed to be band limited, is sampled at or above Nyquist rate to define a new

version of the desired HR image. Blurring, down-sampling, and noise adding are the other

AC

steps in the observation modeling as are depicted in Fig.1. Further, we can express the image observation as: ( (

))

(1)

5

ACCEPTED MANUSCRIPT Where

are rth observed LR image and the added measurement noise,

and

respectively.

is the imaging system PSF.

and

are the sampling and down-sampling

operations, respectively. * is the 2-D discrete convolution operator. of

in -direction and -direction, respectively. Besides,

and

are also the size

is the number of available

CR IP T

frames.

rth Warped HR image Continues scene

rth Observed LR image

Optical or/and Motion blur and imaging system PSF

Down Sampling by factor d

AN US

Sampling at or above Nyquist rate

Continues translation

Noise

M

Fig.1, Observation model relating LR images to HR image in spatial domain.

ED

Eq. (1) formulates the observation model in the spatial domain for the SR problem to be expressed in the frequency domain. Accordingly, step by step, we transform the obtained

PT

spatial-based model to the spectrum domain. (

CE

As a first move, let

(FT) of continuous signal

AC

respectively while

( (

(

) be the Fourier transform

) and the corresponding shifted version represent frequency in

(

-direction and

), -direction,

. Based on the sampling theory, the DFT of the signal

)) is given by: (

Where

and √

respectively. Also

(

) and

)

(2)

;

is the DFT of the desired HR image

6

.

and

are the pixel sizes

ACCEPTED MANUSCRIPT in -direction and -direction, respectively. represent discrete frequency in

is also well-known as the SR factor(srf).

and

(

) is

-direction and

-direction, respectively.

assumed band limited. For the shifted continuous signal sample

( (

)) which is rth HR image, DFT

( (

))

↔

(

)

Where

and

, while

and

CR IP T

can be given as follow:

(3)

represent the pixel plus sub-pixel

AN US

shifting of the rth HR image in -direction and -direction, respectively. By comparing Eq.(3) relating DFT of the rth HR image to that of the

and Eq.(2), the translation function

reference HR image can be defined as follows:

(4)

M

,

ED

Now, assume that the PSF of the observing system is space invariant. Then we can write: ( (

)) ↔

is the DFT of PSF . For simplicity of notations, we set

CE

PT

Where

(5)

.

(6)

Finally, if we apply the DFT of the down sampled 2-D signal to factor , the observation

AC

equations can be completely expressed in the spectrum domain as ∑

Where

(7)

is the measurement noise at

DFT of the noise variance

∑

,

. It is easy to show that if

frequency component equaling the is a white Gaussian noise with

will also be a white Gaussian noise with variance 7

(cf.

ACCEPTED MANUSCRIPT Appendix A). Moreover, in cases where

is non-Gaussian, based on the central limit

theorem (CLT), when independent random variables are added, their proper sum tends toward a Gaussian distribution [24], which means each component of

can

be approximated by a Gaussian noise.

AN US

and which is to be exploited in our SR reconstruction method.

CR IP T

Fig. 2 illuminates our proposed observation model developed out of the previous discussion

M

Fig. 2, Observation model relating frequency components of rth LR image to that of HR

ED

image.

CE

algorithm

PT

3- Developing the dynamic systems, Kalman filters, and the proposed

It can be seen from Eq. (7) that the frequency domain observation model of the super

AC

resolution is a linear measurement. Hence, the Kalman filter estimator can be used to solve super resolution problems. To attain this objective, first the proper dynamic systems must be organized. 3-1-Dynamic systems

8

ACCEPTED MANUSCRIPT Essentially, every dynamic system can be set up through a state vector determination and establishing two equations: system and measurement equations. Based on Eq. (7), the states vector

can be defined as:

[

(

)

(

)

]

And, the measurement matrix is also defined as (

CR IP T

(8)

)

(

)

AN US

(9)

Now, based on the linear relation (7) and the defined matrix, the dynamic system equations are realized as follows:

(10)

ED

M

{

Where

represents the system modeling error assumed to be zero,

.

PT

represents the measurement error described above, with the covariance matrix for all , while

is corresponding to the FT of

AC

CE

the desired HR image at the reference position.

3-2-Kalman filter and the proposed SR algorithm Based on the developed dynamic systems through Eq.(10), Kalman filters can be derived to estimate the states being frequency components of desired HR image. There are two steps to the Kalman filter process. The first step is prediction, the second one is updating. For the

9

ACCEPTED MANUSCRIPT represented dynamic systems, these two steps can be joined and the corresponding Kalman filter should be derived as follows [19]: ( ̂

)

̂

̂

(

)

(11) (12)

and

are the Kalman gain and the covariance matrix,

respectively. Superscript

represents the matrix Hilbert transpose.

Eq. (11) provides the gain matrix at utilizing

gain

th

matrix ̂

observation and Eq. (12) updates the system state ̂ and

the

measurement

residual

AN US

Where

CR IP T

(13)

. Additionally, Eq. (13) updates the system covariance

matrix. Using these equations, the Kalman filtering process can be formulated and renewed . Afterwards, they are repeated iteratively.

M

for

Finally, the SR procedure is completed by placing each final version of ̂ to Eq.(8), and applying the inverse DFT of ̂

according

ED

to obtain the desired HR image.

PT

To constrain the required number of iterations in a way that the algorithm converges, we

‖̃

CE

consider that the latter algorithm is terminated if the following condition is satisfied: ̃

‖ ‖̃

‖

(14)

AC

Where ̃ is the estimate of the DFT of the desired HR image at the tth iteration, ‖ ‖ and denote norm L2 and the terminated threshold, respectively. Our proposed SR method can be summarized in the following algorithm scheme. It is noted that in our research the same initial value matrices

where

is considered for all covariance

is a presetting scalar.

10

ACCEPTED MANUSCRIPT

Algorithm

AN US

CR IP T

Predetermine values: noise variance, for all ̂ Initial values; , I (I is identity matrix), for all , ( ) Input frames ; Calculate DFT of the frames as 1- Repeat while terminated condition Eq.(14) not fulfilled 2-Repeat for to 3- Repeat for =0 to 4- Repeat for to Calculate Kalman gain matrix according to Eq.(11) Estimate ̂ according to Eq.(12) Update covariance matrix according to Eq.(13) End of repeat 4 End of repeat 3 End of repeat 2 Set End of repeat 1 Calculate inverse DFT of final ̂ as the desired HR image

M

4-Experimental Results

ED

In this section, we make use of both artificially created LR images and real frames to test our proposed SR algorithm, so we can evaluate its performance in comparison with other existing

PT

SR methods. Three metrics, PSNR (peak signal to noise ratio), SSIM [25] (structural similarity index measure) and FSIM [26] (feature similarity index measure) are employed to

CE

verify the quality of the reconstructed HR images in simulation cases. Then real world frames

AC

are applied to visually evaluate and compare our SR approach with other related methods presented in application. In our experiments, different PSFs selected from Table 1 are employed. Moreover, in the final experiments, the scalable ability of our proposed SR method to reconstruct HR images of different sizes and scales is represented. In order to have a fair validation of our proposed algorithm, we employ five SR algorithms, namely, two methods based on the Kalman filter, that is, video-to-video dynamic system SR method (VTV) [21] and the Kalman filter-based method (KFB) [23], and three regularized 11

ACCEPTED MANUSCRIPT SR methods which are L1 norm with BTV method (L1+BTV)[11], adaptive norms with modified BTV method (AN+MBTV) [12], and regional, spatially adaptive total variation method (RSATV) [13]. In all experiments, initial values are provided in the manner presented in Table 1. In addition,

CR IP T

regularization parameters are adjusted by several rounds of running in algorithms- VTV [21], L1+BTV [11], AN+MBTV [12], and RSATV [13]- until the best HR results are achieved. Additionally for each method, the number of iterations used is provided in Table 1. Table 1

AN US

Some details of the five SR algorithms investigated and our proposed method SR method

Iteration number

predetermined parameters

Tuning

VTV[21]

Step1 Step2 20 15 10 50

α=0.9

regularized parameters ---regularized parameters regularized parameters

RSATV[13]

M

α=0.8,q=2, c=1, α=0.8,β=0.05,τ=20 Noise variance=σ2

----

̂

bi-cubic interpolation of the first frame

̂ For all

CE

Proposed method

80(srf=2) 120(srf=4) 80(srf=2) 120(srf=4) Automatic determination

ED

AN+MBTV[12]

σ2=10,ε2=0.5 q=2

PT

KFB[23] L1+BTV[11]

Initial values

AC

4-1- Results of the simulated frames 4-1-1 Effectiveness of the proposed method in the fine details restoring In this section, to validate the ability of the proposed method in details restoring, an experiment is designed conducted to a test pattern as the HR image containing different frequencies in spatial domain (Fig.3(a)). The process of the observation model is carried out in the following steps. At first, the original HR image is shifted in horizontal and vertical 12

ACCEPTED MANUSCRIPT planes while the translations are chosen randomly from the ranges between (0-1). Then, the shifted images are blurred with a low pass filter as imaging system PSF listed in Table 2 (srf=2). In the third step, the blurred image is down sampled by factor 2. Finally, zero mean Gaussian noise with variance σ2=4 is added to obtain the LR image. In this experiment, 12

Table 2 Characteristics of three types of PSFs used in our experiments

srf=4

size Strength(∑

)

size Strength(∑

PSF2 Gaussian 3×3, σ=1 0.1256 5×5, σ=1 0.8250

AN US

srf=2

PSF1 Square 2×2 0.2500 4×4 0.6250

CR IP T

LR images are provided.

)

PSF3 Gaussian 3×3, σ=2 0.1119 5×5, σ=2 0.4330

The experimental quantity results are reported in Table 3 including PSNR, SSIM [25] and

M

FSIM [26] of VTV [21], KFB [23], L1+BTV [11], AN+MBTV [12], and RSATV [13] methods. Added to which is our proposed method with different PSFs. Moreover, some

ED

reconstructed HR images are given in Figs. 3 through 5.

PT

It can be seen that for the cropped regions, the proposed method achieves higher PSNR, SSIM and FSIM than that of other methods. In the other hand, the results obtained by using

CE

the proposed method are more appealing in preserving the fine details containing thin lines and corners in the test pattern. Moreover, for the total reconstructed test pattern, our proposed

AC

method restores higher quality images based on SSIM and FSIM. Fig.4 and 5 show the corresponding visual effects.

13

ACCEPTED MANUSCRIPT Table 3 PSNR, SSIM, and FSIM of reconstructed test pattern (cropped region) by VTV [21], KFB [23], L1+BTV [11], AN+MBTV[12], RSATV[13], and our proposed methods with different PSFs Assessment

VTV [21]

L1+BTV [11]

KFB [23]

AN+MBTV[12]

Proposed

RSATV[13]

20.54(12.70)

26.53(17.78)

19.32(15.89)

30.23(21.59)

29.85(23.98)

30.80(29.62)

SSIM

0.9792(0.7096)

0.9732(0.9042)

0.9066(0.8793)

0.9748(0.9663)

0.9934(0.9875)

0.9978(0.9902)

FSIM

0.9182(0.7418)

0.9869(0.9541)

0.9636(0.8527)

0.9910(0.9813)

0.9973(0.9757)

0.9999(0.9969)

PSNR

17.08(7.93)

24.18(13.66)

--

22.64(12.70)

24.05(12.39)

22.41(16.81)

SSIM

0.9313(0.0785)

0.9676(0.7546)

---

0.9651(0.6871)

0.9651(0.6833)

0.9790(09117)

FSIM

0.8956-0.2446

0.9824-0.8016

---

0.9795-0.7365

0.9844-0.7547

0.9945-0.9035

PSNR

16.42(7.83)

21.57(10.15)

---

21.91(10.86)

22.84(10.83)

21.64(14.04)

SSIM

0.9127(0.0605)

0.9481(0.4697)

---

0.9732(0.5476)

0.9535(0.5729)

0.9538(0.7117)

FSIM

0.8945(0.2325)

0.9727(0.4470)

---

0.9811(0.5081)

0.9769(0.4936)

0.9853(0.8182)

ED

M

AN US

CR IP T

PSNR

(b)

(c)

(e)

(f)

CE

PT

(a)

AC

PSF3

PSF2

PSF1

method

(d)

Fig.3, Experimental results of the test pattern image by using PSF1: (a) Original HR image. Reconstructed images of the: (b) VTV[21], (c) L1+BTV[11],(d) AN+MBTV[12], (e) RSATV[13], and (f) The proposed SR methods.

14

ACCEPTED MANUSCRIPT

(b)

(d)

(e)

(c)

CR IP T

(a)

(f)

(b)

(c)

(d)

(e)

(c)

(d)

(e)

CE

(a)

PT

ED

(a)

M

AN US

Fig.4, Detailed region cropped from Fig.3.

(b)

AC

Fig.5, Detailed region cropped from reconstructed HR images obtained by: (a) VTV [21] ,(b) KFB[23],(c) L1+ BTV[11],(d) AN+MBTV[12], (e) RSATV[13], and (f) The proposed SR methods. First row and second row corresponding to PSF2 and PSF3, respectively.

4-1-2- Testing our proposed method conducted to traditional images In this part of simulation experiments, three popular images are chosen, Boat, Zebra, and Barbara of a 400 400 size as HR images having proper details. The process of the observation model is carried out in the same manner which is described in the previous

15

ACCEPTED MANUSCRIPT section in the cases where srf=2. For the cases where srf=4, the translations are chosen randomly from ranges between (0-4), imaging system PSF is selected from the second row of the Table 2, and the blurred image is down sampled by factor 4. 30 LR images are generated in the cases where srf=4.

CR IP T

The experimental quantity results are reported in Table 4 and 5 including PSNR, SSIM [25] and FSIM [26] of VTV [21], KFB [23], L1+BTV [11], AN+MBTV [12],and RSATV [13] methods. Added to which is our proposed method with different PSFs where the SR factors (srf) are considered 2 and 4. Moreover, some reconstructed HR images, while cropped

AN US

regions, and corresponding metric indexes are given in Figs. 6 through 8 in the case where srf=2 and, in Figs. 9 through 11 in the case where srf=4, respectively. The following points deserve attention:

Based on the three metrics PSNR, SSIM, and FSIM, for most cases, the obtained

M



images by our proposed method exhibit higher quality measures computed with those

The edges, the characters, and the lines in the reconstructed images accomplished via

PT



ED

of other methods, regardless of the srf, the image, and the PSF.

the proposed method are much more vivid and distinct compared with those of other

CE

methods. These are reflected in the shown cropped regions and corresponding metrics.

The output quality of the VTV [21] method is completely damaged in case where

AC



srf=4.



For the image containing more details such as Zebra and Barbara, the proposed method is more effective i.e. the details are better reconstructed through our method.

16

ACCEPTED MANUSCRIPT 

Based on the obtained results of the quantity and visual dimensions, our reconstructed output quality- corresponding to PSF1and PSF2 in the cases where srf=2 and srf=4 respectively- surpasses all other considered cases.



The order of the outputs qualities corresponding to the PSFs are as follows: PSF1>PSF2>PSF3 and PSF2>PSF1>PSF3 in the cases where srf=2 and srf=4,

CR IP T

respectively which are compatible with strength ordering of the PSF filters. It is noted that the strength of the PSF filters (∑

) are presented in Table 2.

In general, based on the reported metrics in Table 4 ,Table 5 and the images displayed in

AN US

Figs. 6 through 11, comparatively, the reconstructed HR images and the restored details produced by our proposed method are superior to those obtained through the five other methods in cases where srf=2 and srf=4.

M

In addition, one of the most important advantages of the proposed method is that it does not require any parameter tuning, while generally speaking, in the case of the regularized SR

ED

methods like the ones utilized in our experiments, it is necessary to adjust the tuning parameters-dependent upon the image properties-effected by several rounds of running until

PT

the acceptable output quality is attained. Moreover, the KFB [23] method is limited by the

AC

CE

size of PSF while our proposed method supports PSFs of whatever size and form.

17

ACCEPTED MANUSCRIPT Table 4 PSNR, SSIM, and FSIM of reconstructed HR images by VTV [21], KFB [23], L1+BTV [11], AN+MBTV[12], RSATV[13], and our proposed methods with different PSFs in the case where srf=2 Zebra

Barbara

PSF3

PSF1

PSF2

PSF3

PSF1

PSF2

PSF3

VTV [21]

34.23

32.38

31.51

30.62

27.94

26.51

31.23

27.14

26.00

KFB[ 23]

33.82

---

---

30.46

---

---

31.50

---

---

L1+BTV [11]

34.07

32.34

31.99

30.53

28.09

27.08

31.04

27.73

26.94

AN+MBTV [12]

36.32

33.87

33.88

33.47

29.64

RSATV [13]

35.44

33.02

32.87

32.47

29.21

Proposed Method

35.92

33.11

33.00

35.27

29.99

CR IP T

PSF2

28.70

33.69

29.08

28.58

28.65

32.23

28.76

28.29

29.04

35.74

29.15

28.65

VTV [21]

0.9952 0.9890 0.9835 0.9956 0.9858 0.9753 0.9898 0.9542 0.9268

KFB [23]

0.9930

---

---

0.9888

---

---

0.9882

---

---

0.9873 0.9847 0.9806 0.9877 0.9767 0.9708 0.9781 0.9523 0.9321

AN+MBTV [12]

0.9957 0.9923 0.9902 0.9969 0.9920 0.9851 0.9927 0.9757 0.9621

RSATV [13]

0.9937 0.9877 0.9844 0.9958 0.9907 0.9856 0.9906 0.9698 0.9555

Proposed Method

0.9960 0.9905 0.9866 0.9986 0.9940 0.9879 0.9958 0.9781 0.9616

VTV[21]

0.9967 0.9927 0.9914 0.9959 0.9892 0.9846 0.9943 0.9787 0.9723

KFB [23]

0.9950

---

AN US

L1+BTV [11]

---

0.9908

---

---

0.9928

---

---

0.9913 0.9907 0.9893 0.9914 0.9839 0.9822 0.9880 0.9758 0.9693

AN+MBTV [12]

0.9980 0.9962 0.9954 0.9970 0.9929 0.9916 0.9960 0.9884 0.9841

RSATV [13]]

0.9972 0.9946 0.9932 0.9951 0.9904 0.9895 0.9953 0.9864 0.9817

Proposed Method

0.9986 0.9959 0.9940 0.9989 0.9941 0.9878 0.9980 0.9885 0.9806

ED

M

L1+BTV [11]

AC

CE

PT

FSIM

SSIM

PSNR

Boat PSF1

18

ACCEPTED MANUSCRIPT

PSNR=32.52 SSIM=0.9486 FSIM=0.9764

PSNR=32.34 SSIM=0.9430 FSIM=0.9776

(b)

M

(c)

PSNR=35.03, SSIM=0.9663 FSIM=0.9869

AN US

PSNR=32.52 SSIM=0.9364 FSIM=0.9772

CR IP T

(a)

(e)

ED

PSNR=34.22, SSIM=0.9606 FSIM=0.9846

(d)

PSNR=35.32, SSIM=0.9516 FSIM=0.9914

(f)

AC

CE

PT

Fig.6, Reconstructed images , regional details, and corresponding metric indexes of Zebra by using: (a) VTV [21] ,(b) KFB[23], (c) L1+BTV[11],(d) AN+MBTV[12], (e) RSATV[13], and (f) The proposed SR methods while considering PSF1 in the case where srf=2.

19

ACCEPTED MANUSCRIPT

PSNR=23.86, SSIM=0.4420, FSIM=0.7027

PSNR=24.53, SSIM=0.5169, FSIM=0.7490

(b)

(d)

ED

M

(c)

PSNR=25.29, SSIM=0.5643, FSIM=0.7993

AN US

PSNR=25.19, SSIM=0.5526, FSIM=0.7759

CR IP T

(a)

PSNR=25.71, SSIM=0.6040, FSIM=0.8031

(e)

AC

CE

PT

Fig.7, Reconstructed images , regional details, and corresponding metric indexes of Barbara by using: (a) VTV [21] , (b) L1+BTV[11],(c) AN+MBTV[12], (d) RSATV[13], and (e) The proposed SR methods while considering PSF2 in the case where srf=2.

20

ACCEPTED MANUSCRIPT

PSNR=28.31, SSIM=0.7903, FSIM=0.8778

PSNR=23.86, SSIM=0.4420, FSIM=0.7027

(b)

CR IP T

(a)

PSNR=29.24, SSIM=0.8308, FSIM=0.9053

AN US

PSNR=29.93, SSIM=0.8434, FSIM=0.9009

(d)

ED

M

(c)

PSNR=30.44, SSIM=0.8565, FSIM=0.9158

(e)

AC

CE

PT

Fig.8, Reconstructed images, regional details, and corresponding metric indexes of Boat by using: (a) VTV [21], (b) L1+BTV[11], (c) AN+MBTV[12], (d) RSATV[13], and (e) The proposed SR methods while considering PSF3 in the case where srf=2.

21

ACCEPTED MANUSCRIPT Table 5 PSNR, SSIM, and FSIM of reconstructed HR images by VTV [21], KFB [23], L1+BTV [11], AN+MBTV [12], RSATV [13], and our proposed methods with different PSFs in the case where srf=4 Boat Zebra Barbara PSF3

PSF1

PSF2

PSF3

PSF1

PSF2

PSF3

29.185

---

---

23.807

---

---

24.973

---

---

29.04

29.48

28.21

23.58

24.73

22.72

24.61

25.25

23.81

AN+MBTV [12] 31.67

32.45

30.26

26.30

28.19

24.78

26.15

27.19

24.99

31.14

31.79

29.92

25.77

27.51

24.24

25.99

26.33

24.81

Proposed Method 30.30

31.013

29.21

25.58

27.76

24.08

25.79

27.35

24.62

CR IP T

KFB [ 23] L1+BTV [11] RSATV [13]

SSIM

PSF2

25.182 25.340 25.476 20.452 20.860 20.371 22.910 22.810 22.808

VTV [21]

0.8685 0.8859 0.8919 0.7906 0.7744 0.7897 0.7980 0.7915 0.7992

KFB [23]

0.9558

L1+BTV [11]

---

---

0.9375

---

---

0.8929

---

---

0.9542 0.9618 0.9450 0.9188 0.9521 0.8884 0.8780 0.9084 0.8478

AN+MBTV [12] 0.9768 0.9864 0.9646 0.9591 0.9840 0.9373 0.9181 0.9477 0.8828 RSATV [13]

0.9678 0.9798 0.9558 0.9541 0.9811 0.9307 0.9048 0.9353 0.8717

AN US

PSNR

PSF1 VTV [21]

FSIM

Proposed Method 0.9639 0.9591 0.9488 0.9529 0.9838 0.9340 0.9072 0.9465 0.8741 VTV [21]

0.9222 0.9303 0.9378 0..8863 0.9012 0.8956 0.9309 0.9176 0.9296

KFB [23]

0.9843

L1+BTV [11]

---

---

0.9788

---

---

0.9732

---

---

0.9807 0.9798 0.9763 0.9720 0.9713 0.9579 0.9653 0.9596 0.9555

AN+MBTV [12] 0.9933 0.9940 0.9882 0.9902 0.9918 0.9812 0.9812 0.9792 0.9688 RSATV [13]

0.9917 0.9925 0.9861 0.9865 0.9887 0.9750 0.9775 0.9762 0.9656

AC

CE

PT

ED

M

Proposed Method 0.9905 0.9924 0.9856 0.9876 0.9902 0.9777 0.9790 0.9792 0.9668

22

ACCEPTED MANUSCRIPT

PSNR=20.12, SSIM=0.2058, FSIM=0.5666

PSNR=20.42, SSIM=0.2543, FSIM=0.6112

(b)

(d)

ED

M

(c)

PSNR=20.75, SSIM=0.3100, FSIM=0.6456

AN US

PSNR=20.13, SSIM=0.2201, FSIM=0.5628

CR IP T

(a)

PSNR=20.81, SSIM=0.3274, FSIM=0.6806

(e)

PSNR=21.16, SSIM=0.3689, FSIM=0.6872

(f)

AC

CE

PT

Fig.9, Reconstructed images , regional details, and corresponding metric indexes of Barbara by using: (a) VTV [21] ,(b) KFB[23], (c) L1+BTV[11],(d) AN+MBTV[12], (e) RSATV[13], and (f) The proposed SR methods while considering PSF1 in the case where srf=4.

23

ACCEPTED MANUSCRIPT

PSNR=24.00, SSIM=0.6855, FSIM=0.8080

PSNR=26.74, SSIM=0.7706, FSIM=0.8487

(b)

CR IP T

(a)

PSNR=28.30, SSIM=0.7955, FSIM=0.8742

AN US

PSNR=28.54, SSIM=0.8087, FSIM=0.8740

(d)

ED

M

(c)

PSNR=28.55, SSIM=0.7771, FSIM=0.8905

(e)

AC

CE

PT

Fig.10, Reconstructed images , regional details, and corresponding metric indexes of Zebra by using: (a) VTV [21] , (b) L1+BTV[11],(c) AN+MBTV[12], (d) RSATV[13], and (e) The proposed SR methods while considering PSF2 in the case where srf=4.

24

ACCEPTED MANUSCRIPT

PSNR=25.26, SSIM=0.6837, FSIM=0.8176

PSNR=22.79, SSIM=0.5848, FSIM=0.7794

(b)

CR IP T

(a)

PSNR=26.98, SSIM=0.7617, FSIM=0.8718

AN US

PSNR=26.67, SSIM=0.7510, FSIM=0.8581

(d)

ED

M

(c)

PSNR=30.44, SSIM=0.8565, FSIM=0.9158

(e)

CE

PT

Fig.11, Reconstructed images , regional details, and corresponding metric indexes of Boat by using: (a) VTV [21] , (b) L1+BTV[11], (c) AN+MBTV[12], (d) RSATV[13], and (e) The proposed SR methods while considering PSF3 in the case where srf=4.

AC

4-2- Testing the proposed method under different noise intensities In order to evaluate the effectiveness of our proposed SR method under different conditions of noise intensity, we carry out experiments like the ones we did in the previous section except for the fact that this time different noise values are added. Firstly, simulated datasets of Zebra and Barbara images are generated while adding noise under different variances 4, 10, 16, 20 and 28. Then we apply the SR methods including our proposed method and two SR methods AN+MBTV [12] and RSATV[13] in the case where srf=2 using PSF1. The 25

ACCEPTED MANUSCRIPT results obtained from the quantitative evaluation results for reconstructed HR images and cropped regions are shown in Tables 6 and 7 while some images are shown in Figs.12 and 13. It can be deduced that: 

In most cases, based on the three reported metrics, the proposed method generates a



CR IP T

higher quality HR image than that of the other ones. When there is an increase in noise, in all cases, the three index of the cropped regions of the HR images -obtained through the proposed method- is higher than that of other methods even in the case of AN+MBTV [12] method. This means that the features

AN US

and the details of the HR images are better restored through our proposed method, a fact reflected by the FSIM metric in the high noisy cases.

PSNR SSIM FSIM PSNR SSIM FSIM PSNR SSIM FSIM PSNR SSIM FSIM PSNR SSIM FSIM

PT

4

Index

10

CE

16

AC

20

28

AN+MBTV[12]

ED

Noise Variance

M

Table 6 Quantitative evaluation results of different noise intensities for Zebra image (cropped region) experiment

33.66(32.97) 0.9972(0.9087) 0.9972(0.9476) 33.04(31.99) 0.9959(0.8881) 0.9964(0.9381) 32.57(31.57) 0.9946(0.8823) 0.9957(0.9317) 32.29(30.96) 0.9936(0.8679) 0.9951(0.9256) 31.79(31.11) 0.9921(0.8677) 0.9943(0.9284)

26

RSATV[13]

Proposed method

32.47(32.30) 0.9958(0.8964) 0.9951(0.9478) 31.84(31.68) 0.9938(0.8848) 0.9937(0.9440) 31.35(30.86) 0.9923(0.8683) 0.9925(0.9347) 31.11(30.52) 0.9913(0.8552) 0.9921(0.9326) 30.65(30.25) 0.9891(0.8536) 0.9906(0.9285)

35.19(34.30) 0.9982(0.9206) 0.9986(0.9569) 33.19(32.87) 0.9959(0.8969) 0.9969(0.9502) 32.08(31.75) 0.9937(0.8737) 0.9955(0.9380) 31.54(30.99) 0.9926(0.8491) 0.9946(0.9324) 30.77(30.40) 0.9899(0.8479) 0.9931(0.9289)

ACCEPTED MANUSCRIPT Table 7 Quantitative evaluation results of different noise intensities for Barbara image (cropped region) experiment

10

16

20

RSATV[13]

Proposed method

PSNR SSIM FSIM PSNR SSIM FSIM PSNR SSIM FSIM PSNR SSIM FSIM PSNR SSIM FSIM

33.69(31.20) 0.9927(0.9382) 0.9960(0.9605 33.10(30.47) 0.9898(0.9340) 0.9948(0.9547) 32.50(30.38) 09857(0.9288 0.9929(0.9539) 32.03(29.66) 0.9832(0.9065) 0.9915(0.9416) 31.56(29.38) 0.9793(0.9018) 0.9889(0.9389)

33.24(30.59) 0.9906(0.9412) 0.9953(0.9614) 32.35(30.47) 0.9839(0.9363) 0.9924(0.9581) 31.74(30.36) 0.9775(0.9320) 0.9899(0.9564) 31.32(30.06) 0.9735(0.9271) 0.9882(0.9539) 30.85(29.87) 0.9669(0.9175) 0.9855(0.9475)

35.70(35.31) 0.9947(0.9725) 0.9975(0.9776) 33.50(33.08) 0.9881(0.9572) 0.9946(0.9684) 32.33(32.08) 0.9825(0.9481) 0.9920(0.9619) 31.83(31.16) 0.9791(0.9382) 0.9906(0.9555) 31.00(30.42) 0.9724(0.9247) 0.9877(0.9500)

AC

CE

PT

ED

M

28

AN+MBTV[12]

CR IP T

4

Index

AN US

Noise Variance

(a)

(b)

(c)

Fig.12, One of the generated LR sequences under different noise intensities. Noise variance equals to: (a) 4, (b) 16, and (c) 28. 27

AN US

CR IP T

ACCEPTED MANUSCRIPT

(a)

(b)

(c)

Fig.13, Experimental results of the Zebra image by using PSF1 in the case where srf=2 and noise variance=16. Reconstructed images (first row) and regional details (second row) of: (a) AN+MBTV[12], (b) RSATV[13], and (c) The proposed SR methods.

M

4-3- Effects of the initial and the setting values on the performance of the proposed method

ED

The Kalman filter developed through Eqs.(11), (12), and (13) requires only one setting value

considered as

PT

for the noise variance and one initial value for the covariance matrix, which is generally where

and

are the identity matrix and the

CE

initial scalar value respectively. First, to analyze the sensitivity of the performance to parameter

, in both cases where srf=2 using PSF1 and where srf=4 using PSF2, for Barbara

AC

datasets, the algorithm is repeated with different values of; subsequently, the obtained PSNRs versus the iteration schemes are plotted in Fig.14(a). It can be observed that by considering a lower value for

, only the required number of iterations increases while the algorithm

converges to the same final PSNR in all cases. This confirms that the robustness of our proposed algorithm to initialization. Based on the obtained results, in all experiments and

are considered in the cases where srf=2 and srf=4, respectively.

28

ACCEPTED MANUSCRIPT Second, to verify the robustness of the performance of proposed SR method to setting noise variance, the algorithm is repeated as in the previous paragraph except that the setting noise variance is considered differently. Subsequently the obtained PSNRs versus iteration number are plotted in Fig.14(b). It can be seen that when setting noise variance is changed, only the

AN US

CR IP T

required iteration number is changed while the final PSNR remains constant.

CE

PT

ED

M

(a)

(b)

Fig.14, PSNR of reconstructed Barbara image versus iteration number considering different:

AC

(a) Initial values for

: (b) Setting values for noise variance. Left and right columns

corresponding to cases where srf=2 using PSF1 and srf=4 using PSF2, respectively. 4-4-Real-world frames To verify the effectiveness of our algorithm in practical applications, we also conducted some experiments with two real-world images, including Alpaca and Emily sequences [27]. These sequences have adequate details which can visually differentiate the results of our proposed

29

ACCEPTED MANUSCRIPT method against the five other methods compared. At the beginning, 30 frames of each sequence (16-45 from Alpaca and 26-55 from Emily) are selected as the LR with a specified size of 96 128. The well-known block-based registration algorithm is also used to estimate the relevant translations with sub-pixel accuracy [28].

CR IP T

Some reconstructed HR images and their relevant zoomed regional details are also provided in Figs. 15 through 22 in the case where srf=2 and in Figs. 23 through 30 in the case where srf=4, respectively. To exactly show the restored details, we crop two regions for every reconstructed HR image containing middle and finer details. With regard to Figs. 15 through



AN US

30, the following can be deduced:

In the case where srf=2, the details are clearly restored by our proposed method as far as PSF1 is concerned. To take an example, two characters 'i' and 'e' are noticeable in



M

Fig. 16.

The characters and the numbers in the zoomed regional details restored by

ED

AN+MBTV [12],RSATV [13] and our proposed method are very close to each other in the case where srf=2 when PSF2 and PSF3 are selected. As in the case where srf=4, when PSF1 is used, the reconstructed HR image restored

PT



CE

by RSATV [13] and our proposed method are very close to each other and better than that produced via other methods. The restored lines, the edges, characters, numbers and other middle details by

AC



AN+MBTV[12] ,RSATV[13] and our proposed method are clear with minimum dissimilarity as the two connect for the case where srf=4 and PSF2 is considered.

Altogether, the middle and the finer details of the HR images are better restored in respect of resolution by our proposed method when PSF1 and PSF2 are selected in cases where srf=2 and srf=4, respectively. (cf. Figs. 16, 22, 26, and 30). Further, these cases are compatible with 30

ACCEPTED MANUSCRIPT the simulation results acquired in the previous section. This means that the SR reconstruction

(b)

(c)

AN US

(a)

CR IP T

performance of our proposed method is far superior.

(d)

(e)

(f)

AC

CE

PT

ED

M

Fig.15, Experimental results of the real-world Alpaca images using PSF1. Reconstructed HR image by: (a) VTV [21], (b) KFB[23], (c) L1+BTV[11],(d) AN+MBTV[12], (e) RSATV[13], and (f) The proposed methods in the case where srf=2.

(a)

(b)

(c)

(d)

(e)

(f)

Fig.16, Two sets of detailed regions cropped from Fig.15. First row Alp-Region1, second row Alp-Region2.

31

ACCEPTED MANUSCRIPT

(a)

(c)

AN US

CR IP T

(b)

(d)

(e)

AC

CE

PT

ED

M

Fig.17, Experimental results of the real-world Alpaca images using PSF2. Reconstructed HR image by: (a) VTV[21], (b) L1+BTV[11], (c) AN+MBTV[12], (d) RSATV[13], and (e) The proposed methods in the case where srf=2.

(a)

(b)

(c)

(d)

(e)

Fig.18, Two sets of detailed regions cropped from Fig.17. First row Alp-Region1, second row Alp-Region2.

32

ACCEPTED MANUSCRIPT

(b)

(c)

CR IP T

(a)

(e)

(d)

AC

CE

PT

ED

M

AN US

Fig.19, Experimental results of the real-world Alpaca images using PSF3. Reconstructed HR image by: (a) VTV[21],(b) L1+BTV[11],(c) AN+MBTV[12], (d) RSATV[13], and (e) The proposed methods in the case where srf=2.

(a)

(b)

(c)

(d)

(e)

Fig.20, Two sets of detailed regions cropped from Fig.19. First row Alp-Region1, second row Alp-Region2.

33

ACCEPTED MANUSCRIPT

(b)

(d)

(e)

(c)

CR IP T

(a)

(f)

CE

PT

ED

M

AN US

Fig.21, Experimental results of the real-world Emily images using PSF1. Reconstructed HR image by: (a) VTV [21], (b) KFB[23], (c) L1+BTV[11](d) AN+MBTV[12], (e) RSATV[13], and (f) The proposed methods in the case where srf=2.

(a)

(b)

(d)

(c)

(e)

(f)

AC

Fig.22, Two sets of detailed regions cropped from Fig.21. First row Emil-Region1, second row Emil-Region2.

34

ACCEPTED MANUSCRIPT

(b)

(d)

(e)

(c)

CR IP T

(a)

(f)

AC

CE

PT

ED

M

AN US

Fig.23, Experimental results of the real-world Alpaca images using PSF1.Reconstructed HR image by: (a) VTV[21] , (b) KFB[23], (c) L1+BTV[11], (d) AN+MBTV[12], (e) RSATV[13], and(f) The proposed methods in the case where srf=4.

(a)

(b)

(d)

(c)

(e)

(f)

Fig.24, Two sets of detailed regions cropped from Fig.23. First row Alp-Region1, second row Alp-Region2.

35

ACCEPTED MANUSCRIPT

(a)

(c)

CR IP T

(b)

(d)

(e)

AC

CE

PT

ED

M

AN US

Fig.25, Experimental results of the real-world Alpaca images using PSF2. Reconstructed HR image by: (a) VTV[21], (b) L1+BTV[11],(c) AN+MBTV[12], (d) RSATV[13] , and (e) The proposed methods in the case where srf=4.

(a)

(b)

(c)

(d)

(e)

Fig.26, Two sets of detailed regions cropped from Fig.25. First row Alp-Region1, second row Alp-Region2.

36

ACCEPTED MANUSCRIPT

(a)

(c)

CR IP T

(b)

(d)

(e)

CE

PT

ED

M

AN US

Fig.27, Experimental results of the real-world Alpaca images using PSF3. Reconstructed HR image by: (a) VTV [21], (b) L1+BTV[11], (c) AN+MBTV[12], (d) RSATV[13], and (e) The proposed methods in the case where srf=4.

(a)

(b)

(c)

(d)

(e)

AC

Fig.28, Two sets of detailed regions cropped from Fig.27. First row Alp-Region1, second row Alp-Region2.

37

ACCEPTED MANUSCRIPT

(b)

(c)

CR IP T

(a)

(d)

(e)

AC

CE

PT

ED

M

AN US

Fig.29, Experimental results of the real-world Emily images using PSF2.Reconstructed HR image by: (a) VTV[21], (b) L1+BTV[11], (c) AN+MBTV[12], (d) RSATV[13], and (e) The proposed methods in the case where srf=4.

(a)

(b)

(c)

(d)

(e)

Fig.30, Two sets of detailed regions cropped from Fig.29. First row Emil-Region1, second row Emil-Region2.

4-5-Comparing the running time of different methods In order to fairly compare the computational complexity of our proposed method with that of other five SR methods, the measured running times for a real sequence Alpaca in two cases

38

ACCEPTED MANUSCRIPT where srf=2 and srf=4 are summarized in Table 8. It should be noted that the experiments are implemented under Matlab R2016 environment, exploiting Core-i7 2.2 GHz CPU. It is seen that: 

Our proposed method converges toward the best results in the reasonable running



CR IP T

time being lower than that of the KFB[23] method. Although the proposed method displays higher running time in comparison with the VTV[21], L1+BTV[11], and RSATV[13] methods, based on the previous results obtained and reported in Tables 3 through 7,our proposed method achieves higher

AN US

quality HR images.

It is noted that our proposed method can be accelerated by parallel implementation process

M

while the other methods are unfeasible.

ED

Table 8 Execution time (in seconds) of our proposed and other five SR methods

4.25

94.123

15.87

52.12

27.29

54.31

CE

srf=4

PT

Case VTV[21] KFB[23] L1+BTV[11] AN+MBTV[12] RSATV[13] Proposed method srf=2 0.63 50.40 3.746 13.74 6.85 28.11

AC

4-6- Scalable SR by partial spectrum rejection technique In some of the SR applications, the desired HR image size is quite fixed, that is, impossible to change. In other words, the size of the required HR image is predetermined by the resolution and the aspect ratio of the monitoring system. Employing the proposed SR algorithm, we can reconstruct the HR image of any size

,

in a very simple manner, while

other traditional SR methods are unfeasible with regard to the HR image size and scale. In

39

ACCEPTED MANUSCRIPT this respect, considering our proposed algorithm, it is sufficient to reject some of the end spectrum of ̂ ̂

as follows (cf. Appendix B): ̂

(15)

;

Fig.31 illuminates how the end of spectrum of the HR image implements the rejection

M

AN US

CR IP T

process before using inverse DFT.

ED

Fig.31, End of spectrum rejection to obtain HR image with an arbitrary size and scale.

In general, in order to obtain HR images with any size and scale, a well-known technique is

PT

to conduct one of the traditional resampling methods after SR procedure. In order to evaluate

CE

the proposed scalable SR method, two traditional resampling methods employed executed by imresize function in Matlab and an HR image the same size as that of the proposed method is

AC

obtained.Figs.32 and 33 present the reconstructed HR images of the two previous real world sequences of three different sizes and scales obtained via both partial spectrum rejection technique and resampling method which drawn upon nearest and bicubic interpolation. As is shown in the two figures, the images reconstructed by the scalable SR method exhibit lower noises because of rejecting high frequency components containing major noise elements. In addition, the time is saved through our proposed technique since no resampling process is resorted. 40

ACCEPTED MANUSCRIPT Size: 370 500 (real scale)

Size: 250 500

PT

ED

M

AN US

CR IP T

Size: 350 350

AC

CE

Fig. 32, Reconstructed images of Alpaca displaying different sizes and scales using partial spectrum rejection technique (first row), resampling within nearest interpolation method (second row) ,and resampling within bicubic interpolation(third row) .

41

ACCEPTED MANUSCRIPT Size: 370 500 (real scale)

Size: 250 500

ED

M

AN US

CR IP T

Size: 350 350

CE

PT

Fig.33, Reconstructed images of Emily displaying different sizes and scales using partial spectrum rejection technique (first row), resampling within nearest interpolation method (second row) , and resampling within bicubic interpolation(third row) .

AC

5- Conclusion

Applying the Kalman filter to the frequency domain, we have introduced a multi-frame super-resolution algorithm. Unlike previous studies, exploiting similar methods, we have considered a method which solves two major problems of the conventional Kalman filtering technique requiring neither approximations nor limitations on space-invariant PSF of any type, whatsoever. Further, parallel implementation enables us to speed up our proposed

42

ACCEPTED MANUSCRIPT method making it feasible to set PSF of any form and size, which is a great help in attaining higher quality HR images. The presented technique requires no numerical manual parameters or tuning. Experiments are conducted on both simulated and real data. Because the proposed method emphasizes restoring details which is the main task of all SR techniques, it is more effective to reconstruct images containing finer particulars. The obtained results are also a

CR IP T

confirmation of the fact that the algorithm offered here outperforms those developed by other approaches. Additionally, this algorithm promises better results as regards quantity and vision compared with those of similar methods. Further, a scalable SR algorithm is advanced through partial spectrum rejection technique, as a result of which an HR image of an arbitrary

AN US

size and scale is obtained.

Although, the SR method set forth in this research is applicable to a wide swath of real data, developing the latter method so that it incorporates such more complicated motion conditions

ED

M

as rotation and zoom is a part of our future research project.

PT

Appendix A

To simplify, we consider one dimensional signal

CE

Gaussian noise and ∑

, as a white

as its FT in the following relation:

(A1)

It is not difficult to show that

is a zero mean Gaussian noise. Now, let us calculate the

AC

,

relevant autocorrelation:

̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ ∑

∑

̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅ ∑ ∑

̅̅̅̅̅̅̅̅̅̅̅̅̅̅̅

(

)

(A2) 43

ACCEPTED MANUSCRIPT

∑

(

∑

Where * is a conjugate,

)

(

∑

represents power of noise

function. Relation A2 shows that

)

and

stand for impulse

is a white Gaussian noise.

Appendix B

is assumed to be sampled at or above Nyquist rate.

)

,

(B1)

( Where

)

AN US

(

( ) which

CR IP T

( ) is a one dimensional continuous band limited signal with FT

Supposing

,

is the sampling period. Now, let us cut a part of spectrum of

as follows:

(B3)

M

,

(B2)

its corresponding signal is obtained by inverse DFT as is borne out by the

ED

Where

∑

(

)

(B4)

AC

CE

∑

PT

following:

References

[1] L. Yue, H. Shen, L. Li, Q. Yuan, H. Zhang, Image super-resolution: the techniques, application, and future, Signal Process. 128 (2016) 389-408.

[2] A. Rahimi, P. Moallem, K. Shahtalebi, M. Momeni, Preserving quality in minimum frame selection within multi-frame super-resolution, Digital Signal Process. 72 (2018) 19-43. [3] T. S. Huang, R. Y. Tsai, Multi-frame image restoration and registration, Advances in Computer Vision and Image Process. 1, JAI Press Inc., Greenwich, CT (1984) 317–339.

44

ACCEPTED MANUSCRIPT [4] A. M. Tekalp, M. K. Ozka, and M. I. Sezan, High-resolution image reconstruction from lowerresolution image sequences and space-varying image restoration, in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, San Francisco, California, 3(1992)169-172. [5] S. P. Kim, N. K. Bose, and H. M Valenzuela, Recursive reconstruction of high resolution image from noisy under sampled multiframes, IEEE Trans. on Speech and Signal Process. 38 (1990) 1013–027.

reconstruction algorithm, Opt. Eng. 24 (2003) 1408-1432.

CR IP T

[6] S. Rhee, M G. Kang, Discrete cosine transform based regularized high-resolution image

[7] N. Nguyen, P. Milanfar, G. Colub, A computationally efficient superresolution image reconstruction algorithm, IEEE Trans. Image Process. 10(2001) 573-583.

[8] M. Irani, S. Peleg, Improving resolution by image registration, CVGIP: Graph. Model, Image

AN US

Process. 53(1991) 231-239.

[9] A.M. Tekalp, M.K. Ozkan, M.I. Sezan, High-resolution image reconstruction from lowerresolution image sequences and space-varying image restoration, in: Proceeding of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 1992. pp. 169-172. [10] H. Shen, I. ZHANG, B. Huang, et al., A MAP approach for joint motion estimation,

M

segmentation, and super-resolution IEEE Trans. Image Process. 16(2007) 479-490. [11] S. Farsiu, M.D. Robinson, M. Elad, P. Milanfar, Fast and robust multi-frame super resolution,

ED

IEEE Trans. Image Process. 13 (2004) 1327–1344. [12] X. Zeng, L. Yang, A robust multiframe super-resolution algorithm based on half-quadratic

PT

estimation with modified BTV regularization, Digital Signal Process. 23 (2013) 98-109. [13] Q. Yuan, L. Zhang, H. Shen, and P. Li, Regional spatially adaptive total variation super-

CE

resolution with spatial information filtering and clustering, IEEE Trans. Image process. 22 (2013) 2327-2342.

AC

[14] L. Yue, H. Shen, Q. Yuan, and L. Zhang, A locally adaptive L1-L2 norm for multi-frame superresolution of images with mixed noise and outliers, Signal process. 105 (2014) 156-174.

[15] X. Yang, T. Liu,D. Zhou, A multi-frame adaptive super-resolution method using double channel and regional pixel information, Optik 126 (2015) 5850-5858.

[16] H. Shen, L. Peng, L. Yue, and Q. Yuan, Adaptive norm selection for regularized image restoration and super-resolution, IEEE Trans. Cyber. 46 (2016) 1388-1399. [17] C. Miao and H. Y. Yu, A General Thresholding Solution for lp (0
45

ACCEPTED MANUSCRIPT [18] T. KÖhler, X. Huang, F. Schebesch, A. Aichert, A. Maier, and J. Hornegger, Robust Multiframe super-resolution employing iteratively Re-weighted minimization, IEEE Trans. Comp. Imaging, 2 (2016) 42-58. [19] B. D. O. Anderson, J. B. Moore, Optimal Filtering, Chap. 3, Prentice-Hall, Inc. Cliffs, New Jersey, 2005. [20] F. Dellaert, C. Tborpe, and S. Thrun, Jacobian image of super-resolved texture maps for model-

Vision, New Jersey,(1998)2-7.

CR IP T

based motion estimation and tracking, in Fourth IEEE Workshop on Application of Computer

[21] S. Farsiu, M. Elad, and P. Milanfar, Video-to-video dynamic superresolution for grayscale and color sequences, EURASIP J. Appl. Signal Process. (2006) 1-15.

[22] C. Newland, D. Gray, and D. Gibbins, Modified Kalman filtering for image super-resolution, in

AN US

Proc. Of the Conf. (IVCNZ06), New Zealand (2006) 79-84.

[23] Z. Wei, F. Tao, and W. Jun, Kalman filter-based method for image superresolution using a sequence of low-resolution images, J. Elec. Imaging, 23(1) (2014) 1-12. [24] V.V. Petrov, Limit Theorems of Probability Theory: Sequences of Independent Random Variables, Clarendon Press, 1995.

M

[25] Z. Wang, Bovik, A.C. Sheikh, E. P., Simoncelli, Image quality assessment: from error visibility to structural similarity, IEEE Trans. on Image Process. 13( 2004) 600-612.

ED

[26] L. Zhang, L. Zhang, X. Mou, and D. Zhang, FSIM: a feature similarity index for image qualtiy assessment, IEEE Trans. on Image Process. 20 (2011) 2378-2386. of

California,

PT

[27]University

Santa

Cruz,

Milanfar

Software

[Online]

Available:

http://users.soe.ucsc.edu/-milanfar/software/sr-datasets.html

CE

[28] S. Zhu , K. Ma, A new diamond search algorithm for fast block matching motion estimation,

AC

IEEE Trans. Image Process. 9 (2000) 287–290.

46

Using Kalman filter in the frequency domain for multi-frame scalable super resolution

Using Kalman filter in the frequency domain for multi-frame scalable super resolution

Recommend Documents