Open Access
1 October 2005 Image resolution upscaling in the wavelet domain using directional cycle spinning
A. Temizel, Theodore Vlachos
Author Affiliations +
Abstract
Cycle spinning, a technique mainly used for wavelet denoising, has also been shown to be successful toward image resolution upscaling in the wavelet domain. We propose a directional variant of the cycle spinning methodology. We obtain estimates of local edge orientation from a wavelet decomposition of the available low-resolution image and use this information to influence the choice of cycle spinning parameters that are employed for resolution upscaling. Our experimental results show that the proposed method outperforms competing methods for a wide range of images offering modest but consistent improvements both in objective as well as subjective terms. Lower computational complexity compared to the conventional cycle spinning is also demonstrated.

1.

Introduction

A common feature of wavelet domain image resolution upscaling algorithms is the assumption that the image to be enhanced is the low-pass filtered subband of a wavelet-transformed high-resolution (HR) image. Then the detail wavelet coefficients in subbands containing high-pass spatial frequency information are estimated and the HR image is obtained by inverse wavelet transform. In Ref. 1, detail coefficients are estimated using the evolution of wavelet transform extrema in coarser subbands. A similar but less computationally expensive approach is advocated in Ref. 2. In Ref. 3 a technique is proposed that takes into account the hidden Markov tree (HMT) approach. The HMT-based methods have been further developed not to require any training data set.4

The decimated wavelet transform is not shift-invariant and, as a result, inaccurate representation of wavelet coefficients—for example, due to quantization or non-exact estimation—introduces cyclostationarity into the image, which manifests itself as ringing in the neighborhood of discontinuities. Cycle-spinning (CS), introduced in Ref. 5, has been shown to be an effective method against ringing when used for denoising purposes. In Refs. 6, 7, it was shown that CS, applied as a postprocessing operation, yields improvements in the framework of JPEG and JPEG2000. Recently, it has been shown that it is also a powerful alternative to competing methods, when applied to image resolution upscaling problems in the wavelet domain.8

Since its introduction, the basic CS algorithm has hardly evolved apart from an iterative variant in Ref. 9. This involves iterated applications of a processing chain consisting of forward translation, wavelet denoising, and backward translation. Although this scheme was shown to improve upon the basic CS in wavelet denoising problems, it doesn’t lend itself to wavelet domain resolution upscaling. In the resolution upscaling context, the upscaled image, which is the output of the previous iteration, would need to be downsampled before being fed as input into the next, nullifying the effects of the preceding iteration.

We introduce the concept of directional cycle spinning. In particular we take into account local edge orientation information derived from a wavelet decomposition of the available low-resolution (LR) image to influence key parameters of the CS algorithm.

2.

Image Resolution Upscaling Using Cycle Spinning

Image resolution upscaling involving CS typically consists of two steps, namely, wavelet-domain zero padding (WZP) followed by the application of a CS operator.

In the first step, an initial approximation to the unknown HR image is generated using WZP: Using the available LR image x of size m×n , the unknown HR image y is reconstructed by using zero padding of high-frequency (HF) subbands followed by inverse wavelet transform:

Eq. 1

ŷ0=W1[x0m,n0m,n0m,n]
where 0m,n is an all-zero matrix of size m×n and W1 is the inverse discrete wavelet transform. This implies a simplified image formation process whereby the point spread function is associated with the low-pass wavelet filter kernel used for the above transformation. The HR approximation obtained as above commonly exhibits artifacts such as smoothing and ringing. The ringing emerges at the vicinity of discontinuities as alternating undershoots and overshoots of the intensity level. CS has been shown to be an effective means to reduce these artifacts by averaging out the translation dependence.5 For the image resolution upscaling purposes, a variant of the CS methodology is applied as a second step as follows:8 First a number of LR images x̂i,j are generated from ŷ0 by (1) spatial shifting, (2) wavelet transformation, and (3) discarding the HF coefficients: x̂i,j=DWSi,jŷ0 where D represents discarding of HF coefficients, W denotes wavelet transform, and Si,j is a shift operator applying horizontal and vertical shifts of (i,j) in the range i,j{k,k+1,,k1,k} . Then, Eq. 1 is applied to all x̂i,j yielding N ŷi,j images, where N=(2k+1)(2k+1) . Finally, these intermediate HR images are re-aligned and averaged to give the final HR reconstructed image: ŷ=(1N)i=kkj=kkSi,j1ŷi,j where Si,j1 is the inverse of the shifting operator Si,j .

3.

Directional Cycle Spinning

In conventional CS as described above, all possible shifts within a range that defines a local neighborhood are used. Nevertheless, a closer inspection of artifacts in a typical HR image generated using WZP reveals that ringing artifacts not only occur in the vicinity of strong edges but, more importantly, they are strongly correlated with the orientation of those edges. In particular, for an edge of a given orientation, ringing is more pronounced in the normal direction. This is due to the fact that ringing artifacts materialize as a result of undershoots and overshoots of sharp intensity level changes. This observation suggests that CS should be predominantly applied across edges (i.e., vertically in the vicinity of a horizontal edge and vice versa) avoiding other orientations and thus preventing unnecessary smoothing of image detail.

To test this conjecture, we have used the synthetic image shown in Fig. 1 . We consider this synthetic image as the ground truth HR image we seek to reconstruct. Then we generate an LR version of it by low-pass filtering followed by downsampling. Finally we obtain approximations to the original HR image using WZP followed by CS applied at different orientations: nondirectional (NCS), horizontal (HCS), vertical (VCS), or diagonal (DCS). Figure 2 indicates the patterns used for CS (shift locations used are shown as shaded elements) with the no shift location occupying (0,0). For example HR reconstructions using HCS and VCS are obtained respectively using

Eq. 2

Ch=i=kkŷi,02k+1andCν=j=kkŷ0,j2k+1
where ŷi,j are the intermediate HR images corresponding to a shift of (i,j) as explained in Sec. 2.

Fig. 1

PSNR (dB) values for the different quadrants of the synthetic image using different CS directions.

040501_1_1.jpg

Fig. 2

Shifts used for different directional CS types.

040501_1_2.jpg

PSNR values of the HR reconstruction for each quadrant and for each CS orientation are shown in Fig. 1 while the best value is shown in boldface. Our results confirm that the best reconstruction always occurs by CS in the direction of the normal to dominant edges while in the absence of such edges nondirectional CS is preferable. The gain is more pronounced in HCS and VCS, while DCS brings a more modest improvement.

4.

Algorithm Description

Our approach uses a local estimate of image directional activity to influence the orientation in which CS is applied. This estimate is obtained by using a wavelet decomposition of the available LR image x .

Let LH0 and HL0 denote respectively the HF horizontal and vertical detail subbands of x and whr,s and wνr,s denote the wavelet coefficients at position (r,s) of LH0 and HL0 respectively. We partition x into nonoverlapping blocks and estimate horizontal and vertical activity measures for the p ’th block as Ahp=r,swhr,s and Aνp=r,swνr,s respectively; summation is over all wavelet coefficients contained in block p . Finally, block p in the HR image is reconstructed by using these activity values as weights to the corresponding reconstructions generated using Eq. 2:

Eq. 3

ŷp=AhpCνp+AνpChpAhp+Aνp.
It has been found that the extra computational complexity brought by application of diagonal CS hardly justifies the relatively insignificant gain it provides and hence excluded from the algorithm.

5.

Experimental Results

The proposed method has been tested on a number of well-known test images. An HR version of these images (512×512) was used as ground truth for performance evaluation purposes. The wavelet transform was implemented using the well-known Daubechies 97 filters, the maximum shift k was set to 5, and block sizes were 8×8 . The proposed method has been compared with other wavelet based methods as well as an edge directed10 and a fractal-based11 method. PSNR values are tabulated in Table 1 for both 2× and 4× enlargement factors. Figures 3 and 4 respectively show the enlarged images and amplified residual images for subjective comparisons with bilinear interpolation and WZP. Figure 5 shows a cross section of an edge (taken from Fig. 1) and reconstructed versions as well as the absolute error of these reconstructions.

Fig. 3

Extract from original Lena (a) and images for 4× reconstructions: bilinear interpolation (b), WZP (c), and directional CS (d).

040501_1_3.jpg

Fig. 4

Extract from original Lena (a) and amplified error images for 4× reconstructions: bilinear interpolation (b), WZP (c), and directional CS (d).

040501_1_4.jpg

Fig. 5

Cross section of an edge and its reconstructed versions using various methods (a). Absolute error of the reconstructed edges (b).

040501_1_5.jpg

Table 1

PSNR (dB) values for 2× (256×256to512×512) and 4× (128×128to512×512) enlargement.

Image/MethodLenaElaineBaboonPeppers
Bilinear30.1324.0630.6025.3822.8520.4330.0124.37
Bicubic31.3426.7631.1728.9322.9821.0230.2826.86
Genuine Fractals1133.6528.0132.7129.5823.7821.1033.7928.55
NEDI1034.1028.8132.8929.9723.8721.1833.5428.52
WZP34.4528.8433.2630.4424.2221.4733.9429.57
HMM334.5228.8633.3130.4624.2421.4734.0429.58
Regularity Preserving234.4828.8133.2930.4224.2421.4734.0329.57
WZP and CS834.9329.2733.5630.7824.2821.5434.3229.87
WZP and Directional CS35.0929.5533.7330.9824.3721.6734.5030.14

It can be seen that directional CS adapts better to edge orientation and avoids jagged edge (staircase) artifacts. Overall our results show that the directional CS outperforms the competing methods and offers modest but consistent improvements over baseline CS. Additionally, the proposed scheme offers a reduction in computational complexity relative to conventional CS because only shifts in the normal to salient edges are required, involving 2k WZP calculations compared with k2 in the conventional method.

6.

Conclusion

A directional variant of the CS methodology was considered for image resolution upscaling in the wavelet domain. The proposed method estimates local edge orientation from a wavelet decomposition of the available LR image and uses this information to influence CS parameters. Our experimental results confirm that the proposed method outperforms competing methods for a wide range of images both in objective and subjective terms.

references

1. 

S. G. Chang, Z. Cvetkovic, and M. Vetterli, “Resolution enhancement of images using wavelet transform extrema extrapolation,” 2379 –2382 (1995). Google Scholar

2. 

W. K. Carey, D. B. Chuang, and S. S. Hemami, “Regularity-preserving image interpolation,” IEEE Trans. Image Process., 8 (9), 1295 –1297 (1999). Google Scholar

3. 

K. Kinebuchi, D. D. Muresan, and T. W. Parks, “Image interpolation using wavelet-based hidden Markov trees,” 7 –11 (2001). Google Scholar

4. 

D. H. Woo, I. K. Eom, and Y. S. Kim, “Image interpolation based on inter-scale dependency in wavelet domain,” 1687 –1690 (2004). Google Scholar

5. 

R. R. Coifman and D. L. Donoho, “Translation-invariant denoising,” Lecture Notes in Statistics: Wavelets and Statistics, 125 –150 Springer Verlag, New York (1995). Google Scholar

6. 

A. Nosratinia, “Denoising of JPEG images by re-application of JPEG,” J. VLSI Signal Proc. Syst. Signal, Image, Video Technol., 27 (1), 69 –79 (2001). https://doi.org/10.1023/A:1008167430544 Google Scholar

7. 

A. Nosratinia, “Post-processing of JPEG-2000 images to remove compression artifacts,” IEEE Signal Process. Lett., 10 (10), 296 –299 (2003). https://doi.org/10.1109/LSP.2003.817179 Google Scholar

8. 

A. Temizel and T. Vlachos, “Wavelet domain image resolution enhancement using cycle-spinning,” Electron. Lett., 41 (3), 119 –121 (2005). https://doi.org/10.1049/el:20057150 Google Scholar

9. 

A. K. Fletcher, K. Ramchandran, and V. K. Goyal, “Wavelet denoising by recursive cycle spinning,” 873 –876 (2002). Google Scholar

10. 

X. Li and M. T. Orchard, “New edge-directed interpolation,” IEEE Trans. Image Process., 10 (10), 1521 –1527 (2001). https://doi.org/10.1109/83.951537 Google Scholar

11. 

LizardTech, “Genuine Fractals Print Pro Trial Version, ver. 3.5.1 build 787, 2004,” (2004) http://www.genuinefractals.com/products/gf/ Google Scholar
©(2005) Society of Photo-Optical Instrumentation Engineers (SPIE)
A. Temizel and Theodore Vlachos "Image resolution upscaling in the wavelet domain using directional cycle spinning," Journal of Electronic Imaging 14(4), 040501 (1 October 2005). https://doi.org/10.1117/1.2061247
Published: 1 October 2005
Lens.org Logo
CITATIONS
Cited by 58 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Wavelets

Image resolution

Lawrencium

Wavelet transforms

Denoising

Image enhancement

Linear filtering

Back to Top