Shadow removal using sparse representation over local dictionaries

ARTICLE IN PRESS Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■ Contents lists available at ScienceDirect Engineering ...

Download PDF

3MB Sizes 8 Downloads 151 Views

Report

Full Text

ARTICLE IN PRESS Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■

Contents lists available at ScienceDirect

Engineering Science and Technology, an International Journal j o u r n a l h o m e p a g e : h t t p : / / w w w. e l s e v i e r. c o m / l o c a t e / j e s t c h

Press: Karabuk University, Press Unit ISSN (Printed) : 1302-0056 ISSN (Online) : 2215-0986 ISSN (E-Mail) : 1308-2043

H O S T E D BY

Available online at www.sciencedirect.com

ScienceDirect

Full Length Article

Shadow removal using sparse representation over local dictionaries Remya K. Sasi a,*, V.K. Govindan b a b

NIT Calicut, Kerala, India IIIT, Kerala, India

A R T I C L E

I N F O

Article history: Received 24 September 2015 Received in revised form 29 November 2015 Accepted 1 January 2016 Available online Keywords: Shadow Dictionary learning Shadow removal Sparse coding MCA

A B S T R A C T

The presence of shadow in an image is a major problem associated with various visual processing applications such as object recognition, traﬃc surveillance and segmentation. In this paper, we introduce a method to remove the shadow from a real image using the morphological diversities of shadows and sparse representation. The proposed approach ﬁrst generates an invariant image and further processing is applied to the invariant image. Here, shadow removal is formulated as a decomposition problem that uses separate local dictionaries for shadow and nonshadow parts, without using single global or ﬁxed generic dictionary. These local dictionaries are constructed from the patches extracted from the residual of the image obtained after invariant image formation. Finally, non-iterative Morphological Component Analysis-based image decomposition using local dictionaries is performed to add the geometric component to the non-shadow part of the image so as to obtain shadow free version of the input image. The proposed approach of shadow removal works well for indoor and outdoor images, and the performance has been compared with previous methods and found to be better in terms of RMSE. Copyright © 2016, The Authors. Production and hosting by Elsevier B.V. on behalf of Karabuk University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

1. Introduction Shadow detection and removal process is widely used as a preprocessing operation in various image processing applications for the removal of undesirable noise and objects. For example, applications such as video surveillance [1], scene interpretation [2] and object recognition [3] require shadow removal as an initial step to eliminate the undesirable effect of noise on the performance of such systems. Once detected, shadows in images are used for applications such as detection of object shape and size in aerial images, detection of movement of objects in video surveillance system, and ﬁnding the number of light sources and illumination conditions in natural images. In digital photography, removal of shadows can help to improve the visual quality of photographs. Ignoring the existence of shadows in images can, in general, degrade the output quality. Shadow detection and removal is an active research area for the last two decades. Several algorithms have been proposed based on learning [4], color models [5], region [6,7] and invariant image models [8,9] for shadow detection and removal in image as well as in videos. A major work by Lalonde et al. [10] mainly focuses on the shadows cast by objects onto the ground plane. Other notable works are based on assumptions of Lambertian reﬂectance and

* Corresponding author. Tel.: +919496812035. E-mail address: [email protected] (R.K. Sasi). Peer review under responsibility of Karabuk University.

Planckian lighting [11]. Interested readers can see a review article by Sasi and Govindan [12] to get a more comprehensive report of the methodologies reported in the ﬁeld of shadow detection and removal during the last decade. Though shadow removal involving multiple images [13] and interactive methodologies [14–16] provides superior performance, fully automatic approaches available for single image shadow removal stand behind them in terms of performance. This is because of the fact that indoor and outdoor shadows are much affected by the direction, intensity of light source, as well as geometry and texture of the objects where shadow is cast. The review carried out reveals that the research work reported in shadow detection and removal works satisfactorily in the case of user interaction [14–16] and for multiple images [13]. The automatic approaches available for single shadow removal are more complex to implement and set much restriction in the class of images under consideration [10,11]. Reintegration methods and local area processing are time intensive. Also, many methods cannot distinguish between near dark objects and shadows. In the case of smallpatch regions, image in-painting method is more suitable, whereas in-painting large patch holes involve huge computation. Thus, the topic of single image shadow detection and removal requires a good amount of further research to develop an approach that provides satisfactory performances. This paper proposes a method to remove shadow from a real image using sparse representation and a variation of MCA. Sparsity is a powerful way to approximate signals and images by using a sparse

http://dx.doi.org/10.1016/j.jestch.2016.01.001 2215-0986/Copyright © 2016, The Authors. Production and hosting by Elsevier B.V. on behalf of Karabuk University. This is an open access article under the CC BY-NCND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

ARTICLE IN PRESS R.K. Sasi, V.K. Govindan/Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■

2

linear combination of atoms from an over-complete dictionary. Sparse representation is being used in signal and image processing applications such as de-noising [17], super-resolution [18], in-painting [19], deblurring [20], segmentation [21], and compression [22], demonstrating that sparse models are well suited to natural images as well. Starck et al. developed the idea of MCA in a series of papers and it was used in separating the texture from the geometric component [23–25]. MCA algorithm works by decomposing the image into edges and textures, using the morphological differences in these features [23,24]. Each of the shape features is related to a ﬁxed dictionary of atoms such as wavelets or Discrete Cosine Transforms. Typical MCA is an iterative thresholding scheme in which the threshold linearly decreases to zero. Another similar work in the area is “Learning the Morphological Diversity” by Peyré et al. [26]. They introduced an adaptive MCA scheme by learning the morphologies of image layers. A combination of adaptive local dictionaries and ﬁxed generic dictionaries using wavelets and curvelets is used for decomposition. The main deﬁciency of these models is the existence of similar atoms corresponding to cartoon and texture dictionaries that produce coherence. So, to get the expected solution, proper manual initialization is essential. Apart from sparse representation, we are also using invariant image formation as a base step in the proposed shadow removal method. Many of the works in the area of shadow removal using invariant image formation were authored by Finlayson and his students [8,9,27–31]. In general, his methods are based on forming an invariant image, in which shadows do not appear followed by reconstructing the required missing components using reintegration. Invariant image formation results in the loss of photo quality of image. To bring back the ﬁne details lost in the invariant image formation, reintegration is performed using Poisson equation by averaging over retinex paths [8] or Hamiltonian paths [27]. In most of the methods missing information after shadow removal is interpolated using image inpainting methods. Finlayson also limits his work to images that follow Lambertian model where Planckian illumination lights the scenes. However, real scenes need not satisfy Lambertian assumption. The remaining part of this research report is put forth as in the following: The basics of sparse coding, dictionary learning, MCA and invariant image formation required for better readability of the paper is presented in Section 2. The proposed method of shadow removal using sparse representation is discussed in Section 3. The dataset used, results obtained and further discussions of the proposed work are given in Section 4. The paper is concluded in Section 5 highlighting the approach used and performance gain achieved.

Solution to the above problem is NP hard; however, many convex [32], non-convex [33] optimization and greedy approximation algorithms [34,35] exist in literature to deal with problems having the above formulation. Since norm 2 minimization is equivalent to norm 1, L1 regularized LS also gives solution for the above problem [32,36,37]. 2.2. Dictionary learning In dictionary learning, the algorithm is given samples of the form y = Aα, where α ∈ℜm is an unknown random sparse vector and A is an unknown dictionary matrix in ℜn∗m . The goal is to learn A and α from given y such that 1. A should be over-complete ie m > n 2. Atoms in A are linearly dependent 3. Representation error, E = y − Aα, is minimized Dictionary learning can be formulated as given in (3). For ﬁxed dictionary solve system of equations subject to α is sparse. Then, for ﬁxed α, update A.

arg min y − Aα α ,A

2 2

s.t .

α

0

<= k

(3)

where k denotes sparsity. Dictionaries can be ﬁxed, global or local. Global dictionaries are built from clean patches of selected images of a database. Earlier, in the sparse coding area, focus was mostly given to ﬁxed overcomplete dictionaries, such as wavelets and discrete cosine transform [38]. These approaches are called generic since the dictionary is predeﬁned. Local dictionaries are learned online and hence more adapted to the input image. Different methodologies exist in dictionary learning literature, starting from ﬁxed dictionaries to online dictionaries [39]. Fixed dictionaries ﬁnd sparse approximations of the set of training signals for ﬁxed dictionary, whereas optimized dictionaries keep sparse signals ﬁxed and build optimized dictionaries [40,41]. MOD, K-SVD [42], and online dictionary learning [39,43] are the popular algorithms in the area. 2.3. MCA Morphological Component Analysis is used for the separation of the components of an image having different morphologies. MCA and Basis Pursuit are based on sparsity, but MCA is much faster and is capable of handling large data sets. Consider an image y having ‘s’ morphological components, such that S

y = ∑ yk , where yk denotes the kth geometric or textural compo-

2. Preliminaries

k =1

This section brieﬂy reviews the theory behind sparse coding (SC), dictionary learning, morphological component analysis (MCA) and intrinsic image formation to better understand the proposed shadow removal methodology using MCA.

nent of y. To decompose the image y into { yk }k=1 , the MCA algorithm ﬁnds the sparsest solution over the dictionaries Ak such that S

s

{α1,.α s } k =1

2.1. Sparse coding Using sparse coding, an image y can be expressed as a set of few elementary signals taken from an over-complete dictionary A, subject to α should be sparse.

min α α

0

subject to

y = Aα

(1)

Considering noise and sparsity constraint, we can add a regularization parameter λ and reformulate (1) as 2

min y − Aα 2 + λ α α

0

(2)

s

2

k =1

2

{α1, .. α S } = arg min ∑ α k 1 + λ y − ∑ Akα k

(4)

where αk denotes the kth sparse solution MCA algorithm uses ﬁxed generic dictionaries such as wavelets and curvelets in representing geometric component and DCT for textural components. A major step in this algorithm is the selection of dictionaries that are mutually incoherent. Further details about the MCA algorithm can be found in Reference [24]. 2.4. Invariant/Intrinsic image formation An invariant image is invariant to illumination, color and intensity. Illumination invariant image is invariant to illumination. Shadow is caused by illumination and hence illumination invariant image

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

ARTICLE IN PRESS R.K. Sasi, V.K. Govindan/Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■

is free from shadows. However, invariant image formation degrades photo quality of image. The proposed method of shadow removal uses invariant image formation proposed by Finlayson et al. [28]. Finlayson et al. say that the correct angle of projection is one with minimum entropy in the resulting invariant image. The result is a gray scale image that is independent of shadows. 3. Proposed work In the proposed approach, shadow removal is formulated as a variation of image decomposition problem using MCA, which uses separate local dictionaries for shadow and nonshadow parts. The input image is initially used to generate an illumination invariant image [28] in which shadows are absent. The formation of invariant image also results in the loss of photo quality of image; hence, texture and edge information is lost and we need to bring back these ﬁne details of the image to get shadow-free resultant image. Invariant image is then subtracted from input image to get the residual image that consists of shadow and geometric information of the input image. Sparse coding using orthogonal Matching Pursuit is applied using locally learned dictionaries. Finally, missing geometric components are reintegrated to the invariant image to get the resultant shadow-free image. The major steps during the shadow removal process is given in Fig. 1(a)–(f). Major differences between the proposed approach and MCA are 1. MCA algorithms are iterative, whereas the proposed approach is non-iterative. 2. MCA algorithm for image decomposition directly applies to the input image, whereas in the proposed approach MCA is applied to the resultant image obtained after subtraction of the input image from the invariant image. 3. Instead of using a predeﬁned generic dictionary based on curvelets, wavelets or DCT, we construct separate local dictionaries for shadow part and geometric part as shadow falls on diverse texture of an image.

3

3.1. Problem formulation Input shadow image I is initially used to generate an illumination invariant image, IN, in which shadows and relevant geometric information are absent. Difference between input image I and illumination invariant image IN generates a residual image IS where the shadows and the other geometric information are retained. Thus, I is now split into two images; the shadow image and the nonshadow image, I = IN + IS. Dictionary learning generates a dictionary AIS using the training patches extracted from IS. AIS is further divided into two sub-dictionaries, which represent the shadow and the geometric components of IS, respectively. The problem of shadow removal using the proposed approach is formulated as given in (5). Sparse coding is performed using Mallat and Zhang’s Orthogonal matching pursuit [34]. k k min yIS − AIS α IS

α Ik ε Rm

2 2

s.t .

α IkS

0

≤L

(5)

S

where yIkS ε Rn denotes the kth patch extracted from IS. α IkS ε Rm are the sparse coeﬃcients of yIkS with respect to AIS ε Rn×m and L denotes the maximum number of nonzero coeﬃcients of α IkS . 3.2. Dictionary learning The proposed approach of shadow removal uses separate local dictionaries for shadow and nonshadow parts, instead of using single global dictionary. Even though shadows are affected globally, we perform decomposition and reconstruction locally, using local dictionary. Hence, using local dictionaries can contribute more results than using global dictionary. The proposed work mainly focuses on the use of a local dictionary or an adaptive strategy that builds the dictionary from the exemplar patches extracted from the shadow and the nonshadow regions of residual image (IS). The reasons for not selecting a single global dictionary are 1. Shadow regions and geometric components are highly mixed in some parts of the image.

(a)

(b)

(c)

(d)

(e)

(f)

Fig. 1. (a) Input image I; (b) shadow free invariant image IN of input I; (c) IS = I − IN; (d) geometric component; (e) output of the proposed approach; and (f) ground truth.

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

ARTICLE IN PRESS R.K. Sasi, V.K. Govindan/Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■

4

Fig. 2. (a) Input image I, (b) invariant image; (c) IS = IN − I; (d) initial dictionary learned from Is (e) Local Shadow dictionary; (f) Local Geometric dictionary.

2. Shadow regions often exhibit different characteristics in various regions of an image 3. Local exemplar based dictionary learning of shadow regions better represent shadow than a global dictionary does. Initial dictionary AIS is built from a set of overlapping patches from an image having shadow part IS. Dictionary learning for shadow detection problem can be formulated as follows

min

AIk ε Rn×m,αε Rm S

⎛1 k k ⎜⎝ yIS − AIS α IS 2

2 2

⎞ + λ α IkS 1 ⎟ ⎠

(6)

where αk denotes the sparse representation coeﬃcients of yk based on AIS , and λ represents regularization parameter. To solve (6), we used an online dictionary learning algorithm proposed by Mairal et al. [43]. The atoms constituting AIS are further divided into two classes representing the geometric ( AIS _G ) and the shadow components ( AIS _S ). Figure 2(a)–(f) illustrates the steps involved in partition. Dictionary partition is performed by computing texton histogram followed by k-means clustering. The sum of the norm of each atom’s texton histogram in a cluster is computed and the one giving the minimum norm belongs to a shadow cluster and the other cluster is a geometric component. This is based on the observation that norm of shadow cluster when ploted always gives less area under the curve, whereas geometric cluster gives higher value for the same. Thus

AIS = [ AIS _G

AIS _S ]

(7)

The MCA algorithms distinguish between the morphological diversities of geometric, AIS _G , and shadow, AIS _S , sub-dictionaries by using mutual incoherence between them. The mutual coherence μ ( AIS _G , AIS _S ) of AIS _G and AIS _S is deﬁned as

μ ( AIS _G , AIS _S ) =

max

dGiε AI S _G,dSj ε AI S _S

dGi , dSj

(8)

where dGi , dSj stand for the ith and jth atoms in AIS _G and AIS _S , respectively and dSi , dGj denotes the inner product of dSi , dGj . When each atom is normalized to have a unit l2 norm, the range of μ(A1, A2) is [0, 1]. The mutual incoherence of dGi , dSj is 1 − μ ( A1, A2 ). A smaller value of mutual coherence indicates that there are much diversities in the two sub dictionaries and decomposition will be better. 3.3. Reconstruction of shadow free image OMP algorithm is applied to each yIkS extracted from IS via minimization of (5) based on the two dictionaries AIS _S and AIS _G to ﬁnd its sparse coeﬃcients α IkS . To recover geometric component I SG of IS we use reconstructed patch yIkS as follows. 1. Coeﬃcients corresponding to AIS _G in α IkS is set to zeros to obtain α IkS S . Similarly, the corresponding coeﬃcients of AIS _S in α IkS are set to zero to obtain α IkSG . 2. Each patch yIkS can be used to recover I SG by averaging the pixel values in overlapping regions. yIkS can be re-expressed as geometric and shadow components as follows.

yIkSG = AIS _G × α IkSG

or

yIkS S = AIS _S × α IkS S

Finally, the shadow-removed image can be obtained via I shadow_free = I N + I SG . Instead of iteratively performing sparse coding, here, MCA based image decomposition is performed only one time for each patch yIkS with respect to AIS = [ AIS _G AIS _S ] The entire algorithm is summarized in Algorithm 1: Noniterative MCA based shadow removal. 4. Experimental results and discussion This section gives detailed description about the implementation details, the dataset used, evaluation metrics and performance of the proposed shadow removal methodology.

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

ARTICLE IN PRESS R.K. Sasi, V.K. Govindan/Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■

5

4.1. UIUC shadow datasets

4.3. Experiment

Guo et al. [6] provide datasets for shadow detection as well as removal which are available in the University of Illinois at UrbanaChampaign database (UIUC).1 This dataset consists of 108 indoor as well as outdoor image pairs, photographed under a variety of illumination conditions. As this dataset also provides shadow-free image and ground truth of shadow region, it is useful for the evaluation of detection and removal.

Experimental evaluation was performed on both indoor and outdoor images from UIUC database [6]. The images used in this paper include human subjects and natural objects in various environments comprising different background materials and textured surfaces. Figure 3 gives the sample result of few images using the proposed method. The procedure and implementation details of the proposed method are described as follows. The implementation of the intrinsic image is adopted from Finlayson et al. [28]. Successful removal of shadow depends to a great extent on this ﬁrst phase. This method is simple and works by ﬁnding the invariant direction followed by generating a gray scale image. Finally, an L1, a chromatic intrinsic image that is free of shadows, is formed. The method does not need any sort of camera calibration or prior knowledge about the image. Invariant image results in the loss of ﬁne details, namely edges and geometric features. The proposed method tries to bring back these ﬁne details into this invariant image to get the ﬁnal shadow-free image. A plot given in Fig. 4 shows the difference in RMSE for invariant image and resultant image using proposed approach for the entire set of 108 images from the UIUC dataset. It can be observed that the RMSE of the proposed approach always remains better than that of the invariant image. In the dictionary learning step, we used online dictionary learning proposed by Mairal et al. [43] to solve (6) with the suggested regularization parameter λ set to 0.15. Initial dictionary AIS is built from a set of overlapping patches from residual image IS (see Fig. 2(c) and (d)). Size of dictionary is set as 1024 and the dictionary learning iteration is ﬁxed as 100. For dictionary learning, residual image of size 200 × 200 is used, from which patches of size 18 × 18 are extracted. The criteria for selecting patch size is further explained in this section. K-means clustering is used to classify initial dictionary into shadow and geometric dictionary as the proposed approach uses separate local dictionaries for shadow and geometric parts. Figure 2(e) and (f) shows a sample of shadow and geometric dictionaries. Sparse coding was performed using Mallat and Zhang’s Orthogonal Matching pursuit [34]. We measure the diversities of two sub dictionaries, shadow and geometric ( AIS _G and AIS _S ), using mutual coherence μ deﬁned in (8). Smaller value of mutual coherence leads to much diverse subdictionaries and improves the decomposition based on the two subdictionaries. As the dictionary atom size increases, mutual coherence also increases. Size of dictionary atoms inﬂuences the mutual coherence of the sub-dictionaries. We tried to plot the dictionary atom size versus the mutual coherence of local, global and generic dictionaries in Fig. 5. For generic dictionaries such as Haar wavelet, DCT and Gaussian, mutual coherence decreases as atom size increases, whereas for local and global dictionaries, mutual coherence increases along with atom size. Also, μ of local dictionaries is less compared to that of global dictionaries. From Fig. 5 its clear that as atom size increases, mutual coherence of local and global dictionaries also increases, whereas decreasing atom size affects the computation time. So, we try to keep atom size as small as possible so that it always maintains a reasonable computation time during the online dictionary learning process. Hence, we ﬁxed the patch size as 18 × 18, which in turn is equal to atom size 324. Even though μ of ﬁxed generic dictionaries is less, the resultant shadow-free image using local dictionaries has less RMSE than the one using generic dictionaries. The major difference between the proposed method and MCA was explained in Section 3. MCA algorithms are iterative, whereas the proposed MCA approach is non-iterative. Even if the algorithm iterates for a few number of times, the result somewhat remains the same and there is no notable difference in the RMSE of the resultant image. This is clear if we observe the plot given in

4.2. Evaluation metrics Evaluation of the proposed method is performed using RMSE (Root mean squared error). Most of the shadow removal works reported in still images have not performed quantitative evaluation of the result. Guo et al. [6], Miyazaki et al. [14] Gong and Cosker [15], and Gryka et al. [16] are the major works that provide quantitative evaluation. Other notable works provide only visual comparison.

1

http://aqua.cs.uiuc.edu/site/projects/shadow.html.

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

ARTICLE IN PRESS 6

R.K. Sasi, V.K. Govindan/Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■

Fig. 3. Results of the proposed shadow removal process: Left – input image I; Middle – output of the proposed approach; Right – Ground truth.

Fig. 6, where RMSE of the proposed approach for image in Fig. 2 over 100 iteration is given. The intention of Fig. 6 is just to clarify that RMSE value never changes over iteration. Another difference from MCA is that the proposed approach uses local sub-dictionaries, whereas the MCA approach uses ﬁxed generic dictionary. Plot given in Fig. 6 also gives the difference in performance using RMSE value of the resultant image using ﬁxed dictionary, local subdictionary and after invariant image formation. The proposed

approach using local dictionary substantially improves the performance. Quantitative evaluation of the proposed method has been performed using RMSE. Table 1 gives the performance of the proposed approach in terms of RMSE using generic, local and global dictionaries. Performance is comparatively better when using local dictionary. Even though μ of generic dictionaries is comparatively lower, RMSE of shadow removal using the proposed approach is

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

ARTICLE IN PRESS R.K. Sasi, V.K. Govindan/Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■ 20

7

12.252 Proposed Invariant

19

Proposed Fixed dictionary Invariant

12.25

18

12.248

17

RMSE

RMSE

12.246

16

12.244

15 12.242

14 12.24

13 12.238

12 12.236 0

11 0

20

40

60

80

100

10

20

30

40

50

60

70

80

90

100

Iteration

120

Dataset

Fig. 4. RMSE for the invariant image and the resultant image for the entire dataset. It can be observed that RMSE of the proposed approach always remain less than that of the invariant image.

Fig. 6. RMSE vs Iteration for image in Fig. 2 using generic dictionary, local dictionary and invariant image. Proposed approach is non-iterative and even if the algorithm iterates for 100 times, RMSE value of the resultant image remains almost constant.

Table 1 Average RMSE of image from UIUC dataset using generic, global and proposed local dictionary based shadow detection. Generic dictionary 1a

2b

3c

Global Dictionary

Global + Local Dictionary [26]

Proposed Approach

14.67

14.85

13.93

14.10

13.5

12.23

a b c

Haar wavelet packet and DCT dictionary. DCT and shifted Kronecker delta. iid Gaussian entries with zero mean.

Table 2 Quantitative evaluation-RMSE between result of shadow removal and the ground truth shadow-free images using UIUC [6] dataset.

Fig. 5. Dictionary atom size versus Mutual coherence (μ) of generic, local and global dictionaries.

better. The method has been quantitatively compared with Guo et al. [6] and Gryka et al. [16] and found to be better in terms of RMSE. Details of evaluation using images from UIUC dataset are given in Table 2. The proposed method has been visually compared using results from Gong and Cosker [15] and is given in Fig. 7. Table 3 gives a qualitative evaluation of six major shadow removal algorithms – Miyazaki et al. [14], Lalonde et al. [10], Guo et al. [6], Gong and Cosker [15], Zhu et al. 2015 [11] and Gryka et al. [16] – to compare their performance with the proposed approach using properties such as computational load and preservation of textures. Average running time of an image from UIUC dataset [6] is 78.34 ± 18 s/image, out of which 69.6 ± 10 is for dictionary learning. In comparison, for the same dataset, the method by Guo et al. [6] takes 104.718 s/image for shadow removal. Computational time of the proposed method is better compared with state of the art methods.

Methodology

RMSE

Guo et al. 2013 [6] Gryka et al. 2015 [16] Invariant image [28] Proposed approach

19.85 13.83 15.10 12.23

Table 3 Qualitative evaluation. Author

Method

Preservation of Texture

Computational load

Miyazaki et al. [14] Lalonde et al. [10] Guo et al. [6] Gong and Cosker [15] Zhu et al. [11] Gryka et al. [16] Proposed

Interactive Automatic Automatic Interactive Automatic Interactive Automatic

Excellent Good Excellent Good Average Excellent Excellent

Low Low Medium Low Medium Low Low

5. Conclusion We have presented a method to remove shadow from an image using the morphological diversities of shadow coupled with invariant image formation. In the proposed approach, shadow removal is formulated as a decomposition problem that uses separate local dictionaries for shadow and nonshadow parts, without using single global dictionary. Local dictionaries used in the proposed approach are learned from the patches extracted from the residual of

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

ARTICLE IN PRESS R.K. Sasi, V.K. Govindan/Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■

8

Fig. 7. Result of the proposed shadow removal process and Gong and Cosker [15]: Left – Input image I; Middle – output of the proposed approach; Right – Result of Gong and Cosker [15].

image obtained after invariant image formation. Finally, a variation of MCA-based image decomposition using local dictionaries is performed to add the geometric component to the non-shadow part of the image to obtain the shadow-removed version of input image. The proposed approach of shadow removal works well for indoor and outdoor images, and the method has been compared with previous approaches and was found to be better in terms of RMSE. References [1] V. Pascual, C. Cuevas, N. Garcia, A method for shadow and highlight removal in nonparametric moving object detection strategies, in: Consumer Electronics (ICCE), 2015 IEEE International Conference on, IEEE, 2015, pp. 172–173. [2] J.-H. Shim, Y.-I. Cho, A shadow removal method for a mobile robot localization using external surveillance cameras, Proced. Comput. Sci. 56 (2015) 150–155. [3] M. Russell, J. Zou, G. Fang, Real-time vehicle shadow detection, Electron. Lett. 51 (16) (2015) 1253–1255. [4] J. Zhu, K.G. Samuel, S.Z. Masood, M.F. Tappen, Learning to recognize shadows in monochromatic natural images, in: Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, IEEE, 2010, pp. 223–230.

[5] S. Murali, V.K. Govindan, Shadow detection and removal from a single image using lab color space, Cybernet. Inform. Technol. 13 (1) (2013) 95–103. [6] R. Guo, Q. Dai, D. Hoiem, Paired regions for shadow detection and removal, Pattern Anal. Mach, Intell., IEEE Trans. 35 (12) (2013) 2956–2967. [7] R.K. Sasi, V. Govindan, Fuzzy split and merge for shadow detection, Egypt. Inform. J. 16 (1) (2015) 29–35. [8] G.D. Finlayson, S.D. Hordley, M.S. Drew, Removing shadows from images using retinex, in: Color and Imaging Conference, no. 1, Society for Imaging Science and Technology, 2002, pp. 73–79. [9] G.D. Finlayson, M.S. Drew, C. Lu, Entropy minimization for shadow removal, Int. J. Comput. Vision 85 (1) (2009) 35–57. [10] J. Lalonde, A. Efros, S. Narasimhan, Detecting ground shadows in outdoor consumer photographs, Computer Vision-ECCV 2010, pp. 1–14, 2010. [11] X. Zhu, R. Chen, H. Xia, P. Zhang, Shadow removal based on YCbCr color space, Neurocomputing 151 (2015) 252–258. [12] R.K. Sasi, V.K. Govindan, Shadow detection and removal from real images: state of art, in: Proceedings of the Third International Symposium on Women in Computing and Informatics, WCI ’15, ACM, New York, NY, USA, 2015, pp. 309–317, doi:10.1145/2791405.2791450. [13] S. Duchêne, C. Riant, G. Chaurasia, J. Lopez-Moreno, P.-Y. Laffont, S. Popov, et al., Multi-view intrinsic images of outdoors scenes with an application to relighting, ACM Trans. Graph. 16 (2015). [14] D. Miyazaki, Y. Matsushita, K. Ikeuchi, Interactive shadow removal from a single image using hierarchical graph cut, in: Computer Vision–ACCV 2009, Springer, 2010, pp. 234–245.

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

ARTICLE IN PRESS R.K. Sasi, V.K. Govindan/Engineering Science and Technology, an International Journal ■■ (2016) ■■–■■

[15] H. Gong, D. Cosker, Interactive shadow editing from single images, in: Computer Vision-ACCV 2014 Workshops, Springer, 2014, pp. 243–252. [16] M. Gryka, M. Terry, G.J. Brostow, Learning to remove soft shadows, ACM Transactions on Graphics 2015, pp. 153, October. [17] S. Liu, Y. Chen, K. Zhao, L. Zhao, Image denoising method of edge-preserving based sparse representations, in: Multimedia, Communication and Computing Application: Proceedings of the, 2014 International Conference on Multimedia, Communication and Computing Application (MCCA 2014), Xiamen, China, CRC Press, 2015, p. 237. October 16–17, 2014. [18] F. Yeganli, M. Nazzal, H. Ozkaramanli, Selective superresolution via sparse representations of sharp image patches using multiple dictionaries and bicubic interpolation, in: Signal Processing and Communications Applications Conference (SIU), IEEE, 2015, pp. 1957–1960. 2015 23th. [19] T. Rao, M. Rao, T. Aswini, Image inpainting with group based sparse representation using self adaptive dictionary learning, in: Signal Processing and Communication Engineering Systems (SPACES), 2015 International Conference on, IEEE, 2015, pp. 301–305. [20] L. Feng, Q. Huang, T. Xu, S. Li, Blind image deblurring based on trained dictionary and curvelet using sparse representation, in: Selected Proceedings of the Photoelectronic Technology Committee Conferences held August–October 2014, International Society for Optics and Photonics, 2015, p. 95222G. [21] A. Bjorholm, V.A. Dahl, Dictionary based image segmentation, in: Image Analysis, Springer, 2015, pp. 26–37. [22] N. Pati, A. Pradhan, L.K. Kanoje, T.K. Das, An approach to image compression by using sparse approximation technique, Proced. Comput.Sci. 48 (2015) 770–776. [23] J.-L. Starck, M. Elad, D. Donoho, Redundant multiscale transforms and their application for morphological component separation, Adv. Imag. Electron Phys. 132 (2004) 287–348. [24] J.-L. Starck, M. Elad, D.L. Donoho, Image decomposition via the combination of sparse representations and a variational approach, Image Process., IEEE Trans. 14 (10) (2005) 1570–1582. [25] M. Elad, J.-L. Starck, P. Querre, D.L. Donoho, Simultaneous cartoon and texture image inpainting using morphological component analysis (MCA), Appl. Comput. Harmon. Anal. 19 (3) (2005) 340–358. [26] G. Peyré, J. Fadili, J.-L. Starck, Learning the morphological diversity, SIAM J. Imag. Sci. 3 (3) (2010) 646–669. [27] G.D. Finlayson, C. Fredembach, Fast re-integration of shadow free images, in: Color and Imaging Conference, vol. 2004, Society for Imaging Science and Technology, 2004, pp. 117–122.

9

[28] G.D. Finlayson, M.S. Drew, C. Lu, Intrinsic images by entropy minimization, in: Computer Vision—ECCV 2004, Springer, 2004, pp. 582–595. [29] C. Fredembach, G. Finlayson, Hamiltonian path-based shadow removal, in: Proc. of the 16th British Machine Vision Conference (BMVC), vol. 2, 2005, pp. 502– 511. [30] G.D. Finlayson, S.D. Hordley, C. Lu, M.S. Drew, On the removal of shadows from images, Pattern Anal. Machine Intell., IEEE Trans. 28 (1) (2006) 59–68. [31] C. Fredembach, G. Finlayson, Simple shadow removal, in: 18th International Conference on Pattern Recognition, 2006. ICPR 2006, vol. 1, IEEE, 2006, pp. 832–835. [32] S.S. Chen, D.L. Donoho, M.A. Saunders, Atomic decomposition by basis pursuit, SIAM J. Sci. Comput. 20 (1) (1998) 33–61. [33] R. Chartrand, Exact reconstruction of sparse signals via nonconvex minimization, Signal Process. Lett., IEEE 14 (10) (2007) 707–710. [34] S.G. Mallat, Z. Zhang, Matching pursuits with time-frequency dictionaries, Signal Process., IEEE Trans. 41 (12) (1993) 3397–3415. [35] Y.C. Pati, R. Rezaiifar, P. Krishnaprasad, Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition, in: Conference Record of The Twenty-Seventh Asilomar Conference on Signals, Systems and Computers, 1993, IEEE, 1993, pp. 40–44. [36] R. Tibshirani, Regression shrinkage and selection via the lasso, J. Roy. Stat. Soc. B (1996) 267–288. [37] I.F. Gorodnitsky, B.D. Rao, Sparse signal reconstruction from limited data using FOCUSS: a re-weighted minimum norm algorithm, Signal Process., IEEE Trans. 45 (3) (1997) 600–616. [38] M. Elad, M. Aharon, Image denoising via sparse and redundant representations over learned dictionaries, Signal Process., IEEE Trans. 15 (12) (2006) 3736–3745. [39] J. Mairal, F. Bach, J. Ponce, G. Sapiro, Online dictionary learning for sparse coding, in: Proceedings of the 26th Annual International Conference on Machine Learning, ACM, 2009, pp. 689–696. [40] J. Mairal, J. Ponce, G. Sapiro, A. Zisserman, F.R. Bach, Supervised dictionary learning, in: Advances in Neural Information Processing Systems, 2009, pp. 1033–1040. [41] R. Rubinstein, T. Peleg, M. Elad, Analysis K-SVD: a dictionary-learning algorithm for the analysis sparse model, Signal Process., IEEE Trans. 61 (3) (2013) 661–677. [42] M. Aharon, M. Elad, A. Bruckstein, K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation, IEEE Trans. Signal Process. 54 (11) (2006) 4311–4322. [43] J. Mairal, F. Bach, J. Ponce, G. Sapiro, Online learning for matrix factorization and sparse coding, J. Mach. Learn. Res. 11 (2010) 19–60.

Please cite this article in press as: Remya K. Sasi, V.K. Govindan, Shadow removal using sparse representation over local dictionaries, Engineering Science and Technology, an International Journal (2016), doi: 10.1016/j.jestch.2016.01.001

Shadow removal using sparse representation over local dictionaries

Shadow removal using sparse representation over local dictionaries

Recommend Documents