A note on “A globally convergent BFGS method with nonmonotone line search for non-convex minimization”

A note on “A globally convergent BFGS method with nonmonotone line search for non-convex minimization”

Applied Mathematics and Computation 219 (2012) 764–766 Contents lists available at SciVerse ScienceDirect Applied Mathematics and Computation journa...

139KB Sizes 0 Downloads 25 Views

Applied Mathematics and Computation 219 (2012) 764–766

Contents lists available at SciVerse ScienceDirect

Applied Mathematics and Computation journal homepage: www.elsevier.com/locate/amc

A note on ‘‘A globally convergent BFGS method with nonmonotone line search for non-convex minimization’’ q Li Zhang ⇑, Xinlong Chen Department of Mathematics, Changsha University of Science and Technology, Changsha 410004, China

a r t i c l e

i n f o

Keywords: BFGS method Nonmonotone line search Global convergence

a b s t r a c t In this note, we point out that Assumptions 3.1, 3.2 in [Y. Xiao, H. Sun, Z. Wang, A globally convergent BFGS method with nonmonotone line search for non-convex minimization, J. Comput. Appl. Math. 230 (2009) 95–106] are not enough to guarantee the global convergence result Theorem 3.1 in [1]. We present a new assumption and show that the proposed method in [1] still converges globally under these three assumptions. Ó 2012 Elsevier Inc. All rights reserved.

1. Introduction In this note we consider the following unconstrained optimization problem:

min f ðxÞ; n

ð1:1Þ

x2R

where f : Rn ! R is continuously differentiable. In [1], Xiao et al. proposed a nonmonotone BFGS method for solving (1.1) and obtained some results as follows. Lemma 1.1 ([1], Lemma 3.2). Let the sequence fxk g be generated by Algorithm 2.1 in [1]. If kg k k P  holds for all k with some constant  > 0, then there are positive constants bj ; j ¼ 1; 2; 3, such that, for any k, the inequalities

kBi si k 6 b1 ksi k and b2 ksi k2 6 sTi Bi si 6 b3 ksi k2

ð1:2Þ

hold for at least a half of the indices i 2 f1; 2; . . . ; kg, where Bk is the iterative matrix and sk ¼ xkþ1  xk ¼ ak dk . Lemma 1.2 ([1], Lemma 3.4). If kg k k P  holds for all k with some constant  for all i 2 fij ð1:2Þ holdsg. that ai P a

 > 0, then there exists a positive constant a such

Theorem 1.1 ([1], Theorem 3.1). Let Assumptions 3.1,3.2 in [1] hold. Then the sequence fxk g generated by Algorithm 2.1 in [1] converges globally in the sense that

lim inf k!1 kg k k ¼ 0: The authors in [1] obtained Theorem 1.1 under the following assumptions:

q This work was supported by the NSF foundation (10901026) of China and The Open Fund Project of Key Research Institute of Philosophies and Social Sciences in Hunan Universities. ⇑ Corresponding author. E-mail address: [email protected] (L. Zhang).

0096-3003/$ - see front matter Ó 2012 Elsevier Inc. All rights reserved. http://dx.doi.org/10.1016/j.amc.2012.06.073

L. Zhang, X. Chen / Applied Mathematics and Computation 219 (2012) 764–766

765

Assumption 3.1 [1]. The level set L0 ¼ fxjf ðxÞ 6 f ðx0 Þg is bounded. Assumption 3.2 [1]. Function f is continuously differentiable on L0 , and there exists a constant L > 0 such that

8x; y 2 L0 ;

kgðxÞ  gðyÞk 6 kx  yk;

where gðxÞ is the gradient of f at x. In this paper, we first point out that Assumptions 3.1,3.2 are not enough to guarantee the global convergence result Theorem 1.1 by an example. Then we present a new assumption and give a simpler proof to show that the proposed method in [1] still converges globally under these three assumptions.

2. A new assumption and global convergence In this paper, we use the same notations as those of [1]. For simplicity, we first introduce some notations as follows:

f ðxhðkÞ Þ ¼ max f ðxkj Þ; 06j6M0

k  M0 6 hðkÞ 6 k;

where M 0 is a given positive integer and

J ¼ fij ð1:2Þ holdsg; J ¼ frj

X

ðg Tr sr Þ < 1g;

r

where J is only a subset of f0; 1; 2;   g; g r ¼ gðxr Þ and sr ¼ ar dr . In the proof of Theorem 3.1 in [1], the authors used the condition J # J which is the key in their proof. However, this is not proved in [1]. In fact it is not easy to prove this condition in theory. The main reason is that we can not control the behavior of the iterative matrix Bk . And hence we do not know when the matrix Bk generates a so-called good iteration which satisfies (1.2). Hence there is some possibility that J \ J may be an empty set, for example: Assume that

J ¼ f3mg1 m¼1 and

J ¼ f3m  2; 3m  1g1 m¼1 : Then for all k > 0, the number of J is 2=3 times k, which satisfies Lemma 1.1. It is clear that J \ J ¼ /, i.e., an empty set. To guarantee that the main results of [1] still hold, we give a new assumption as follows. Assumption A. G,K \ J is infinite, where K ¼ fhðkÞ  1g. Now we present the following global convergence result under this additional condition. Theorem 2.1. Let Assumptions 3.1,3.2 in [1] and Assumption A hold. Then the sequence fxk g generated by Algorithm 2.1 in [1] converges globally, that is,

lim inf k!1 kg k k ¼ 0: Proof. We prove this theorem by contradiction. Assume that the conclusion is not true, then there exists a positive constant e such that

kg k k P e;

8k P 0:

ð2:1Þ

From Lemma 3.3 in [1], we have that the sequence ff ðxhðkÞ Þg is monotonically nonincreasing and fxk g  L0 . By the line search (2.3) in [1], we have

f ðxhðkþ1Þ Þ 6 f ðxhðkÞ Þ ¼ f ðxhðkÞ1 þ ahðkÞ1 dhðkÞ1 Þ 6 max f ðxhðkÞ1j Þ þ dahðkÞ1 g ThðkÞ1 dhðkÞ1 06j6M0

T hðkÞ1 g hðkÞ1 dhðkÞ1 :

¼ f ðxhðhðkÞ1Þ Þ þ da

Let k ! 1 in the above inequality, we have

ahðkÞ1 g ThðkÞ1 dhðkÞ1 ! 0; where we use the fact g ThðkÞ1 dhðkÞ1 < 0. This and Lemma 1.2 imply

lim g Tk dk ¼ 0:

k2G;k!1

ð2:2Þ

766

L. Zhang, X. Chen / Applied Mathematics and Computation 219 (2012) 764–766

Moreover, from Lemma 1.1 and sk ¼ ak dk , we have for all k 2 G, T

kBk dk k 6 b1 kdk k and b2 kdk k2 6 dk Bk dk 6 b3 kdk k2 ; which together with Bk dk þ g k ¼ 0 yields that

kg k k 6 b1 kdk k and b2 kdk k2 6 g Tk dk 6 b3 kdk k2 :

ð2:3Þ

By (2.2) and (2.3), we have that

limk2G;k!1 kg k k ¼ 0; which contradicts with (2.1). This completes the proof. h Reference [1] Y. Xiao, H. Sun, Z. Wang, A globally convergent BFGS method with nonmonotone line search for non-convex minimization, J. Comput. Appl. Math. 230 (2009) 95–106.