Physics LettersA 174 (1993) 289-292 North-Holland
PHYSICS L E T T E R S A
Learning rules for an oscillator network H i d e t s u g u Sakaguchi Department of Physics, College of General Education, Kyushu University, Fukuoka 81 O, Japan Received 7 October 1992; revised manuscript received 11 December 1992; acceptedfor publication 18 December 1992 Communicatedby A.R. Bishop
Two learning rules are studied for a simple oscillator network. One is a Boltzmann-machine-typeof supervised learning rule and the other is a generalized Hebbian rule for the self-organizationof connections. Synchronization is one of the peculiar phenomena observed in coupled oscillator systems [ 1,2]. Recently Eckhorn et al. and Gray and Singer found that neurons in the primary visual cortex of a cat can exhibit oscillatory responses and that the oscillation is synchronized over relatively large distances [ 3,4]. Sompolinsky et al. used a smooth phase oscillator or an X Y spin as a dynamical unit and analyzed the phase coherence of the oscillation [ 5 ]. Chawanya et al. used an integrate-and-fire-type element and studied mutual synchronization among the neurons [ 6 ]. Abbott used a Fitzhugh-Nagumo type element and proposed phase locking memories [7]. Synaptic connections are fixed in their oscillator networks. However, modification of synaptic connections is characteristic of a neural system. We want to consider a learning process in an oscillator network and study the interplay of mutual synchronization and learning. In this Letter we use a simple phase oscillator model and propose simple learning rules, since such phase models are expected to retain the essence of the interplay of learning and synchronization and are mathematically tractable. A different kind of learning rule that may occur in a network of fireflies is discussed by Ermentrout [ 8 ]. We assume that each oscillator can be described with its phase and the phase oscillators obey the coupled phase equation [ 2,5,9 ] d~i d-T = to~ - ~ W~j sin ( g), - q)j) + ~ ( t ) ,
dc/~ dt - -
OH 0-~ + ~ ( t ) ,
i= l, ..., N ,
(2)
where H = - E~j W~j cos ( ¥~- C/j). Equation (2) is the Langevin equation for C/~. The equilibrium distribution P_ ((C/,-)) is written as e--H/r P - ({C/")) = f2o~ . . . f ~ d C / l ...dC/~e - z / r "
(3)
This equilibrium distribution is equivalent to the equilibrium distribution of the X Y spin system with temperature T. The connections W,-j can be modified slowly by a learning rule in a neural system. We propose a learning rule which is a simple extension of the Boltzmann machine learning [ 10,11 ]. In the Boltzmann machine learning the connections are changed as the Kullback divergence is decreased. The Kullbach divergence G is defined as 2n
2n
G= ~...~dC/,...dC/jvP+((C/i))In(P+/P_),(4)
J
i = 1,...,N,
where ~ is the phase of the ith oscillator, to~ is its natural frequency, W~j represents the synaptic strength between the ith and jth oscillators, and ~ represents noise. The noise is taken to be Gaussian white noise with ( ~ i ( t ) ~ ( t ' ) ) = 2 T ~ i f l ( t - t ' ) . The connection is assumed to be symmetric: Wij= Wj.~, and Wi,~= 0. I f each oscillator has a constant natural frequency too, ¥~= ~J-toot obeys the coupled equations
(1)
O
O
0375-9601/93/$ 06.00 © 1993 Elsevier Science Publishers B.V. All fights reserved.
289
Volume 174, number 4
PHYSICS LETTERS A
where P+ is a desirable probability distribution and P_ is the equilibrium distribution (3). The Kullback divergence G is generally positive and G = 0 only when P+ = P_. It represents a distance between P+ and P_. When the connections are changed as the Kullback divergence is decreased, the equilibrium distribution P_ is expected to approach the desirable distribution P+. The learning rule is written as
dW~,j OG r dt =--Tow~,j 2x
=T 0 2n
dy,...dyNp
({y./),
0
W~a = - 0.11
a wi,j
2~
= f ... f d y l . . . d y N C O S ( y i - Y j ) o o × [P+ ({Y~})-P_ ({y~}) ]
=+-_,
(5)
where z is the time constant for the learning and < > + and < > _ represent respectively the average with respect to P+ and P_. In the original Boltzmann machine learning cos ( y~- YJ) is replaced by S:g where s~ and sj represent the firing state of the ith and jth neuron and take the value + 1 or - 1. The quantity cos(y~-yg) describes the phase coherence between the ith andjth oscillators and it is a quantity similar to s~sjin the Ising system, since sisj also describes the spin correlation between the ith andjth spins. Equation (5) says that the connections are modified as the average phase coherence _ approaches the desirable phase coherence < c0s ( y ~ - yj) > +. In the original Boltzmann machine each spin variable can take only two values, + 1 or - 1. But in our oscillator network each oscillator takes a continous value between 0 and 2n. We can therefore approximate the probability distribution P_ to a larger class of the probability distribution P+ with our oscillator network. As an example we have carried out a numerical simulation of eqs. ( 1 ) and (5). We consider a system of six oscillators and the desirable phase coherences < c o s ( y ~ - y j ) > + are assumed to be + = < c o s ( y 3 - y , ) > + : + =0.95
290
and < c o s ( y ~ - y j ) > = - 0 . 4 for the other pairs. Namely, the six oscillators are classified into three groups: ( 1, 2), (3, 4) and (5, 6); the pair in each group is attractive and the three groups are repulsive. The temperature T is 0.1 and the thermal average < c o s ( y ~ - y j ) ) _ is replaced by a long time average of cos [ Yi(t) - yg(t) ]. Initially all connections W~j are assumed to be 0. The result of the simulation is W1, 2 = W3, 4 = W5, 6 =0.87,
2~x
...
15 March 1993
for the other pairs.
(6)
As is expected the connections in each group are positive and the connections between two groups are negative. Figure 1 shows an equilibrium configuration of the six oscillators in the plane (x, y ) = (cos(y), s i n ( y ) ) for the above values of the connections and T = 0. Two oscillators in each group are completely synchronized and each group is -~n away from the other two groups. The above learning rule is a kind of supervised learning rule, since a desirable phase configuration is given to the system. Next we propose an unsupervised learning rule. We consider a system of N oscillators (1) where the natural frequencies are generally different. As a learning rule we consider a generalized Hebbian rule:
dWi,j z dt = - D W , j+g(Oi-Oj).
(7)
The first term represents a simple damping term and the connection is strengthened by the second term.
34 6 o
12 I
i
1
O0
-T O. O0
J
I 1.00
X
Fig. 1. Equilibrium configuration for the six oscillators with the connections (6) and T = 0.
Volume 174, number 4
PHYSICSLETTERSA
15 March 1993
In the neural network composed of the Ising spin si, like the Hopfield model, the Hebbian rule is written as d W~,j/dt ocs~sj [12]. As a simple generalization of the Hebbian rule we assume g ( ~ t - ¢j) =cos(Oi-Oj), since c o s ( ~ ; - ¢ j ) represents a similar correlation to s~sj in the Ising system. Since the time constant z is very large, cos ( ~ - ~j-) can be replaced by its average
0
(cos(¢~-~j))_. We consider at first a system of two oscillators and study theoretically the generalized Hebbian learning rule. The phase difference ~ u = ~ - ~ 2 obeys d~/ dt - Aco--2Wsin(¢,) + ~ ( t ) ,
(8)
~;o. oo
'
0. 2 0
'
o.',o
'
o.',o
'
o.',o
'
~.'oo
W
Fig. 2. Phase coherence (cos(~=-~2))_ for Ato=0, 0.2 and T= 0.1 as a functionof W.
where Aco=col--092, W= I4II,2 and ~=~l-~2, which satisfies ( ~ ( t ) ~ ( t ' ) ) = 4 T J ( t - t'). The probability distribution P_ (~u) obeys
OP_ O Ot - -- ~ { [Aog-EWsin(~,) ]P_}
7.
+ 2 T o2P-
(9)
0~¢2 "
The stationary solution of eq. (9) satisfying the periodic boundary condition P (~) = P (~u+ 2n) is given by P - (~u) = e x p ( - - 2 W + Aco¢¢+ ~-~ 2 W cos ( ¥ ) ) r n (0)
X ( I + [exp( - 2nAog/2T) - 1 ] X f~ d~'exp{ [ -Ato~u- 2 Wcos(~u) ]/2T} fo2x d~ exp{ [ - A t o ¥ - 2 Wcos(~,) ] / 2 T } ] '
(10) where P _ ( 0 ) is determined by the normalization condition fo2x d y P _ (~) = 1. Substitution of (10) into eq. (7) yields a learning equation for W:
dW
z--~ =-DW+
2x
j- d ~ c o s ( ~ ) P _ ( ~ ) .
(11)
0
Figure 2 shows the second term of (11 ), i.e., c = ( c o s ( ~ , ) ) _ as a function of W for T=0.1, and A~o=0 and 0.2. The stationary solutions ofeq. ( 11 ) are obtained as the intersection of the curve of fig. 2 and the straight line c=DW. If the damping con-
0. 00
t. 00
2, 00
3. 00
4. O0
5. 00
D
Fig. 3. Stationary solutions o f the learning equation ( i 1 ) for Ato=0 and 0.2.
stant D is large, the only solution is c = W = 0 . Namely, the connection cannot be generated. But if D is smaller than a critical value, the zero solution becomes unstable and a nontrivial connection appears. The phase coherence appears owing to the connection and the connection is strengthened by the generalized Hebbian rule. This positive feedback is the origin of the self-organization of the mutual connection. If W is small enough, the phase coherence c= (cos(~u))_ can be expanded in powers of W / T and then the stationary solution W satisfies the equation
DW=
2T
AO)2+4T2
1 W - -4-T-~ gW
3
(12)
where g = 4 / ( A 0 9 2 + 16T 2) --8TAto/(Ao)2+4T2) 2. The critical damping constant Dc = 2 T~ (Ato 2 + 4 T 2 ) and when g is positive (negative), the transition is supercritical (subcritical). Figure 3 shows the stationary solutions of eq. ( I 1 ) as a function of D for 291
Volume 174, number 4
PHYSICS LETTERS A
T = 0 . 1 a n d Ao9=0 a n d 0.2. W h e n Aog=0, the transition occurs at D = 5 a n d it is supercritical. On the other h a n d when Ao9=0.2, the transition occurs at D = 2.5 and it is subcritical. Between D = 2.5 a n d 2.7 the system is bistable. The critical d a m p i n g constant Dc becomes small when the difference from the natural frequency is large. We have carried out a numerical simulation o f six oscillators. T h e natural frequencies are o91= 1.1, 092=0.5, o93=0.1, o 9 4 = - 0 . 1 , o 9 5 = - 0 . 5 a n d 096= - 1.1. Initially W~j=0.5, D = 1.5 a n d T = 0 . 1 a n d the d a m p i n g constant D is increased gradually. Figure 4 shows the connections WL,2, W2,3 a n d W3.4 as a function o l D . At D = 1.5 all connections are not zero. The critical d a m p i n g constant Dc for the self-organization is smaller for the larger frequency difference. So at first the connections Wt,2 a n d I415,6 b e c o m e zero
15 March 1993
discontinuously at D ~ 1.85 a n d the first and sixth oscillators are isolated from the oscillator network. Next, the second and fifth oscillator are isolated from the oscillator network at D ~ 2.15 and finally all connections become zero at D ~ 2.75. In s u m m a r y we have investigated two learning rules for the connections o f a coupled oscillator system. One is a simple extension o f the Boltzmann machine learning. A desirable phase coherence can be realized by the learning rule. The learning rule can be easily extended to include higher h a r m o n i c interaction like W,j sin [ n ( ~ - ~j) ]. The other is a generalized H e b b i a n rule, for which the connections can be self-organized in a certain p a r a m e t e r region. The self-organization o f connections is easier for the oscillator pair whose frequency difference is small. The models considered in this Letter seem to be too simple and we would like to study m o r e biologically realistic models in the future.
References
,=::;
'
t
40
i.
80
I
/ ~ 2. 20
I
' 2, 50
3.00
D
Fig. 4. Connection strength W~.2, 1412.3and W3.4as the damping constant D is gradually increased. The natural frequencies of the six oscillators are cot=l.l, co2=0.5, ca3=0.1, co4=-0.1, o~n= -0.5 and co6= - 1.1.
292
[ 1] A.T. Winfree, The geometry of biological time (Springer, Berlin, 1980). [ 2 ] Y. Kuramoto, Chemical oscillation, waves, and turbulence (Springer, Berlin, 1984). [3] R. Eckhorn, R. Bauer, W. Jordan, M. Brosch, W. Kruse, M. Munk and R.J. Reitboeck, Biol. Cybern. 60 ( 1988 ) 12 I. [4 ] C.M. Gray and W. Singer, Proc. Natl. Acad. Sci. 86 ( 1988 ) 1698. [ 5 ] H. Sompolinsky, D. Golomb and D. Kleinfeld, Phys. Rev. A43 (1991) 6990. [ 6 ] T. Chawanya, T. Aoyagi, I. Nishikawa, K. Okuda and Y. Kuramoto, to appear in Biol. Cybern. [7] L.F. Abbott, J. Phys. A 23 (1990) 3835. [8] B. Ermentrout, J. Math. Biol. 29 ( 1991 ) 571. [9] H. Sakaguchi, Prog. Theor. Phys. 79 (1988) 39. [ 10] D.H. Ackley, G.E. Hinton and T.J. Sejnowski, Cogn. Sci. 9 (1985) 147. [ 11 ] H. Sakaguchi, Prog. Theor. Phys. 83 (1990) 693. [ 12 ] J.J. Hopfield, Proc. Natl. Acad. Sci. 79 ( 1982 ) 2254.