Directions in matrix theory, Auburn 1990, conference report

Directions in matrix theory, Auburn 1990, conference report

REPORT Directions in Matrix Theory, Auburn 1990, Conference Report Frank Uhlig and Tin-Yau Tam Mathematics Department Auburn University Auburn, Al...

4MB Sizes 0 Downloads 90 Views

REPORT

Directions

in Matrix Theory, Auburn 1990, Conference

Report

Frank Uhlig and Tin-Yau Tam Mathematics Department Auburn University Auburn, Alabama 36849-5307 and David Carlson Mathematical Sciences Department San Diego State University San Diego, Calgornia

92182

Submitted by Richard A. Brualdi

1. INTRODUCTION

The

idea for the

conceived There,

a large number

and common however,

fourth

linear

algebra

conference

of researchers

language

that as wonderful

as these

was clearly

conferences

are,

or the direction

as discontinuous

apparent,

they give only a momentary But

will not yield the derivative f

And a direction

as research

over

that our area is experiencing.

of many discrete values off

of f. One has to work harder.

assess for something

was

and the vitality

It became

As one attends such conferences

the years, one might get a feeling for the development in calculus-knowledge

evident.

University

(see LAA, Vol. 121).

from many lands had gathered,

of our research

glimpse of the state of the art in linear algebra. -as

at Auburn

by Frank Uhlig during the 1987 Valencia Conference

developments

is especially difficult to in a vast area such as

linear algebra. So the idea was conceived

of inviting many experts in linear algebra to a conference

and asking them to present their visions of the forces from the past and those leading into the future as regards linear algebra. Frank talked with several of the participants

in

Valencia and wrote many letters from Coimbra in the fall in order to plant the seeds for this conference. “Hilbert’s

The responses were encouraging.

shoes”

would

speaker speak from his/her research:

“What

not fit anyone.

There was a general realization that

But Frank

kept on, suggesting

that every

own standpoint and develop the vision that we need for our

are the important problems?”

LINEAR ALGEBRA AND ITS APPLICATIONS

“Why are they important?’ 162-164:711-797

“Where

(1992)

0 Elsevier Science Publishing Co., Inc., 1992 655 Avenue of the Americas, New York, NY 10010

0024~3795/92/$5.00

did 711

712

FRANK

UHLIG,

TIN-YAU

TAM, AND DAVID

CARLSON

they come from?’

“Where might they lead?” “ What is linear algebra as a discipline capable of solving now or in the near future?” “What lies ahead?” “How are past results pointing us in certain directions?” “ What direction is matrix theory taking?’ Thus the title for our conference. number

In this Special

Issue you will find an unparalleled

of deeply visionary papers on the affairs of linear algebra.

a year after the conference,

it is apparent

that “Hilbert’s

And now, more than

shoes” are not really needed.

We can each wear our own. One important ence.

The

Meetings

new direction

Undergraduate in Louisville

Undergraduate renewed

Algebra

in January

1990,

Conferences

in the teaching

Thompson

came

lectures

conference

aboard

and feature

all active

much needed

Together

research

of 1988.

With him he brought

in TEX and word processing. There

were

33 invited

given, with a total attendance From the beginning to support

of linear

hoped-for camellias

support

in lieu of speakers’

Our financial Auburn The

faltered, support

Uniuersity

We

specifically

Auburn University

to give

in the fall

speakers

for the success

65 contributed

of the

talks were

from 25 countries.

to use the funds that might become

according

our senior

and others

and helped us greatly with his

In addition,

of 135 matricians

speakers

algebra.

Tam joined

speakers.

the

the program of the

Tin-Yau

talents

a

issues.

during

activities,

we had planned

our invited

of 1988

on regional

Most of all we have to thank our group of invited conference.

also with teaching

we tried to balance

his extraordinary

on the

Matrix Theory

to report

surveys of some areas.

expertise

that future

in the summer

areas

at the Joint

in August 1990, represent

will involve themselves

at Johns Hopkins.

at our confer-

Discussion

and Mary Workshop

We expect

as a coorganizer

wanted to offer some minisymposia

Panel

in Williamsburg

of our subject.

at Auburn and elsewhere Carlson

too late to be represented Curriculum

and the William

Linear Algebra Curriculum

interest

Dave

appeared

Linear

to their

speakers

need.

came

And when

through

one

available agency’s

for us by accepting

fees. What a treat it is to work with matricians!

came from four sources:

and its Vice-President

Oak Ridge Associated

of Research (ORAU)

Unioersities

supported

supported

us generously.

us with

unrestricted

funds, a fact that was very helpful in the initial stages when we had to build our logistics base here and had to print and mail out the announcements

etc. in order to even have

the conference. The

National Security Agency

supplied

adequate

funds for our speakers’

expenses

where needed. Finally we are grateful to the Linear Algebra Group of SlAM and to ZLAS, who each helped us out with mailing labels and general And, last but not least, the participants Our sincere

thanks to all that were involved for making this conference

Let us conclude linear algebra and

Linear

Figure

support. who came to attend the show.

our preface

to the conference

over the last 20 years. and Multilinear

Algebra

1 shows graphs of the bookshelf

year intervals,

and with

the number

The journals were

report

Linear Algebra

founded

in 1968

space occupied of volumes

with a factual

possible. assessment

of

and Its Applications

and 1973

respectively.

by each marked on the left in 5

per period

marked

on the

right.

AUBURN

1990 CONFERENCE

ON MATRIX

713

THEORY

50

100 90

Shelfspace in cm

80

40

60 -

30

Number

50 40

-

30

-

of

volumes 20

10

20 10-

1

O-

1968 1972

1982

1 977

0

19t

(a)

30

-

25

-

- 15

10

20 -

Shelfspace in cm

15

-

-

Number of volumes

t

10 5 1 OJ

1973

31977

1982

1987

‘90

(b) FIG.

Cumulative

1.

Linear

and Multilinear

These

graphs

seem

the two journals numbers

totals

by period:

Algebra,

1973-1990.

to suggest

doubles

of participants,

that

about every

These progressions

A doubling

the

This is corroborated

1968-1990;

of papers

in each

and talks given at the Auburn growth pattern

for a constant

a = (In2)/10

of

Conferences

in linear algebra:

. volume of papers(year

(b)

by data on the

2 below.

t) = (1 + tr)

every 10 years is achieved

volume

Appl.,

10 years.

suggest an exponential

volume of papers(year

Algebra

currently

home countries,

from 1970 to 1990, as shown in Figure

(a) Linear

(t = 0.07.

1)).

FRANK

150

UHLIG,

TIN-YAU

TAM, AND DAVID

CARLSON

100I-

25

90I120

80

20

90

60

15

a

lo

ii :: t

50 60

40

r

30 30

20

0

1970

Note that Bob Thompson, an exponential

was obtained

19’80

Auburn Conferences,

in the introduction

19’S;

0

19;o

1970-1990.

to his address for this conference

growth rate for linear algebra with d = 0.04.

by counting

s

&‘--

FIG. 2.

established

::

5

A

__--

10

0

rf IFi

70

the number

of reviews

in the 15Xxx section

also

His rate figure in MathernaticaZ

Reoiews each year from 1940 on. It stands to reason that, given an overall yearly growth of 4%, a much larger growth rate of 7% is obtained Linear

Algebra,

as these were founded

only recently,

The main part of this report contains abstracts

for the two journals

specializing

in

in 1968 and 1973.

the following:

of invited talks, titles of contributed

talks, and synopsis of both.

Of course, we do not repeat those abstracts

or titles of talks that evolved into papers

in this special issue.

2. ABSTRACTS Generalized

OF

INVITED

TALKS’

inverse Invariances

by Jerry K. Baksalary.2 A survey is given of criteria

for the concepts

singular values, etc. to be invariant

with respect

such as range, rank, trace, eigenvalues, to the choice

of a generalized

inverse

‘Only those abstracts are given here that did not result in a paper in this issue. “Department PL-65-069 SF-33101

Zielona Tampere

of Mathematics G&a,

Poland,

10, Finland.

and Statistics,

Tadeusz

and Department

Kotarbiliski

of Mathematics,

Pedagogical University

University, of Tampere,

AUBURN

1990

B-,

they

when

type

CONFERENCE are referred

are relevant

estimability,

ON MATRIX

to in the product

to several

statistical

characterizations

and properties

AB-C.

problems,

of the

of canonical

715

THEORY The invariance

for instance

minimum-dispersion

correlations

in linear

properties

to criteria

linear

of this

for unbiased

unbiased

estimators,

models.

LAP&X-a Portable High-Performance Linear-Algebra Library by Javes Demmel.3 The goal of the LAPACK project library

for efficient

based

on the

eigenvalue including

used

problems,

and

recent

SVD algorithm tion ric

in the library.

First,

and the grading which

thought

positive

contain

definite

small

significantly

singular

improve

using

criterion) and

SVD.

we

show

that

work

with

overview

errors.

Third, more

In fact, even

much

project,

algorithms which

respects

a new bidiagonal

more

accurately

than

differential

we show accurate

as long

using

equa-

that

Jacobi’s

for the symmet-

as the

infinite

is

solving,

of the

solver

a Hamiltonian

is uniformly

library

high-accuracy

we present

and vectors

of roundoff

stopping

a brief on new

Second,

involves

linear-algebra The

for linear-equation

a linear-equation

values

The proof

errors,

After

we discuss

eigenproblem

relative

squares.

we concentrate

propagation

(with a modified

a portable computers.

EISPACK packages

of the problem.

computes

possible. the

and

least

results,

to understand

method

linear

and implement

of high-performance

LINPACK

benchmark

the sparsity

previously

is to design

on a variety

widely

to be included both

use

matrix

precision

entries will

not

on Jacobi.

This talk represents Deift,

J. Dongarra,

L.-C.

Li, A. McKenney,

joint

J. Du

Croz,

E. Anderson,

I. Duff,

D. Sorensen,

M. Arioli,

A. Greenbaum,

C. Tomei,

Z. Bai, J. Barlow,

S. Hammarling,

W.

P.

Kahan,

and K. Veselic.

Structured Linear Algebra Problems in Signal Processing and Control4 by Paul van Dooren. We give a survey processing though

and

the problems

no longer

make

3Department Current address: 94720.

of a number

control, use

where

one wants here

of linear-algebra the

structure

to solve for these

of standard

problems

of the

matrices

linear-algebra

occurring

matrices tools,

involved

are rather since

the

in digital

signal

is crucial. classical, structure

Al-

one can of the

of Mathematics, Courant Institute, 251 Mercer St., New York, NY 10012. Division of Computer Science, University of California, Berkeley, CA

4This paper appeared in Numerical Lirwar Algebra, Digital Signal Processing and Pam&l Algorithms (G. Golub and P. van Dooren, Eds.), NATO ASI Ser. 70, 1991, pp. 361-384. 5Philips Research Laboratory, Ave. Albert Einstein, gium; current address: Coordinated Science Laboratory, Champaign, Urbana, IL 61801.

4, B-1348 Louvain-la-Neuve, BelUniversity of Illinois at Urbana-

716

FRANK

matrices

UHLIG,

has to be taken into account.

how structure

affects the sensitivity

TIN-YAU

We discuss

Structured

matrices

around

for a long time

algebra

several

fields.

dealing

with

matrices,

structure

such

of the matrices

but one is then

from loss of accuracy divergence

of the algorithm

The importance gebra problems recognized occur.

When

them.

We

sensitivity

there

then

are fast algorithms

analyze

of a problem

if the

may have

exploited

in general.

LAPACK:

A Linear-Algebra

by Jeremy

on the

is planned

squares problems, The library

systems

common linear-algebra This library,

packages

squares.

LINPACK

These algorithms.

problems

and how

EISPACK

architectures

talks describe routines,

they

discuss

may affect

structure

the that

could

be

called

LAPACK,

for the analysis

equations,

linear

and least-

problems. a uniform

set of subroutines

solving,

networks,

among machines tools

scheme

not only will ease

of different

for evaluating

and widely used

eigenvalue an important

but they were not designed now becoming

to solve the most

on a wide range of architectures.

via computer

have provided

notes

package

algebraic

also will provide

the naming

and contain

we briefly

Computers

77 subroutines

linear

more portable but

of structured

and control

also look at the effect

computational

FORTRAN

and to run efficiently

for linear-equation

on serial machines,

and vector

proposed

to provide

efficiency,

and

a number

on a matrix

We

The library will be based on the well-known

EISPACK

computing

of

of simultaneous

make codes

and increase

performance.

the proposed

which will be freely accessible

code development,

of structure

linear-al-

and Z. Bai.’

and matrix eigenvalue

is intended

of structured

for these problems,

of an algorithm

to be a collection

of various

suffer

dealing with them is being

Library for High-Performance

outlines

fast algorithms

of the sensitivity

for such a matrix.

Ducroz, Ed Anderson,

This minisymposium

parallel

so-called

the

or in other

which then results in complete

in algorithms

available

stability

for

with exploiting

of the problem,

of digital signal processing

constraint

defined

in

derived

answer.

understanding

in which problem

structure

tures,

of these

and of the error propagation

and indicate

solution

concerned

execution,

from the correct

of a correct

have been

more and more these days. Here we first consider

matrices

which

mainly

Yet several

during their (real-time)

and are encountered

algorithms

in order to improve the complexity

words to speed up the algorithm.

should

constraint.

In linear

application

and show

at hand and how algorithms

have been

various

CARLSON

some of these problems

of the problem

be adapted in order to cope with the structure

TAM, AND DAVID

problems,

LINPACK

and

and linear least

infrastructure to exploit

architeccomputer

for scientific

the profusion

of

available. for the routines,

on the structure

give listings

of the routines

for a few

and choice

of

In addition, a discussion of the aspects of software design is given.

‘Department of Computer Science, University of Tennessee, Knoxville, TN 37996, Oak Ridge National Laboratory, Mathematics Science Section, Oak Ridge, TN 37831.

and

717

AUBURN 1990 CONFERENCE ON MATRIX THEORY A Summary

of Research on Linear Algebra and Matrix Theory in Spain

by Vicente Her&&z7 This talk is devoted working

on topics

to giving a picture

related

paid to the research

with matrix

in the Universidad

of the different

theory

PolitCcnica

Some Recent Results on Singular-Value by Roger Horn.’ A quasilinear representation clearer

understanding

hold between inequality

from which

Special

attention

is

de Valencia.

invariant

norms

Ky Fan domination

for all unitarily

many known

groups in Spain which are algebra.

lnequulities

for unitarily

of the classical

two matrices

and linear

invariant

on matrices

theorem

norms.

and new inequalities

leads to a

on inequalities

that

It also leads to a master

can be extracted

as special

cases. Basic notions essential

of duality play a key role in obtaining

in deriving

an apparently

ucts that obey a fundamental

new characterization

majorization

to treat the ordinary and Hadamard generalization

of both products

inequality.

products

cannot

FFTs and the Sparse-Factorization

our results,

and they are also

of those bilinear

matrix prod-

This characterization

permits

us

in a unified way and shows why a natural

satisfy the basic inequality.

Idea,9

by Charles van Loan.” The FFT Algorithms subscript

literature

tend notations.

sen block-matrix Borrowing framework DFT

is vast, disconnected,

to be

detailed

ideas

from

“sparse factorizations.” and its connection

selected FFT

authors,

algorithms.

with

an array of tricks.

obscure

this situation

I have developed The central

of sparse matrices.

multidimensional through

a well-cho-

The theoretical

vehicle

of this activity

FFT algorithms.

has important

a high-level,

unifying

idea is the factorization

Different

FFTs

correspond

that surface

of the

to different

for doing this is the Kronecker

with the kind of data transpositions

fringe benefit

factorizations

level

notation.

for describing

of vector/parallel

and (to an outsider)

scalar

This talk is about how to correct

matrix into a product

important

at the

in FFT

product work. An

is that our notation facilitates the development

So once again we see that the language

computational

overtones.

‘Dpto. Sistemas Informaticos y Computation, Camino de Vera, s/n, 46071 Valencia, Spain.

of matrix

Notation is everything.

Universidad Politecnica

de Valencia,

‘Department of Mathematics, Johns Hopkins University, Baltimore, MD 21218 with Roy Mathias and Yoshihiro Nakamura).

(jointly

‘This paper gives an overview of the book Matrix Frameworks for the Fast Fourier Transform, SIAM, Philadelphia, 1992. “Department

of Computer Science, Cornell University, Ithaca, NY 14853.

718

FRANK

UHLIG,

TIN-YAU

TAM, AND DAVID

CARLSON

Matrix Complements by Carl Meyer.” The

purpose

is to introduce

complementation

tion is focused on stochastic a partitioned

some variants

in block-partitioned

irreducible

matrices

complementation,

stochastic

concept

special

of Schur

structure.

Atten-

which, in its simplest form, is defined on

matrix

Pll

P=

p12

p

21

i with square diagonal blocks.

of the well-known which possess

The stochastic

p22

1

complement

of Pii in P is defined to be the

matrix

sij = Pii Stochastic as important

+ P,j(

complements stochastic

complementation

z-

for

Pjj) - lPji

i = 1,2 and j = I,2.

have a variety of interesting

interpretations.

are discussed,

Several

theoretical

of the algebraic

and then applications

properties

aspects

as well

of stochastic

to Markov-chain

problems

are

developed. Extensions

of these

ered. The concept

ideas to general

nonnegative

of Perron complementation

well as its applications

Lijwner-Ordering

irreducible

matrices

are consid-

is put forth, and some of its properties

as

are presented.

Monotonicity

and

Convexity

Properties

of

Some

Matrix

Functions by K. Norah-iim.‘2 A survey

is given

some matrix functions the statistical

of LGwner-ordering encountered

literature

set of Hermitian

Combinatorial

monotonicity

in statistics.

only for nonnegative

and convexity

Some of these properties,

definite

matrices,

properties

of

considered

in

are here extended

to the

matrices.

Perron-Frobenius

Theory

by Hans Schneider.13 Combinatorial

spectral

a matrix to its spectral

“Department

theory is the study of the relation

properties.

of Mathematics,

of the graph (or pattern)

In this talk, we mainly consider

(reducible)

of

nonnega-

North Carolina State University, Raleigh, North Carolina,

27695. “Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland; current address: Institut fur Mathematik, Universitgt Augsburg, Memminger Str. 6, D-8900 Augsburg,

Germany.

13Department

of Mathematics,

University

of Wisconsin,

Madison,

WI 53706.

AUBURN

1990 CONFERENCE

tive matrices

ON MATRIX

(or, equivalently,

singular

M-matrices)-as

relate the graph of such a matrix to the structure spectral

radius.

We consider

properties,

such

particular

interest

(combinatorial)

positivity

are known.

possible

relations

between

height

Many conditions

We give a solution

by the title. eigenspace

of the eigenspace

of the Jordan

of the (spectral)

level characteristic.

teristics

part

indicated

of the (generalized)

properties

as the corresponding is the relation

719

THEORY

and its spectral

normal

(Weyr)

We for its

form.

A topic

characteristic

of

to the

for the equality of the two charac-

of a long-standing

problem

to characterize

all

the two characteristics.

Combinatorial Aspects of Multilinear Algebra by Jose Dias Da Silva. l4 We report Other cerning

on some recent

combinatorial

aspects

the permanent

results

on the multilinearity

of multilinear

spectrum

algebra

and the spectrum

On the Equality of the Ordinary

partition

are mentioned, of matrices

Least-Squares

of a character.

namely,

associated

those con-

with graphs.

Estimator and the Best Linear

Unbiased Estimator I5 by George Styan.”

It vector

known

is well

in the general

that the ordinary Gauss-Markov

least-squares

matrix V can be the best linear unbiased the identity

matrix.

dispersion izations

of the OLSE

of these conditions,

Block

We also consider

similarity

It comes

single square

“This

Zyskind.

the situation

of the mean

definite

dispersion

even if V is not a multiple perspective,

the model matrix

is a generalization

matrix

X nor the C.

several simple character-

and a rather

complete

set of

when all or part of X is allowed to vary.

of the usual similarity

of square

matrices

a matrix pair (A, B),

A square,

rather

than a

appeared

in the

A. The first studies on this equivalence

talk is based

just published

We present

of

the development

Problems on Block Similarity

up when we consider

“Department Canada

Neither

along with various examples

“Department of Mathematics, 30 Piso, 1700 Lisboa, Portugal. Finland)

in a historical

to be BLU.

Rao, and (the late) George

Some Outstanding by Ion Zaballa.‘? fields.

(OLSE)

matrix V need be of full rank. The key results are due to T. W. Anderson,

Radhakrishna references.

(BLU) estimator

In this talk we discuss,

of the several conditions

estimator

linear model with nonnegative

Universidade de Lisboa, Rua E. Vasconcelos,

on a joint

paper

(with discussion

of Mathematics

relation

and

with

Simo

and rejoinder) Statistics,

Puntanen

in Amer. McGill

(University Statist.

University,

over

Bloco CL

of Tampere,

43:153-164

(1989).

Montreal,

Quebec,

H3A 2K6.

17Department Vitoria-Gasteiz,

of Mathematics, Spain.

Facultad

de Farmacia,

Universidad

de1 Pais Vasco,

01007

720

FRANK UHLIG, TIN-YAU TAM, AND DAVID CARLSON

control

theory literature

and dealt with linear time invariant

systems of the form

f+) =Ax(t) +h(t). In this context

the equivalence

we say that block-similarity pencils

is usually called feedback

or feedback

equivalence.

In matrix theory

is the strict equivalence

of singular

of the form [sZ, - A, - B].

During relation.

the last years a number

This talk presents

OF

CONTRIBUTED

Extreme

Points of a Set of Positive

by William Fairleigh George

N. Anderson,

Dickinson

have been working

on this equivalence

results as well as some open problems.

TALKS”

Semidefinite

Matrices

Jr., Department

University,

Teaneck,

of Mathematics NJ 07666

and Computer

Science,

(jointly with T. D. Morley and

E. Trapp)

Operators

by LeRoy

Preserving

B. Beasley,

UT 84322-3900 Inversion

of people

some of the obtained

3. LIST

Linear

equivalence

Idempotent Department

Matrices

over Fields

of Mathematics,

Utah State University,

Logan,

(jointly with N. J. Pullman)

of Infinite

Matrices

by Kerry G. Brock,

Department

of Mathematics,

Georgia

Institute

of Technology,

Atlanta, GA 30332-0160 On the Convergence by Jack 36849 The

B.

of the Iterative

Brown,

Proportional

Department

Fitting

Procedure

of Mathematics-FAT,

Auburn

University,

AL

fjointly with P. Chase and A. Pittinger)

pth Roots of a Matrix by Bryan 50011

Elementary

Cain,

Department

Divisors

and Ranked

by Keith L. Chavey, son, WI 53706 Circularity

Iowa

State

University,

Ames,

IA

of Mathematics,

to Matrix Compounds

University

of Wisconsin,

Madi-

(jointly with R. A. Brualdi) Chien,

111, Republic

Study of Complex

Posets with Applications

Department

of the Numerical

by Mao-Ting Taiwan

of Mathematics,

(jointly with T. Laffey)

Range Department

Symmetric

by Dipa Choudhury, 4501 N. Charles

of Mathematics,

Soochow

University,

Taipei,

of China Matrices

Department

Str., Baltimore,

of Mathematics,

Loyola

College

in Maryland,

MD 21210-2699

‘*Only those are listed here that are not represented as papers in this issue.

AUBURN

1990

CONFERENCE

Determinantal

Inequalities

ON MATRIX

for Positive

Cox rg, Department

by B. Ann

721

THEORY

Definite

Matrices

of Mathematics,

Brigham

Young

University,

Provo,

UT 84602 Diagonahzing

the Adaptive

by Jerome Park, Some

Dancis,

Interlacing Apartado

Results Leal 3008,

Idempotence, Atlanta, Modified

3000

Hermitian

Department

Coimbra,

University

Acyclic

Iterative

Methods

(jointly

and Sign-Pattern

New

Lyngby,

State

University,

of Spectral

McEachin,

Ordering

Systems Ohio

and Pseudo

Institute,

at Fort

Wayne,

of Generalized

of Mathematical Fort

Wayne,

Inverses

of Almost

malai

Tamil

India

6008002,

School

GA 30332-0160

Nadu,

the Resolution

University,

of Mathematics,

(jointly

n’Ekwunife

Systems of Denmark,

Sciences,

Indiana

University,

Definite

Matrices

Annamalai

University,

Anna-

P. 0.

with

Georgia

Institute

of Technology,

Atlanta,

W. L. Green)

of E. Dittert’s

Conjecture

Muoneke,

Department

Drawer

125, Prairie

of Mathematics, View,

Prairie

View

A and

M

TX 77446-0125

Pivot

Larry

Neal, City,

Permutation New

Department

of Mathematics,

TN 37614-0002

(jointly

with

East

Tennessee

State

University,

G. Poole)

Matrices

by Christopher Bedford

University

of Operators

by T. D. Morley,

Johnson

45701-

IN 46805-1499

of Mathematics,

The Rook’s

OH

Variation

Department

Department

Intervals

Athens,

Gyroconservative

The Technical

by A. R. Meenakshi, Nagar

University,

and L. Snyder)

Symmetrizable,

Mathematical

University

On Partial

Similar

Georgia

Denmark

Examples

Purdue

Linear

of Mathematics,

A. Gunawardena

Kliem,

by Raymond

by

de Coimbra,

Matrices

of Mathematics,

for Consistent

of Normalizable,

DK-2800

by

Universidade

Portugal

Department

Department with

by Wolfhard

Towards

College

GA 30303

The Dynamics

Order

of Maryland,

Matrices

of Mathematics,

Idempotence,

A. Eschenbach,

by S. K. Jain, 2979

for Indefinite

Duarte,

Qualitative

by Carolyn

Some

Method

of Mathematics,

MD 20742

by Antonio

Some

SOR Iterative

Department

W. College,

Norman,

Department

Egham,

Matrix

Results:

Generated

lgCurrent 36849-5307.

address:

Department

Surrey

TW20

of Mathematics, OEX,

by Two Statistical

of Mathematics-ACA,

Royal

Holloway

and

England

Applications

Auburn

University,

Auburn,

AL

722

FRANK UHLIG, by Patrick 76798-7328

L. Odell,

On the Commutant Vector Space by Christian 2800

Department

of a Linear

Mechanical

Baylor University, Operator

Department,

Danish

Waco,

on a Finite

TX

Complex

Engineering

Academy,

College

of William

and Mary,

College

of William

and Mary,

Lyngby, Denmark

A Review of Recent by Leiba

Results on Inertia-Preserving

Rodman,

Williamsburg, Inertia Theorems by Leiba

Department

for Matrix Polynomials

Rodman,

Approximations

Linear Maps

of Mathematics,

VA 23185

Williamsburg,

Department

of Mathematics,

VA 23185

of the Spectral Radius, Corresponding

Modulus of an Eigenvalue by Uriel Rothblum, University,

RUTCOR,

between

Ill-Conditioned by Burkhard Columbus,

Eigenvector,

for a Square Nonnegative

New Brunswick,

On a Relation

Robust

Rutgers

Schalfrin,

Irreducible

Center

and Second Largest

Matrix

for Operations

Research,

Rutgers

NJ 08903 Prediction

Linear Equation

and Accelerated

Iteration

Schemes

for

Systems

Department

of Geodetic

Science,

Ohio State University,

College

of William

OH 43210-1247

On Inversion of Symmetric by Tamir

Shalom,

Williamsburg, On Perturbations

Toeplitz Matrices

Department

of Mathematics,

VA 23187-8795

and Mary,

(jointly with L. Rodman)

and Transformations

by Nagwa Sherif, Department 13060

of Mathematics,

and Anticommulant Pommer,

TIN-YAU TAM, AND DAVID CARLSON

of Orthogonal

of Mathematics,

n-frames Kuwait University,

P.O.

Box 5969,

Safat, Kuwait

Parallel Sparse LU Decomposition by A. F.

van der

Amsterdam,

P.O.

Staapen,

Box 3003,

on a Mesh of Transputers Department 1003

MSE,

Koninkijke/Shell

AA Amsterdam,

The Netherlands

Laboratorium Cjointly with

R. Bisseling and J. G. van de Vorst) Matrices That Commute

with a Generalized

by Jeffrey L. Stuart, Department sippi, Hattiesburg, An Extension

Permutation

of Mathematics,

Matrix University

of Southern

Missis-

MS 39406-5045

to Outer Products

with a Variety of Applications

by Tien-Hsi Teng, 12D, Maple Mansion, Taikoo Shing, Hong Kong On a New Eigensolver by Anna Tsao, MD 20715-4300

for Real Diagonalizable

Supercomputing

Research

Matrices with Real Eigenvalues Center,

On the Structure of the Product of Boolean E-Functions by Gene Underwood, Department of Mathematics, UT 84322-3900

17100

Science

Drive,

Bowie,

Utah State University,

Logan,

(jointly with Steven Lederman)

AUBURN

1990 CONFERENCE

More on the Reverse-Order

ON MATRIX

Law

by Hans J. Werner,

Institut

Bonn, Adenauerallee

27-42,

Symmetric

Hankel

Operators:

4.

La Jolla, CA 92093,

SYNOPSES A DIXIE CUP:

consider

We

the

set of all

symmetric

for future

n

Universitlt

(jointly with J. W. Helton)

INTERVALS

x

real

n

convenience.)

OF MATRICES

and G. E. TRAPP23

symmetric

space with inner product

as an

n(n + 1)/2-

(A, B) = tr(AB)/2.

matrices

(The factor

We shall use the following

basis of the real

2 X 2 matrices:

A,=

Note that this is an orthonormal

[

;

_;

1 ,

basis with respect

If A and B are n x n real symmetric

matrix is termed

denote

n

the

set

of

positioe

n

x

symmetric

B - AEPos,.

The set Pos is a cone, i.e.,

X > 0. Thus

A 6 B is the sort of order

and

1 1

A,=

;

:,

to the above inner product.

matrices,

for all vectors x. A real symmetric

relativity:

Research,

Completions and Eigenstructure of Mathematics, University of California,

JR.,21 T. D. MORLEY,22

real inner product

of two is chosen

und Operations

Bonn 1, Germany

VISUALIZINGORDER

by W. N. ANDERSON,

dimensional

fur 6konomie D 5300

Minimal-Norm Department

by Hugo J. Woerdemans’ San Diego,

723

THEORY

we say that A < B if Ax

* x < Bx * x

positioe if A 2 0. We let Pos = Pos,

matrices;

then

A Q B if and only if

Pos + Pos C Pos, and hPos C Pos for all real that one usually

encounters

first in special

if we think of A + Pos as the light cone of future events to A, then A < B if

and only if B is in this cone. To visualize

the cone Pas,,

we simply note that

X = ArA, + AsA, + AsA, =

h

+

x,

h

A3

3

is positive

A,

-

h

1

if and only if Xi + hs > 0, X, - hz > 0, and A; - % - A” 2 0. Putting these

20Current address: Department burg, VA 23185.

of Mathematics,

21Department of Mathematics Teaneck, NJ 07666.

and Computer

22School of Mathematics, morley~cerc.uvu.uMet.edu. 23Department town, WV 26506;

of Computer

Georgia Science

[email protected].

Institute

College of William and Mary, WilliamsScience, of

and Statistics,

Fairleigh

Technology, West

Virginia

Dickinson Atlanta,

University, GA

University,

30332; Morgan-

724

FRANK

conditions

together,

UHLIG,

we see that the cone

TIN-YAU Pas,

TAM, AND DAVID

CARLSON

is simply the geometric

((A,, &> A,) I Al 2 0, AI 2 22 + A?,. If A is a positive symmetric matrix, we define the order inter&,

cone

denoted

{X =

[O,A], by

[O,A] = {X]O
study

generalized (with respect

where

to a suitable

A,, : Y-

e.g., A,, Either

the structure

of [O,A], we need

Y,

the matrix

matrix viewpoint

Ai2:

basis) write A as

YL++

etc.

Y,

viewpoint requires,

of shorted

(If we think of A as a linear where P is the orthogonal

or the linear-operator

however,

bases.)

where dagger denotes

to Y,

operator

operator

the Moore-Penrose the (1,l)

dimensions,

alternative

are needed.

definitions

then we define the parallel We refer the reader

can be expressed

pseudoinverse.

the same as A: then

We also need the concept

viewpoint

entry

then

onto

can be adopted.

Y. The

track of the

as

If A is invertible,

of Y(A)

or

we can

operator,

projection

that some care be taken in keeping

The shorted

partitioned

and A-’

is ([A-‘],,)-l.

is

In infinite

of paraZle2 sum. If A and B are two positive

matrices,

sum A : B as A : B = A(A + B)+B.

to [l, 21 for the basic facts about the shorted

operator

and the

sum.

If VE Pos is a convex set, i.e., an

concept

orthogonal

is simply PA restricted

various orthogonal

parallel

the

Schur complement. If A is a positive matrix, and Y is a subspace,

XV+

extreme point of V if whenever

then Y = Z = X. Intuitively,

(1 - A) VS

@? whenever

0 < X Q 1, then X is

X = hy + (1 - A)Z E $? for some Y and Z in Y,

the extreme

points of V are the corners

theorem

THEOREM

Let A be a positioe n x n real matrix. Then X E [O,A] is an extreme conditions are satisfwd:

1.

is from [3]; see also [7, 91 for related

of V.

The following

results.

point of [O,A] if and only if any of the foU&ing

(a) Then matrix X is the shorted operator of A to some subspace Y. (b) X : (A - X) = 0. (c) X = A1f2PA1/’ for some orthogonal projection P. (d) range(X)

tl range(A - X) = (0).

The above theorem

allows us to give a picture

Let +A denote the linear function

of [O,A]. Suppose that A is invertible.

@A : X - A1/‘XA1/‘. Note that a;’

= +*-I.

It is easy

to see that @A maps [O,I] to [O,A]. Thus the convex structure of [O,I] is the same as the convex

structure

of [O,A]. But the shorts of I are precisely

the orthogonal

projections

AUBURN

1990 CONFERENCE

FIG. 3.

(see [l]); indeed,

ON MATRIX

A polygonal approximation

to the set [O,I] for 2 X 2 matrices.

the fact that the orthogonal

projections

is known independently

of shorted

operators

projections

Ps =

Expanding

I 1 cos 8

sin 8

[COST

p = tI

extreme

We now consider

matrices)

we notice that the rank 1

co?e

sin e cos 8

sin e cos 8

sinse

1

sin@] =

and using trigonometric

A, + (sin20)Aa

we get

+ (cos20)Aa

points of [OJ] are a geometric

the intersection

of the two equal expressions

of two order intervals.

circle. The following result is due

(see [2]), we write A : B : C for either

(A : B) : C or A : (B : C).

24The figures in this paper from the second

identities,

I’

2

to T. Ando [4]. Since the parallel sum is associative

available

points of [O,I]

are of the form

out the matrix elements

Thus the rank-one

are the extreme

(see [8]).

To see this picture 24 (for the 2 x 2 symmetric orthogonal

725

THEORY

author

were produced via E-mail.

by MathematicaTM.

The computer

codes

are

726

FRANK UHLIG, TIN-YAU TAM, AND DAVID CARLSON THEOREM 2.

Let A and B be two positive n x n real matrices, and set V = ‘8”. B

= [O,A] n [O,B]. Then the matrix X E V is an extreme point of V ifand following

conditions

(a) range(A - X) fl range(B (b) (A - X):(B

- X):X

The above theorem

- X) Cl range(X)

either

allows us to explicitly

of V=

therefore

construct

semidefinite

the extreme

matrices.

points in the case

If either A or B is singular,

%“,B = [O,A] fl [O,B] re d uces to the one-dimensional

A < B or B < A, then

[O,A] n [O,B] reduces

that A and B are invertible,

the extreme

= (0).

= 0.

n = 2. Let A and B be 2 x 2 positive then the geometry

only ifany ofthe

are true:

and that neither

to the previous

case.

case. If Assuming

A < B nor B < A, we can classify

points of V by their rank:

Rank 0:

The zero matrix.

Rank 1:

For each angle 0, let Y0 Since ble.

be subspace

spanned by the vector

[cos 0, sin OIT.

YO(A) and Y@(B) are rank one with the same range, they are comparaThe

operators

min{ Y@(A), Y@(B)}

are precisely

the rank-one

extreme

points. Rank 2:

The set of 8’s for which min{ Y@(A), YO(B)} = YO(A) is an interval, For each such 0, let &, be chosen B}.

Then the rank-two

extreme

points are precisely

as 0 ranges over 0 E (@o, 0,).

At the extreme

extreme

in this

points

constructed

[0,, 8,].25

to solve max{ A( YO(A) + h[ A values of 0, i.e.,

manner

Y@(A)] <

Y@(A) + b[ A -

reduce

YO(A)]

B0 and 8,, the

to rank-one

extreme

points. The above theorem construction extreme

generalizes

points generalizes

modifications.

to a construction

The

of the rank-n

points.

We now work out a numerical write (x,

to the n x n case, with suitable

of the rank 2 extreme

y, z) for the matrix

example.

xA,

Let A,, A,, and A, be as before, and let us

+ yA, + zAa. Let A and B be the matrices

A = (2,1,0)

= 2A, + A, =

and B = (2, -

1,0)

= 2A, - A,=

Then [O,A] n [O,B] is, in terms of coordinates, (z - 2)2 > (y -

I

:,

the intersection

;

I

. of the cones

1)2 + z2,

(x-2)2~(y+1)2+z~, x2 2 y2 + 22, x > 0.

25Via the correspondence 0 ++eie, we consider angles as points on the unit circle; thus an interval means an interval of the unit circle.

AUBURN

1990CONFERENCE

FIG. 4.

ON MATRIX

[O,A] rl

This is easily solved. The rank-one

and the same with the y-coordinate points is the piece

“triple

the points

727

[O,B], top transparent.

extreme

points are pieces

negated.

The “upper

of the two ellipses

handle”

of rank-two extreme

of the hyperbola y = 0,

between

THEORY

z = $,

(r

- 2)” - 22 = I

y = 0, z = + f.

The ellipses

and hyperbola

meet

at the

points”

We close with a picture

of

[O,A] fl[O,B].

REFERENCES

1 W. N. Anderson, Jr., Shorted operators, SIAM J. Appl. Math. 20:520-525 (1971). 2 W. N. Anderson and G. E. Trapp, Shorted operators II, SIAM J. Appl. Math. 28:60-71

(1975).

728 3

FRANK N. Anderson

W.

and G.

semidefinite operators,

E.

UHLIC, Trapp,

TIN-YAU The

TAM, AND DAVID

extreme

Linear Algebra A&.

points

106:209-217

4

T. Ando, unpublished communication.

5

G. P. Barker, The lattice of faces of a finite-dimensional 7:71-82

6

S.

L.

Eriksson

and H.

Math.

7

W. L. Green

8

Gert

9

of a set of positive

(1988). cone, Linear Algebra Appl.

(1973).

addition,

Sot.

CARLSON

Monogr.

Extreme

C*-Algebras

theoretic

of Operators

approach

points of order intervals,

and Their Automorphism

14, ISBN 0076-0560,

Shorts

A potential

to parallel

(1986).

and T. D. Morley,

K. Pedersen,

Pekerev,

Leutwiler,

Ann. 274:301-317

Academic,

and

to appear. London

Groups,

Math.

New York, 1979.

Some Extremal

Odessa,

Problems,

1989,

to

appear. 10

A. Shapiro,

Extremal

Appl. 67:7-18 11

S. Wolfram, Redwood

INDUCED

problems

on the set of nonnegative

matrices,

Linear Algebra

(1985). MathematicaTM-A

City, Cahf.,

System for Doing Mathematics,

Addison-Wesley,

1988.

NORM OF THE SCHUR MULTIPLIER OPERATOR

by T. ANDOz6 and K. 0KUBOz7 Let

M,, be the vector

[bij] E M,, denote product

An B =

space of all n square complex

by A0 B their

Schur

matrices.

(or Hadumard)

product,

For A = [aij], B = that is, the entrywise

[ aijbij]. Then each A EM, gives rise to a linear operator

called the Schur multiplier norm of S, with respect

operator,

to a norm 1) -

defined

by S,(X)

S, on M,,

= A0 X (X E M,). The induced

)(on M, is defined by 11S,\( := sup 11 x 11 Q111A0 X (1.

A familiar and useful norm on M, is the spectral norm:

another

where

useful norm is the numerical

radius:

w(A)

= s;p

)I .

‘(,;‘;(,;”

\I and ( * ) . ) denote the Euclidean

,

norm and the standard

inner product,

26Division of Applied Mathematics, Research Institute of Applied Electricity, Hokkaido University, Sapporo 060, Japan. 27Mathematics Laboratory, Sapporo College, Hokkaido University of Education, Sapporo 002, Japan.

AUBURN 1990 CONFERENCE

729

ON MATRIX THEORY

respectively. We denote the induced norms of SA with respect to the spectral norm and the numerical radius by 1)S,ll_ and 11S,ll w, respectively. It is mentioned in [2] that Haagerup succeeded in determining IIS, II_ in the following form. For A = [aij] EM,,

HAACERUP’S THEOREM.

the following

assertbns

are mutually

equivalent : A = B*C such that

(ii) A admits a factorization

B*BoI&Z

C*CoZ
and

where Z is the identity (or unit) matrix. (iii) There are vectors 2, iji E G” (i = 1,2, . , . , n) such that 1,2, . . . , n) and aij

=

(iv) There are 0 Q R,, R, E M,,

(Zjliji)

(i,j

= 1,&n).

such that

R,oZ
and

In this note we are going to give the characterizations derive Haagerop’s theorem for it as a consequence. THEOREM.

(9, (ii),

II?ziI(, I( GilI Q 1 (i =

R,oZ,
of the norm I(S,II w, and to

For A = [ aij] E M,, the following assertions are mutually equivalent:

II%ll, G 1. A admits a fwtorieation

A =’ B*WB such that B*B-Z&Z

(iii)w There

are vectors

II$1) Q 1 (i = 1,2,.

?ieO”

(i = 1,2,.

. . , n) and a matrix

. . , n), W*W Q I, and aU = (WGjlZi)

(iv), There is 0 < REM,

W*WQZ.

and

(i = 1,2,.

..,n).

such that

[

: 1 A R

20

and

RoZgZ.

WE M,, such

that

730

FRANK

UHLIC,

We can derive from the theorem Schur multiplier (I)

llS,ll,

TIN-YAU

the following

TAM, AND DAVID

properties

CARLSON

of the induced

norms of

operators: C IIS,ll,

Q 2llS,ll0,

(AeM,).

(2) llS,ll, Q II AlI, (AEM,). (3) I]S,]] oD= I(S,]] w if A is Hermitian. (4)

]IS,I] m = ]]S,]] w = 1 if A is unitary.

(5)

llS,ll, G llSI~~+~~*~ llw (AEWJ

(6)

](S,]],.(]SIA~)]u;if

Aisnormal.

The details will he published

in [l].

REFERENCES 1

T. Ando and K. Okubo, Induced

2

Appl., to appear. V. I. Paulsen, Completely Bounded Maps and Dilation, Math.

146, Longman,

Essex,

norms of Schur multiplier

U.K.,

operator, Pitman

Linear Algebra

Research

Notes in

1986.

AN APPLICATION OF VALUATION THEORY TO THE CONSTRUCTION OF RECTANGULAR MATRIX FUNCTIONS by

1.

JOSEPH A. BALL and MAREK RAKOWSK12’ Zntroduction The problem

functions

we address

with a given

complex

plane C,2

B

formed

m

X n

by functions

rational

m x n rational

matrix

and

function

WmXn(u)

WE gnx”,

analytic

functions.

do there

with m

x

exist rational

subset

matrix

c of the extended

exist, how to find one? and by B(u)

the subring

will denote the g-vector on u. Bnx” B? mXn (u) will denote the %?(o)-module

which are analytic

algebraically find

When

over a proper

the field of scalar rational functions

matrix functions

can be characterized Bmx”

structure

If such functions

We will denote by 9 of

here is as follows.

zero-pole

Nw, fiwe

over

BmX”(u),

of of

on 0. The zero and pole structure

in terms of coprime

n matrices

space space

B

factorizations. and

g(u),

Namely, identify

respectively.

DWs kJfflx”(u), and fiw~

Given a gmxm(u)

such that (i) det Dw # 0 and ew,

NW are right coprime

(ii) det D, # 0 and D,, NW are left coprime (iii) W = N,D&’ = fi&‘6w. In this notation, functions

[over g(u)], [over g(u)],

W, HE 92 lnxn have the same left zero structure if NH = NwQ W and H have the same right pole structure

for some Q such that Q, Q- ’ E %! “x”(u).

28Department of Mathematics, Virginia Tech, Blacksburg, VA 24061.

AUBURN

1990 CONFERENCE

if fin = Rfiw

for some

THEORY

731

R such that R, R- ’ E .%’“.“(u)

(see [2] for the regular case,

that is, when the functions vanish

ON MATRIX

involved

are square

and

determinants

which

do not

identically).

A concept null-pole

which

refines

subspace

the notion

W over

of

WS? nx1( a). For the regular

u,

case,

l] (see [2] for a comprehensive matrix functions have been Clearly,

functions

of zero

i.e.

null-pole

in S? mx”

subspaces

the same

subspaces For matrix

over some

functions

methods

right

pole

have played

u is that

9?‘nx1,

of a

yO(W) =

a key role in [7, 8, 3, rational

over u have the

over u. Indeed, if W, HE 9 mx”, then Q such that

left zero

Y,(W)

=

(see Proposition

Q, Q- ’ E W mxn(u)

and HE W mx”ff, then W = HQ for some

structure

S’rm pl e examples over

u may

have

show

that

different

functions null-pole

u. time

the

analytic

has been

for finding

been

and

over of

Null-pole subspaces of nonregular

Q E S? “*x”“( u) if and only if J$( W) C Y,(H). with

structure

with the same null-pole subspace

if WE .%’n’x”u

generally,

and pole

W(u)-submodule

in [6].

investigated

Yo( H) if and only if W = HQ for some 1.1 in [4]). More

the

exposition).

same right pole and left zero structure

have

have

a regular

available.

description

known

One

rational

subspaces

matrix

function

with

there). a given

of regular

rational

Also, constructive null-pole

subspace

to the general case is via embedding in the regular

approach

case. A tool for such embedding

of null-pole

(see [2] and the references

is provided by valuation theory (see [9], [12], and, for

the general theory [ll]).

2.

Orthogonality Let

from

in 2 n

h be a point

of the

where

r) is the unique

integer

r(z)

with

F analytic

complex

stronger

triangle

such

plane.

We define a function

( * 1z=h

=

that

1

(Z-X)“?(z)

if

XeC,

z-V?(z)

if

X=03

at h. The function

and nonzero

1n ) z=h < 1 for every

integer

n, the valuation

) . ) ==Ais a real valuation of W . Since ) * 1z=x is non-Archimedean and the

inequality

I r1 holds

extended

S? into the set of real numbers by putting

for all rr, rs E 9.

+ r2

I z=x Q m=4 I t-1 I z=h, I f-2 I z=hl

732

FRANK UHLIG,

TIN-YAU TAM, AND DAVID CARLSON

Let n be a positive integer and let he C,. the product

We define a function

)] .

1)z=x on W”,

of n copies of ~8, by putting

ll(rIpf-2,.

..,

rn)Ilz=x= m={lrll.=~, lr21z=h....,ImldJ

In this way 9? ” becomes a normed vector space over the real valued field ( 1, Following

the definition of orthogonality

shall say that two subspaces

in a non-Archimedean

A and Q of W * are orthogonal

I * ( ==,,_).

normed

at XE C,

space, we

if

IIx + Y II==h= m={ IIr IIdI II Y IIz=h} for each

r E A, y E Q. We shall say that A and Q are orthogonal

orthogonal

at every point of u. We shall say that vectors

at

(respectively,

XE C,

on a subset

{xi:i#j)areorthogonalat

X(on

o of C,)

u)forj=

W” and AEC,,

Ifhisasubspaceof

on u C C,

if they are

rr, ~2,. . . , rk are orthogonal

if the spans over

W of { xj}

and

1,2,...,k. we will denote by A(A) the set of values at X

of those functions in A which are analytic at A. Plainly, A(A) is a subspace of C”. The space A(A) can be characterized coefficients

equivalently

as the linear span over C of the leading

in the Laurent expansions at A of the functions in A. Using this notation, we

can characterize

orthogonality

in 8 ” as follows (see Proposition

2.3 in [6]).

Let A and Q be two subspaces of W “, and let A E C,.

PROPOSITION2.1.

Then A

and R are orthogonal at A if and only if A(A) fl R(A) = (0). It follows from the definition of orthogonality at a single point AEC, subspaces

of

9”.

necessarily

that two subspaces of W ” orthogonal

have the trivial intersection.

We say that the subspace

Q is an orthogonal

subspace A in (C, a) if A and n are orthogonal the next proposition,

see Proposition

PROPOSITION2.2. proper

subset

u of C,.

Let A, 0, and C be of the

complement

on o and A + Q = X. For the proof of

2.5 in [5].

Let A and Q be subspaces Then Q has an extenskm

of 92 n which are orthogonal

on a

complement

of A in

c C C. Also, Wax”

will

to an orthogonal

(W”, 0). 3.

Analytic Description of Pole and Zero Structure For notational convenience, we will assume hereafter

denote m x n rational matrix functions analytic in C-1

u and vanishing at infinity. By

partial fraction expansions, we have an exact sequence

where all vector spaces are over C. We will use the symbol I’,= for the projection (3.1) for arbitrary positive integers

m and n.

as in

AUBURN

1990 CONFERENCE

If WEWmxn, C-linear

space

ON MATRIX

we define

P,c( yO( W)).

the

733

THEORY

right pole structure

of W over

8,

For th e p roof of the next proposition,

o to be the

see Proposition

4.1

in [6]. PROPOSITION3.1.

8,

= {C,(z The

Plainly,

pair (C,,

A,) in Proposition

denote

the left annihilator

is any matrix polynomial Let

3.1

is called

a right pole pair

such that

for W over

(I.

C u.

u( A,)

Let W”

There exists an obseruable pair of matrices (C,, A,) : x a constant vector}.

- A,)-‘x

of W in g1 xm. A

Zdt kernel polynomial of W

whose rows form a minimal polynomial

XE u be a zero

of W. Choose

(a lxm,X). One can show (see Proposition of matrices (A, Bx) such that

P,c({~En,:4WcWlx”(c)})

an orthogonal

basis (see [9]) for W”.

complement

Ax of W”l

4.3 in [S]) that there exists a controllable

= (~(.a

- A,)-‘Bx:

in pair

z isaconstantvector}.

If x,, &, . . . , X, are the zeros of W in u, the pair

diag(Ax,,A&

,...,

Aht),[B$

BE

a..

B<’ 1)

is called a lej? null pair for W over u. PROPOSITION3.2. One canjnd such thatPO~({q5~A:~W~W1Xn(u)}) For the proof of Proposition Proposition

an orthogonal complement A of W” in ( glx”‘, = {x(z -A,)-‘B,: raconstantuector}. 3.2,

see Lemma

3.16

3.2 a subspace associated with the pair (At,

If (C,,

A,),

(Al.

a)

in [5]. We will call A as in

Br).

Br), and P, are as above, there exists (see Theorems I’ such that rA, - ATr = BrC, and

2.7, 3.1, and

3.3 in [S]) a unique matrix

%(W)

=kerP,n

C,(z-A,)-‘x+h(z):+~C”‘~~,h~W~~~(u),

and c

Res,,,O

(z - A,)-‘B,h(

z) = TX

Z&l A triple ((C,, A,), (AC, Br), r) is called a I& u-spectral triple of W.

4.

Construction A construction

triple

of a Function

is given in [lo].

function Theorem

with prescribed 6.2 in [6].

with the Prescribed

of a regular rational matrix function Below,

we utilize

left u-spectral

Null-Pole

Subspace

with a prescribed

it to construct

a rectangular

triple and kernel polynomial.

left u-spectral rational

matrix

For the proof, see

734

FRANK THEOREM 4.1.

given. left

Let

matrices

Then there exists a rational a-spectral

conditions

0)

triple

and

P,

UHLIG, C,,

A,,

TIN-YAU Ay, B,, r

matrirfunction

TAM, AND DAVID and

with

polynomial

as a left kernel

a matrix

7, = ((C,,

polynomial

A,),

g and

CARLSON P,

be

( Ay, Bc), I’) as a

only

if the following

hold:

the pair

(C,,

A,)

(ii) t& pair (At.

is obseruable

and u( A,)

C u,

and u( Ar) C u,

Br) is controllable

at infinity,

(iii) P, has no zeros in C, and its rows are orthogonal

(iv) the rational matrix function PK( z)C,( z - A,)- ’ is analytic on C, h,} and lk is the identity matrix with the same number {A,,&,..., (v) $a(A&= rows as P,, the pair diag( A(, X,1,,

. . . , A&),

[ B;

P(A1)’

...

PN’]

of

T,

is controllable, (vi) I’A,

- A$

= B< C,.

If rS and P, satisfy conditions 7, as a left u-spectral

triple

(i)-(vi)

in Theorem

and P, as a left kernel

4.1, a function polynomial

WE 9?“x”

with

can be constructed

as

follows.

step 1.

Find a regular Choose

Step 2.

rational

matrix function

a Smith-McMillan

Let Y be the largest largest geometric of the geometric

factorization

geometric

multiplicity multiplicity

H with 7, as a left u-spectral

multiplicity

of a pole of H in u, let /J be the

of a zero of H in u, and let q be the largest sum of a pole and the geometric

of H at any single point of u. Let di denote

multiplicity

9ij

manic polyno-

has a zero at a point XE u of order k then

zero at h of order k. Define

=

an m x r) matrix polynomial

1

if

‘i=jQv-p,

Pi

if

v-r
if

i=j-q+m>m-CL,

1

i 0

of a zero

the ith diagonal entry of D. For

v, let pi be the minimal-degree

i=v--+l,q-p+2,..., mial such that if d,_s+i

triple.

EDF of H, and put W, = ED.

pidi has a

Q = [qij] by

otherwise,

and put W, = W,Q. Step 3.

Find a subspace

E associated

with the pair (A,,

W, onto Ker P along the subspace Step 4.

Find an orthogonal U u( AI)),

complement

and a basis

{ul, 02,.

of W m xl

Bt). Project

annihilated

every column of

by E to get W,.

A of the column span of W, in (Ker P, u( A,) . . , ul} for A such that the function [ul up

. . * ul] has neither zeros nor poles on u( A,) U u( A(). Put W, = [Ws 01 ~2 . . . 011. Step 5.

Multiply

W, on the right by a regular

poles or zeros on u( A,)

poles nor zeros in u \ (u( A,) Particular

rational

matrix

U u( Ar), so that the resulting

steps of this construction

function

function

Q, without

W has neither

U a( A()). are illustrated

in [5] with specific

examples.

AUBURN

1990 CONFERENCE

ON MATRIX

735

THEORY

REFERENCES 1

J. A. Ball,

N. Cohen,

improper

rational

and A. C. M. Ran, Inverse

matrix functions,

Matrix Functions (I. Gohberg, J. A. Ball, I. Gohberg,

3

J. A. Ball, I. Gohberg,

and L. Rodman,

problems

matrix functions,

Sot., 4

and L. Rodman,

Boston,

for rational

zero-pole

Two-sided

OT (I. Gohberg,

functions,

submitted

McMillan structure

Birkhbser,

Null-pole

subspaces

G. D. Forney, I. Gohberg

Jr.,

matrix functions

to appear.

of nonregular

rational

matrix

rational

matrix

problems

for rational

matrix

problems

for rational

matrix

of nonregular

spectral

inverse

spectral

Zntegral Equations Operator Theuy 10:349-415

multivariable functions

rational

(1987).

Linear Algebra Appl. 86:237-282

functions,

10

Math.

for publication.

J. A. Ball and A. C. M. Ran, Local

9

interpolation

to appear.

J. A. Ball and A. C. M. Ran, Global inverse functions,

degree

Linear Algebra Appl.,

Zero-pole

Ed.),

J. A. Ball and M. Rakowski,

8

Lagrange-Sylvester

in Proc. Symp. Pure Math., Amer.

Minimal

structure,

J. A. Ball and M. Rakowski, functions,

7

1988, pp. 123-175.

Znterpolatixm of Rational Matrix Functions,

to appear.

with prescribed

6

for regular

1990.

J. A. Ball and M. Rakowski,

5

problems

Ed.), OT 33, BirkhHuser, Boston,

2

OT 45, Birkhguser,

spectral

in Topics in Znterpolatiun Theory of Rational

Minimal

linear systems,

bases of rational

spaces,

SZAMJ. Control 13(3):493-520

and M. A. Kaashoek,

and minimal

vector

(1987).

divisibility,

An inverse

spectral

with applications

to

(May 1975).

problem

for rational

matrix

Zntegral Equations Operator Theory 10:437-465

(1987). 11

4. F. Monna,

12

G. Verghese

Analyse Non-archimdienne, and T. Kailath,

Rational

Springer-Verlag,

Matrix Structure,

New York, 1970.

in Proceedings of the 18th

ZEEE Conference on Decision and Control, Fort Lauderdale, FL,

Vol.

2, Wiley, New

York, 1970, pp. 1008-1012.

CLASSES OF STABLE MATRICES by ABRAHAM

BERMAN2’a

The inertia, i+(A)

3o and DAFNA

in A, of a square

is the number

of pure imaginary

matrix

of eigenvalues

eigenvalues

SHASHA2’

A is a triple

(i+(A),

io( A), i_(A)),

where

of A in the right open half plane, io( A) the number

of A, and i_(A)

the number

of eigenvalues

in the left

open half plane. A matrix

A E R”* ”

is stable for every positive

diagonal matrix (a diagonal matrix whose diagonal entries

“Department

of Mathematics,

IS (positive) stable if i+(A) = n. A is D-stable if AD

Technion-Israel

Institute

of Technology,

Israel. 30Research

Supported

by the M. and M. Bank Research

Fund.

are

Haifa 32000,

736

FRANK

positive) matrix

D.

A is (Lyapwwv)

D such that

diagonally

diagonally

are D-stable.

in differential

5, 4, 8, 91. A real matrix

study these

preserving

are D-stable.

[semildefinite.

Stable,

equations,

TAM, AND DAVID

if there

@ni]stable

A is inertia-preseruing

in AD = in D. We matrices

TIN-YAU

AD + DAT is positive

stable matrices

arise in problems

UHLIG,

D-stable,

ecology,

chemistry,

is not true, as shown by the following

EXAMPLE 1.

Let

A=[;

Here

A is D-stable

[6] but not inertia

[2], that

diagonal

because,

e.g. [2,

matrix

clearly,

D,

inertia

example.

-5;].

preserving,

since for D = diag{ - 1,3, - 1) the

AD is also stable.

A subclass

of the

[l]: matrices

matrices

D-stable

matrices

A such that

only if D is positive. Manus

1

diagonal

e.g.

and economics,

invertible

This is of interest

The converse

matrix

exists a positive It is known,

and diagonally stable matrices

if for every

matrices.

CARLSON

is the

class

AD is stable,

where

Again, it is clear that inertia

of

Arrow-McManus

D-stable

D is a diagonal matrix,

preserving

matrices

if and

are Arrow-Mc-

D-stable.

Example D-stable.

1 is also an example

Observe

A real matrix preserving

D-stable.

D-stable A is

invertible)

and diagonally

strongly

matrix

inertia

semistable,

preserving

submatrices

that

real

but not strongly inertia preserving.

diagonal

A is strongly

are inertia preserving.

The matrix

is inertia preserving

but not inertia preserving.

if for every

D, in AD = in D. Observe

if and only if all its principal

EXAMPLE 3.

[7]. In fact, we show

Let

A is Arrow-McManus

necessarily

matrix which is not Arrow-McManus

semistable

Let A be a diagonally semistable matrix. Then A is D-stable if and only

THEOREM 1.

if it is Arrow-McManus EXAMPLE 2.

of a D-stable

that it is not diagonally

(not

inertia

AUBURN 1990 CONFERENCE

737

ON MATRIX THEORY

An important class of inertia preserving (and even strongly matrices is the class of diagonally stable matrices.

inertia

preserving)

A diagonally stable matrix is strongly inertia preserving.

THEOREMS.

Clearly, if A is inertia preserving, then io( A) = 0. The following theorem terizes the real square matrices A such that io( A) = 0. Recall [7] that B,(A) denotes the cone B,,(A)

(a) (b) (c) (d)

= (BEPSD[(BA)~~=O,~=I,...,~)

The folhving

THEOREMS.

charac-

properties

of A E R”, n are equivalent:

io( A) = 0.

BEPSD, BA+ATB=O * B=O. BcPSD,BA+ATB=0,rankB<2 t+ B=O. B EB,,( A), BA + ATB = 0, o B = 0. (e) BEB,,(A), BA+ATB=0,rankB<2 o B=O. For matrices which have no pure imaginary eigenvalues THEOREM

4.

one has

Suppose io( A) = 0. Then

(a) in A = in AD for every positioe diagonal matrix D if and only if

(b) io( AD) = 0 fw every positive diagonal matrix D. Given that a matrix A is stable, we obtain D-stability as a corollary of Theorems 3 and 4. Let A be a real stable matrix.

COROLLARYl.

every B E Bo( A) of rank

4.

a simple

characterization

of

Then A is D-stable ifand only iffor

< 2 and for every positive diagonal matrix D, BAD+DATB=O

EXAMPLE

here

o

B=O.

The matrix

is stable. To show that it is D-stable we observe that B E B,( A) if and only if it is of the form a 0 ob’

B=

-4c

0

-4c 0, C

a, b, c > 0, I

a > 16~.

738

FRANK

UHLIG,

TIN-YAU

TAM, AND DAVID

CARLSON

In this case

4c-

a

16c

a0 0

and if BAD is skew-symmetric

for an invertible

are equal to zero, so BA = 0, which implies Since

inertia

matrices,

preserving

matrices

it is natural to ask whether

preserving.

Note that Example

two classes coincide

THEOREM

are

matrix

D-stable

and include

diagonally

this question

acyclic

c, a, and b

the diagonally

semistable

matrices

in the negative.

stable

are inertia

However,

the

matrices.

Let A be an acyclic irreducible

5.

D, then necessarily

that B = 0.

D-stable

2 answers

for irreducible

I I

matrix.

Then A is D-stable if and only

ifA is inertia presweruing. If the irreducible

2.

COROLLARY preserving

if and only if it

The following property

THEOREM

components

of A are acyclic,

of diagonally

semistable

matrices

is of great importance.

Let A E R”,” be a diagonally semistable matrix,

6.

invertible diagonal matrix.

then A is inertia

is D-stable.

and let F be an

The following are equivalent:

(a) in AF = in F (b) io( AF) = 0. (c) BAF + FATB # 0 for every nonzero B E B,( A) s.t. rank B < 2. (d) BAF + FATB # 0 for every nonzero B E B,( A). Observe general,

that

the

implication

as shown by Example

COROLLARY

(b),(c),(d)

i* (a) in Theorem

Let A be a diagonally semistable

3.

6 does

not hold

in

1.

matrix.

The following are equiva-

lent: (a) A is inertia preserving. (b) io( AF) = 0 for every real invertible diagonal matrix F. (c)

BAF + FATB # 0 for

every

nonzero

BE Bo( A) such

that rank B Q 2, and fw

every real diagonal invertible matrix F. (d) BAF + FATB f 0 fat every nonzero B E B,( A), and for every real diagonal invertible matrix F. We conclude

QUESTIONS.

by asking the following

questions:

Is every strongly inertia preserving

matrix diagonally

stable?

AUBURN 1990 CONFERENCE QUESTION 2.

Is every irreducible

The proofs and additional

739

ON MATRIX THEORY inertia preserving

matrix diagonally semistable?

examples will appear in [3].

We wish to thank Professor Daniel Hershkowitz for many suggestions which improved the paper.

REFERENCES K. J. Arrow and M. McManus, A note on dynamic stability, Econometrica 26 (1958). G. P. Barker, A. Berman, and R. J. Plemmons, Positive diagonal solutions to the Lyapunov equations, Linear and Multilinear Algebra 5:249-256 (1978). A. Berman and D. Shasha, Inertia preserving matrices, SZAM /. of Matrix Anal. Appl., to appear. B. L. Clarke, D-stability and chemical reaction networks, presented at the Combinatorial Matrix Analysis Conference, Victoria, 1987. G. W. Cross, Three types of matrix stability, Linear Algebra Appl. 20:253-263 (1978). D. J. HartRel, Concerning the interior of the D-stable matrices, Linear Algebra Appl. 30:201-207 (1980). D. Hershkowitz and D. Shasha, Cones of real positive semidefinite matrices associated with matrix stability, Linear and Mu&linear Algebra 23:165-181 (1988). J. F. B. M. Kraaijevanger, A characterization of Lyapunov diagonal stability using Hadamard products, Linear Algebra Appl., submitted for publication. J. F. B. M. Kraaijevanger and J. Schneid, On the unique solvability of the Runge-Kutta equations, Numer. Math., submitted for publication.

CONJUGATE-SUBSPACE DECOMPOSITION AND ITS APPLICATION SOLVING LINEAR SYSTEMS WITH MANY RIGHT-HAND SIDES by MEI-QIN

IN

CHEN31

The inspiration for the conjugate subspace decomposition (CSD) has come from the updated conjugate subspace method for solving unconstrained minimization problems whose objective functions are twice differentiable, as first introduced in [5]. We assume throughout this paper that A is an n x n symmetric positive definite matrix. The CSD can be described as follows [3].

31Department of Mathematics/Computer

Science,

The Citadel, Charleston,

SC 29409.

740

FRANK THEOREM 1.

UHLIG,

TIN-YAU

Let R” be a conjugate sum of T,,

TAM, AND DAVID

CARLSON

. . , T,,, with respect to A, that is,

(1) R” = T, + *.. +T,; (2)

rfArj

= 0 for xi E Ti, rj E Tj, and i # j.

+ * * . +I(~)

Then x* = 8) XE ‘1;:. The

following

equations

is an algorithm

based

on the CSD

for solving

systems

of linear

Ax = 6. (CSD).

ALGORITHM matrices

solves Ax = b, x E R”, where each xci) E Ti solves Ax = b,

Let R” be the conjugate

sum of T, . . . T,, and Zi be the basis T,, respectively. Let nj = rank Ti, where n, + .- . + n, = n. x* of Ax = b can be computed as follows:

of the subspaces

Then the solution (1) Compute

bj = Zfb, i = 1,. . . , m.

(2) Solve Z:AZ, fci) = bi for ??ci)E R”i, i = 1, . . . , m. I ,..., m. (3) Evaluate x (i)=Z,&i),i= (4) Set r* = x(l) +

*. * +x(@.

The

allows

CSD

strategy

ni-dimensional problems.

subproblems

Some questions,

scale problems

however,

ment in efficiency?

subspaces

a class of problems

x(t)

numbers. when

need to be answered:

The

the

b(t)

(CC)

(BCG)

multiprocessor

In general,

time; for example,

the vector

this case, the BCG algorithms

algorithms

sequential

in

definite, of

P)

the right-hand-side

vector

t, and S is a set of discrete

is suitable

for solving

and without

an explicit

[7] are adaptable

for all

b

real

(P), especially form.

If the

t in S, then the

and work efficiently

on a

the b(t) may not be available at the same

are no longer adaptable.

If the vectors

then the Lanczos-Galerkin

r(tj) for j < i. In b(t) do not change

projection

error bound of the approximation

If this is not the case, then computing

such problems.

the solution

at step (2) to find each solution

It has a high parallelism

of the CSD,

of the CG.

t E S,

procedure

of the solution is

of (P) is completely

t. The algorithm combining the CSD strategy with the CG algorithm, that

is, applying the CG algorithm because

subspaces?

b(tj) may depend on previous solutions

in [9], and its theoretical

given in [lo].

(1) For what type of large

simultaneously

however,

too much from one solution to another, is suggested

method sparse,

are available

block-conjugate-gradient machine.

positive

functions

A in (P) is large,

vectors

m

with less cost and with great improve-

for each

= b(t)

conjugate-gradient

right-hand-side

by solving

for solving large scale

of the form

that are real-valued

matrix

problem

(2) For a given linear system of equations,

be chosen

E R”, A E R “xn is symmetric

has components

n-dimensional

(3) What is the cost of forming these conjugate

Ax(t) where

an

and has great potential

should the CSD be adapted?

how should the conjugate Consider

us to solve

in parallel

and it does not require

in computation the explicit

x-ci), is suitable for solving for solving

(P) for each

form of the matrix

A, because

t

AUBURN

1990 CONFERENCE

ON MATRIX

Notice that the basis matrices set S, the cost of forming So it is important efficiently. the

The performance

spectrum

unfavorable

of

distributions methods matrix

A in the

spectral

well-separated,

Zi need to be formed only once. With many t in the

Ti can be compensated

to choose

of the CG algorithm presence

is very sensitive

of roundoff

errors

In practice,

can be found, for instance, parameters

[l,

the spectrum

when the relaxed

w close

of solving (P).

solves each subproblem to the distribution

6, 8, 9, 11,

for the CC is when the spectrum

eigenvalues.

with varied

by the overall efficiency

Ti’s such that the CG algorithm

distribution

large

741

THEORY

has a few distinct,

of A in (P) with such

(block) incomplete

to 1 are adapted

of

121. One

Cholesky

as preconditioners

to the

A which results from discretizing

-Au=ffor(r,y)EOand

u=Ofor(x,y)~afi,

by linear

finite

examples

with different

element

approximations choices

where

over a uniform

the relation

between

following are two theorems projections approximate

on the

eigenspace

subspaces

the spectrum

triangulation.

Some

one

of the

for a given A, it is necessary

of A and its projections

[3] which describe

Ti’s when

isosceles

x (0,l)

of w are given in [2].

In order to form a set of proper conjugate investigate

0 = (0,l)

the relation subspaces

of the spectrum

contains

to

on the Ti’s. The of A and its

an eigenspace

or an

of A.

THEOREM 2. Let (X,,qJ,i = l,...,m, be the eigenvalue-eigenvector pairs of A, and let Q = [ql. . . . , q,,J. If span Q E Ti f or some i, then span Q I ?; for j + i. lf Zj’Zj = I,,, for each j, and (hi, qi) are extreme eigenpairs of A, then X,, . , . , A,,,are not in the spectrum of A on Tj for j f i.If in addition ZfZj = 0, then the former statement is aLso true for intermediate eigenpairs.

. . ,4,1, w/m-e 1 C ml3 m2

THEOREM 3. Let iT, = [&, . . . , g,,,], & = [q_,,+l,. Q n, and let pk = i$ A& /&& for each k be such that

I pk

-

‘k

I = “$

I Pk

-

$1,

k=

l,...,m,,

1~

k=

and I &t-k

If span[&,

-

h-k

I =

jF;yk

I /hi-k

-

h-k

l,...m,.

G2] E Ti for some i, then fm j # i,

[

k=.,$,,,,,

(:)‘1’:.

742

FRANK

Furthermore, in the inter&

UHLIG,

TIN-YAU

TAM, AND DAVID

CARLSON

if ZiZj = l,,, fm each j, then the spectrum of A on q fw j # i is contained [a, b], where

a= &,+1 -

k$l (xm,+1 - Ak)Ck(Ek)>

where the functions ck are such that

ck(o) = 0

Theorem

2 and Theorem

the CC algorithm, of the conjugate

to (i) choose a subspace

of A whose corresponding proper algorithm

eigenvalues

Ax(t)

ri are normalized = b(t), for some

(1) The vectors -

appear in the spectrum

of A on the

conjugate

vectors

we need

eigenspace

to the CG, so that the CG

subspaces

efficiently;

(ii) find a

on T1.

to form such a subspace

residual

more efficient,

or an approximate

are not favorable

to solve the subproblem

to

of A on one

an eigenspace

of A on the other

One of the natural choices where

which are not favorable

in the spectrum

In order to make the CG algorithm

T, which contains

solves the subproblems

= 0

are embedded

then they no longer

subspaces.

&k)

&k

3 show that if the eigenvalues

or their approximations subspaces,

other conjugate

ck(

lim E-o

and

generated

T, is to let Z, = [?,,,

by the CG algorithm

. , Pm],

for solving

t E S. There are four advantages to such a natural choice:

ri can be formed throughout

the computations

by their recurrence

relation. (2) The quantities can be computed

ci used to estimate

explicitly

(3) Since T1 = K,_,,

the Krylov subspace

m such that T, contains well-separated

extreme

(4) The matrix performance

an eigenspace

3

of dimension

or its approximation

m, there exists an integer corresponding

to those

eigenvalues.

Zf AZ, is a tridiagonal

is not affected

of A, such as the direct

algorithms

whose

method,

can be

Z: AZ, f(l) = b,.

complement

T, = span Z, = null[( AZ,)‘].

matrix, and many efficient

by the spectrum

adapted to solve the subproblem To form a conjugate

the lower and upper bounds in Theorem

or can be easily estimated.

of subspace

A Householder

T,

with respect

orthogonalization

to A, we may let

procedure

can be ap-

AUBURN 1990 CONFERENCE

ON MATRIX THEORY

743

plied here to form an orthonormal basis which generates the subspace Ts, and the spectrum of A on T, can be estimated by Theorem 3. This decomposition can be carried on again on the subspaces to that a set of conjugate subspaces are formed successively. Observe that if Ta is chosen as an orthogonal complement of Tr, then its basis matrix 2, can be formed as 1 - ZrZi without any extra cost. With this choice of ‘I’,, the solution of(P) is no longer simply the sum of the subsolutions on Tr and Ta, but it can be computed from those subsolutions by using the Sherman-Morrison-Woodbury formula. The complexity and the performance of the algorithm with these choices, and the scheme used to determine the dimension of Tr, are discussed with details in [4].

REFERENCES 1

2

3

4

5

6 7 8 9 10 11 12

0. Axelsson and G. Lindskog, On the Eigenvalue Distribution of a Class of Preconditioning Methods, Group Report 3, Dept. of Computer Sciences, Chalmers Univ. of Technology, Goteborg, Sweden, 1985. 0. Axelsson and G. Lindskog, On the Rate of Convergence of the Preconditioned Conjugate Gradient Methods, Group Report 5, Dept. of Computer Sciences, Chalmers Univ. of Technology, Goteborg, Sweden, 1986. M.-Q. Chen, The Updated Conjugate Subspace Method in Optimization and in Solving Linear Systems of Equations, Ph.D. Thesis, Dept. of Mathematics, Univ. of Illinois at Champaign-Urbana, 1989. M.-Q. Chen and A. Sameh, The Conjugate Subspaces Decomposition and Its Application in Solving Linear Systems of Equations with Many Right-Hand Sides, Tech. Report 1024, Center for Supercomputing Research and Development, Champaign-Urbana, Aug. 1990. S.-P. Han, Optimization by updated conjugate subspaces, in Numerical Analysis (D. F. Griffiths and G. A. Watson, Eds.), Pitman Res. Notes Math. Ser., 1986, pp. 82-97. A. Jennings, Influence of the eigenvahre spectrum on the convergence rate of the conjugate gradient method, J. Inst. Math. AppZ. 20:67-72 (1977). D. P. O’Leary, The block conjugate gradient algorithm and related methods, Linear Algebra Appl. 29:243-322 (1980). C. C. Paige, The Computation of Eigenvalues and Eigenvectors of Very Large Sparse Matrices, Ph.D. Thesis, Univ. of London, 1971. B. N. Parlett, A new look at the Lanczos algorithm for solving symmetric systems of linear equations, Linear Algebra Appl. 29:323-346 (1980). J. &ad, On the Lanczos method for solving symmetric linear systems with several right-hand sides, Math. Cump. 48(178):651-662 (Apr. 1987). A. van der Sluis and H. A. Van der Vorst, The rate of convergence of conjugate gradients, Numer. Math. 48:543-560 (1986). A. van der Sluis and H. A. Van der Vorst, The convergence behavior of Ritz values in the presence of close eigenvahres, Linear Algebra Appl. 88/89:651-694 (1987).

744 INVERSES

FRANK UHLIG, TIN-YAU TAM, AND DAVID CARLSON OF MATRICES

ARISING

FROM

DIFFERENCE

OPERATORS

by SUI SUN CHENG3*

Explicit inverses and inversion algorithms for square matrices have been a major concern since the early days of matrix theory. As the areas of application of matrix theory began to broaden, band and Toeplitz matrices [2] found their way into several methods of numerical analysis and approximation theory. For example, they would arise when using finite difference methods for differential equations, when using polynomial splines, and in studying discrete random processes [3] and statistics [4]. When using finite difference methods, for instance, it is desirable to find accurate error bounds. Explicit inverses or properties such as bounds or asymptotic behavior of the elements of the inverses are needed for this purpose. A common way to obtain this information is to observe that each column of the inverse of, say, a Toeplitz matrix satisfies a linear recurrence equation wth constant coefficients. This recurrence equation can then be solved, at least in theory, and the solution expressed as a linear combination of powers of roots of the characteristic equation associated with the recurrence equation (see e.g. [5, Chapter 41). Thus, the properties of the elements of the inverse can be deduced by this process. As an example, consider the well-known tridiagonal matrix A = (u~~)“~,,, where aii = - 2, aij = 1 if ( i - j 1 = 1, and aij = 0 otherwise. If we denote the jth column vector of the inverse A- ’ by col( x(l), x(2), . . , x(n)), then the components of this vector satisfy

x(k -

1) -

2x(k)

+ x(k +

1) = 6kj,

k=

1,2 ,...)

It,

where we have defined x(O) = 0 and x(n + 1) = 0, and Likj= 1 if i = j and iikj = 0 otherwise. It is convenient to employ the forward difference operator A, defined by Au(k) = u(k + 1) - u(k), to write the above equations as

A%(k

-

1) = i_ikj,

k=1,2

,...,

n,

x(o) = 0 = %(rt + 1). Taking our cue from the theory of Green’s functions in differential would guess that the solution of the above problem is of the form

equations,

we

x(k)=a+bk+(k-j),,

32Department

of Mathematics,

work was funded by the National

Tsing Hua University, Research

Council

Taiwan,

of the Republic

Republic of China.

of China.

This

AUBURN 1990 CONFERENCE

745

ON MATRIX THEORY

where (Y+= Q if cr > 0 and LY+= 0 if (Y< 0. Then it is easily calculated that a = 0 and b = - (n + 1 - j)/(n + 1). By symmetry considerations, we would also guess that

x(k) = c + d(n + 1 - d) + (j - k)+ and deduce that c = 0 and d = - j/(n + 1). By means of the definitions and (j - k) +, we see that x(k) is also given by

of (k - j),

k(n+l-j)

x(k)

=

I

n+l



k
j
_j(n+l-k), n+l

This example motivates a generalization for constructing explicit inverses of matrices for which the inner product of its kth row with the jth column of its inverse can be written in the form A”%(k - t) = Skj,

(1)

where m + 1 is the band width and 1 Q t Q m - 1. The details of this generalization will appear elsewhere [l] and will not be repeated here. However, we shall quote the following theorem, which is central to the derivations of explicit inverses. THEOREM. Let 2 4 m < n, 1 4 t 6 m - 1, and 1
k=

. . . . -1,&l

G n. Let P,,,_l be the set of

with degree less than or equal to m - 1. Then x(k), ,... by

+)

p(k)

z

+

defined fw

(k-‘(; t-l;/:“-1), P~P?l-,

or

+)

=

+)

+

(-l)“(j -;m’_ml-,l + #?-“,

satisfies Equation (l), wher-e fw c@ = cY(a - 1) *. 9 (a -p+l) j3 = 0.

any integers if

a>0

9EP,.-,,

(Y and & @) = 0 if (Y < 0 or /3 < 0,

and

/3>0,

and c@)=l

ifa>0

and

We remark that even though explicit inverses can be constructed in principle by the above theorem, we have not derived subsequent properties of the inverses. It will be of interest to find norms or bounds or asymptotic behavior of the elements of the inverses by means of the method mentioned above.

746

FRANK UHLIG, TIN-YAU TAM, AND DAVID CARLSON

REFERENCES 1

S. S. Cheng

and L. Y. Hsieh, Inverses

of matrices

arising from difference

operators,

Utilitas Math., to appear. 2

D. S. Meek, A survey of the results on the inverses of Toeplitz Proceeding of the Conference RIMS,

Implementations, 3

B. N. Mukherjee matrices

W. F. Trench,

5

K. S. Miller,

12:515-522

definite

Toeplitz (1988).

An algorithm for the inversion of finite Toeplitz matrices,

J. SIAM

to the Cakx~us of Finite Differences

An Introduction

and Di&jerential

Dover, New York, 1966.

MINIMAL RANK AND MAXIMAL FOR CERTAIN BAND MATRICES by JEROME

RANK HERMITIAN

COMPLETIONS

DANCIS33

In the last decade Hermitian

of positive

Linear Algebra Appl. 102:21 l-240

applications,

(1964).

Equations,

specified

in

and the

Kyoto Univ., Kyoto, Japan, July, 1982.

and S. S. Maiti, On some properties

and their possible

4

and band matrices,

on Standard Algorithms jw Linear Computation

Hermitian

a popular

matrix either

problem

has been

to a Hermitian

to try to “complete”

matrix with prechosen

a partial

inertia

or to a

matrix with the maximum or the minimum possible rank. The purpose of this

paper is to present band matrices

the standard

together

method

for constructing

Hermitian

completions

of

with some results on minimal and maximal rank Hermitian

completions. The

DEFINITION.

{ rr( H), v(H), 6(H)}

inertia

consisting

ues of H. We let r(H), Our starting

THEOREM 1 [I].

n x n Hermit&

a

Hermitian

matrix

H

of positive,

is

negative,

a

is our generalization

of Poincare’s

triple

In H =

and zero eigenval-

Y(H), and 6(H) denote the three coordinates

point

interlacing theorem

of

of the numbers

of In H.

inequalities

and Cauchy’s

(notation: 0’ = 0 E Cr):

Let R, be the leading principal

matrix

H.

(n -

r) x (n - r) submatrix

of an

We set A = Dim Ker 23, - Dim[(Ker R,) @ Or] fl Ker H.

Then (a) 7r( H) 2 T( R,) + A and (b) r(H) Clearly,

> r(R,)

+ A.

the inertia

of every

Hermitian

completion

of a (band) matrix

must be

consistent with this theorem.

33Department of Mathematics, University of Maryland, College Park, MD 20742.

AUBURN 1990 CONFERENCE

ON MATRIX THEORY

747

BORDERED-MATRIXHYPOTHESES. Let H, be an (r - 2) x (r - 2) Hermitian matrix. Let o and w be vectors in C’-‘, and let a and b be real numbers. Let H(z) be the bordered matrix

H(Z)

=

a v

::

.z* w ,

Iz

w*

b 1

and let

and

H, =

H(z) is called a one-step compkiun of H(0). The one-step-completion problem for these bordered matrices is usually the inductive step in the proofs of completion theorems. We used Theorem 1 as we classified all the possible kernels of bordered matrices in Lemmas 3.3-3.9 of [2]. A sample of the “common threads” of these lemmas is Theorem 2. THEOREM 2 (Theorem 1.2 of [2]). Given the bordered-matrix hypotheses. Suppose that the nullities are non-decreasing, namely,

6(H2)
and h(h) 6

B(h)

(that is, the s&matrix does not have a larger nullity). Then there is a number z. such that there is a preservation of positivity and negativity, namely,

“(H(Q)) =M~{+++G)j

and +(G))

= M~{~(H+OS))

and again the nullities are non-decreasing, namely,

6(HI) g 6(H(G)) (that is, the new s&matrices

and 6( Hs) 6

h(H( ~0))

do not have larger nullities).

DEFINITION. A matrix with all zeros off the main diagonal and the first m pairs of superdiagonals is called an m-band matrix, that is, R = (rjk) and rjk= 0 for all (k - j ( > m. An n X n Hermitian matrix F = (f$) is a compktion of an m-band matrix fl if fjk = rjk for all 1k - j ) < m. The maximal Hermitian (m + 1) x (m + 1) submatrices R,, R,, . . . , R,_, within an m-band n x n matrix R are Ri = ( rjk Ij,k = i, i + 1, . . . , m + i). The almost maximal Hermitian m x m submatrices of R are RT = (rjk (j,k = i, i + 1, ...,m + i - 1). RT, R;, . . . , &,,+I

748

FRANK UHLIG, TIN-YAU TAM, AND DAVID CARLSON A simple diagonal completion R’ of an m-band Hermitian

DEFINITION. an m + l-band

Hermitian

of an m-band

completion

(N - 1)st (successive)

Hermitian

as the s&matrices

hypotheses.

Also,

completion

matrix

An

N th (successioe)

R is a simple

H,,

H,,

and

H,,

completion

Ri and Ri+l

of an

and their “com-

of the bordered-matrix

respectively,

the ith maximal

the maximal

almost maximal s&matrices

diagonal

R is

diagonal

RT of a band matrix R always overlap in the same

H( .zo) becomes

R’. In addition,

matrix

simple

of R.

Each pair of maximal submatrices

almost maximal submatrix

manner

of R.

simple diagonal completion

OBSERVATION 3. mon”

completion

of the simple

submatrix

submatrices

diagonal

of a band matrix R become

the

of its simple diagonal completion R’ (with the same ordering

from the upper left corner). STANDARD METHOD (For matrices). R’, then

Complete complete

diagonal completion completion

R’ to R” =

completions will construct Theorem

a Hermitian 4 below

of our

and negativity

completion

3, and the Standard

Method

rank Hermitian submatrix

completion

and 6(Ri+1)

property.

will establish completion

of a Hermitian

= Max{*(Ri),i

method”

As an example, the next theorem.

result.

It says that

band matrix

R is not

then there is a positivity

Given a Hermitian

RI, R,,

m-band matrix R with

. . , R,_, and R:, RX,. . , R*,-,+l.

namely, foreach

i=1,2

,...,

n-m-l.

F of R such that there is preservation

= 1,2 ,...,

= Max(v( A,), i = 1,2,. (We announced

of

completions

the “standard

namely, z(F)

kernels

on one-step

of R.

2 6(Ry)

completion

possible

By using these one-step

of its pair of almost maximal submatrices,

Then there is a Hermitian

simple diagonal

theorems

completions,

with the desirable

Suppose that the nullities are nondecreasing,

and negativity,

of the

properties.

THEOREM 4 (Minimal rank completions).

> 6( Ry)

classification

simple diagonal

is a minimal

preserving

A to

completions.

desirable

maximal and almost maximal submatrices

6(R,)

3, each successive

one-step

of [Z]), one may obtain

the nullity of each maximal

less than the nullities

n x n

complete

F is an (n - m + 1)st simple

3.3-3.9

of successive

2, Observation

Theorem whenever

threads”

(Lemmas

which propagate

in the construction

of m-band

namely,

is the method used in [3-71.

“common

matrices

completions

(R’)‘. . . to F, where

by a set of independent

This standard method bordered

Hermitian

simple diagonal completions;

of R. After noting Observation

is achieved

By finding

constructing

successive

n - m}

and

V(F)

. . , n - m).

this result at the ILAS meeting

in Provo, Utah, 1989.)

of positivity

AUBURN 1990 CONFERENCE

749

ON MATRIX THEORY

Of course, the inertia of a Hermitian completion F of a partial Hermitian n X n matrix R must be consistent with Poincare’s inequalities and Cauchy’s interlacing theorem, that is, for each maxim+ specified principal submatrix R, of R, r(F) ) *( RJ and V(F) ) v( Ri). Therefore Theorem 4 provides the minimal possible rank among all possible Hermitian completions. Dym and Gohberg presented the standard method in [5] as they showed that, when all the maximal submatrices of a Hermitian band matrix R are positive definite, then R has a Hermitian completion F which is also positive definite, such that F-’ is also a band matrix. They report that this result has connections with signal processing and system theory. Johnson and Rodman have shown (in [7]) that when all the maximal submatrices of a Hermitian band matrix R (or even more generally a matrix with a “chordal” graph) are invertible, then R has an invertible Hermitian completion. Ellis, Gohberg, and Lay have shown (in [S]) that when all the maximal submatrices and all the almost maximal Hermitian submatrices of R are invertible, then R has an invertible Hermitian completion F for which F-’ is also an m-band matrix. THEOREM 5 [3]. < n - m, suppose

LetRbeanm-bandn~nmutrix. that R,

Furanintegerr,m+5<2r

or R’f is invertible.

Then R has an invertible

Hermitian

completion. COUNTEREXAMPLE 6. An example of a l-band 3 x 3 matrix, with an invertible maximal Hermitian submatrix, which does not have an invertible completion is

1

010 1 01

0 0.

I

Theorem 7 is a generalization of Theorem 5, but its Hermitian or may not be the one with the maximal possible rank.

completion

F may

THEOREM 7 [3]. Let R be a Hermitian m-band n X n matrix. Suppose that m 6( R,) + 5 Q 2 r < n - m for some integer r. Then R has a Hermitian completion F with S(F) Q 6(R,).

DEFINITION. Given an m-band n x n matrix R = (rij), its maximal full-column is the unique specijied n x (2m + 2 - n) submattix

submatrix

M=

(rij(i=

1,2,...,

r

and

j=n-m,n-m+l,...,

m+l).

REMARK. Since the columns of M will also be columns of any completion Ker F > On-m-1

e Ker M e O”-m-1

and

RankF,<2(n-m-l)+RankM

F of R,

750

FRANK UHLIG,

for any completion matrix,

TIN-YAU TAM, AND DAVID CARLSON

F of R. In the trivial case, when 2m + 2 < n and M is an empty

we set Ker M = 0 and Rank M = 0.

provided by Theorem

Therefore

the

Hermitian

completion

8 has the maximal possible rank among all (including non-Hermi-

tian) completions. 8 (A maximal rank, minimal kernel completion

THEOREM

n x n Hermitian matrix R with maximal s&matrices IRank Ri - Rank Ri+l

1< 1

forall

R,, R,,

[4]).

1,2 ,...,

i=

Given an m-band

, R,_,.

.

Suppose that

n-m-

1.

Then for almost all Hermitian completions F of R, Ker F = O”-m-1

@ Ker M @ On-m-1

Rank F = 2( n - m - 1) + Rank M,

and

where M is the full-column maximal submatrix

of R. if R is also a real symmetric matrix, then the conclu.sions are valid for all but possibly a finite number of the real symmetric completions. Furthermore,

Curiously, Theorem

REMARK. demonstrated

8 is not valid for non-Hermitian

band matrices,

as is

by this l-band 3 x 3 matrix:

F( n, y) = 1 Here Rank F( x, y) = 2 # 3 = 2(3 -

1 -

0 0

1 0

x 0

Y

1

0

. i

1) + Rank M for all values of x and y, even

though Rank R, = Rank R, for the only maximal specified submatrices.

REFERENCES Jerome

Dancis,

operators,

On the inertias

of symmetric

Linear Algebra Appl. 105:67-75

Jerome

Dancis, Bordered

Jerome

Dancis,

matrices,

Invertible

matrices

and bounded

Linear Algebra Appl. 128:117-132

completions

self-adjoint

(1988).

of band matrices,

(1990).

Linear Algebra Appl., to

appear. Jerome Dancis, Minimal Rank and Maximal Rank Hermitian Completions Certain Band Matrices, Technical Report Tr 89-58, Univ. of Maryland, 1989. Harry Dym and Israel Gohberg,

Extensions

of band matrices

for

with band inverses,

(1981).

Linear Algebra Appl. 36:1-24 Robert L. Ellis, Israel Gohberg,

and David C. Lay, Invertible self-adjoint extensions

of band matrices and their entropy,

SIAM J. Algebraic

Discrete Methods 8:483-500

(1987). Charles

R. Johnson

partial Hermitean

and Leiba

matrices,

Rodman,

Inertial

Linear and Mu&linear

possibilities

for completions

Algebra 16:179-195

(1984).

of

AUBURN

ON CHARACTERIZATIONS OPERATORS by

THEORY

~~~~C~NFERENCEONMATRIX

OF THE SPECTRAL RADIUS OF POSITIVE

SHMUEL FRIEDLAND Let B be a Banach space over the real numbers

the Banach space of all real-valued pointed

(K fl - K = (0))

function&

with

cone.

respect

to

K.

bounded Denote

then let

x ( y iff

B = K - K and bounded

K* f (0).

in norm, then

linear operator.

Denote

Note

let

y 2 x and

Furthermore,

interior,

called

positive

if

Collatz-Wielandt

AK C K. Assume

v(A)

sets associated

-e

: B + B be a bounded

the local spectral

radius of A at x.

linear operator

A is a positive

operator,

A : B + B is

and define

= (o:3r>O,

Ax
X1(A)

= {a:3x%+O,

Ar
“(A)

= {w:3x>O,

Ax>wr},

$(A)

= (w:3r+O,

Ax>wx}

the upper and lower Collatz-Wielandt =inf{a)O:Ar
R(A,x)

the

with A [2]:

“(A)

For xcK

that

Q x < e is

= mimi)lhl.

p( A, r). A bounded that

A

by K,.

implies

of A. Set

For x E B let p( A, x) = lim sup I( A’% ((‘lrn denote ,tal

K, # 0

the segment

B* = K* - K*. See [7j or [S]. Let

linear

y - r E K and

and denote its interior

if for eEK,

by a( A) the spectrum

T c B set V( A, T) = inf,..r

by B*

K C B be a closed

y > x iff

that the assumption

P(A) = pan, I XI,

For

]] I]. Denote

Let

by K* C B* the cone of nonnegative

As usual,

y - XEK,.

with the norm

linear function&.

y - r E K \ (0). Assume that K has a nonempty We

751

numbers

are defined as follows:

r( A, x) = sup{w

> 0: Ax > UK}.

(1)

Note that R( A, r) = 00 if no u exists such that An < UT. Clearly,

sup Qr( A) = :“,“or( A, x) G sup Q( A) = supr( X>O infX(A)

In the case of B = numbers

= mf,R(A,x)


R”, K = R;, and A an irreducible

A, x),

\

= I=f,R(A,x).

nonnegative

matrix, all the above

are equal to p(A). This is the classical result due to Wielandt.

any finite-dimensional

34Department

B with a closed

of Mathematics,

pointed

spanning

cone

I

It also holds for

K and a positive

University ofIllinois at Chicago, Chicago, Illinois 60680.

752

FRANK UHLIG,

irreducible

operator

A. However,

TIN-YAU TAM, AND DAVID

if A is not irreducible

have to be equal even in the finite-dimensional following theorem

characterizes

local spectral

do not

[6] and [9]. The

in terms of the spectral

radius

radii of A and A* [5]:

Let B be a real Banach space with a closed pointed cone K. Assume interior and B* = K * - K*. Let A : B + B be a

THEOREM 1. bounded

the above four numbers See for example

the above four numbers

and the minimal distinguished

furthermore

case.

CARLSON

that K has a nonempty

linear operator which leaves K inuariunt.

Then

(3) ji$R(

A, x) = v( A, K),

sup r( A, x) = r~( A*, K*),

(5)

x*0

supr(A,x)


x>o Zj p(A) i.s in Recall pointed;

of A, then

the point spectrum

that

a finite-dimensional

Let C be a C*-algebra. of the form aa*. Let

Denote

K

is generating

B Theorem

(B = K - K)

of K, i.e.,

iff

is

K*

1 is due to [9].

by K the cone of all self-adjoint

be the interior

K,

equality holds in (6).

cone

e.g. [l]. For a finite-dimensional

(6)

(positive)

elements

K, is the set of invertible

elements

in K. THEOREM 2 [5].

Let C be a C*-algebra.

positioe linear operator p(A)

Assume

that A : C -+ C is a bounded

(AK C K). Then

= ,i{

u( A*, K*)

@‘Ax),

= sup v( &4x).

0 The above theorem tion.

Indeed,

product

on

self-adjoint

can be considered

C: (a, b) = +(ab*),

QC Endz(C).

as an extension

if C is a finite-dimensional

positive

elements

where

of C, e.g.

Note that the subalgebra

$? is a semisimple

algebra.

(7)

EK”

C* algebra, 4

is a positive

[8]. Then

functional

characteriza-

on the

C can be viewed

@? is invariant

More precisely,

of Wielandt’s

then one can define

vector

cone

of

as a subalgebra

under the involution

the underlying

an inner

space

*. Hence,

C splits to a

direct sum VI + * * * + Uk where each Vi is an irreducible invariant subspace under the action of ‘iR. It then follows that the restriction of V to uj is isomorphic to M,,(C), where mi = dim Vi. Thus, any finite-dimensional

C*-algebra

c = Mm,(C) + *. * +%f,,(c) .

is isomorphic

to

AUBURN

1990 CONFERENCE

ON MATRIX

753

THEORY

Here,

a=

( a,,...,

Q),

aiEMmi(C),

@I,..

i = l,...,

.>a#$.

.>bk) = (a$,,..

Thus, if A : C + C is a positive operator elements,

ly,ykP(x;‘( Ax)i),

a;)>

.dQbk)>

to the cone of self-adjoint

v( x-'Ax)

. .

(xl,. . ., xk),

Ax = ((Ax)~,..

Note that C is isomorphic C is a commutative i = 1,.

with respect

a* ( r,...,

That

under the pointwise

is, in (8) we have

characterization

C = M,(C)

for a nonnegative

and a positive

Consult

irreducible

operator

[4] for other characterizations

with respect

to general

irreducible

Ax)i),

i = 1,. . .,k.

xi = aia’,

.,(Ax)~),

to Ck as a C*-algebra

C*-algebra.

mh,v( x;l(

=

the

matrix

(9)

multiplication

equalities

. . , k. As p(a) = Y(U) = a for 0 < a E M,(C) = C, we deduce

Wiehmdt

positive

we deduce

P( x-‘Ax) = LX=

a* =

k,

iff

ml = 1,

that (7) is the

A. Theorem

2 for

A was proven already in [4].

of the spectral

radius of positive

operators

cones, and [3] for related results.

REFERENCES A. Berman

and R. J. Plemmons,

Academic,

New York,.1979.

G. P. Barker and H. Schneider,

Appl. 11:219-233 K. H. Fijrster

S.

matrix,

Characterizations

Friedland,

theory,

numbers

Linear Algebra

and the local spectral

Linear Algebra Appl. 120:193-205 of the spectral

radius of positive

(1989). operators,

Linear

operators

on

(1990).

Characterizations

of spectral

radius

of positive

C*

/. Funct. Anal., to appear.

S. Karlin, Positive

operators,

J.

M. G. Krein and M. A. Rutman,

(1959).

Math. Mech. 8:907-937 Linear operators

space, Uspekhi Mat. Nauk 3:95 (1948); G. K. Pedersen,

leaving invariant cone in a Banach

Amer. Math. Sot. Transl.

No. 26.

C*-Algebras and Their Automorphism Groups, Academic,

B. S. Tam and S. F. Wu, preserving

Perron-Frobenius

and B. Nagy, On the Collatz-Wielandt

Algebra Appl. 134:93-105 algebras,

Algebraic

(1975).

radius of a nonnegative S. Friedland,

Nonnegative Matrices in the Mathemutical Sciences,

On the Collatz-Wielandt

map, Linear Algebra Appl. 125:77-95

sets associated

(1989).

1979.

with a cone

754

FRANK UHLIG,

TIN-YAU TAM, AND DAVID

EXTREME DOUBLY NONNEGATIVE

CARLSON

MATRICES WITH PRESCRIBED

ROW SUMS by ROBERT GRONE35

Let either

Zf, denote

the convex

the real or the complex

extreme

cone

of all n-by-n

positive

semidefinite

case, it is obvious that a matrix

ray in H, if and only if the rank of A is 1. Several

structure

of the set of extreme

example,

the

diagonal. points,

correlation

The extreme have been

set the notation,

for example,

rays of subcones that

familiar convex

convex

(undirected,

interest

set of matrices

set in

obtained

by establishing

is of some

H,

matrices

matrices,

the ranks of between

the

this result being

and methods

interest.

if p = (p,,

employed K,

role.

previously

One useful

Our

in describing for simplicity,

mentioned

concept

a lot of

that it held for A in K,, techniques

extreme

be the set

. , p,. Although points of K,

and because

of extreme graph

are

some of the

points will be similar

in that rank and iero patterns

will be the

so

are valid in

> 0, let K(p)

. . . , pJT

papers deal only with K,. The investigation

the extreme

stimulated

in H, which have row sums pr,

we will use the notation

to that of the other problems an important

To

is a,,, the set of n-by-n doubly stochastic

conjecture

In particular,

nonnegative

valid for K(p),

authors

with no loops or

and there is a distinction

points of 62, are just the permutation

more generality.

of all entrywise

referenced

various

a given sparsity pattern.

some 40-odd years ago. In this note we wish to consider

this particular

For

on the

. . , n}. and let M(G) consist of all A in H, such that

activity on the van der Waerden

the theorems

l’s

ranks of extreme

[7], and [lo]

points of the set K, = a2, fl H,. In 1962, Marcus and Newman [ll]

somewhat

of H,.

have

cases.

The extreme

due to Birkhoff

subsets

which

i # j and (i, j) is not an edge in E. In these investigations

real and complex Another

authors have studied the

the possible

in [9]. In [l],

of H, which respect

points or rays are of particular

matrices.

In an

convex

in H,

C = (V, E) is a simple

multiple edges) graph on V = (1, extreme

matrices

points of this set, and in particular

suppose

aij = 0 whenever

or rays of certain

are those

matrices

studied,

studied the extreme

points

matrices.

A in H, generates

of a symmetric

play

matrix.

If

A = AT is n-by-n, we let G(A) be the graph on V = { 1,. . , n} with edges (i, j) for all i, j for which V = (1,.

i # j and aij # 0. If G, = (V, E,)

. , n}, we say that G, is a s&graph

we use the notation

and Fisher

in K,.

They established

This work

DMS-9007048.

matrix in K,

in H,. Furthermore,

35Department 92182.

noted that I, is always an extreme

in Q,, and that I,, is the only extreme

noted that the unique rank-l it is extreme

In this case

G, C G,.

In [5], Christensen it is extreme

and G = (V, Ea) are two graphs on

of G, if and only if El C E,.

(l/n)J,

has been

Sciences,

supported

since

They also

is (l/n)J,, and that it is extreme in K, since is the only entrywise positive extreme point

also that rank-2 matrices

of Mathematical

point in K,

point of rank n in K,.

are extreme

San Diego

by the

National

State

in K,

if and only if they

University,

Science

San Diego,

Foundation

under

CA grant

AUBURN

1990 CONFERENCE

have zero entries.

These

the authors

obtained

established

an inequality

ON MATRIX

observations

tell the complete

some tridiagonal

extreme

which relates

755

THEORY story for n = 2,3.

For n = 4,

points of rank 3. For general

the number

of nonzero

entries

n, they

of an extreme

point with its rank. These

results

determine

suggest

extremality

the possibility

for A in K,.

the rank of G be the minimum

that rank A and G(A)

If G is a connected

rank of A in K,

might suggest that for a given G, either no matrices or else the extreme

matrices

in K,

with G(A)

might be sufficient

graph on V = (1,.

with G(A) in K,

= G. The results

with G(A)

= G are exactly

to

. . , n}, let so far

= G are extreme,

those with rank A =

rank G. We shall see later that this is not the case. In testing any A E K, matrix C a perturbation

for extremahty,

the following notion from [8] is useful.

Call a

of A if and only if

(i) C = CT, (ii) Ce = 0,

A C nullspace C, and

(iii) nullspace

(iv) G(C) C G(A). These four constraints perturbation K,

space is trivial.

is replaced The

are linear and homogeneous,

of the n-by-n real matrices.

a subspace

A matrix

results

rank A = n criteria

of A form

The reader should note that this definition

is unchanged

in [5] give a characterization 1. We assume without

that

C(A)

of the extreme the structure

loss of much generality

is connected.

By counting

points

of extreme that

equations

A in K, A in K,

rank A = n -

with

and unknowns

or

in the

to exist, we saw that if rank A = n - 1 and A is extreme,

for a perturbation We established

with

A is irreducible,

then G(A) has less than n edges. This forces G(A) to be a tree, or equivalently, acyclic.

if

by K(p).

rank A = 1, 2, or n. In [8], we considered equivalently,

and so the perturbations

A in K, is extreme if and only if this

that when

G(A)

is a tree,

then

A is extreme

1. Also, if rank A = n - 1 and A is irreducible,

A to be

if and only if in K,

then A is extreme

if and only if G(A) is a tree. In [2] a lemma

was obtained

which

was also useful in [7j and [8]. The matrices

studied in [2] were doubly nonnegatiue, that is, positive semidefinite nonnegative. irreducible enabled

Clearly,

in K,

the matrices

and doubly nonnegative,

and G(A)

us in [8] to give a complete

bipartite.

and

K(p)

is bipartite,

answer to when

if A in K, has G(A) bipartite

Specifically,

as well as entrywise

fall into this category.

If

A is

then rank A 2 n - 1. This

A in K,

is extreme

and connected,

if G(A)

is

then A is extreme

ifandonlyinG(A)isatreeandrankA=n-1. In [8] we also considered exactly

the case when G is unicyclic

n edges and one cycle).

bipartite

and there

is no extreme

If the unique

(that is, G is connected

cycle in G is of even length,

with

then G is

A in K, with G(A) = G. If G has a cycle of odd

length, then any A in K, with G(A) = G has rank at least n - 2. In this case, then A in K, with G(A) = G is extreme if and only if rank A = rank G = n - 2. We also established

the existence

rank, n - 2. Showing result

for K(p)

of A in K,

which has this graph and also has the minimal

that this minimum

by invoking

rank is obtained

the DAD theorem

[4]. Lastly,

in K,

also establishes

the

we noted that if G is any

756

FRANK UHLIG,

graph on V = (1,.

TIN-YAU TAM, AND DAVID

. . , n} for which there is a matrix

A in K,

rank A = k, then for every m, k < m 4 n there exists a matrix andrankB=m. In [6], we considered Say that agraph common

edge.

a class of graphs which generalizes

G on V= Suppose

{l,...,

n} is nonchordal

G is a connected

cycles

and

nonchordal

has exactly

r even

s odd cycles.

corresponds

to a tree, and that r + s = 1 corresponds

results in [8] can be used to establish

an induction

that rank G = n - s - 1, and we constructed minimal

rank. Again, the DAD theorem

has G(A) nonchordal

connected

such a matrix is extreme

Berman

graph on V = (1,.

. , n} which

that the case

in K,

example

points

points in K, lays to rest

qualitatively Lastly,

when

r = s = 0 Hence

with specified

this to K(p).

Suppose

and s odd cycles.

that rank A and G(A)

for n = 4. For completely

n = 5 this pattern

determine

different

possibility

of a nice,

points

in K,.

for certain

p.

the presentation

in [l], [7j, and [lo]. of these

exactly the same if K,

results

characteriIn [3],

of two matrices

simple

answer

investigations

to the K,

problem

and

are closely

the extremality

in K,

and the other not. This

As R. Brualdi quite correctly

in Auburn,

is replaced

1.

to determine

fails to hold.

It may even turn out that

we would like to note that these

sparsity questions

A in K,

We found that

the rank and sparsity possibilities

for n = 5. They also give an example

the

extreme

the

graph and

that

are sufficient

with equal ranks and the same graph, but where one is extreme characterizing

graphs.

of G share a

on r + s. For such a graph we found

with r even cycles

suggests

and Shaked-Monderer

of extreme

trees and unicyclic

= G

if no two cycles

This is true for n < 4, and our results in [6] provide a complete

zation of the extreme

= G and

to G being unicyclic.

matrices

extends

with G(A)

B in K, with G(B)

if and only if r = 0 and rank A = rank G = n - s -

So far the evidence extremahty.

Note

CARLSON

related

pointed

question

K(p)

of are

to the

out during

for A in K,

is

with K, Il M(G( A)).

REFERENCES J. Agler, J. W. Helton,

S. McCullough,

and L. Rodman,

Positive

definite

with a given sparsity pattern, Linear Algebra Appl. 107:101-149 A. Berman and R. Crone, Bipartite completely positive matrices, Philos. Sot. 103:269-276 A. Berman

(1988).

and N. Shaked-Monderer,

More

doubly stochastic matrices, preprint. R. Bruahh, The DAD theorem for arbitrary 45:189-194 J.

points,

R. Grone, matrices, R.

Grone

on extremal row sums,

positive

Proc.

semidefinite

Amer.

Math. Sot.

(1974).

Christensen

extreme

matrices

(1988). Proc. Cambridge

and P. Fisher,

R. Loewy,

and S. Pierce,

Linear and Multilinear and

Positive

Linear Algebra A&.

S.

Pierce,

131:39-50 (1990). R. Grone and S. Pierce,

Extremal

doubly (1986).

Nonchordal

positive

stochastic

matrices

semidefinite

and

stochastic

Algebra, to appear.

Extremal

Linear Algebra Appl. 150:107-117

definite

82:123-132

bipartite

positive (1991).

matrices,

semidefinite

Linear

Algebra

doubly stochastic

Appl.

matrices,

AUBURN 9

1990 CONFERENCE

R. Crone,

ON MATRIX

757

THEORY

S. Pierce,

and W. Watkins, Extremal correlation matrices, Linear (1990). J. W. Helton, S. Pierce, and L. Rodman, The ranks of extremal positive semidefinite matrices with given sparsity pattern, SZAMJ. MatrZx Anal. 10:407-423 (1989). M. Marcus and M. Newman, Inequalities for the permanent function, Ann. of Math. 75:47-62 (1962). B. Ycart, Extremales du cone des matrices de type non negatif a coefficients Algebra Appl. 134:63-70

10 11 12

positifs ou nuls, Linear Algebra AppZ. 48:317-330

RECENT

RESULTS

by SHU-AN

ON THE

(1982).

PERMANENTAL

HU36 and TIN-YAU

NUMERICAL

RANGE

TAM37

The purpose of this synopsis is to give an account of recent advances on the permanental numerical range. The kth permanental numerical range of A E enxn (the set of n x n complex matrices),

where 1 < k < n, is defined as

Pk( A) = {per(U*AU)IUe$,,k,

U*U = Zk},

and per B = CoeS, Htr bj,(i) is the permanent function for BE @kxk. This definition is motivated by the classical numerical range of A E B,xn:

W(A)

If k = 1, then Pr( A) = W(A). upon five properties

= {x*Axlz~V,

IIxII = 1).

There are many nice results for W(A).

here and investigate generalizations

We shall touch

to the permanental

numerical

range. Maybe the most interesting W(A)

one is the. celebrated

Toeplitz-Hausdortf

theorem:

i.sconoexfwanyA~@&,.“.

In particular,

there is a complete

Let A E @2zx2 have eigenvahm

description

Xl and &.

for the shape of W(A)

Then W(A)

when n = 2:

is an elliptical disk with foci

at A, and &,, minor axis of length

dtr(A*A) -

l&l’-

l&l’,

3sDepartment of Mathematics, University of Connecticut, Storrs, CT 06269. 37Department of Algebra, Combinatorics 5307.

and Analysis, Auburn University, AL 36849-

758

FRANK

UHLIG,

TIN-YAU TAM, AND DAVID

CARLSON

and major axis of length

Vtr( In particular,

A*A)

- 2 Re( A,&)

if

c

Al o

A=

hz>

[

1

A, and &,

Recently,

the authors

[5] obtained

Let ~~~~~~

THEOREM 1.

the following

have eigenvalues

minor axis of length

1c 1, and

analogy for Ps( A):

A, and AZ. Then Pz( A) is an elliptical

disk with foci at X,X, and i( XT + hi), minor axis of length

d[tr(

A*A)

-

] X, ] ’ -

] X, ] “] [tr( A*A)

- 2 Re( A1x2)]

,

and major axis of length

tr( A*A)

In particular,

-

IhI2

2

-

-

l&l2

if

Al

A=

0

c

then P2( A) is an elliptical disk with foci at AlA2 IA,-X,12+

= (0)

In [8] Marcus

ifand

W(A)

Recently

Jc12+

)A,-A212/2.

enjoys is:

only ifA

= 0.

and Wang asked whether

they posted it as a conjecture.

THEOREM 2.

and +(A; + A$, minor axis of length

Ic(2,andmajoralrisofZength

Another property W(A)

1

%>

[

ICI

- 2Re(Xr&).

2

Let A E G,,,,

Chan [l] extended

Then

the above property

is true for Pk( A), and

Hu [4] proved the conjecture

1 < k Q n. Then Pk( A) = (0)

the result to arbitrary

ter. Let 1 ,< k < n, and let G < Sk be a subgroup

subgroups

ifand

in the affirmative: only ifA

with principal

of the full symmetric

= 0. charac-

group Sk of

AUBURN degree

1990 CONFERENCE

k, and x

759

ON MATRIX THEORY

: G -+ G an irreducible

character

of C. Then define the following

range:

P,“(A)

= {~;(U*AU)IUEC”,~,

where d:(B) = C oeC x(u)H!=, bio(i), associated with x. Chan [l] obtained:

for

BE

U*U = Zk},

ejkxk~

is the generalized

matrix function

THEOREMS (a) Zfx 3 1, then P:(A)

= {0}

ifand

only ifA = 0.

(b) Zf x f 1, then rank A < 1 implies P,“( A) = (0). The third property

we want to discuss is:

W( A) is a line segment if and only if 5 A is Hermitian for some .$ E G with

1[ 1 = 1.

In [5] the authors obtained a similar result for Pk( A):

LetAEGnx,,.

THEOREMS.

(a) Zf n = k = 2, then Pz( A) is a line segment if and only if A is normal. (b) If 1 < k < n, excepting n = k = 2, then the following statements are equivalent: (i) Pk( A) is a line segment. (ii) Pk( A) is a line segment on a ray containing (iii)t A is Hermitian for some nonzero

COROLLARY

the origin.

t E G with

1.$1 = 1.

&AE@$,,,.

1.

(a) Zf n = k = 2, then P2( A) lies on the real axis (nonnegative real axis; positive real axis, respectively)

if and only if

(i) A or iA is Hermitian ;)

(A or -A

is positive semidefinite;

positive definite, respectively),

A is uniturily similar to diag(a + bi, a - bi) or diag(a + bi, - a + bi), where a and

b are

nonzero

real

numbers

(A is unitarily

similar

to diag(a + bi, a - bi),

where

a2 - b2 > 0; a2 - b2 > 0, respectively). (b) Zf 1 < k Q n, excepting n = k = 2, then Pk( A) lies on the real axis (nonnegative real axis; positive real axis) if and only if 5 A is Hermitian for sollze 5 E G such that tk = 1 or - 1 (.$ A is positioe semidefinite;

positive definite, fm some k th root of unity

.$, respectively). The fourth property

is:

Zf A is positive semidefinite, largest eigenvalues

as endpoints.

then W(A)

is a line segment with the smallest and the

FRANK UHLIG,

760 It is a long-standing semidefinite,

of U*AU are equal.

[lo]

to find max{

min{ 1z 1 I .z EP,( A)}

whereas

Mehta [9] conjectured conjecture,

problem

TIN-YAU TAM, AND DAVID

that the maximum Recently

CARLSON

1z 1 ) z E P,( A)} if A is positive

= det A is well

known

(see

e.g.

is attained when all the main-diagonal

Drew and Johnson

[3] gave a counterexample

[lo]). entries

to Mehta’s

and in [2] they proved that:

THEOREM 5. If A is a 3 x 3 real positioe semideftnite matrix, then max{ I .zI I z E P3( A)} is always of the form $A,( A! + A:) or $[3Ai(h, + A,) + A\ + A” + (% + A\ Ai)3/2] for sine ordering of the eigenvalues.

It turns out that there exists a persymmetric the maximum persists

(i.e.,

symmetric

with respect

to the

diagonal as well as the main diagonal) matrix U*AU that yields

upper-right-to-lower-left

for n = 2,3.

It was then asked in [3] whether

the persymmetry

criterion

for n 2 4.

Lastly, in [ll]

Pellegrini proved that:

A linear operator T: G,x, -+ Ctnxn satisfws W(T( A)) = W(A) and only if there exists U E U,(G) such that

for all A E enxn

if

(i) T(A) = U*AU for all A E Gnxn or (ii) T(A) = U*ATU for all A E GnX,. In [5, 121 the authors extended

the result.

Let T: Gnxn + Gnxn be an operator. THEOREM 6. Pk( A) for all A E C?,x, if and only if: Case I.

Then T satisfis

If 1 6 k < n, excepting k = n = 2, there exist U E U,(c)

Pk(T( A)) =

and a k th root of

unity 5 such that either (i) T(A) = f;U*AU for all A E Gnxn, or (ii) T(A)

= tU*ATUfw

If k = n = 2, there exists UE U,(G) such that either

Case 2. (i) T(A)

all AE@,~,,.

= ~U*AU~~~U~EAE@&~,,

(ii) T(A) = +U*ATUfor (iii) T(A)=

+-I ‘iA

(iv) T(A)=

+ y’l+““*(

REMARK.

2 +iU*

Recently,

or

all AE@&~~, ( AATLei

or

CA 2

z)

yIS)

[7] obtained

UforallAEGZxZ,

or

UforaZlAEG2x2. part

of Corollary

1 independently.

recently the authors [6] extended Theorem 4 and Corollary 1 for arbitrary Sk with principal

More

subgroups

of

character.

REFERENCES 1

C.

T.

Chan,

M&linear

Some

more

on a conjecture

Algebra 25:101-105

(1989).

of Marcus

and Wang,

Linear

and

AUBURN 2

1990 CONFERENCE

J. H. Drew and C. R. Johnson,

The maximum

semi-definite matrix, given

eigenvalues,

25:243-251 3

J.

H.

4

the

and C.

R. Johnson,

permanental

(1989). S. A. Hu,

On

20:191-196

(1987).

the

Counterexample

and

of a 3-by-3

positive

M&&near

Algebra

Marcus-Wang

conjecture,

S. A. Hu and T. Y. Tam, Operators

6

line, to appear. S. A. Hu and T. Y. Tam,

to a conjecture

Linear and Mu&linear

maximization,

5

character,

permanent Linear

(1989).

Drew

regarding

761

ON MATRIX THEORY

Linear

with permanent

On the generalized

of Mehta

Algebra 25:253-254

and M&linear

numerical

numerical

Algebra

ranges on straight

range with principal

preprint.

Tian-Gang Lei, On the numerical M. Marcus and B. Y. Wang, Mu&linear Algebra 9:111M. L. Mehta,

range of an induced power, preprint.

Some variations

on the numerical

range,

Liner and

120 (1980). Hindustan

Elements of Matrix Theuy,

Publishing,

Delhi, 1977,

p.

156. 10

H.

Mint,

21:109-148 11

Theory

of permanents,

V. J. Pellegrini, Numerical T. Y. Tam,

COMPLETELY

KOGAN3’

applications

range of induced

MATRICES

Algebra

n x m matrix.

positive matrices

include

on a Banach algebra, St&a

power,

Linear and Multilinear

AND GRAPHS

and ABRAHAM

is called the factorization

Completely

operators

BERMAN3”

3g

A is completely positive if it can be decomposed

B is a nonnegative

factorization

and M&linear

(1988).

POSITIVE

An n x n matrix where

range preserving

On the numerical

Algebra 23:207-211

by NATALIA

Linear

(1975).

Math. 54:143-147 12

1982-1985,

(1987).

“a proposed

sectors of the U.S. economy”

The minimal number

as A = BB’,

m that admits such a

index of A and is denoted by q(A).

are important in the study of block designs [q. Other mathematical

model of energy demand

for certain

and statistics [5].

A matrix which is both elementwise

nonnegative

and positive semidefinite is called

doubly nonnegative. It is obvious that every completely

positive matrix is doubly nonnegative,

but the

converse is not always true [3, 5, 61. It depends in some sense on the zero pattern of the matrix. To describe this dependence,

we associate with an n x n symmetric

38Department of Mathematics, Technion-Israel Israel.

matrix

A a

Institute of Technology, Haifa 32000,

3QResearch supported by the C. Wellner Research Fund at the Technion.

762

FRANK

graph G defined by V(G) graph G is completely completely positive.

= { 1,

positive

UHLIG,

TIN-YAU TAM, AND DAVID

CARLSON

,n}, E(G) = {(i, j) : i # j, aij # 0). We say that a

if every doubly nonnegative

The aim of this work is to characterize

completely

matrix

positive

A with G(A) graphs.

= G is

The following

results are known:

PROPOSITION1 [5, 91.

A graph

PROPOSITION2 [2]. completely

Bipartite

graphs

(graphs

positive i$n < 5.

is completely

which

contain

no odd

cycle)

are

positive.

PROPOSITION3 [3]. If a graph then G is not completely The characterization A is completely

m-dimensional

vectors,

doubly nonnegative tive inner products, that the coordinate

G contains

an odd cycle

of length

greater

4,

than

positive. is completed

when G is not completely matrix

G with n vertices

positive. positive where

by proving

that Proposition

that an n x n

if and only if it is the Gram matrix of n nonnegative m may be greater

than

n. Using the fact that every

matrix is a Gram matrix of a set of vectors we find an m-dimensional vectors

3 is the only case

The proof is based on the observation

with mutually nonnega-

space with an orthonormal

basis such

of the set in this basis are nonnegative.

To prove our main result we use the following lemmas:

LEMMA 1. Let k be a cutpoint {k}.

If both

G, and G,

LEMMA 2.

of a graph

are completely

The graph

G, i.e.,

positive,

T, consisting

G = G, U G,

and G, fl G, =

then so is G.

of n triangles

with a common

base is completely

positive. Based on the above facts, we obtain

THEOREM 1.

Let G be a nondirected

graph

without

equivalent : (1) G is compl&ely

positive.

(2) G has no odd cycle of length greater (3) G consists

of blocks of the following

than 4. types:

(a) blocks with less than 5 vertices; (b) bipartite (c) families

blocks; of triangles

with the common

(4) All the blocks of G are completely (5) G is the root graph of a pafect

base.

positive.

line graph.

loops.

Then the following

are

AUBURN

1990 CONFERENCE

The proofs of the lemmas and of the equivalence A slightly different equivalence

proof

of the equivalence

of(l),

(2), and (3) are given in [S].

of (2) and (3) is given

in [4]. The

of (2) and (5) is given in [lo].

Let G be a nondirected completely

763

ON MATRIX THEORY

graph without loops. The set of all factorization

positive realizations

indices of

of G is denoted by Z(G). In the next theorem

we list

some facts we know about Z(G).

THEOREM 2. U G, such that G,, . . . , G, are connected by a cutpoint (see Lemma l), then Z(G) = Z(G,) + *.. +Z(G,) + {O,l}. (2) Zf K, is a complete graph, G(A) = K,, and A is a completely positive matrix, then (p(A) = rank A, and thus Z(K,) = (1, . . , n}. (3) If ) V(G)) = n Q 4 then Z(G) E {l,. . . , n}. (4) IF G is bipartite then Z(G) E { ( E(G) (; ( E(G) ( + 1). Zf G is a tree then (1) Zf G = G, U ...

Z(G) = { I E(G) I + 1). (5) ZfG=T,(seeLemma2)thenZ(G)S{n,...,2n+l}. (6) Zf there are k independent vertices in G, then Z(G) fI (1,.

. . , k} # 0.

Results (I), (2), and (5) could be improved by settling the following question:

QUESTION. Let a, b E Z(G). Does

Z(G) contain all the integers between

a and b?

The authors would like to thank Professor T. Ando for suggesting matrix theoretic proofs, using Schur complements, of Lamas Z and 2 [Z].

REFERENCES 1

T. Ando, private communication.

2

A. Berman and R. Grone, Bipartite completely

PhiZos. Sot. 103:269-276 3

A. Berman matrices,

4

at

Jerusalem,

Maximum

the

Combinatorial

k-coloring

French-Israeli

Proc. Cambridge

results

on completely

positive

(1987).

of perfect

line graphs and their roots, pre-

Symposium

on

Combinatorics

Nonnegative

factorization

and

Algorithms,

Nov. 1988.

L. J. Gray and D. G. Wilson, nonnegative

6

Hershkowitz,

Linear Algebra Appl. 95:111-125

Gavril Fanica, sented

5

and D.

positive matrices,

(1988).

matrices,

Linear Algebra AppZ. 31:119-127

M. Hall, Jr., A survey of combinatorial

of positive semidefinite (1980).

analysis, in Surueys Appl. Math. IV, Wiley,

1958, pp. 35-104. 7

M. Hall, Jr., Cumbinatorid

8

N. Kogan and A. Berman, Math., to appear.

Theory, Blaisdell, Lexington, Mass., 1967. Characterization

of completely

positive graphs, Discrete

764 9

FRANK UHLIG,

TIN-YAU TAM, AND DAVID CARLSON

J. E. Maxtleld and H. Mint, On the equation Sot. 13:125-129

10

L. E. Trotter,

RANK

Line perfect graphs,

PRESERVERS

by RAPHAEL

X’X

= A, Proc.

Edinburgh

Math.

(1962).

AND

Math. Programming

INERTIA

12:255-259

(1977).

PRESERVERS

LOEWY4’

Let V be a vector space which is one of the following: (1)

P”,

the

throughout

set of all m x n matrices

with entries

in a field

F.

We assume

that m Q n and F is infinite.

(2) H,, the set of all n x n Hermitian

matrices.

(3) S,, the set of all n x n real symmetric Given a matrix

matrices.

A, let p(A) denote the rank of A. Let

ttj=

{AEV:~(A)

For A E H,, or A E S,, let In A = (r, s,

=j).

t), where r is the number of positive eigenvalues t the number of zero eigenvalues

of A, s the number of negative eigenvalues of A, and of A. Let

C( r, s, t) = { A : In A = (r, s,

We

assume

throughout

that

k is a fixed positive

t)} .

integer

and T

: V + V a linear

transformation.

DEFINITION

(a) We say T is a rank-k preserver

if p(A) = k implies that p(T( A)) = k, i.e., if R,

is invariant under T. (b) We say T is rank-k nonincreasing

if p(A) = k implies p(T( A)) Q k. It is easy to

see that (under our assumptions) this is equivalent to the statement that the set lJf=,

Rj

is invariant under T. (c) If V = H,, or S,, we say T is a G(r, s, t)-preserver

if the set G(r, s,

t) is

invariant under T. We consider here the following three problems, which seem to have attracted of interest in recent years.

40Department of Mathemaics, Technion, Haifa 3200, Israel.

a lot

AUBURN

1990 CONFERENCE

ON MATRIX

PROBLEM

1.

When is T a rank-k nonincreasing

PROBLEM

2.

When

PROBLEM

3.

When is T a G(r, s, t)-preserver?

In thefrrst

765

THEORY map?

is T a rank-k preserver?

two problems

we assume V = Fmv”, while in

the third we assume V = H,, or

SIL.

Problem 1 A full solution functional

is known

THEOREM

1[12].

PI)

(III)

T(A)

= L’AtV

T(A)

=

(IV) It should be noted course,

nonincreasing

T is rank-l

I’( A) = UAV

(1)

depend

only for the case

k = 1. We denote

by (p(A) a linear

on A. Botta showed:

on

forsome

[a,

that in (III)

A. As indicated,

some related

THEOREM

U, VEF”‘,“,

ez(A)

a2

...

...

dA$

a,].

and (IV) ai are fixed elements

no analogous

it is clear that if T satisfies

VEF”,“,

UEF”‘,“‘,

for some

[v&4)

q and only if it is one of the following:

statement

of F and do not

holds for any arbitrary

(I) or (II), then it is rank-k nonincreasing.

k. Of

We state

results. 2 [lo]. Suppose F is algebraically

closed and T is rank-k nonincreasing.

Then either Im T c lJj”=, Rj or dim Ker T < mn - (k + 1)2. THEOREM 3 [8, 131. Suppose F is algebraically closed. If k < m and T(Ujk_ 1 Rj) c UT=, Rj, then either (I) holds, or m = n and (II) holds, where U and V are nonsingular. Note that the assumption a significant

restriction

on T in Theorem

3 means that Ker T n { UJ=, Rj} = 0,

on the rank-k nonincreasing

map T.

766

FRANK

UHLIG,

TIN-YAU TAM, AND DAVID

The following result is useful in the investigation was also used by Loewy in the investigation

THEOREM 4 [18].

CARLSON

of rank-k nonincreasing

of Problem

maps, and

3.

Suppose that T is rank-k nonincreasing.

Then it is rank-l nonin-

creasing for every 1 > k. Using this theorem,

Loewy has recently

THEOREM 5 [22]. nonincreasing,

proved the following:

Suppose that F is algebraically

closed and k < m. If T is rank-k

and Im T contains a matrix B such that p(B) 2 k + I, then either (I) or

(II) must hold. It is now clear, in light of Theorem set of rank-k nonincreasing

said if T is rank-k nonincreasing

DEFINITION.

5, that in order to completely

A subspace

It is clear that given any l-subspace For

example,

can

one

if p(A) < k for all A EL.

L, one can build rank-k nonincreasing

it is desirable

characterize

the

What can be

Q k for all A E lm T? This leads us to:

and p(A)

I_. is said to be a x-subspace

whose image is L. Therefore inclusion)?

characterize

maps we are faced with the following question:

the

to obtain information

maximal

i-subspaces

about (with

maps

Z-subspaces.

respect

to set

The task seems quite formidable.

Atkinson

and Lloyd [2, 31 and Atkinson

They defined

the concepts

weak canonical Eisenbud

of primitive

form for a %-subspace.

and Harris

roughly equivalent

[14]. They

[l] obtained

and imprimitive Another

recent

on l-subspaces.

Z-subspaces,

and obtained

paper on z-subspaces

state that the problem

to the problem

some results

of classifying torsion-free

%subspaces

of classifying

certain

the maximal

dimension

of a &subspace

of F”‘,“.

Then dim L Q kn.

a

is due to is

sheaves on projec-

tive spaces. The

problem

Flanders

showed

of determining

THEOREM 6 [15]. Flanders

Suppose L is a x-subspace

also characterized

the case where

equality

is attained.

is easier.

He assumed

that

1F 1 > k + 1, and for the case of equality also char F # 2. Meshulam [25] reproduced Flanders’s

results,

removing

the restrictions

on F. His proof uses the KGnig-Egervary

theorem. PROBLEM 2.

We assume that F is algebraically

holds, or m = n and (II) holds, where preserver. direction

The question was obtained

THEOREM 7 [24].

here is whether

closed.

It is easy to see that if (I)

U and V are nonsingular, the converse

is true.

then

T is a rank-k

The first result

in this

by Marcus and Moyls.

Suppose that T is a rank-l

preserver.

m = n and (II) holds, where U and V are nonsingular.

Then either

(I) holds, or

AUBURN

1990 CONFERENCE

Marcus

ON MATRIX

and Moyls assumed

767

THEORY

that char F = 0. Westwick

[29] obtained

the same

result for char F # 0. In their paper, Marcus and Moyls also raised the following: CONJECTURE1.

The conclusion

of Theorem

7 holds for a rank-k preserver,

where

k is any integer such that 0 < k < m. This conjecture is to date not completely resolved. We give some partial results, all confirming the conjecture. Moore [26] proved the case k = 2. Beasley [5, 7, 91 obtained various results.

They include

the confirmation

of Conjecture

1 in the cases

k = 3,

k = m, and k < $n. Recently Beasley has been able to show: THEOREM8 [ll].

Conjecture 1 h&s

solution in case F = a? It turns out that in the problem

Why do we get a complete of characterizing

rank-k preservers,

role, much as z-subspaces

a certain

family of subspaces

are associated with rank-k nonincreasing

plays an important maps.

L of Fmq” is said to be a k-subspace if p(A) = k for any

DEFINITION. A subspace

AEL,

if F = c.

AfO.

There are several papers dealing with k-subspaces.

Earlier

ones are due to West-

wick [3O] and Beasley [6]. They obtained bounds for the dimension of these subspaces. Beasley, for example, showed that if L is a k-subspace then dimL
F = @ is made. His result is somewhat too invol&l

based on Sylvester’s

work, Westwick

to be stated here.

[31] was able to show that if L is a

k-subspace of G”‘* ” then dimLgm+n-2k+l. Based on this inequality, Beasley was able to cover (for F = G) the cases k > fn that were not covered

PROBLEM 3.

in his earlier work, thus obtaining Theorem Suppose that (r, s,

t) is a fixed inertia triple, and let T : H,, + H,.

Suppose that there exists a nonsingular

(“1

8.

S E en*” and E such that either

T(A)

= ES*AS

Or

(“I)

T(A)

= ES*A’S,

where E = 1 if r # s and E = f 1 if r = s. Then T is a G(r, s, t)-preserver.

768

FRANK UHLIG,

TIN-YAU TAM, AND DAVID CARLSON

The obvious analogue holds for S,. Johnson and Pierce showed:

THEOREM 9 [17].

Suppose

that T

: H,, --) H,, is an invertibb G( r, s, t)-preserver. 0, 0), (0, n, 0), (0, 0, n), (n /2, n/2,0). Then either

Suppose that (r, s, t) is not one of (n, (V) or (VI) must hold. The corresponding

result for S, also holds. What about the four exceptional

The set G(O,O, n) consists of 0, so it is of no interest.

classes?

We clearly have G(0, n, 0) =

- G( n, 0,O). The class G( n, 0,O) consists of all n X n positive definite matrices, set of G(n, O,O)-preservers definite matrices

consists

so the

of all linear maps that map the set of positive

into itself. This is a well-known

open problem.

Pierce

and Rodman

showed:

THEOREM 10 [27]. invertible G( n/2,

Suppose

n/2,0)-preserver.

n is an even integer,

n > 4, and T

Pierce and Rodman also characterized

the set of G(l, 1, 0)-preservers,

contains the set given by (V) and (VI). The proof of Theorem proof

of Theorem

subspaces

of en,

9.

It uses the Grassmannian,

and the gap metric

the real symmetric

THEOREM 11 [20].

Suppose

such that T(A)

elements

are the

It should be noted that

n > 4, and T : S, -+ S, is an

Then there exist nonsingular

11 uses a result of Friedland

- I + 2) contains a nonzero

S E R n, n and E = f 1

9, 10, and 11 assume

Loewy and Pierce

1, any subspace

and Loewy [16] which states L of S, with dim L > i(l -

matrix whose largest eigenvalue has multiplicity

least 1. The method of proof of Theorem Theorems

whose

= ES~AS.

The proof of Theorem

dropped?

which strictly

10 is different from the

10. However, we managed to prove

n is an even integer,

that given any 1 such that 2 Q I Q n I)(2n

a space

is put into this space.

case is not covered by Theorem

invertible G( n 12, n /2,0)-preserver.

: H,, + H,, is an

Then (V) or (VI) must hoM.

11 can be used to prove Theorem that

T is invertible.

at

10 as well.

Can this assumption

be

[23] gave a positive answer for the class G(l, 1, n - 2) in

case n 2 3. Johnson and Pierce [17] gave a positive answer for the classes G(n and G(k + 1, k,O), and therefore

also for G(l, n - 1,0)

1, 1,O)

and G(k, k + 1,O). They also

stated:

CONJECTURE 2.

Suppose that n > 3 and rs > 0. If T is a G(r, s, t)-preserver,

then

either (V) or (VI) must hold. We have

THEOREM 12 [21].

Conjecture

The proof of Theorem earlier. The assumption class

G(r, 0, n - r),

2 holds ifr

12 relies heavily on rank-k nonincreasing

rs > 0 in Conjecture

where

# s.

0 < r < n. The

2 is essential. map

T

maps, discussed

Indeed, consider now the

: H, + H, defined by T(H) =

AUBURN 1990 CONFERENCE

ON MATRIX THEORY

(tr H)I,. $ 0 is easily seen to be a singular G( r, 0, n - r)-preserver. showed:

THEOREM13 [4]. Let T : H,, + H,, be a G( r, 0, n - r)-preserver, p(T) > r2. Suppose that 0 < r < n. Then (V) or (VI) must hdd.

769 Baruch and Loewy

and suppose that

It can be shown that the bound r2 cannot be improved. It comes from the possible dimensions of faces of the cone of n x n positive semidefinite matrices in H,, which are 12, 1 = O,l,. . . n. In the real symmetric case r2 should be replaced by $-(r + 1). The case r = 1 was proved earlier by Loewy [19].

REFERENCES 1

2 3 4 5 6

M. D. Atkinson, Primitive spaces of matrices of bounded rank II, J. Au&d. Math. Sot. Ser. A 34:306-315 (1983). M. D. Atkinson and S. Lloyd, Large spaces of matrices of bounded rank, Quart. J. Math. Oxford 31:253-262 (1980). M. D. Atkinson and S. Lloyd, Primitive spaces of matrices of bounded rank, J. Au&d. Math. Sot. Ser. A 30:473-482 (1981). M. Baruch and R. Loewy, in preparation. L. B. Beasley, Linear transformations on matrices: The invariance of rank k matrices, Linear Algebra Appl. 3~407-427 (1970). L. B. Beasley, Spaces of matrices of equal rank, Linear Algebra Appl. 38:227-237 (1981).

7

L. B. Beasley, Linear transformations which preserve fixed rank, Linear Algebra AppZ. 40:183-187 (1981). 8 L. B. Beasley, Linear transformations on matrices: The invariance of sets of ranks, Linear Algebra Appl. 48:25-35 (1982). 9 L. B. Beasley, Rank-k preservers and preservers of sets of ranks, Linear Algebra Appl. 55:11-17 (1983). 10 L. B. Beasley, Linear transformations preserving sets of ranks, Rocky Mountain /. Math. 13:299-307 (1983). 11 L. B. Beasley, Linear operations on matrices: The invariance of rank-k matrices, Linear Algebra Appl. 107:161-167 (1988). 12 P. Botta, Linear maps preserving, rank less than or equal to one, Linear and Mu&linear Algebra 20:197-201 (1987). 13 G. H. Chan and M. H. Lim, Linear transformations on tensor spaces, Linear and Multilinear Algebra 14:3-9 (1983). 14 D. Eisenbud and J. Harris, Vector spaces of matrices of low rank, Adu. in Math. 70:135-155 (1988). 15 H. Flanders, On spaces of linear transformation with bounded rank, J. I,.ono!on Math. Sot. 37:10-16 (1962). 16 S. Friedland and R. Loewy, Subspaces of symmetric matrices with a multiple first eigenvahie, Pa&c J. Math. 62:389-399 (1976).

770 17

FRANK UHLIG, C. R. Johnson and S. Pierce,

TIN-YAU TAM, AND DAVID CARLSON

Linear maps on Hermitian matrices: The stabilizer of

an inertia class II, Linear and M&&near 18

T. J. Laffey and R. Loewy, Linear and M&linear

19

Linear

Algebra 26:181-186

R. Loewy, Linear transformations Appl. 121:151-161

Algebra 19:21-31

transformations

(1986).

which do not increase

rank,

(1990).

which preserve or decrease rank, Linear Algebra

(1989).

20

R. Loewy, Linear maps which preserve a balanced nonsingular inertia class, Linear

21

R. Loewy,

Algebra Appl. 134:165-179 Linear

Appl. 11:107-112 22

(1990).

maps which preserve

an inertia class,

SIAM ].

Matrix Anal.

(1990).

R. Loewy, Linear mappings which are rank-k nonincreasing,

submitted for publica-

tion. 23

R. Loewy and S. Pierce,

24

M. Marcus and B. N. Moyls, Transformations Math. 9:1215-1221 (1959).

25

R. Mesbulam,

of matrices,

spaces,

Pacifi

Quart.

J.

J.

Math.

(1985).

C. F. Moore, Characterization a Tensor

on tensor product

On the maximal rank in a subspace

Oxford 36:215-229 26

unpublished.

Product

Space,

of Transformations

Univ. of British

Preserving

Columbia,

Rank Two Tensors of

Vancouver,

B.C.,

Canada,

1966. 27

S. Pierce and L. Rodman, Linear preserves balanced inertia, SZAM I.

28

J. Sylvester, conditions,

On the dimension

R. Westwick,

30

(1967). R. Westwick, 5:49-64

31

on tensor

satisfying rank

(1986). spaces,

Spaces of linear transformations

Pucifz

J.

Math.

23:613-620

of equal rank, Linear Algebra Appl.

(1972).

R. Westwick, 20:171-174

MATRICES INEQUALITIES by ROY

Transformations

(1988).

of spaces of linear transformations

Linear Algebra Appl. 78:1-10

29

of the class of Hermitian matrices with

Matrix Anal. Appl. 9:461-472

Spaces

of matrices

of fixed rank,

Linear

and M&linear

Algebra

(1987).

WITH

POSITIVE AND

DEFINITE

LINEAR

HERMITIAN

PART:

SYSTEMS

MATHIAS41

Let M,(C) real] matrices.

[respectively, We call A EM,

M,(R)]

denote the space of n x n complex [respectively,

positbe definite (respectively,

positioe semideftnite) if A

41Department of Mathematics, The College of William and Mary, Williamsburg, VA 23187. Research supported by an Eliezer Naddor postdoctoral fellowship in the Mathematical Sciences from the Johns Hopkins University during the year 1989-90 while the author was in residence at the Department of Computer Science at Cornell University.

AUBURN

1990

is Hermitian

CONFERENCE and

Hermitian

part

ON MATRIX

x*Ax > 0 (respectively,

and the skew-Hermitian

further

definite

results

definite

can be found

largest

Frobenius

We write

norm

use

that

A,,

being

have many

some

[respectively,

1

x

1

analogous

in this

synopsis.

and

to those Proofs

and

norm

the

alge-

. 11 2) and

(I]

semidefinite.

His proof

A,,

to denote

on M, by

Ar = b by Gaussian

is backward

being

algorithm

&,(A)]

of A. The spectral

stable

if

of A [JC~(A) = II AlI 2 II A-‘11 2 = L(

Cholesky

properties

of these

eigenvalue

solving

precise)

is not too large.

product

part

&,,,,(A)

smallest]

(I] . ]IF) are defined

to be

number

[with

The

in [6].

we

[8] showed

decomposition,

outer

x E C”.

2,

discuss

A Q B if B - A is positive

tive definite]

all nonzero

A + A*

=

Hermitian We

[respectively,

Wilkinson condition

) 0) for

of A is

matrices.

A is Hermitian,

If braically the

part

with positive

of positive

r*Ax

of A is

H(A)

Matrices

771

THEORY

was based

(n -

1)

x

elimination

A is positive A)/&,,(

(the Cholesky definite

A) because

and

on the fact that if we partition

(TV -

l)], then

we are left with the (n -

1)

after

the

A is posi-

one step

A as

of the

x (n - 1) matrix

A21 42

A=A,,-

-,

All

which

is also positive

definite

and for which

and Lax(A) which

together

imply

K~( A)

Q K~(

(2)

Q %,m,( A).

A). This fact ahm

the induction

to proceed.

772

FRANK UHLIG, When

A has positive

definite),

the leading

definite

principal

Gaussian elimination

TIN-YAU TAM, AND DAVID

Hermitian

minors

of

and run more

so one would like

efficiently,

elimination

the inequalities generalize

is not necessarily and so one

without pivoting (but there is no guarantee arithmetic).

positive definite.

(but

A are nonzero

be stable in fmite-precision Gaussian

part

without

Algorithms

to determine

will be backward

This is the motivation

for this research.

(1) and (2) to matrices argument

In [4] it was argued Gaussian elimination

with positive

heuristically

precision

definite

Hermitian

solving

Ax = b provided

that the algorithm

conditions

under

stable assuming Our approach

definite

will

structure

that

which H(A)

is

is to generalize

Hermitian

part and then

to these matrices. that

without pivoting,

in which u is machine

positive

can perform

that do not pivot preserve

pivoting

Wilkinson’s

CARLSON

i,

the solution

and c, is a linear function

part. Using this result,

to

Ax = b computed

by

( A + E) 12= b, where

satisfies

of n, when

A has positive

they argued that it is safe not to pivot when

the ratio

1)H + STH- ‘S II2

IIAll, is not large. (It is easy to show that this quantity is at least I.) We make their argument rigorous

and give a sufficient

arithmetic

(without pivoting)

completion

with positive

son’s argument, matrices First

condition

for the

pivots.

Our approach

definite

let us consider

Hermitian

some

part, in particular

the properties

a matrix.

Previous

Hermitian

part [5, 1, 2, 71 has concentrated

interlacing

inequalities

on matrices

for the arguments

this

results

follows

from

the

identity

part to run to

(Theorem

of the Hermitian

with positive

definite

part of the inverse of such

definite

on the properties

of the eigenvalues

of Wilkin2) involving

and submatrices.

of matrices

with positive

ues of AA- l* all have unit modulus.) We start by determining the Hermitian

in finite-precision

Hermitian

of inequalities

part, their inverses

of the properties

Hermitian

research

definite

is based on a generalization

and for this we need a variety

with positive

LU factorization

of a matrix with positive

Hermitian of AA-‘*,

of AA-‘*.

or skewespecially

(The eigenval-

part of the inverse of a matrix. The proof of + Y-l = X-‘(X + Y)Y-’ applied with

X -’

X = A, Y = A*. LEMMA 1. Let A have positive definite Hermitian part, and let H = H(A) and S = S(A). Then A is invertible and A-’ has positive definite Hermitian part given by

A-’

H(A-‘)

=

+

2

A-‘*

= (H + s*H-‘s)-~,

AUBURN

1990 CONFERENCE

ON MATRIX

THEORY

and we have the inequalities IIA-‘112 Define

IIH-‘II,

G

the functions

llAll,< llff+S*H-‘Sl12.

ad

j and K~ on the cone of positive

definite

matrices

by

( A-ly‘-l*)jl~H+s*H-ls

f(A)=

and K+)

where

and S = S(A).

H = H(A)

Many results matrices

involving

with positive

that K~( A) =

=

K~(

1IH+S*H-‘SI12IIH-‘ll,,

Notice

the condition deftnite just as

A-‘),

is convex with respect

that

Hermitian K~(

A) =

A) =

K~(

numbers

part when K~(

to the partial order

A) if A is positive

K~(

of positive K

definite

is replaced

matrices by

K*.

definite. hold for

Notice ako

A-‘). Th e next result, which states that f ,< , is crucial, and numerous inequalities

follow from it (we only state a few here).

2.

THEOREM

order

Q

Let

f be

defined

by (5). Then f is convex with respect

. That is, for any A,, As EM,

with positive definite Hertnitian

to the partial part and any

t E [O, 11,

f(% + (1 - +z) Furthermore, partitioned

suppose

that A, B EM,,

have positive definite Hermitian

(7) part and A is

as

A=

and bt

Q tf( 4) + (1 - t)f( As).

f( A)

with

be partitioned

A,, E Mn-k,

in the same way. Then

1.

Ijf(A22

2.

f(4,

3.

f(h)

Gf(Ah,

4.

aH(A)

2 II Allzll A-‘112

5.

q,(

6. 7.

K”(A)

- ~zlK&)((,

@

A,, E Mk,

4422) Gf(4,

~ljf(A)22112~ ef(A)zz,

= KZ(A) > 1,

tA + (1 - t) B) ~mmax{~~(A),K~(B)}f~anytE[O,1], 2 KH(&

@ AZ,) 2 KH(&)~

KH(A) 2 KH(AZZ - Az,4?‘4,).

Notice that if A has positive of A then, combining

definite Hermitian

4 and 6, we have

K~(

B) Q

part and B is a principal submatrix K~(

B) Q K”(A).

That is, we have a

774

FRANK UHLIC,

bound

on the e-norm

positive definite

condition

Hermitian

number

TIN-YAU TAM, AND DAVID

of any principal

submatrix

CARLSON

of a matrix with

part.

We also prove the following perturbation

result for the function

f by a straightfor-

ward argument.

Let A = H + S have positive definite Hermitian part, and let E be such

LEMMAS. that

[If(A) -f(A+E)II

~~ll~ll,Il~-‘IlzlI~+~*~-‘~l~~=~ll~ll~~~(~)~

Note that if we restrict stronger than (9):

A and E to be Hermitian,

IIf(A+E) -.@)]I,= regardless

of the value of

purposes,

since

A). However,

K~(

our results

in Theorem

Now we consider algorithm

without

the backward

pivoting

part using finite-precision for a complete

arithmetic.

discussion.

matrices

to Hermitian

LU factorization definite

Hermitian

a rigorous

see [3]

are real in our final result.

part of A, is the same as the Hermitian

error of the LU factorization

solution to Ax = b computed

for our matrices,

are many reasons to avoid pivoting;

already proved above and an induction

This in turn can be used to derive

is

in [S] (up to a constant).

of the outer-product

to a matrix with positive

There

[In this case, (A + AT)/2, the symmetric

R =

restricted

We will assume that all matrices

A.] Using the inequalities bound on the backward

the bound (9) is quite satisfactory

stability

when applied

which

IIEII,,

4, when

reduce to the bounds proved for Hermitian

then we have a result

(9)

argument,

part of

we have a

of A with H(A) positive definite.

bound

by Gaussian elimination

on the backward without pivoting.

error

in the

Given a matrix

[ bjj]. we define I B I = [ I bij I]. THEOREMS.

and S = S(A).

Let A E M,(R) have positive dejkzite Hermitian part, and let H = H( A)

Then L and V, the exact LU factors

IIlL Let u be machine

precision.

of A, satisfy

IUl(I,~nllH+STH-‘SII~.

lf

%n3/2~H(

A)” < 1,

(10)

AUBURN

1990 CONFERENCE

775

ON MATRIX THEORY

then the LU factorization algorithm runs to completion and the computed factors i and fi satisfy IIifi-

7un3”]]H+

All,<

S*H-lS]Js.

(12)

Block LU factorization algorithms (see, e.g., [3, Algorithms 3.2.5, 3.2.61) typically will not produce exactly the same computed LU factorization as scalar algorithms (e.g., [3, Algorithm conclusions, x2(B)

3.2.4]),

but

one

with different

may

constants

< xn( A) for any submatrix

expect

the error

in (11)

analysis to produce

and (12),

since

we have

B of a positive definite matrix

similar

shown

that

A.

REFERENCES K. Fan, On real matrices

with positive definite symmetric

M&linear

(1973).

Algebra 1: l-4

K. Fan, On strictly dissipative matrices, G. Golub and C. Van Loan, Baltimore,

component.

Linear Algebra AppZ. 9:223-241

Linear and (1974).

2nd ed., Johns Hopkins U.P.,

Matrix Computations,

1989.

G. H. Golub and C. Van Loan, Unsymmetric

positive definite linear systems, Linear

(1979).

Algebra AppZ. 28:85-97

C. R. Johnson, An inequality for matrices whose symmetric Linear Algebra AppZ. 6:13-18 R. Math&,

Matrices

with positive definite Hermitian

systems, SIAM J. Matrix Anal.

Appl., to appear.

R. C. Thompson,

matrices

11:255-269

Dissipative

part is positive definite,

(1973).

and related

part: Inequalities results,

and linear

Linear Algebra

AppZ.

(1975).

J. H. Wilkinson, International

A priori error

analysis of algebraic

Congress of Mathematicians,

WARING’S

PROBLEM

FOR

by BORIS

REICHSTEIN4’

processes,

in Proceedings

of

1968, pp. 629-640.

SMOOTH

CUBIC

Let (p be a cubic form, i.e., a homogeneous

CURVES

polynomial of degree 3. We would like

to determine the smallest integer k such that (p can be expressed as a sum of cubes of k linear forms and find all corresponding

representations

tions. This question is known as Waring’s proposed belonging

an algorithm

that allows one, for some values of k and for cubic

to a wide class of forms

42Deparhnent 20064.

that we call Waring presenta-

problem for cubic forms. In [l, 21 we have

in n variables,

to find almost

of Mathematics, The Catholic University of America,

forms

all the Waring

Washington,

D.C.

776

FRANK UHLIG,

presentations algorithm, almost

or to prove however,

all Waring

presentations

form

form

written

in canonical

linear

forms

form

appear

by cube

root

two-dimensional. investigate

In this case

coordinates in Waring

is unique

that

the cubic

presentations.

formulas

It turns

vanishes;

the variety

X,

obtained

by methods

of algebraic

that, opposite to the case of quadratic forms, X,

and

X,

work

of the we find

corresponding

(p smooth.

For the

multiplication

of Waring

inspired

geometry. is irreducible

of

k = 3 if and only if

k = 4. In the first case

of terms

in this

work

for the coefficients

out that

otherwise

(up to reordering

formulas

curve

we also call the form

of 1). In the second case the variety The

Some of the final steps

n > 3. In the present

we find explicit

[3, p. 3021 of the curve

presentation

exists.

only to the case for n = 3 provided

(p is smooth.

that

the j-invariant Waring

that no such presentation

are applicable

to the given

TIN-YAU TAM, AND DAVID CARLSON

Zinovy

the

of each

presentations

is

Reichstein

to

It has been established and irrational [4].

In order to derive the desired formulas we first exploit those steps of the algorithm in [1, 21 that are applicable to the case n = 3. The algorithm requires form (p = x: + 3x3(x:

Any smooth cubic fom

LEMMA 1. appropriate

linear

transfotmation

cp in three variables

of variables,

Let cp(zl, z2, z3) be an arbitrary

Proof. linear

cp to be in the

+ ~22) + (pp(x1, x2) where p2 is a cubic form in xl and x2.

can be written,

after an

as

cubic

form.

It is known

that there

exists a

T, : ( zlr z2, z3) + { y,, yz, y.J such that

transformation

$0( y1, yz> Y3) = Y? + Y$ + Y33+ 3OYl YzY31

(2)

0 # 0 [3, p. 2931. Let

xl

Tz: { ~1, YZ~ ~3) -+

+

ix,

J;;’

x1

J;;’

-

ix, ~3 I

Then the form Q becomes T,T,

maps

the arbitrary

as in (1) with p = 2/ 0.

form

Since the inverse transformation find all Waring presentations In order prescribes

T; ‘T;’

n

maps (1) into (p(zl, z2, .z3), it suffices to

for the form (1).

to find a Waring

introducing

Thus, the linear transformation

cp into the form (1).

presentation

the following matrices:

of the form (1) the algorithm

in [1, 21

AUBURN These

1990

matrices

CONFERENCE commute

if and

expressible

as a sum of cubes

~1 = z/2,

then

gl=

ON MATRIX only

if $

c3 = 8. Thus,

to [2], these

rp(%~,A,)

vectors

the Waring

5 j=l

: g3jkTl

gli,

g,i,.

. . , g3i are coordinates

(

-fix,+

Therefore,

for the form

form

g,=(-$-

$,lJ1.

presentation

(gljrl

+

g2jx2

+

g3jx3)3.

(4)

dj

of the vector

vGx,+2x,

3

gi, i = 1,2,3.

- fix,

-

&x2

Thus,

+ 2x3

3

+ 2% (2) with

(2) is

of Ds are

define

=

the

forms if and only if o = 0 or o3 = 8.43 If

{l,o,2E]t, g,=(-$.$Ji’.

According

where

= $. Then

of three linear

the eigenvectors

777

THEORY

I u = 2 we obtain

i the following

2%

(5) I

representation

as a sum

of three cubes:

(

(l-iv5)yl+(1+iv%)y2+2y3 2fi

i (l+ifi)yr+(l-iv5)y2+2y3 2%

13 +

1.3

430f course, this result is consistent with the fact that the j-invariant of the form (2) is a3(8 - a3)/(03 + 1)3 [3, p. 3021 and therefore the form (2) with (r = 0 is equivalent to the forms (2) with o3 = 8.

778

FRANK UHLIG,

TIN-YAU TAM, AND DAVID

CARLSON

From now on we assume

u # 0,

Under these conditions cannot be expressed

a3 - 8 # 0,

the matrices

2pa -

(3) do not commute

1 # 0.

(6)

and the form (2) as well as (1)

as a sum of cubes of three forms. In order to express

cubes of four linear forms we introduce,

in accordance

with the algorithm

c as a sum of in [l, 21, the

following matrices:

fii=

; 1P

-p

0

r

0

0IP

0

r

OS

p

Here

p, 9, r, s, and t are arbitrary

show that the commutator

0

I

El,=

)

i

complex

[Dir Da] vanishes

2~’ + r2 9=

1

P

s=

parameters.

-

p(2$

-

2,s

P(2P2

For the further

calculations

this expression the characteristic

polynomial

4(A)

=x4+

-

ci = -r{2P4

c3 = P{2Pz

-

-

1)

- 1

I)

(8)

.

for s only. After substituting

c,x3 + c2x2 + c,x + Co),

1

+ ( p2 + r2 + 3)9

-

+ 2r2

( p2 - 3r2

calculations

for

- N) of 6,:

( p” - r2 + l)$

cs = - {2P4 + (2p2

I

Straightforward

where co = - (2~” -

(7)

in (7) we obtain the following expression

4(X) = det(fir

‘i2$

t

- 3)

we shall need the expression

from (8) into the matrix 6,

r 9

- p2 + 3r2

- p2 + 3rs

rr(@

0

0 I 100’ 0 9

if and only if



r=

-P

-n 0 r

+ r”),

- 2( p2 - 2r2

+ 1)~~ -

+ 1)).

+ l)},

( p2 + r2 + l)},

(9)

AUBURN

1990 CONFERENCE

If h,, &, Xs, X, are eigenvalues of d,, corresponding

gi=

eigenvectors

779

ON MATRIX THEORY

they are distinct for almost all p and r. The

are

{-p&(P+X,),r(p&-

+CLjtT

l),-P(~+Ai),-A~+~(S+l)

g+

i = 1,2,3,4.

Each of the eigenvalues

Xi is a function of the parameters

(o(% X2-Tx3) =

5

:

( gljxl

+

(II)

p and r. The formula

g2jx2 +

(12)

g3jx3)3p

j= ’ g3jkGldj where

{ gii, g2i, g3i, gdi} are coordinates

find Waring presentations the coordinates

of the vector

that we have found so far, none of the coefficients

vanishes. Now we will find Waring presentations linear forms the variable the scalar product

(

x(

x3 does not appear explicitly.

of x3

that there

exists no

of x3; otherwise,

Let again p( xi, x2, x3)

a new cubic form

x2>

x3)

=

v(

x1,

x2,

x3)

-

p and r are arbitrary complex parameters,

defined later. We compute The commutator

Notice

, ) defmed in [l, 21 would be degenerate.

Xl,

of

of the form (1) such that in one of the

of (1) where more than one linear form is independent

be the form (1). Introduce

where

allows us to

of gi from (11) into (12), we obtain almost all Waring presentations

the form (1). In all Waring presentations

representation

gi (i = 1,2,3,4),

for almost all values of p and r (see [I, 21). Thus, substituting

the matrices

4(

PXl + fl2),,

(13)

and q is a function of p and r to be

(3) for the form (13) and their commutator.

vanishes if and only if

2/P4=

1

PP( P’ - 3r2)

If p and r are arbitrary parameters



(14

and q is as in (14), the form (13) is expressible

as a sum of cubes of three linear forms. Let s = p/r

be a nonhomogeneous

parameter.

780

FRANK UHLIG,

TIN-YAU TAM, AND DAVID CARLSON

The matrix D, associated with the form (13)-(14)

[see (3)] now becomes

_ /q ss + 3) - ss P(s2 D, =

- 3) _ p2(s2 -1)-l

- +;:31/

0

P( s2 - 3) 1

I Its characteristic

0

polynomial 4(X) = det(N

01

- Dr) is

(s2 + 1) 2p

f&(X) =x3+

-

1

x2 +

P(S2 - 3)

cl”( ss + 1) - 2( s2 - 2) x _

cl”( s2 -

ss - 3

1) -

1

(16)

q-3)

If h,, &,, A3 are the roots of the last polynomial, the linearly independent s) eigenvectors

(for almost all

of the matrix (15) are

g, = { Xi[ /.&(3 - S2)Xi + $(l

- s”) + 11,

XiS(2$

(i = 1,2,3).

(15)

.

-

l),p(3

- S2)hi + p2(1 - s”) + l}t

As above, we obtain the desired set of representations

of the form (1):

+&-3/4x,x,2=

SXl + x2) 3+

c

j=l

1

g3j

(

g12j +

) gij

+

( gljrl

+

g2jx2

+

g3jx3)3.

dj

From the results of [4] it follows that the roots of the polynomial (9)-(10) made rational by a substitution of variables

can be

p and r if and only if /.I~+ 4 = 0 and hence

u3 + 1 = 0. In this case the cubic form (8) [and therefore

(9)] is a product

of three

AUBURN

1990 CONFERENCE

ON MATRIX

THEORY

781

forms. If (r = - 1, we have

Y?+Y23+Y33-3Y1Y2Y3=

(Yl + Yz + YB)(%Yl

+ EZYZ + YB)(%Yl

+ ElYZ + Y3)p (17)

where cl and Ed are cube roots of 1:

The form (17) can be expressed

as a sum of cubes of four linear forms as follows:

It is trivial to obtain the similar formulas for the form (8) with (I = E~ and u = Ed.

REFERENCES B. Reichstein,

An algorithm

to express

a cubic form as a sum of cubes of linear

forms, in Current Trends in Matrix Theory, Proceedings of the Third Auburn Matrix

Theory Confwence, North Holland, 1987, pp. 273-283. B. Reichstein, appear.

On Waring’s

problem

for cubic

forms,

Linear

Algebra A&.,

to

782

FRANK and H. Knorrer,

UHLIG,

3

E. Brieskom

4

B. Reichstein, Z. Reichstein, On Math. -I., submitted for publication.

TIN-YAU

Plane Algebraic Waring

TAM, AND DAVID

Curoes,

Birkhsuser,

presentations

CARLSON

1986,

of plane

pp.

cubits,

1-721. Michigan

OUTPUT FEEDBACK CONTROL OF LINEAR REPETITIVE PROCESSES-A 2D POLYNOMIAL MATRIX APPROACH by

E. ROGERS44 and

Repetitive,

or multipass,

can be illustrated piece,

H. OWENS45

D.

involved

processes

by considering is processed the output,

forcing function

on, and hence

length a is constant, hence

operations

of sweeps,

or passes,

contributes

to, Yk+l(t),

processing

structural problems

and Owens

of the

over a by Y,(t),

processes

[Rogers and Smyth length

also

(1989)],

to the current M, or simply

The

termed unit-memory.

essential

studies

system. obviously appropriate

Roesser

example,

image-

model

[Rogers

Such processes and

process

are

unit-memory

can be regarded

bench-mining

systems

M > 1 passes which contribute termed

nonunit-memory

in the special

case

of

of M = 1.

as the natural generalization

results Further, totally

control

problem

Such behavior

on actual

is easily

processes.

from simulation it is clear

stability

and

Rogers

in the special from

control

[see,

for

example,

the

of its

control

44Department of Aeronautics 45SchooI of Engineering,

action in

methodology

and Astronautics,

University

studies Smyth

is required {Yk}kal.

is required, linear-dynamics

in Rogers

University

of Exeter, U.K.

and

is the

increase

possible

in amplitude and observed (1989)

contains

case of one type of bench-mining

appearing

treatment

process

which

in simulation

example,

appropriate

feature

analysis

a repetitive

generated

For

studies

that

undesirable

for

{ Yk}k 5 1 of oscillations

sequence

aspect is not necessarily feedback-based. A rigorous stability theory for the constant-pass developed

operations,

and 2D

state-space

so-called

it is the previous

pass profile.

unique

in the output

extensive

subclasses

counterpart.

from pass to pass. in field

certain

by the

exist-for

where

nonunit-memory

Hence a nonunit-memory unit-memory presence

described

0 Q t < a,

(1990)].

Repetitive directly

type

exist between

a

finite pass

acts as a forcing function on, and

0 6 t < a, k > 0, and is therefore

similarities

In

To introduce

Industrial examples include long-wall coal cutting and certain metal-rolling and strong

tool.

pass acts as a

suppose that the necessarily Y,(t)

which

or work-

of the processing

to, the next pass profile.

is one where

process

action

the material,

on the current

and denote the pass profile generated

a repetitive

contributes

by a recursive where

or pass profile, produced

[Rogers and Owens (1990)],

definition

k > 0. Then

characterized

by a series

such operations, formal

are

machining

and

Owens

to prevent

this

In particular, where

the

an latter

case has been (1990)]

using

of Southampton, U.K.

an

AUBURN 1990 CONFERENCE

ON MATRIX THEORY

783

abstract model formulated in functional-analysis terms which includes almost all known examples, or subclasses, as special cases. This has shown that two distinct concepts are required. These are termed asymptotic stability and stability along the pass; the former is a necessary condition for the latter, which is clearly required for all practical purposes. In effect, stability along the pass demands that, given well-defined inputs or driving terms, { Yk)k a r converges strongly to a steady, or limit, profile irrespective of the pass length a. The results of applying this abstract theory to a wide range of special cases have been reported [Rogers and Owens (1989a, b)], and it is known that the resulting conditions have well-defined physical interpretations. In particular, the results for the subclass of so-called differential nonunit-memory linear repetitive processes are well known and understood. This subclass includes the bench-mining systems as special cases and has the following state-space model:

y/r+l(t)= ‘X,+1(t) +

5

Djyk+l-j(t)-

j=l

Alternatively, suppose, for simplicity, that X k+r(O) = 0, k 2 0, and Y,_j(t) = 0, 0 Q t < a, 1 < j < M. Then it can be shown [Rogers and Owens (1990)] that (1) has the 2D transfer-function matrix description

Y(s, 2) = G(s, z)U(s, z), where the m

x 1 2D

transfer-function

Go(s)

(2)

matrix G(s, z) is defined by

= C(s& - A)-‘B

(4)

and Gj(s)

= C(s& - A)-‘Bj_,

+ I+,

l
(5)

FRANK UHLIG,

784 This subclass

has clear

structural

TIN-YAU TAM, AND DAVID CARLSON

similarities

to standard

or, in repetitive-systems

language, conoentional linear systems. Suppose that the previous pass terms are deleted from (l), the subscript Then

k + 1 is dropped, and the concept

the result is just the standard

(A, B, C), which is termed has transfer-function B = 0, B,_r

state-space

model

of a pass length is irrelevant. characterized

by the triple

the derioed conoentionaZ linear system in this context,

matrix

G,,(s), i.e. a constituent

= 0, Di = 0, 1 < i # j Q

and

element of G(s, z). Similarly, set

M, drop the subscripts, and ignore the concept of

a pass length. In this case the result is just the standard state-space model characterized by the quadruple

(A,

Bj_l, C,Dj),1 Q j Q M, which is termed the jth associated

conoentional linear system, and has transfer-function constituent The

structural

developing

matrix

Gj(s) of (5), i.e.,

similarities

a comprehensive

summarized control

above

theory

have

motivated

an approach

for (1) based on using directly

extending (where possible) the well-established

conventional

feasible stability tests (asymptotic

pass) based on G,(s)

Q

however,

and Gj(s), 1
to

and/or

linear systems theory.

date, this has yielded computationally remains,

another

element of G(s, z).

To

and along the

M, or their state-space realizations. There still

work to be done,

aspects of the role of G( s, z). In particular,

and this exposition

will focus on other

the following results, conjectures,

and open

questions will be addressed.

THEOREM 1.

Theprocess (1) is stable along the pass if, and only if, the characteris-

tic polynomial p(s, 2) satisfws Res>O,

P(S,2) f 0,

(.zI >l,

(6)

where

p(s,z):=

sl, - A c

-W

Q(z)

and

B(Z)

Proof.

=

5 Bj_lz-j,

Q(z)

j=l

= ,$Djz-j. 1, -

See Rogers and Owens (1991).

Suppose that (1) is embedded in an output-feedback-based unity-negaTHEOREM 2. tive-feedback control scheme dej%ed by

U(s, 2) = K(s, z)e(s,

2) = K(s,

z)[R(s, 2) - Y(s, z)],

(9)

AUBURN

1990 CONFERENCE

where R( s, z) is the reference

signal and K(s, z) has the structure

the open-loop forward-path p,( s, z) respectively.

785

ON MATRIX THEORY

and closed-loop

characteristic

of (3). Further,

polynomials

denote

by p,,(s, z) and

Then

Pc(s, z) =p(s, Z)I> Po(STz)

(10)

-

where the return difffence

matrix T(s, z) is given by

T(s,

z) = Z, + G(s,

z)K(s,

2).

(‘1) n

See Rogers and Owens (1991).

Proof.

DEFINITION. The natural definition of a pole for (1) is a pair of complex numbers (& z^)which satisfy p(s, z) = 0.

CONJECTURE. A pole of (1) has a well-defined physical interpretation used to characterize

stability in a similar manner

which can be

to its conventional

linear-system

counterpart.

OPEN QUESTIONS. 1.

What is the equivalent of the Rosenbrock

system matrix for (l), and how (if at

all) can it be used to define and answer fundamental 2.

systems-theoretic

questions?

How (if at all) can p(s, z) and T(s, z) be used in the development

controller

of efficient

design algorithms?

REFERENCES Rogers, E. and Owens, D. H. 1989a. processes, Rogers,

E. and Owens,

processes

D. H. 1989b.

with non-unit memory,

Rogers, E. and Owens, D. H. 1990. Lecture

Systems, Research E.

Stability analysis for discrete

Stability Analysis for Linear Repetitive Processes, Sci., Springer-Verlag,

D. H.

differential non-unit memory submitted for publication.

to appear.

Modeling and Simulation Studies on Bench Mining

Report, Div. of Dynamics

and Owens,

linear multipass

ZMA J. Math. Control Znfonn. 6(4):399-409.

Notes in Control and Inform.

Rogers, E. and Smyth, K. J. 1989. Rogers,

Axis positivity and the stability of linear multipass

Linear Algebra Appl. 122/123/124:779-796.

1991.

and Control,

An output feedback

linear

repetitive

processes,

Univ. of Strathclyde. based control Linear

theory

Algebra

for

Appl.,

786

FRANK UHLIG, TIN-YAU TAM, AND DAVID CARLSON

MINIMUM

PERMANENTS

ON CERTAIN

DOUBLY

by SEOK-ZUN

SONG46

I.

AND

MINIMIZING

STOCHASTIC

MATRICES

MATRICES

Introduction For a pair ( p,

q) of positive integers, J,, 9 will denote the p x 9 matrix all of whose

entries are 1. Let the set of all n-square doubly stochastic This

is known

to be a polytope

n2-dimensional

Euclidean

Let D = [d,,J

= {X=

with

n! vertices

in the

[x~,~] ~Q,(x~,~=Owheneverd,,~=O).

Euclidean

X E Q(D). Such a matrix

matrices be denoted by Q,. 1)’

matrix, and let

Q(D) is a face of the polytope

finite-dimensional

(n -

space.

be an n x n (0,l)

Q(D)

Then

of dimension

and hence,

Q,,

space, contains a matrix

being a compact

subset of a

A such that per A Q per X for all

A will be called a minimizing matrix on n(D).

Without any doubt, one of the most interesting

and important problems concerning

the face Q(D) is that of determining

the minimum value of the permanent

the set of all minimizing

on it, of which many studies have been done by

several

authors.

determined

For

matrices

example,

the minimum

Knopp

and Sinkhom

permanents

on Q( Dl),

[6], Mint

function and

[7], and Brualdi

[l]

Q( Dz), and Q( D3), respectively,

where

D,

=

_“_-~~_::-_-l-, ”

*..

D,=

1,n

1

0

0

. . .

0

1

. . .

0

1

10

. . .

1

1

. . .

1

. . .

0

1

..------

1

1’

Jn-z,n

0

1

D,=

i 1

.

1

.

1

. .:’

1

I.

Friedland [4], Hwang [5], Foregger [3], and Chang [2] also determined permanents on certain faces (see [9]).

the minimum

4sDepartment of Mathematics, Cheju National University, Cheju 690-756, Republic of Korea, and Department of Mathematics and Statistics, Utah State University, Logan, Utah 84322-3900.

AUBURN 1990 CONFERENCE

787

ON MATRIX THEORY

In [I], Brualdi determined n( W,), where

the minimum

permanent

and minimizing

1

0

0

..:

1

0

1

0

0

a**

0

1

matrix on

We wanted to extend this result from W,, to V,,,,nr defined as

In [ll], we determined the’ minimum permanent and minimizing matrices on Q(V,,J for arbitrary n. In [lo], we determined them on Q(V,,s) for m = 2 and m 3 5, but not for m = 3,4. In [12], we determined them on Q(V,,,) for all m 2 3 by a method somewhat different from that in [lo]. In [12], we also determined them for W,,” and

W,, .(O), where I

02,”

I m,m 1 I 12,”

WIn.” =

----L_-__

i

1 7l.m ,’

I

1 ’

An-2.m -----

%“(q

I Om-2,n 1------

02 _--F_~___~“_

=

[

I n, m

I

12 I

1

for n ) 2, m ) 3. LEMMA 1 (Foregger

[3]).

Let D = [d, j] be an n x n fdy

indecomposable

matrix, and A = [ ai, j] be a minimizing m&-ix on Il( D). Then A is fdly

(0,l)

indecomposable,

and for (i, j) such that di, j = 1,

per A( i 1j) = per A

if

ai, j > 0,

per A( i 1j) > per A

if

ai,j = 0.

LEMMA 2 (Mint [7J). [d,, . . . >d,],

and if&

If A = [al,. . . , a,] is a minimizing matrix on Q(D), D =

= d2, then

per[cual+/3a2,pa,+ora2,a3,...,a,] fOranyol,@>Owithcu+j3=1.

=perA

788

FRANK UHLIG, TIN-YAU TAM, AND DAVID CARLSON We will call this the aweraging method when c~ = /_?= i.

THEOREM 1. For m > 3, let

Then a minimizing matrix form A on Q( V,,,,3) is

(I.‘) and the minimum permanent is mlam-‘[(m

- l)mb4 + 2maxb2+

P-2)

?a”],

where mu = 1 - 3 b, x = 1 - mb, and b is a real root of 27 + m

m2+ 6m+20

21 - ?. m

9 3 +m” m3

b4+

i

b-$=0.

(1.3)

THEOREM 2. For m p 2, n 2 3, let

I Om-2,n

Wm,Il=

/ m,m 1

I --- L__*Ln__ 2, i J“,rn ,’

Then the minimum permanent un O(W,,,,) m! &c-

1 .

is

4(n - l)(n g+l

- Z),-’

(2.1)

AUBURN 1990 CONFERENCE

ON MATRIX THEORY

789

fwna4,and

(m - l)!

5b + 6mb2

2mmm2

fm n

(2.2)

= 3, where b is the unique real root of the equutiun

llm2b3

- 16mb’ + 9b - 2 = 0. m

(2.3)

We remark that the matrix W,,” is cohesive and not barycentric for m 2 2 and n ) 4. (For definitions, see [l].) By the averaging method of Lemma 2 and the proof of Theorem 2, we obtain the following result about one of the faces of n( W,. J: COROLLARY3.

For m 2 2, n > 3, let

Then the minimum permunent on Q( W,,,, *(O)) is the same as (2.1) in Theorem 2, which occurs at the barycenter b(W,,,, JO)), where the batycenter of Q(D) is giuen by b(D) = P, and the summation extends over the set of all permutation matrices P (llP~D)&<, with P < D, and per D is their number. REFERENCES R. A. Brualdi, An interesting face of the polytope of doubly stochastic matrices, Linear and Multilinear Algebra 17:5-18 (1985). D. K. Chang, Minimum permanents of doubly stochastic matrices with one fixed entry, Linear and M&linear Algebra 15:313-317 (1984). T. H. Foregger, On the minimum value of the permanent of a nearly decomposable doubly stochastic matrix, Linear Algebra Appl. 32:75-85 (1980). S. Friedland, A proof of a generalized van der Waerden conjecture on permanents, Linear and Multilinear Algebra 11:107-120 (1982). S. G. Hwang, Minimum permanent on faces of staircase type of the polytope of doubly stochastic matrices, Linear and M&linear Algebra 18:271-306 (1985). P. Knopp and R. Sinkhorn, Minimum permanents of doubly stochastic matrices with at least one zero entry, Linear and Multilinear Algebra 11:351-355 (1982). H. Mint, Minimum permanents of doubly stochastic matrices with prescribed zero entries, Linear and Multilinear Algebra 15:225-243 (1984). H. Mint, Permunents, Encyclopedia Appl. 6, Addison-Wesley, 1978.

FRANK UHLIG,

790

TIN-YAU TAM, AND DAVID CARLSON

9

H. Mint, Theory of permanents

1982-1985,

10

(1987). S. Song, Minimum permanents

on certain faces of matrices containing an identity

submatrix, 11

Linear Algebra Appl. 108:263-280

S. Song,

Minimum

permanents

Algebra AppZ. 143:49-56 12

Linear Multilinear Algebra 21:109-148

(1988).

on certain

doubly

stochastic

matrices,

Linear

(1991).

S. Song, Minimum permanents

on certain

doubly stochastic

matrices

II, Linear

Algebra Appl., to appear.

ON RATIONAL by EIVIND

1.

n

MATRICES

and BOONCHAI

K. STENSHOLT4’

and Observations

Let n, t, x be integers,

1.

DEFINITION x

STOCHASTIC

STENSHOLT4’

De$nitions

set of n

DOUBLY

matrices

with integer

entries

n > 0, t > 0, r ) 0, and N,(t, x) be the from

(0, 1, . . . , t} such that all row and

column sums equal r. Let M,,(t) be given by

iv,(t)

The cardinahty

A

of a set S is denoted

COMBINATOBIAL

owners

hold

u

= N”&O)

Aqt,1)

each,

none

0..

u N&q.

) S (.

In each

INTERPRETATION.

r shares

u

more

than

of n companies

t shares

are

x shares;

in any company.

n

A matrix

(aij) E N,(t, x) specifies a possible distribution of shares, owner i holding aij shares in company j. The special problem Anand, matrices

Dumir,

of determining

and Gupta

[l];

in NJ r, r) are sometimes

indicates additional requirements the entry set is {1,2,.

47Norwegian School

( NJ r, r) ] was studied by Mano [14] and

they denote

] NJ X, r) ( as H(n, r)

called magic squares,

for r = x. The

although usually this term

(such as that the two diagonal sums also equal r and

. , n2}; see [2]).

of Economics

481nstitute of Marine Research,

P.O.

and Business Administration, 5035 Bergen, Box 1870,

5024 Bergen,

Norway.

Norway.

AUBURN 1990 CONFERENCE Several properties

791

ON MATRIX THEORY

of these matrix sets follow from the definition:

N,( t, x) = 0

if

t
(1.1)

N”( t - I, x) c N”( t, x)

if

m -‘
(1.2)

N,( t - 1, x) = N,( t, x)

if

x+l
(1.3)

IN,(t,x)l=IN,(t,nt-x)1. [For (1.4) consider

(1.4)

the l-l map (ajj) * (bij) where aij + bij = t for all i,j.]

NOTATION. Let Q, be the set of doubly stochastic n x n matrices 31, and aQ, its interior and boundary in the relative topology of its &ne span W of dimension m = (n - 1)‘. Let r,(x) be the lattice of n x n matrices with entries zx-I, z E Z, and C,,(e) the n2-dimensional cube of n x n matrices with entries in [O, e]. J, denotes the n x n matrix where all entries equal 1. The set 0, is the intersection of W and the nonnegative orthant of @“‘. Its geometry was studied in [4]. From Definition 1 it follows that for x 2 1,

AEN,,(t,

x)

ifandonlyif

n n,.

n C”(K’)

x-~AEI’,(x)

(1.5)

By (1.5) NJ t, r) splits up as follows if x >, 1: N,(t,

TX)= I$( t, x) U aN,( t, x) (disjoint union),

(1.6)

where &(t,

x) = {AEN,@,

and

x)(x-‘Ad,}

aN,(t,

x) = {AEN+,

x)/x-‘A&Q,).

Let (aij) E NJ t, r), x > 1. a62, consists of those matrices in Q, which are limits for convergent sequences in W \ Q,; hence (aij)

Efin(t,X)

(aij)EaN”(t,

X)

ifand only if

aij > 0

for all i, j,

(1.7)

ifandonlyif

aij=

forsome

(I .8)

0

i, j.

From this it follows that

qt,

x) = 0

&(t* n) = {k) AsN,(t

-

1, x - n) ifandonlyif

if

l
when A +],,Efi,,(t,

(1.9) (1.10)

t>l,

x)

when

x > n.

(1.11)

FRANK

792 We notice the following

UHLIG,

consequence

TIN-YAU TAM, AND DAVID

of these observations:

IN,(x-n,x-n)I=IN,(x-1,x-n)I

and

CARLSON

By (1.3) and (1.11)

IN,(x-1,x-n)(=l~“(x,x)(; (1.12a)

hence

p,(x-n,r-n)l=p”(x,x)I, 2.

Triangulations

Proper

A triangulation of any polytope 62 is a finite set y= {T,, Ts, . . . , T,} all faces of simplices in ? are in 9-, n = T, U T, U *. . U T,, and

DEFINITION2. of simplices

where

if Tj tl Tk # 0,

moreover,

(1.12b)

x>n.

said to be proper

then Tj 17 Tk is a face of both Tj and Tk. The triangulation

if all corners

of each Tj belong to the corner

is

set of fl.

THEOREM(Fuglede [8], Brbndsted [S]). Let Q be a d-dimensional conuex polytope with corners p,, p,, . . . , p,. Then there exists a proper triangulation of Cl. Birkhoffs matrices

permutation taining r,(x)

theorem

for corners. matrices

x- ’ * ( aij).

[3] states

that

For a proper

51, is a convex

triangulation,

that are comers

in the unique

( N,,(cc,x) 1 is determined

polytope

(aid) E

with the permutation

N,(t, x) is a unique sum of the

lowest-dimensional

by counting

the number

simplex

con-

of points from

in each simplex. So, let A be a simplex

with the permutation

matrices

PO,P,, . . . , Pd for comers.

The points

PO+ CkiX-‘(Pi form a sublattice A of r,(x).

- PO),

kiEZ7

l
If d Q 3, it is easy to see that A = r,,(x) whether

A actually

(2.1) fl D, D being the

alline

span of A. It is not known

can be a proper

r,(x)

tl D for (high) values of d, but the possibility must be considered.

sublattice

of

Since the two

lattices have the same affine span, A is of finite index, say s, in I’,,(x) il D. From the kth coset we pick the unique representative

Qke

{p.

+

=j+x-‘(Pi

Qk such that

- P,,)jO < ri G 1, 1 G i 6 +

thus r,(x)nD=(~,+n)~(~,+a)u~~~u(~,+A).

(2.2)

AUBURN Let

1990 CONFERENCE

Qi + A = A, i.e.

793

ON MATRIX THEORY

Qi = x-‘[(x

- d)Pa + P, + .*.

+Z’d]. Each

x-l*

(aij)~r,(x)

fl n, is counted in the simplex of smallest dimension which contains the point, i.e., we count the inner lattice points in each simplex. Using

we count layer by layer. Here

y = z ‘when we count the points of (Qr + A) tl A; the

points is in a face of A and is not counted.

Qk = Pa + -&ix-1(P,

- P,,),

Now, write

0 < q.‘kiQ 1,

(2.3)

and let bi E Z be such that b, Q Crki < b, + 1. Then, for 1 < k < s, we have 0 < b, < b, = d, and the number of points from Qk + A in the interior H of A is

(2.4) thus (r,(x)nAI=

2

‘+d,‘-,,i,

for k>l.

d=b,>b,

(2.5)

k=l

Stanley [17] has proved the following result, originally conjectured

in [l]:

THEOREM (i) ] N,( x, r) ] is a poZytaomiaZin r of degree (n -

l)‘,

x )

1.

When x is allowed to assume also nonpositive values, (ii) the polynomial has n - 1 zeros: IN”(_l,

-

l)/

=

**.

=IN,(l

- n,l

- n)I = 0,

and (iii) it has the following symmetry

1%(-n-x9

-

A simple proof of (i) and (ii).

property:

n-x)/ = (-l)“-‘(~,(~, The exi$ence

of proper

x)I, triangulations

and (2.5)

show that ]N,(r,x)l, la&(x,x)], and I&,(x,x)] = INn(x,x)) - l?JN,,(x,~)l all are polynomials for x > 0. So (i) holds, and (ii) follows from (1.9) and (1.12) with x=1,2 ,..., n-l. n

FRANK UHLIG,

794 (iii) is a special

case of Ehrhart’s

TIN-YAU TAM, AND DAVID

reciprocity

deeper nature than (i) and (ii); see Remark

theorem

CARLSON

[7]. It still seems to be of a

2. A proof in line with this account

is in [12];

see also [6, 7, 15, 171.

REMARK 1. If s = 1 for a simplex of maximal dimension s = 1 for each

of its face simplices

too.

Since

tributed,

the volume of a maximal-dimensional

construct

maximal-dimensional

such that the number increasing integers;

i. [Then hence

simplices

of zero

(n -

points

1)2, then clearly are uniformly

simplex is proportional

in

P, + P, + * . . + Pi decreases

PO, P,, strictly

fl W and x:zi = 1 imply that the

X.ziPj = (aij) Ed,

dis-

to s. It is easy to

such that s = 1 in (2.5); just choose

entries

. with

xzi are

( uij) E A.]

PROBLEM. Are there maximal-dimensional are there

the lattice

cases with

s > l? If so, are there

simplices proper

with different

triangulations

volumes,

which

i.e.,

avoid these

cases? Let the polynomial

1N,,(x, x) ) be expressed as follows:

(2.6) A proper

triangulation

simplices.

The symmetry

a small number computer. With

The

more

functions

s = 1 for all simplices

(iii) allows the polynomials

of values; sequences

refined

completed

with

for

n < 5 sufficiently

{ ( N,( x, x) ( },

techniques

must

many values

r = 0, 1,2,

and a computer, a,

a,_i

ai i-dimensional

have been

from

found

by

. . appear in [16] for n = 3,4.

Jackson

the case n = 6 too. They also gave equivalent [17]. Thus one obtains

contain

) NJ x, x) 1 to be determined

and van Rees

[lo]

have

results in terms of generating

a, for n = 3;4;5:

.

3 12 19 15 6; 352 2464 7544 4.718075

2,905,658,575 14,062,951 These

13,232

51.898825

14,620

2,463,775,850 1,784,345

coefficients

indicate

complex.

In a proper

with

common);

9 vertices

1468 258 24; 1706.729525

2584.561500

770,476,155

280,134,105

74,580,465

120. triangulations,

ai > 0 and u,_r

with ai i-dimensional

= [n + $(n” - 3n + 2)]a,.

simplices, So the sim-

seem, on average, to have n facets (faces of dimension remaining n2 - 3n + 2 are walls in the triangulation

triangulation of valency

of Qs, the 3-dimensional

simplices

4 (adjacency

a 2-dimensional

is to have

similarly Iah+

indicates

6090

that proper

plices of maximal dimension m - 1) on aa,,while the

4945

811.572625 1,584,408,615

140,740

exist and are very regular:

graph

10,532

262.803150

x)1 =IN&,

a graph with 1408 vertices

x)] -IN+

- 4, x - 4))

of valency 9 on aQ,.

on an,

form a face

in

AUBURN 1990 CONFERENCE

795

ON MATRIX THEORY

PROBLEM. What is true for general

n?

Factorized forms of (2.6) for n = 3,4 illustrate (ii): 8- ‘( x + I)( x + 2)[( x + l)( x + 2) + 21 and 11,340-l( x + l)( x + 2)(x + 3)[11( x + 2)6 + 23(x + 2)4 + 128(x + 2)’ + 3061. The first is (33) in [l], and MacMahon [13, Section 4071 gave the form

3(x:3)

REMARK2.

+ (x:2).

By (1.6), (1.12), and part (iii) of Stanley’s theorem

P”(X, x)1=lN,(--)I -I NJ r -

we have

.,x-.)I=IN,(x,x)l+(-l)“lN,(-x,-x)1;

hence (aN,(O,O)j

= [1 + (-1)“]

*IN,,(O,O)l.

(2.7)

For x = 0, (2.5) becomes

i

-;I=(_l)d

Summing for the simplices in 62, and an,, we see that ] N,,(O,0) ] and ] aNJO, 0) ] are the Euler characteristics of 0, and do,, i.e. of the ball and sphere of dimensions m and m - 1. Hence (2.7) is a topological relation which follows from (iii); for this reason it would be interesting to prove (iii) as simply as (i) and (ii). [From topology ] N,(O, 0) ] = 1, i.e., the polynomial gives the correct value also for x = 0.1

3.

Error-Correcting

Codes Related to fl,

The set M,,(t) from Definition 1 is a code with alphabet {0, 1,2, . . . , t}. If (aij) E M,(t), n >, 3, is received with an error in a single entry, this is located by means of the deviating row and column sums, and uniquely corrected. The Hamming distance is at least 3 if n = 3, at least 4 if n is at least 4. This code may well be used as an identification code like the ISBN book code [ll], the universal product bar code UPC [18], or the codes for bank account numbers. For general information see [9]. These well-known codes detect errors only, and a code which also corrects errors may be a worthwhile alternative. The t&,-codes proposed here can be organized in different ways, e.g. according to the row and column sum x, and according to the simplex in a given proper triangulation to which a codeword is associated. Thus codewords may be assigned systematically and without repetitions, e.g. by different local authorities, each with its own simplices. The connection of these codes to the polytopes II, is described in (1.5). We report a few formulas for ( N,( t, x) I and ] M,,(t) ] = C ] N,,(t, x) ( . Apart from the case n = 2,

FRANK UHLIG, TIN-YAU TAM, AND DAVID CARLSON

796

where one may list all codewords,

lM4k

- l)l=

they are based on computer

5578~”

&

+ 12,705~’

+22,520n4

JN2(u-l,x)l=u-

-18(u2

1)2 -

Difference

- 2)1=

+ 1008~’

--&u(1007u8

schemes show that

12(u2+

+ 1)(2x

if u - 1 ,< x < 2u - 2 [otherwise use (1.4),

- 1,2u

+ 19,614u6

+ 14,175

jr-u+ll,

INa(u - 1, x)1 = &[21(u2+

IA?+

counting of the N,( t, x):

1) + 4

- 3u + 3)” +5(2x

- 3u + 3)‘]

(1.3) and reduce to x = t = u -

+ 2766~~

+ 3759u4

+ 3844~~

11,

- 36).

1N4(t, x) 1 fits no polynomial formula, even for t < x <

3t.

PROBLEM.

Get formulas or other results for

1M,(t) 1 directly.

Part of the work was done while the authors were visiting at the University of Wisconsin - Madison with support from the research funds at the Norwegian School of Economics and Business Administration and the Norwegian Council for Science and Humanities (NAVF). The authors are grateful to these institutions, and to Hedge Tverberg fm essential references. REFERENCES 1

H. Anand, V. C. Dumir,

and H. Cupta,

2

(1966). W. H. Benson and 0. Jacoby, New Recreations with Magic Squares, Dover, 1976.

A combinatorial

distribution

problem,

Duke Math. J. 33:757-769

3

4

G. Birkhoff, Tres observaciones sobre el algebra lineal, Rev. Uniu. Nat. Tucuman Ser. A 5:147-151 (1946). FL A. Brualdi and P. M. Gibson, Convex polyhedra of doubly stochastic matrices, I. Applications of the permanent function, /. Combin. Theory A 22:194-230 (1977); II. The graph of Q,, J. Combin. Theory B 22:175-198 (1977); III. Affine and combinatorial properties of n,, J. Combin. Theory A 22:338-351 (1977).

AUBURN

1990 CONFERENCE

ON MATRIX THEORY

5

A. Brbndsted,

Continuous

barycenter

6

Math. 4:179-187 (1986). W. Dahmen and C. A. Micchelli,

functions

797

on convex pOlytOPes, Exposition.

The number of solutions to linear diophantine

equations and multivariate splines, Trans. Amer. Math. SOC.308509-532 7

E. Ehrhart, DBmonstration de la loi de &ciprocite Acud. Sci. Paris 265A:91-94 (1967).

8

B. Fuglede,

Continuous

Math. 4:163-178 9

D. M. Jackson

11

G. Knill, International I. G. Macdonald,

integer matrices,

SIAM].

standard book numbers,

Polynomials

Math. Sot. (2) 4:181-192

in the marketplace,

Amer.

Mbh.

associated

of generalized

double

Comput. 4:474-477 (1975). Math. Teacher 74:47-48 (1981).

with finite cell-complexes,

1.

London

(1971).

Combinatoq

Analysis I, ZZ, Cambridge

in one volume by bhelsea, 14

Modular arithmetic

and G. H. J. van Rees, The enumeration

nonnegative

P. MacMahon,

C. R.

(1988).

12 13

(1988).

selection in a convexity theorem of Minkowski, Exposition.

J. A. Gallian and S. Winters,

stochastic

rationnel,

(1986).

Monthly 95:548-551 10

du polyedre

U.P.,

1915,

1916; reprinted

1960.

K. Mano, On the formula of ,,H,,

Sci. Rep. Fuc. Lit. Sci. Hirosaki Unio. 8:58-60

(1961). 15

P. McMullen,

Valuations

and Euler-twe

relations

for certain

classes of convex

polytopes, 17

Proc. London Math. Sot. 35:113-135 (1977). N. J. A. Sloane, A Handbook oflnteger Sequences, Academic, New York, 1973. R. P. Stanley, Combinatorics and Commutative Algebra, Prog. Math. 41, Birkhlser,

18

E.

16

1983. F. Wood,

Self-checking (1987).

codes-an

application

of modular

Teacher 80:312-316

Received 19 August 1991; final manuscript accepted 30 August 1991

arithmetic,

Math.