Using dynamic mode decomposition to extract cyclic behavior in the stock market

Using dynamic mode decomposition to extract cyclic behavior in the stock market

Accepted Manuscript Using dynamic mode decomposition to extract cyclic behavior in the stock market Jia-Chen Hua, Sukesh Roy, Joseph L. McCauley, Gemu...

2MB Sizes 0 Downloads 90 Views

Accepted Manuscript Using dynamic mode decomposition to extract cyclic behavior in the stock market Jia-Chen Hua, Sukesh Roy, Joseph L. McCauley, Gemunu H. Gunaratne PII: DOI: Reference:

S0378-4371(15)01087-0 http://dx.doi.org/10.1016/j.physa.2015.12.059 PHYSA 16685

To appear in:

Physica A

Received date: 26 May 2015 Revised date: 25 October 2015 Please cite this article as: J.-C. Hua, S. Roy, J.L. McCauley, G.H. Gunaratne, Using dynamic mode decomposition to extract cyclic behavior in the stock market, Physica A (2015), http://dx.doi.org/10.1016/j.physa.2015.12.059 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

*Highlights (for review)

Highlights: • Application of a new technique (Koopman mode analysis) to stock valuations. • Revealed four, hereto unknown, cyclic variations in stock market data. • One of these has a period of 1 year which represents seasonal variations. • Differentiated robust, repeatable features from noise and irregular characteristics. • Opens up new possibilities of applying Koopman mode analysis in other research areas.

1

*Manuscript Click here to view linked References

Using Dynamic Mode Decomposition to Extract Cyclic Behavior in the Stock Market Jia-Chen Huaa,c,∗, Sukesh Royb , Joseph L. McCauleya , Gemunu H. Gunaratnea a Department

of Physics, University of Houston, Houston, TX 77204, United States Energies, LLC, Dayton, OH 45431, United States c School of Electrical and Information Engineering, University of Sydney, NSW 2006, Australia b Spectral

Abstract The presence of cyclic expansions and contractions in the economy has been known for over a century. The work reported here searches for similar cyclic behavior in stock valuations. The variations are subtle and can only be extracted through analysis of price variations of a large number of stocks. Koopman mode analysis is a natural approach to establish such collective oscillatory behavior. The difficulty is that even non-cyclic and stochastic constituents of a finite data set may be interpreted as a sum of periodic motions. However, deconvolution of these irregular dynamical facets may be expected to be non-robust, i.e., to depend on specific data set. We propose an approach to differentiate robust and non-robust features in a time series; it is based on identifying robust features with reproducible Koopman modes, i.e., those that persist between distinct subgroupings of the data. Our analysis of stock data discovered four reproducible modes, one of which has period close to the number of trading days/year. To the best of our knowledge these cycles were not reported previously. It is particularly interesting that the cyclic behaviors persisted through the great recession even though phase relationships between stocks within the modes evolved in the intervening period. Keywords: Econophysics, Business cycle, Stock markets, Dynamical system, ∗ Corresponding

author Email address: [email protected] (Jia-Chen Hua)

Preprint submitted to Physica A

October 24, 2015

Koopman operator, Dynamic mode decomposition, Complex system

1. Introduction The existence of cyclic expansions and contractions in economic activity on a scale of 5-50 years has been long established [1]. Their study can be traced back to 1800’s [2] and an early report of a cycle (of duration 7–11 years) 5

was made by the French statistician Clement Juglar in 1860 [3, 4]. Subsequent studies established evidence for additional cycles including the Kitchin inventory cycle [5] of period 3–5 years, the Kuznets swing of duration of 15–25 years [6], and the Kondratiev wave (or long technological cycle) of period 45–60 years [7]. Much less is known about cyclic behavior in financial markets. The goal of the

10

work reported here is to utilize daily variations in stock prices to search for such cyclic activity. Speculations abound on precursors to and derivatives of economic changes; however, due to their qualitative nature, establishing such causalities is nontrivial. One quantifiable measure of activities related to the economy or to

15

financial market activity is the valuation of stocks. The valuation of a stock emerges from a series of complex interactions between participants such as investors, traders, and banks under constrains imposed by regulations and exchange rules, and reflect their collective estimation of the performance of the entity and the overall economy.

20

Variation of the price of a single stock depends on a multitude of factors, a majority of which is specific to the commodity. Hence, the search for changes in the overall financial market using a single index or a small set of indices is unlikely to bear fruit. Indeed, it has been found that a very large number of wavelet modes are required to decompose financial market dynamics [8, 9]. A

25

collective analysis of a large set of stocks is more likely to yield global market dynamics. Since little is known a-priori on relationships between individual stocks, empirical basis expansions, such as proper orthogonal decomposition (POD) [10, 11, 12, 13, 10, 14], is a natural approach to follow. In fact, POD

2

and its variants, such as singular spectrum analysis (SSA) and its extension 30

Multivariate SSA (M-SSA), have proved effective in the study of financial markets [15, 16, 17, 18, 19, 20]. SSA uses a short length sliding window to filter the original time series and constructs the trajectory matrix whose columns are lagged vectors with same length as the sliding window; M-SSA constructs a stacked matrix of multivariate time series from a set of individual times series.

35

SSA and M-SSA perform singular value decomposition on the trajectory matrix, implicitly assuming that the original time series is stationary for the entire duration. Koopman decomposition [21, 22, 23, 24] is a natural approach to search for collective oscillatory behavior. It is based on Koopman operator theory [25,

40

22, 24], which generalizes eigendecomposition to nonlinear systems. Koopman modes is a generalization of normal modes [24] and each mode represents a global collective motion in (the assumed) market dynamics. Importantly, spectral properties of market dynamics will be contained in the spectrum of the Koopman operator [22, 24]. A fast algorithm proposed by Schmid [25] and referred to as

45

dynamic mode decomposition (DMD), can be used for computing approximately (a subset of) the Koopman spectrum from the time-series of valuations of a collection of stocks. Each Koopman mode is associated with a complex eigenvalue whose real and imaginary parts represent the growth rate and frequency of the mode. Interest-

50

ingly, Koopman eigenvalues may be used to differentiate robust characteristics of the underlying dynamics from those that depend on the specific data set [26]. The starting point is a search for reproducible Koopman modes, i.e., those that persist in different sub-groupings of the time-series. The differentiation is based the following conjecture: robust characteristics of the market dynamics can be

55

associated with reproducible Koopman modes while non-robust features can be associated with non-reproducible modes. Only robust features are expected to be useful in an analysis of financial market dynamics. The methodology is summarized in Figure 1. Our studies are conducted using valuations of the 567 stocks, each of whose market capitalization exceeded 3

60

7.5 billion dollars as of October 2014 and price history can be traced back to November 2002. The total market capitalization of these stocks is over 23 trillion dollars as of October 2014. The daily adjusted close prices were retrieved from “Yahoo! finance” (http://finance.yahoo.com/). Additional details of the filtering and choice of stocks, as well as data retrieval, is given in the Supple-

65

mentary Materials. The 567 stocks are grouped according to sector and industry, and placed in a two-dimensional array to aid visualization. Dynamic mode decomposition of the data yield a large number of modes, the spectrum of the most significant of them is shown in Figure 1(a). The data is then partitioned into different subgroups, e.g., the time-series for even and odd numbered trading

70

days. We find only four common dynamic modes, which (in order of decreasing contributions to the data) are associated with periods 250 ± 4, 108 ± 2, 91 ± 1, and 76 ± 2 trading days respectively. To the best of our knowledge, these cycles

shown in Figure 1(b), have not been reported before. The corresponding eigenfunctions capture the phase relationships between different stocks in the sample; 75

for example, one stock (or a sector) may react to economic changes immediately, while others may lag the changes by a few weeks. Figure 1(c) shows the real part of one of the reproducible eigenfunctions. Finally, Figure 1(d) illustrates the daily variations in the valuations of Intel Corporation in one reproducible mode.

4

Figure 1: (a) The DMD spectrum, and (b) reproducible modes computed from the time series of the 567 stocks, each with a market capitalization of over 7.5 billion dollars as of October 2014 and a price history that can be traced back to November 2002. (c) The real part of one of the reproducible dynamic modes, and (d) the valuation of Intel Corporation under another reproducible mode.

80

2. Methods & Results 2.1. Koopman Decomposition Denote the state of the market by a vector z, whose constituents may include factors like the interest rate, expectations of economic growth, rate of unemployment, etc.; at the outset we recognize the precise form of z is unknown.

85

Next, assume that market dynamics can be expressed as z˙ = F(z); once again, its form is unknown. What we have available is “secondary” data, namely stock

prices, which we expect to reflect changes in z. Let us index the stocks in a pre-specified array x, and express their values at time t through a field u[z](x, t).

5

When the context is clear, we will simplify the notation by eliminating the state 90

z from this expression. Koopman analysis [21, 22, 23, 24] of the system is conducted on the “observables” u[z](x, t). Suppose the initial state z0 of the market evolves under F to

a state zt at time t. During this time interval, an initial function u[z0 ](x, t = 0) evolves to a function u[zt ](x, t). The transformation between the functions is given by the Koopman (or composition) operator, defined as U t : u[z0 ](x, t = 0) → u[zt ](x, t).

(1)

Note that, unlike F, which is finite-dimensional and may be nonlinear, the

Koopman operator U t is infinite-dimensional and linear [22, 24]. Interestingly, spectral properties of (the unknown) F are contained in the eigenspectrum

of U t . Eigenvalues of U t form the Koopman spectrum and the corresponding

95

eigenfunctions are referred to as Koopman eigenfunctions. Coefficients in the projection of the state u[zt ](x, t) to the Koopman eigenfunctions form the Koopman modes of the state [22]. As the system evolves under F, the k th Koopman

modes evolves as eΛk t , where Λk is the associated eigenvalue. Thus, the Koopman formalism is a natural approach to search for collective cyclic behavior in 100

high dimensional time-series. The challenge of establishing spectral properties of F reduces to computing

the spectrum of U t . Schmid [25] proposed a fast algorithm, referred to as dynamic mode decomposition (DMD), for computing approximations to (a subset of) the Koopman spectrum. The analysis, outlined in Supplementary Materials, 105

is based on the computation of a matrix A from a series of snapshots of the secondary field u[z](x, nδt) at equally spaced set of time points. The eigenvalues of A, {Λk }, are approximations to (a subset of) Koopman eigenvalues and their eigenfunctions {Φk (x)}, referred to as dynamic modes, are approximations to

the corresponding Koopman eigenfunctions. Typically, DMD modes are ordered 110

according to the contributions they make toward the spatio-temporal dynamics u(x, t); these contributions are referred to as “energies” of the modes and are defined in the Supplementary Materials. Below, DMD modes are assumed to 6

be indexed in non-increasing order of their energies. 0.01

0

104.9 104.4

0 -0.02

21 -0.01

103.9

33 27

Λr

Λr

-0.04

24

103.4

-0.02 102.9

-0.06

-0.08

-0.03

102.4

-0.04

101.9 101.4

-0.05

-0.1 0

0.05

0.1

0.15

0.2

0

Λi (a)

0.02

0.04

0.06

0.08

0.1

Λi (b)

Figure 2: (a) DMD spectrum for the evolution of the 567 stocks during the 3000 trading days from November 2002 to October 2014. Only the 100 modes with the largest energies are selected and only those with non-negative imaginary parts are shown. (b) Reproducible modes, i.e., those that remain (nearly) unchanged between several sub-groups of the time series. Mode numbers are given beside each point. The error bars quantify standard deviations of the real part of the eigenvalues for different sub-groupings of the time series. Deviations in the imaginary parts are significantly smaller. The colors assigned to the modes represent their energies according to the color bar or the right.

In x, stocks are arranged according to the sectors “Basic Materials,” “Con115

glomerates,” “Consumer Goods,” “Financial,” “Healthcare,” “Industrial Goods,” “Services,” “Technology,” and “Utilities.” Within sectors, entities are sub-grouped by industry, and finally, within each industry, ordered according to descending market capitalization. The input data consist of end-of-the-day valuations of the 567 stocks over the 3000 trading days from November 2002 to October 2014.

120

u(k, t) is the return (i.e., , logarithm of the price) of the k th stock “de-trended” by removing the linear growth (calculated using a linear regression on its time series). The DMD spectrum consists either of real eigenvalues or pairs of complex conjugate eigenvalues (Supplementary Materials). Figure 2(a) shows the eigen-

125

values of the 100 dynamic modes with the largest energies. Only those with non-negative imaginary parts are shown. (Energies of modes corresponding to 7

conjugate pairs of eigenvalues are identical.) Color coding given on the right as well as the size of a dot rely on the energy of the mode. The largest energy is contained in the time-averaged mode Φ0 (x), the remainder distributed among 130

a large number of dynamic modes. Figure 3 shows energies of the dynamic modes with the 36 largest values. Note that mode energies [except for the real mode Φ0 (x)] appear in pairs, as required for complex conjugate pairs (see Supplementary Materials). 10 6 Non-reproducible Modes Reproducible Modes

10 5

Energy

10 4 10 3 10 2 10 1 10 0 0

10

20

30

Dynamic Mode Number Figure 3: Energies of 36 dynamic modes with the largest energies. Reproducible modes are shown in black.

2.2. Reproducible Modes 135

Our goal is to search for cyclic changes in the markets. In particular, such oscillations should present in any subset of the time series. The data is also expected to contain constituents that depend on the subset, i.e., those that are not robust. It is necessary to identify and eliminate these non-robust components from consideration. Toward this end, we associate robust features of financial

140

markets with dynamic modes that persist in different sub-groupings of the time series. In order to implement the robustness criterion, we partition the stock data from the 3000 trading days into the following subgroups: (1) odd trading days,

8

(2) even trading days, (3) trading days divisible by 3, (4) trading days with 145

a remainder 1 when divided by 3, and (5) trading days with a remainder 2 when divided by 3. The determination of whether a dynamic mode is to be considered as reproducible is made using the proximity of the eigenvalues and the corresponding eigenfunctions between the subgroups. The precise conditions used to establish reproducible modes are given in Supplementary Materials.

150

We find only four reproducible dynamic modes which are shown in Figure 2(b), and whose energies are shown in black in Figure 3. Note that many high-energy dynamic modes are not reproducible. This is to be expected in financial markets due to the high-dimensionality of the dynamics and the presence of stochasticity.

155

Interestingly, the eigenvalues of the four reproducible modes are maintained when the data is partitioned into trading days 1-1000, 1001-2000, and 2001-3000; however, the corresponding dynamic modes are not close to those of the full data set. This observation suggests that the robust cyclic behavior was maintained through the great recession of 2008, even though the phase relationships between

160

different stocks changed in the interim. Figures 4 and 5 show the real and imaginary parts of the reproducible dynamic mode Φ21 (x), whose period is 250 ± 4 trading days. Here the standard deviation is derived from the periods computed from the different sub groupings

of the data. The period of the mode is approximately the number of trading 165

days in a year and it likely reflects seasonal changes in the market. As discussed before, the trading symbols are grouped first by sectors, then by industries, and ordered according to descending market capitalization. The notation used is as follows: the number “(1)” indicates the start of a new sector, and a different number (e.g., (6)), indicates the beginning of the corresponding

170

industry within the sector. The detailed numbering of industries within the 9 sectors can be found in Supplementary Materials. The remaining reproducible modes are also given in the Supplementary Materials.

9

Figure 4: The real part of the reproducible dynamic mode Φ21 (x) with period of approximate

250 trading days, i.e., a year.

10 Basic

Mat

E

MON CNQ

PBR CEO RIG HP ESV (10)

DVN PXD MRO NBL CHK

COG

(2) AA (3) DOW EQT

SWN

AGU

PX APD EMN RRC

BHI WFT

BHP RIO VALE TCK (8)

FCX SCCO (5) GG ABX

HAL

(7)

(4)

HRL

TSN

(15)

JAH

TRP PAA ETP

CVX PTR TOT BP

COP OXY EPD EOG

s es erat Good glom er Con Consum

(9)

RAI

MO

BTI

(8)

ENB

SJM

(19)

BK

BLK

(2)

UL

UN

PG

MS

GS

(7)

AON

BAP

STI

IBN

CM

STT SCHW BSAC

IVZ

(4)

EFX

COF

AXP

(3)

AMG

(5)

CS

BCS

UBS

SAN

l

ncia

Fina

KB

HDB

DB

BBD

(23) BBVA

MAT ITUB

(22)

FL

COH

NKE

(21)

PVH HSBC

RL

VFC

(20)

PII

HOG

(17) PCAR

BLL

IP

(16)

(6) RDS−B

ls eria

CLX

(14)

MHK

PPC

(7)

MMP

BG (13)

BMO

BNS

MTU

TD

RY

BAC

JPM

WFC

(10)

LNC

PFG

AEG

MFC

PRU

ING

PUK

MET

(9)

RJF

(8)

LUK

KEY

FITB

USB

(20)

BAM

(19)

FRT

KIM

MAC

O

SLG

GGP

SPG

(18)

ESS

AVB

EQR

(17)

BXP

(16)

PLD

PSA

(15)

HST

(14)

HCP

VTR

HCN MTB

(21)

(5)

MYL

ACT

(6)

(7) UNH

AGN

NVO

(4)

SHPG

LLY

BMY

AZN

GSK

SNY

MRK

PFE

NVS

JNJ

(3)

VRX

(2)

(10) (1)

LLL

COL

RTN

GD

LMT

BA

UTX

(2)

TXT

NOC

PH

(10)

EMR

(9)

FLR

(8)

PWR

(7)

VMC

MLM

MAS

(6)

DE

CAT

(5)

PLL

FLS

AME

DOV

ROP

IR

ROK

CMI

ETN

ITW

ABB

DHR

HON

WM

(15)

DHI

LEN

(14)

PCP

(13)

SWK

(12)

WY

(11)

MMM PNR

GE

(4)

CX

CRH

(3)

ods l Go

DVA

FMS

CVS

(12) BEAV

LH

DGX

A

TMO

(11)

COO

WAT

BCR

BDX

BAX

stria Indu

VAR

EW

SNN

BSX

ISRG

ZMH

STJ

SYK

ABT

MDT

(9)

UHS

(8)

HUM

CI

AET

MDVN WLP

INCY ESRX

PCYC

BMRN

ILMN SLXP

VRTX

ALXN

REGN

CELG PRGO

BIIB

AMGN RDY

GILD ENDP

TEVA

Re(Φ21 (x)) (1)

are lthc

RF

BBT

(22)

Hea

(13) HBAN

NLY

VNO

AMT

(12)

CINF

XL

MKL

CNA

PGR

L

HIG

SLF

CB

ALL

TRV

ACE

AMTD (11)

CPB TROW NMR CMA

CAG

(6)

CIB

UNM MMC PNC

AFL

(1)

ADM MKC NTRS

(12)

SNE

PHG

STZ

BF−B

DEO

(6)

CCE

K

GIS

(18)

EL

KMB

CL

AAPL GMCR BEN

(11)

HSY

MDLZ

(10)

CHD

ECL

MNST NWL

PEP

KO

(5)

TAP

FMX

(4)

ALV

BWA

JCI

(3)

F

HMC

TM

(2)

WHR

(1)

CAJ

IEP

(1)

KMP

GGB

NUE

MT

PKX

(14)

IFF

SIAL

SHW

PPG

(13)

TSO

HFC

UGP

VLO

(12)

BPL

SXL

EEP

KMR

OKS

MWE

NEM XOM WMB

(11)

FTI

CAM

NOV

TLM CNX

ASH

SLB

MUR

(9)

ECA

FMC

ARG

SSL

HES

POT MOS YPF

IMO

APA

SYT

STO

APC

DD

SNP

SU

(1) RSG

(1) MHFI DIS

(17) TW (27)

DISH

(10)

CTAS

ADS

(11)

SJR

TGT

COST

PCLN

(9)

TV

(8)

SIRI

(7)

LUV

(32)

WAB

KSU

CSX

NSC

CP

CNI

UNP

BBY

(16)

ABC

CAH

MCK

URI

RCL

CCL

(34)

(26)

HOT

TSCO

GPC

EBAY

(38)

CA

CTSH

ADP

MSFT

(2)

RHT

(7) SYMC

(3)

FFIV

(16) T

CTL

BCE

TSM

(17)

NTT

TEF

MXIM (23)

TXN

INTC

VZ

(22)

(15) ADSK

TRMB INFY

CSC

XRX

FIS

KT

VOD

CHL

(24)

(19)

gy

S

CHU

RCI

VIV

TU

KLAC

(14)

JNPR

MU

(20)

ASX

(13) LRCX

EA

(1)

ENI

(3)

CMS

AEE

WEC

AES

EOC

DTE

ETR

FE

HNP

XEL

ED

EIX

PCG

PPL

AEP

KEP

SO

D

NEE

(2)

NI

NU

PEG

SRE

EXC

DUK

ies Utilit

MBT

TKC

VIP

ATVI AMAT TSU

(12) ASML

GIB

NTES MCHP

LVLT GRMN (21)

TLK

CCI

LLTC

EQIX NVDA SKM

(11)

DCM CSCO

(5)

NTAP

SNDK

WDC

EMC

(4)

ALU

MSI

NOK AKAM ALTR

ERIC YHOO ARMH AMX

(18)

SWKS

XLNX

ADI

WIT BRCM ORAN

ACN

IBM

(9)

CERN

(8)

APH

QCOM (10)

nolo

JBHT

(40)

PAYX CTXS

(39)

Tech

MCD

(35)

MAR MGM

(25)

SIG

TIF

(24)

(14) FAST SBAC

(15)

SAP

(6) HPQ

(37) ADBE GLW CHKP

TYC

(31) SPLS

TRI

(30)

PSO

(29)

(23) RYAAY

LOW

HD

(22)

(1)

(36) ORCL

YUM

HRB SBUX INTU

(28)

FDO GWW (33)

MGA WAG

(6)

AAP

ORLY DLTR

AZO

(5)

(21)

SWY

CBD

WFM

KR

(20)

CUK

(19)

SYY

(18)

TWX

FOXA

(13) BBBY

KSS

M

TJX

(12)

KMX WMT

(4)

JWN

ROST

GPS

LB

LUX AMZN

(3)

EXPD

CHRW NFLX

FDX

UPS

(2)

IPG

OMC FISV CMCSA HSIC

ices Serv

VE

SRCL WPPGY MCO

(4)

CNP

OKE

−0.16713

−0.092848

−0.01857

0.055709

0.12999

0.20427

Figure 5: The imaginary part of the reproducible dynamic mode Φ21 (x).

2.3. Dynamics

the dynamics of the secondary field [22, 25]. Thus, u(x, t) can be expanded as

Under general conditions, dynamic modes {Φk (x)} form a complete basis of

u(x, t) =

N −1 X

k=0

ak (t)Φk (x),

11

(2)

Basic

Mat

E

MON CNQ

SSL

HES DVN PXD

POT MOS AGU

CEO RIG HP

NBL CHK

COG SWN

(3) DOW EQT PX APD

BHI WFT

(7) BHP RIO VALE TCK (8)

(4) FCX SCCO (5) GG ABX

HRL

TSN

(15)

JAH

TRP PAA ETP

CVX PTR TOT BP

COP OXY EPD EOG

s es erat Good glom er Con Consum

(9)

RAI

MO

BTI

(8)

ENB

SJM

(19)

BK

BLK

(2)

UL

UN

PG

MS

GS

(7)

AON

BAP

STI

IBN

CM

STT SCHW BSAC

IVZ

(4)

EFX

COF

AXP

(3)

AMG

(5)

CS

BCS

UBS

SAN

l

ncia

Fina

KB

HDB

DB

BBD

(23) BBVA

MAT ITUB

(22)

FL

COH

NKE

(21)

PVH HSBC

RL

VFC

(20)

PII

HOG

(17) PCAR

BLL

IP

(16)

(6) RDS−B

ls eria

CLX

(14)

MHK

PPC

(7)

MMP

BG (13)

BMO

BNS

MTU

TD

RY

BAC

JPM

WFC

(10)

LNC

PFG

AEG

MFC

PRU

ING

PUK

MET

(9)

RJF

(8)

LUK

KEY

FITB

USB

(20)

BAM

(19)

FRT

KIM

MAC

O

SLG

GGP

SPG

(18)

ESS

AVB

EQR

(17)

BXP

(16)

PLD

PSA

(15)

HST

(14)

HCP

VTR

HCN MTB

(21)

(5)

MYL

ACT

(6)

(7) UNH

AGN

NVO

(4)

SHPG

LLY

BMY

AZN

GSK

SNY

MRK

PFE

NVS

JNJ

(3)

VRX

(2)

(10) (1)

LLL

COL

RTN

GD

LMT

BA

UTX

(2)

TXT

NOC

PH

(10)

EMR

(9)

FLR

(8)

PWR

(7)

VMC

MLM

MAS

(6)

DE

CAT

(5)

PLL

FLS

AME

DOV

ROP

IR

ROK

CMI

ETN

ITW

ABB

DHR

HON

WM

(15)

DHI

LEN

(14)

PCP

(13)

SWK

(12)

WY

(11)

MMM PNR

GE

(4)

CX

CRH

(3)

ods l Go

DVA

FMS

CVS

(12) BEAV

LH

DGX

A

TMO

(11)

COO

WAT

BCR

BDX

BAX

stria Indu

VAR

EW

SNN

BSX

ISRG

ZMH

STJ

SYK

ABT

MDT

(9)

UHS

(8)

HUM

CI

AET

MDVN WLP

INCY ESRX

PCYC

BMRN

ILMN SLXP

VRTX

ALXN

REGN

CELG PRGO

BIIB

AMGN RDY

GILD ENDP

TEVA

Im(Φ21 (x)) (1)

are

lthc

RF

BBT

(22)

Hea

(13) HBAN

NLY

VNO

AMT

(12)

CINF

XL

MKL

CNA

PGR

L

HIG

SLF

CB

ALL

TRV

ACE

AMTD (11)

CPB TROW NMR CMA

CAG

(6)

CIB

UNM MMC PNC

AFL

(1)

ADM MKC NTRS

(12)

SNE

PHG

STZ

BF−B

DEO

(6)

CCE

K

GIS

(18)

EL

KMB

CL

AAPL GMCR BEN

(11)

HSY

MDLZ

(10)

CHD

ECL

MNST NWL

PEP

KO

(5)

TAP

FMX

(4)

ALV

BWA

JCI

(3)

F

HMC

TM

(2)

WHR

(1)

CAJ

IEP

(1)

KMP

GGB

NUE

MT

PKX

(14)

IFF

SIAL

SHW

PPG

(13)

TSO

HFC

UGP

VLO

(12)

BPL

SXL

EEP

KMR

OKS

MWE

NEM XOM WMB

(11)

FTI

CAM

NOV

TLM CNX

ASH FMC

HAL

(10) SLB

ARG MUR

EMN RRC

ESV

PBR

MRO

(2) AA

(9)

ECA

YPF

IMO

APA

SYT

STO

APC

DD

SNP

SU

(1) RSG

(1) MHFI DIS

(17) TW (27)

DISH

(10)

CTAS

ADS

(11)

SJR

TGT

COST

PCLN

(9)

TV

(8)

SIRI

(7)

LUV

(32)

WAB

KSU

CSX

NSC

CP

CNI

UNP

BBY

(16)

ABC

CAH

MCK

URI

RCL

CCL

(34)

(26)

HOT

TSCO

GPC

EBAY

(38)

CA

CTSH

ADP

MSFT

(2)

RHT

(7) SYMC

(3)

FFIV

(16) T

CTL

BCE

TSM

(17)

NTT

TEF

MXIM (23)

TXN

INTC

VZ

(22)

(15) ADSK

TRMB INFY

CSC

XRX

FIS

KT

VOD

CHL

(24)

(19)

gy

S

CHU

RCI

VIV

TU

KLAC

(14)

JNPR

MU

(20)

ASX

(13) LRCX

EA

(1)

ENI

(3)

CMS

AEE

WEC

AES

EOC

DTE

ETR

FE

HNP

XEL

ED

EIX

PCG

PPL

AEP

KEP

SO

D

NEE

(2)

NI

NU

PEG

SRE

EXC

DUK

ies Utilit

MBT

TKC

VIP

ATVI AMAT TSU

(12) ASML

GIB

NTES MCHP

LVLT GRMN (21)

TLK

CCI

LLTC

EQIX NVDA SKM

(11)

DCM CSCO

(5)

NTAP

SNDK

WDC

EMC

(4)

ALU

MSI

NOK AKAM ALTR

ERIC YHOO ARMH AMX

(18)

SWKS

XLNX

ADI

WIT BRCM ORAN

ACN

IBM

(9)

CERN

(8)

APH

QCOM (10)

nolo

JBHT

(40)

PAYX CTXS

(39)

Tech

MCD

(35)

MAR MGM

(25)

SIG

TIF

(24)

(14) FAST SBAC

(15)

SAP

(6) HPQ

(37) ADBE GLW CHKP

TYC

(31) SPLS

TRI

(30)

PSO

(29)

(23) RYAAY

LOW

HD

(22)

(1)

(36) ORCL

YUM

HRB SBUX INTU

(28)

FDO GWW (33)

MGA WAG

(6)

AAP

ORLY DLTR

AZO

(5)

(21)

SWY

CBD

WFM

KR

(20)

CUK

(19)

SYY

(18)

TWX

FOXA

(13) BBBY

KSS

M

TJX

(12)

KMX WMT

(4)

JWN

ROST

GPS

LB

LUX AMZN

(3)

EXPD

CHRW NFLX

FDX

UPS

(2)

IPG

OMC FISV CMCSA HSIC

ices Serv

VE

SRCL WPPGY MCO

(4)

CNP

OKE

−0.069787

−0.03877

−0.0077541

0.023262

0.054279

0.085295

where N is the number of DMD modes. Since dynamic modes are only approximations to Koopman modes, the time evolution ak (t)’s are not precisely of the form eΛk t ; they need to be evaluated through a projection. The DMD matrix A, defined in Supplementary Materials, is real but is generally not symmetric [25]. Consequently, its eigenvalues are not necessarily real and dynamic modes are not necessarily orthonormal. The computation of ak (t) requires the eigendecomposition of the operator A† conjugate to A. Its eigenvalues are Λ∗k and the ˜ k (x)}. They are orthonormal to corresponding eigenfunctions are denoted {Φ P ˜ the dynamic modes; i.e., x Φm (x)Φn (x) = δmn . Consequently, ak (t) =

X

˜ k (x)u(x, t). Φ

(3)

x

Figure 6(a) shows the time evolution a21 (t) of the reproducible dynamic mode Φ21 (x). The power spectrum, Figure 6(b), shows that the dynamics contains a small broadband component, due to the fact that dynamic modes are only approximations to the Koopman modes. The scatter plot of the real vs. imaginary parts of a21 (t) in Figure 6(c) exhibits irregular dynamics. Interestingly, the “angular” progression of the phase-space orbit is highly regular. To illustrate this point, we define θ(t) = tan−1

Im(a(t)) , Re(a(t))

(4)

where θ(t)/2π returns to its original value after a period T = 2π/ Im(Λ). Fig175

ure 6(d) shows that the dynamics of θ21 (t) is highly regular while that of the magnitude |a21 (t)| is irregular. Spatial structures Φk (x)’s and time evolutions

ak (t)’s for the remaining reproducible modes can be found in the Supplementary Materials. The time series of the valuation of any stock contains cyclic components 180

associated with the four reproducible modes. For example, Figure 7 shows these components for the Exxon-Mobil Corporation (XOM), the Wells-Fargo & Comapany (WFC), and the Intel Corporation (INTC) for the mode Φ21 (x). It is interesting to speculate if a trading strategy can be developed to exploit these highly regular and robust oscillations. 12

1.5

1000 Re(a21 (t))

a21 (t)

0.5 0 -0.5 -1

800 600 400 200

-1.5 11/02 03/05 08/07 12/09 05/12 10/14

0 0

0.02

(a)

(b) 1.4 1.2 1 0.8

|a21 (t)|

1

0

15 θ21 (t)/2π 10

0.6 5 0.4 0

-1 0.2 -2 -2

0.03

Frequency (cycles/trading day)

2

Im(a21 (t))

0.01

Date

θ21 (t) 2π

1

Power (arb. unit)

Re(a21 (t)) Im(a21 (t))

-1

0

1

|a21 (t)|

-5 11/02 03/05 08/07 12/09 05/12 10/14

2

Date

Re(a21 (t))

(c)

(d)

Figure 6: (a) The time evolution of the coefficient a21 (t) of the reproducible dynamic mode Φ21 (x) whose period is approximately 250 trading days. (b) Power spectrum of a21 (t) shows that, unlike for Koopman modes, the dynamics of a21 (t) is not precisely periodic. (c) The points of a21 (t) in the complex plane exhibit significant irregularities. (d) However, the behavior of the phase angle, defined as θ21 (t) = tan−1 [Im(a21 (t))/ Re(a21 (t))], evolves smoothly; the modulus |a21 (t)| fluctuates in time.

185

We finally inquire if the cyclic behaviors can be observed on single indices. One such quantity may be the average u ¯(t) of the 567 stock valuations, weighted by their market capitalizations. Figure 8(a) shows its evolution and Figure 8(b) shows the power spectrum. The arrows in Figure 8(b) are the frequencies of the four reproducible modes. It is clear that the robust cyclic behavior of the

190

collection of stocks cannot be extracted from u ¯(t).

13

0.08 XOM WFC INTC

0.06

2Re[a21 (t)Φ21 (x)]

0.04 0.02 0 -0.02 -0.04 -0.06 -0.08 -0.1 11/02

03/05

08/07

12/09

05/12

10/14

Date

Figure 7: Evolution of the valuation of the Exxon-Mobil Corporation (XOM), the Wells-Fargo & Comapany (WFC), and the Intel Corporation (INTC) associated with the reproducible mode Φ21 (x).

10 3

0.4 0.3 0.2

10 2

Power (arb. unit)

0.1

u(t)

0 -0.1 -0.2 -0.3

10 1

10 0 -0.4 -0.5 -0.6 11/02

10 -1 03/05

08/07

12/09

05/12

10/14

0

Date (a)

0.005

0.01

0.015

0.02

0.025

0.03

Frequency (cycle/day) (b)

Figure 8: (a) The dynamics u ¯(t) of the mean return of the 567 stocks weighted by their market capitalizations. (b) The power spectrum of u ¯(t). Frequencies of the four reproducible modes are labeled by arrows.

2.4. Are Reproducible Modes Artifacts of Dynamic Mode Decomposition? The presence of the four modes in several sub-groupings of the data, and the persistence of their periodicity through the great recession lends credence to the assertion that reproducible modes are reflections of financial market dynamics. 14

195

However, we wish to verify that they are not artifacts of the computation of dynamic modes. We test this using two model data sets. • The first contains 3000 steps from each of 567 realizations of the same

Brownian process. Thus, the dynamics of each synthetic stock is assumed to follow the same stochastic process. We conduct dynamic mode de-

200

composition on this synthetic data in a search for reproducible dynamic modes. • The second data set is identical to the first, except for the inclusion of a

periodic drift in the stochastic process. The drift for different entities have the same amplitude and period but random initial phases. Once again,

205

we search for reproducible modes in the simulated data. The Brownian process in the first simulation has an annualized standard deviation of 0.1. Dynamic mode decomposition of the 567 distinct trajectories was implemented using the same subgroups and thresholds as in the analysis of financial market data. No reproducible modes were found. The second set of

210

simulations contained, in addition, a sinusoidal drift term A sin( 2πt T + φ), where the period T was chosen to be 250 trading days and the phase shift φ for each “stock” was random. A reproducible mode of period T could be identified for A > Acr = 0.056. No other reproducible modes were found. 3. Discussion and Conclusions

215

The existence of economic cycles has been known for over a century. The goal of the work reported here was to search for similar cyclic behavior in the dynamics of the stock valuations. Since they have not been reported previously, one expects their consequences to be subtle; in particular, it is unrealistic to expect to extract them from the time series of a single index or a small set of

220

indices. This motivated our choice to study the collective dynamics of stocks with the highest market capitalizations.

15

Koopman decomposition is a natural approach to extract spectral properties of stock valuations. However, the ability to differentiate robust and non-robust dynamical characteristics is critical to complete the analysis. We implemented 225

the differentiation by associating robust features of the market dynamics with reproducible dynamic modes, i.e., modes that persist in distinct sub-groupings of the time series. Our analysis identified four reproducible modes with periods 250±4, 108±2, 91±1, and 76±2 trading days. The period of the first of these is a year, and likely reflects seasonal changes in stock valuations. That we identified

230

this period lends credence to our analysis. As illustrated in Figure 7 using the time series for Exxon-Mobil Corporation (XOM), Wells-Fargo & Comapany (WFC), and Intel Corporation (INTC), reproducible modes constitute part of the valuations of individual stocks. At this point we cannot determine if the robust cycles are caused by changes in economic activities or have their origins

235

in trading rules and practices. It was particularly interesting that the cyclic behavior persisted through the great recession, even though the form of the dynamic mode, or equivalently the phase relationships between different stocks, changed over the intervening period. It will be of interest to speculate if a trading strategy can be developed to exploit these highly regular and robust

240

oscillations. As seen in Figure 3, the four reproducible modes contribute only a very small fraction of the energy of the market dynamics. For this reason, the modes can only be identified through a composite analysis of the dynamics of a large number of stock valuations. Studies of single indices, such as the example shown

245

in Figure 8(b), are not sufficient to extract these modes. In our exposition, we interpreted the remaining variations as due to non-robust dynamics; i.e., contributed by non-reproducible dynamic modes. However, we are unlikely to know the dynamics F underlying the market; we may thus interpret (at least part of) these fluctuations as resulting from stochastic processes.

250

Analysis of the synthetic data introduced in Section 2.4 supports our contention that the observed robust cyclic behavior is not an artifact of the dynamic mode decomposition. The simulations also show that in order for a cyclic be16

havior to be observable, its amplitude need to be larger than a critical value. Thus, there may be additional smaller amplitude cyclic features of the markets. 255

Intra-day variations in currency exchange rates [27] and stock prices [28] are governed by non-stationary stochastic processes. In addition, while autocorrelation functions for increments in the return decay rapidly, those for the absolute values of the increments decay slowly [29]. It will be of interest to determine if these intra-day variations contain cyclic behaviors as well. We are

260

currently investigating this issue. Acknowledgments We wish to thank Robert Azencott for discussions and important suggestions to our work. References

265

[1] Madhani, P. M. Rebalancing Fixed and Variable Pay in a Sales Organization: A Business Cycle Perspective. Compens. Benefits Rev. 42, 179–189 (2010). [2] de Sismondi, J. C. L. S. Nouveaux principes d’´economie politique, ou de la richesse dans ses rapports avec la population, vol. 2 (Delaunay, Paris,

270

1827). [3] Juglar, C. Des Crises commerciales et de leur retour periodique en France, en Angleterre, et aux Etats-Unis. Paris: Guillaumin (1862). [4] Lee, M. W. Economic Fluctuations: An Analysis of Business Cycles and Other Economic Fluctuations (R. D. Irwin, 1955).

275

[5] Kitchin, J. Cycles and Trends in Economic Factors. Rev. Econ. Stat. 5, 10 (1923). [6] Kuznets, S. S. Secular Movements in Production and Prices: Their Nature and Their Bearing Upon Cyclical Fluctuations (A. M. Kelley, 1967). 17

[7] Kondratieff, N. D. & Stolper, W. F. The Long Waves in Economic Life. 280

Rev. Econ. Stat. 17, 105 (1935). [8] Pomˇenkov´a, J., Fidrmuc, J. & Korhonen, I. China and the World Economy: Wavelet Spectrum Analysis of Business Cycles. Bank of Finland Discussion Papers 5 (2014). [9] Raihan, S. M., Wen, Y. & Zeng, B. Wavelet: A new tool for business cycle

285

analysis. Federal Reserve Bank of St. Louis Working Paper Series (2005). [10] Lumley, J. L. The Structure of Inhomogeneous Flow. In Atmospheric Turbulence and Radio Wave Propagation (Publishing House Nauka, Moscow), 166-176 (1967). [11] Sirovich, L. Turbulence and the Dynamics of Coherent Structures .1. Co-

290

herent Structures. Q. Appl. Math. 45, 561–571 (1987). [12] Sirovich, L. Turbulence and the Dynamics of Coherent Structures .2. Symmetries and Transformations. Q. Appl. Math. 45, 573–582 (1987). [13] Sirovich, L. Turbulence and the Dynamics of Coherent Structures .3. Dynamics and Scaling. Q. Appl. Math. 45, 583–590 (1987).

295

[14] Meng, H. et al. Systemic risk and spatiotemporal dynamics of the US housing market. Sci. Rep. 4 (2014). [15] Hassani, H. & Thomakos, D. A review on singular spectrum analysis for economic and financial time series. Stat. Interface 3, 377–397 (2010). [16] Groth, A., Ghil, M., Hallegatte, S. & Dumas, P. Identification and re-

300

construction of oscillatory modes in US business cycles using Multivariate Singular Spectrum Analysis. In Workshop on Frequency Domain Research in Macroeconomics and Finance (Bank of Finland, Helsinki, 2011). [17] Hassani, H., Soofi, A. S. & Zhigljavsky, A. A. Predicting daily exchange rate with singular spectrum analysis. Nonlinear Anal. Real World Appl.

305

11, 2023–2034 (2010). 18

[18] Afshar, K., Bigdeli, N. & Shayeghi, H. Singular spectral analysis applied for short term electricity price forecasting. IJTPE 4, 32–40 (2012). [19] Wen, F., Xiao, J., He, Z. & Gong, X. Stock Price Prediction based on SSA and SVM. Procedia Computer Science 31, 625–631 (2014). 310

[20] de Carvalho, M., Rodrigues, P. C. & Rua, A. Tracking the US business cycle with a singular spectrum analysis. Econ. Lett. 114, 32–35 (2012). [21] Mezic, I. & Banaszuk, A. Comparison of systems with complex behavior. Physica D 197, 101–133 (2004). [22] Rowley, C. W., Mezic, I., Bagheri, S., Schlatter, P. & Henningson, D. S.

315

Spectral analysis of nonlinear flows. J. Fluid Mech. 641, 115–127 (2009). [23] Budisic, M., Mohr, R. & Mezic, I. Applied Koopmanism. Chaos 22, 047510 (2012). [24] Mezic, I. Analysis of Fluid Flows via Spectral Properties of the Koopman Operator. In Davis, SH and Moin, P (ed.) Ann. Rev. Fluid Mech., Vol 45,

320

357–378 (2013). [25] Schmid, P. J. Dynamic mode decomposition of numerical and experimental data. J. Fluid Mech. 656, 5–28 (2010). [26] Roy, S., Hua, J.-C., Barnhill, W., Gunaratne, G. H. & Gord, J. R. Deconvolution of reacting-flow dynamics using proper orthogonal and dynamic

325

mode decompositions. Phys. Rev. E 91, 013001 (2015). [27] Bassler, K. E., McCauley, J. L. & Gunaratne, G. H. Nonstationary Increments, Scaling Distributions, and Variable Diffusion Processes in Financial Markets. Proc. Natl. Acad. Sci. USA 104, 17287–17290 (2007). [28] Hua, J.-C., Chen, L., Falcon, L., McCauley, J. L. & Gunaratne, G. H.

330

Variable diffusion in stock market fluctuations. Physica A 419, 221–233 (2015).

19

[29] Heyde, C. & Leonenko, N. Student processes. Adv. in Appl. Probab. 37, 342–365 (2005).

Author Contributions Statements 335

J.-C.H, G.H.G., and S.R. developed the technique for separating robust and non-robust constituents of spatio-temporal dynamics. J.-C.H. proposed its application to financial markets and performed the computations. J.L.M. contributed important insights to the project. J.-C.H. and G.H.G. wrote the paper.

340

Additional Information The authors declare no competing financial interests.

20