Exploiting the self-organizing financial stability map

Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539 Contents lists available at SciVerse ScienceDirect Engineering Applications o...

Download PDF

811KB Sizes 0 Downloads 42 Views

Report

PDF Reader
Full Text

Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539

Contents lists available at SciVerse ScienceDirect

Engineering Applications of Artiﬁcial Intelligence journal homepage: www.elsevier.com/locate/engappai

Exploiting the self-organizing ﬁnancial stability map Peter Sarlin n ˚ Akademi University, Joukahaisenkatu 3–5, 20520 Turku, Finland Department of Information Technologies, Abo

a r t i c l e i n f o

a b s t r a c t

Article history: Received 9 July 2012 Received in revised form 5 November 2012 Accepted 7 January 2013 Available online 5 February 2013

This paper enhances the visualization and extraction of information on the self-organizing ﬁnancial stability map (SOFSM). The SOFSM uses the self-organizing map to represent a multidimensional ﬁnancial stability space on a two-dimensional grid and allows monitoring economies in the ﬁnancial stability cycle represented by four states. The SOFSM lacks, however, means for thorough assessment of general structures and individual data. We enhance the visualization and information extraction of the SOFSM by the means of four tasks: (1) fuzziﬁcation of the map, (2) probabilistic modeling of state transitions, (3) contagion analysis and (4) outlier detection. The usefulness of the extensions is shown with sample visualizations and predictive performance. & 2013 Elsevier Ltd. All rights reserved.

Keywords: Financial crisis Self-organizing ﬁnancial stability map Fuzziﬁcation Transition probabilities Contagion Outlier detection

1. Introduction Machine learning and artiﬁcial intelligence techniques are common means for prediction, modeling and intelligence augmentation in a wide range of engineering applications (e.g., Cheng et al., 2008; Fuertes et al., 2010; Jia et al., 2008; Lin et al., 2006; Muttil and Chau, 2007; Xie et al., 2006; Zhang and Chau, 2009). Yet, the application of such techniques in ﬁnancial modeling, such as ﬁnancial stability surveillance, is still in its early beginning. In particular, visual exploration of high-dimensional data, while being an important task for the understanding of data structures, is oftentimes a neglected part in ﬁnancial modeling. A starting point to this has been set by two attempts to map the state of ﬁnancial stability: a spider-chart published by the International Monetary Fund (IMF) (Dattels et al., 2010; IMF, 2012) and a self-organizing ﬁnancial stability map (SOFSM) published by the European Central Bank (ECB) (Sarlin and Peltonen, 2011; ECB, 2011). This paper focuses on enhancing the SOFSM that employs a self-organizing map (SOM) (Kohonen, 1982, 2001) based method for mapping the state of ﬁnancial stability onto a two-dimensional plane.1 The SOFSM allows monitoring economies in the ﬁnancial stability cycle represented by four stages: pre-crisis, crisis, post-crisis and tranquil states. It uses

n

Corresponding author. Tel.: þ358 22154670; fax: þ 3582 215 4809. E-mail address: psarlin@abo.ﬁ 1 SOMs have been applied in a wide range of topics, e.g., on currency crisis Sarlin and Marghescu (2011) and general economic (Resta, 2009) indicators. In addition, see Deboeck and Kohonen (1998) for an overview of applications in ﬁnance and Sarlin (2012b) for a comparison on using data and dimension reduction methods for ﬁnancial performance analysis. 0952-1976/$ - see front matter & 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.engappai.2013.01.002

the SOM to perform a neighborhood-preserving projection onto a grid of nodes with a predeﬁned regular shape. The predeﬁned grid shape facilitates using the SOFSM as a display or basis for additional information, such as visualizing individual data samples or general structural properties of data. This paper attempts to enhance the exploitation of the SOFSM by further developing the visualization and extraction of information on the map. In Sarlin and Peltonen (2011), the SOFSM was not only used for visualizing cross-sectional and temporal data on the two-dimensional display, but also for non-linear correlation hunting in relation to all four states, as well as evaluated in terms of out-of-sample predictive capabilities. It was not, however, paired with other techniques for enhanced visualization, information extraction and ﬁnancial stability surveillance. This paper uses the SOFSM as its basis and enhances the exploitation from four perspectives:

1. Fuzziﬁcation of the SOFSM for visualizing temporal belongingness to ﬁnancial stability states of individual data and class distance structures on the map. 2. Probabilistic modeling of state transitions on the SOFSM for visualizing probabilities of transition to ﬁnancial stability states of individual data and for assessing the general cyclical and temporal structure of the ﬁnancial stability cycle. 3. Using neighborhoods on the SOFSM and superimposed portfolio network topologies to assess the spread of ﬁnancial distress and shock propagation. 4. Computing distances between data and their mean proﬁles on the SOFSM to ﬁnd extreme events and imbalances in economies’ macro-ﬁnancial conditions.

P. Sarlin / Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539

First, we fuzzify and classify the SOFSM with a distance-based metric. Instead of applying a crisp clustering algorithm on the units, as is commonly done (see, e.g., Vesanto and Alhoniemi, 2000), we use perfect ﬁnancial stability states derived from classes in data as representative cluster centers. Then, an adapted version of the membership functions in Cottrell and Letre´my (2005) and Sarlin and Eklund (forthcoming) is used to compute class memberships. By a defuzziﬁcation of the results using the maximum-membership method, we also provide a crisp clustering. This enhances the visualization capability by enabling assessment of temporal belongingness to the ﬁnancial stability states, where the states are expressed by representative cluster centers and ﬂuctuations in macro-ﬁnancial conditions are represented by the temporal variation of belongingness. In addition, visualizations of the memberships on a SOM grid enable assessment of the class structures. Second, we assess temporal variation on the SOFSM. Following previous works on transition probabilities on the SOM (Sulkava and Hollme´n, 2003; Luyssaert et al., 2004; Fuertes et al., 2010; Sarlin et al., 2012), we exploit the temporal patterns by computing, summarizing and visualizing probabilities of future state transitions. This enables not only assessing the most likely future state transitions for each node, and thus also pairing to individual data, but also country proﬁling of low- and high-risk ﬁnancial stability states. Most notably, while this validates the assumed ﬁnancial stability cycle, intra cluster differences in transition patterns show that the cycle is not entirely well-behaving or continuous. Third, we assess contagion effects on the SOFSM. We analyze shock propagation from two perspectives: neighborhoods on the SOFSM and a superimposed portfolio network topology. While the former approach was suggested already in the stand-alone SOFSM, this paper extends the concept by evaluating the value added in terms of predictive performance. The latter perspective on contagion applies the approach in Sarlin (2012a) by combining a portfolio network topology with a ﬁnancial stability topology, i.e., superimposing a network of bilateral portfolio exposures on the SOFSM. This enables two approaches for analyzing the spread of ﬁnancial instabilities: the ﬁnancial stability topology indicates propagation of ﬁnancial distress to similar macro-ﬁnancial conditions, while the portfolio network topology indicates contagion through asset-based direct linkages. Fourth, we assess whether or not, and to what extent, distances between each datum and its mean proﬁle on the SOFSM (best-matching unit, BMU) is an indication of ﬁnancial imbalances. The rationale for this is that the units of the SOM, while being topologically ordered, tend to approximate the probability density function of data (Kohonen, 2001). The distance to a BMU represents thus the ﬁt of a single datum to the multivariate data distribution, i.e., its degree of extremity. When monitoring ﬁnancial stability, one could assume that large distances represent ﬁnancial imbalances in macro-ﬁnancial conditions. Our approach goes beyond applying a pre-deﬁned threshold value on the distance to assess whether or not a datum is an outlier (see, e.g., Vesanto et al. (1998) and Saunders and Gero (2001)), by setting the threshold to optimize predictive performance. Out of the above approaches, we show the usefulness of the fuzziﬁcation, transition probabilities and portfolio network topology with sample visualizations on or together with the SOFSM. The usefulness of neighborhood contagion and outlier analysis is tested in terms of their performance in predicting ﬁnancial crises. The paper is structured as follows. Section 2 introduces the data, the SOM, the fuzziﬁcation, transition probabilities, contagion analysis, outlier detection and the evaluation scheme. We present the enhanced visualization and extraction of information on the SOFSM in Section 3, and conclude in Section 4 by presenting our key ﬁndings, limitations and future research directions.

1533

2. Methodology This section presents the data, methods and evaluations used in this paper. 2.1. Data and pre-processing The data used in this paper are of two sorts: macro-ﬁnancial indicators with class information and asset-based linkages. The ﬁrst type of data follow the dataset in Lo Duca and Peltonen (2012) and consist of a set of vulnerability measures commonly used in the macroprudential literature (see, e.g., Alessi and Detken, 2011; Borio and Lowe, 2002, 2004) and binary class information representing pre-crisis, crisis, post-crisis and tranquil periods. The quarterly dataset consists of 28 countries, 10 advanced and 18 emerging economies, spanning from 1990: 1–2010:3, that is a panel dataset consisting of both a crosssectional and a temporal dimension. Hence, a data vector xjAR18 is formed of a class vector xj(cl)AR4 and an indicator vector xj(in)AR14 for each quarter and country in the sample. To control for cross-country differences and ﬁxed effects, each column of the input data is transformed into its historic country-speciﬁc percentile distribution. The second types of data consist of a network of ﬁnancial linkages. Portfolio exposures are based upon external assets (equities and bonds) as reported in the Coordinated Portfolio Investment Survey by the IMF. It is worth noting that exposures of central banks are not included due to the different nature of their holdings. 2.2. Self-organizing maps (SOMs) The SOM (Kohonen, 1982) is a technique with both clustering and projection capabilities that performs a neighborhoodpreserving projection onto a grid of nodes with a predeﬁned regular shape. As the SOM algorithm is well-known and the main emphasis is on exploiting additional information on the SOM, we do not present details of it here—for details see Kohonen (2001). For its excellent visual representation, the Viscovery SOMine 5.1 package is used as a basis in this study. After an initialization of the reference vectors as to the two principal components, the training process follows two steps. First each input data vector xj (where j ¼1,2,y,N) is paired with its best-matching unit (BMU) mb of all reference vectors mi (where i¼1,2,y,M) using Euclidean distance. The second step adjusts each reference vector mi with the batch updating formula: N P

mi ðt þ 1Þ ¼

hibðjÞ ðtÞxj

j¼1 N P

,

ð1Þ

hibðjÞ ðtÞ

j¼1

where t is a discrete time coordinate and hib(j) is a decreasing function of neighborhood radii and time. However, we employ a semi-supervised version of the SOM by using data vector xjAR18, formed of a class vector xj(cl) and an indicator vector xj(in), in both steps of training. It is important noting the use of the class and indicator vectors when mapping data onto the SOM. After training a SOM model, only the indicator vector xj(in) is used for locating data to their BMUs (mb(in)). Then, the class values of their BMU (mb(cl)) are used as estimates of the current state. The two-dimensional SOM grid describes the topology of a multidimensional space. As all information on the map cannot be visualized in two dimensions, additional information can be shown on own grids. The predeﬁned grid shape of the SOM facilitates this type of visualization linking. For instance, a socalled feature plane can be used for showing the spread of an

1534

P. Sarlin / Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539

indicator on the SOM, where cold colors (blue) indicate low and warm colors (red) high values. As these are different views of the same grid, a unique point represents always the same node.

2.3. Fuzziﬁcation of SOMs The SOM is commonly partitioned using a crisp clustering technique (see, e.g., Vesanto and Alhoniemi, 2000). However, as we not only have class information, but also utilize a semisupervised SOM with the classes in the ordering process, we do not have to estimate clusters and their centroids. We adapt a fuzziﬁcation of the SOM proposed in Sarlin and Eklund (2011, forthcoming) for computing class memberships. Thus, we can classify and fuzzify the SOM based on the class vectors xj(cl)AR4 by assuming the following: the number of states C equals the number of classes K, i.e., C¼K, and the state center ck (where k¼1,2,y,C) for each class is a perfect representative state vector, i.e., 1 if k equals the state of ck : ð2Þ 8k ckðclÞ ¼ 0 otherwise While there exist other methods for class visualization on the SOM, such as Voronoi regions (Mayer et al., 2007), they fall short in dealing with partial truth, uncertainty and imprecision in class memberships. We follow the approach in Sarlin and Eklund (forthcoming) by computing a membership degree using Euclidean distances between nodes and state centers. The fuzziﬁcation resembles that in Cottrell and Letre´my (2005), by being implemented on second-level state centers instead of directly on the nodes, by not assuming inverse exponential distances and by introducing a fuzziﬁcation parameter. In addition, our aim is a visualization rather than a method for imputing missing values which is the key focus of Cottrell and Letre´my. Our approach differs from that in Sarlin and Eklund (forthcoming) by only using xj(cl)AR4 for computing the distances. The rationale for this is the focus on distances between mean proﬁles of classes rather than those between indicators. The SOM is fuzziﬁed by computing the inverse distance between reference vector mi(cl) and each state center ck(cl): uik ¼

1 2

1 þ 99miðclÞ ckðclÞ 99y1

,

ð3Þ

where yA(1,N) is a similar fuzziﬁer as the one used in fuzzy C-means clustering. The fuzziﬁer y affects the distance metric by setting the extent of overlap between the states, where low values provide crisp results and high fuzzy results. The fuzziﬁers y ¼2 and y ¼3 can be seen as benchmark values by resulting in simple and squared Euclidean distances. Eq. (3) extends the approach in Sarlin and Eklund (forthcoming) by also adding a 1 to the denominator to be deﬁned for all distances. We normalize the similarity matrix uik to the following matrix of state membership degrees for each node: vik ¼

uik , C P uik

ð4Þ

k¼1

P to fulﬁll the probabilistic constraint Ck ¼ 1 vik ¼ 1: In addition to computing membership degrees, we follow previous studies by defuzzifying the results using the maximum-membership method. The crisp states on the SOM can be shown by contours, while nodes’ membership degrees to each state can be visualized on own grids per state, i.e., membership planes. Further, the evolution of individual data can be visualized using line graphs by associating each datum with the membership degrees of its BMU.

2.4. Transition probabilities on SOMs The standard SOM paradigm does not explicitly address the issue of temporality on the grid. While trajectories have been a common means to illustrate temporal movements, small samples give no indication of overall patterns and large clutter the display. A more general probabilistic model of the temporal variations in a SOM model has been introduced through node-to-node transition probabilities (Sulkava and Hollme´n, 2003; Luyssaert et al., 2004; Fuertes et al., 2010). Recently, node-to-cluster transition probabilities have been proposed to be computed on SOMs with a second-level clustering and visualized on their own SOM grids (Sarlin et al., 2012). Movements between nodes on the twodimensional SOM are used to compute probabilities of switching from one place to another in a speciﬁed time period. In general, we can compute transition probabilities of switches from each node to every other node. To facilitate visualization, we focus on node-to-state switches by computing pik, where the transition refers to movements from reference vector i to state k: pik ðt þ sÞ ¼

nik ðt þsÞ : C P nik ðt þsÞ

ð5Þ

k¼1

where nik is the cardinality of data switching from mi to ck, t is a time coordinate and s is the time span for the switch. The probability of switching to a particular state can again be associated with each of the SOM nodes, and linked to the SOM visualization. That is, we visualize the probabilities on their own transition planes, where color codes show the probability of transition. To summarize the most likely switches, we show maximum transition probabilities (maxk ðpik Þ) conditional on switching on an own transition plane, where labels show location and color probability of transition. The evolution of individual data can again be associated to the transition probability of their BMU and visualized using line graphs. 2.5. Shock propagation on SOMs The process of some event being transmitted to another entity goes by different names, such as contagion, shock propagation or the spread of an event. We analyze this occurrence through two transmission channels: links between entities and similarities in indicator values. The ﬁrst approach follows that in Sarlin (2012a) by superimposing a network of bilateral exposures on the SOM. While the standard SOM provides a, for example, twodimensional mapping of input data, the SOM does not per se account for relationships or links in data. Visual clustering of network data has been directly performed by applying a standalone SOM on data relationships (Boulet et al., 2008). However, we extend the standard SOM with a superimposed network for simultaneous assessment of dependencies in data, such that positions are set by the indicator-based SOM while links are set by the relationships between the entities. Networks of relationships are mostly expressed in matrix form, where the link between entities g and l in matrix A is represented by element agl. The matrix is of size n2, where n is the number of entities. Matrices of directed graphs, as the one we use, can be read in two directions: rows of A represent the relationship of g to l and columns of A represent the relationship of l to g. To combine SOMs and network analysis, we superimpose network relations on top of the standard SOM grid by visualizing relationships between entities. Labels of entities under analysis, say g and l, are projected to their BMUs of the SOM based upon their data xj ¼ g and xj ¼ l. After that, relations between entities g and l are visualized by edges between the locations of the BMUs of xg and xl on the SOM grid using elements agl and alg.

P. Sarlin / Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539

The second approach measures neighborhood effects on the SOM, as previously suggested but not quantitatively tested in the SOFSM (Sarlin and Peltonen, 2011). First, we assign one of the classes K to be the event of interest. Then, instead of using the associated class vector (pre-crisis class) based upon historical data as an estimate of probable future events, we assign the locations of the event of interest in period t to be signals of similar events in that location in period tþs, where s is the time span for transmission. 2.6. Outlier analysis with SOMs The SOM paradigm is well suited for outlier detection. The nodes mi of the SOM, while being topologically ordered, tend to approximate the probability density function of data p(x) (Kohonen, 2001). The standard quantization error (QE) can be seen as the correspondence between the nodes and the data. However, a more meaningful estimate of event rarity is computing the distance of individual data xj to their BMUs mb. Due to the property of approximating the probability density function, the individual QE represents the ﬁt of a single datum to the multivariate data distribution. An outlier, and its degree of extremity, can thus be estimated by the distance to the SOM in a multidimensional setting, i.e., dj ¼ 99xjðinÞ mbðinÞ 99. The literature has commonly applied a pre-deﬁned threshold value on the distance to assess whether or not a datum is an outlier (see, e.g., Vesanto et al. (1998) and Saunders and Gero (2001)). The approach in the present paper goes, however, beyond using a pre-deﬁned threshold by setting the threshold to optimize predictive performance. 2.7. Performance evaluation The performance of the extensions, in particular contagion and outliers, is evaluated with a Usefulness measure that incorporates the preferences of a policymaker. We assume the predictive model to issue a signal when the estimated probability of a crisis is above a speciﬁed threshold. The signals and non-signals are then collected into a contingency matrix comprising of true positives (TP), true negatives (TN), false positives (FP) and false negatives (FN). In this paper, we follow the approach proposed in Sarlin (2013) by maximizing the usefulness for a policymaker with speciﬁc preferences about giving false alarms (type I errors) and missing crises (type II errors) when calibrating an optimal model and threshold for policy action. The used approach extends ¨ the measures used in Demirguc-Kunt and Detragiache (2000) and Alessi and Detken (2011). We derive the following loss function for a policymaker: LðmÞ ¼ mT 1 P 1 þð1mÞT 2 P 2 ,

ð6Þ

where the parameter m sets the policymaker’s relative preferences between missing crises and giving false alarms, type I errors T1A[0,1]¼FN/(TPþ FN) represent the proportion of missed crises relative to the number of crises in the sample, type II errors T2A[0,1]¼FP/(FPþTN) the proportion of false alarms relative to the number of tranquil periods in the sample, and unconditional probabilities P1 and P2 correspond to the frequency of pre-crisis and tranquil times. Thus, the preference parameter m deﬁnes the risk-aversion between type I and II errors of the policymaker. The policymaker is equally concerned of missing crises and issuing false alarms for a value of m ¼0.5, more concerned about issuing false alarms when m o0.5, and more concerned about missing crises when m 40.5. In this paper, we follow the preferences in the SOFSM by being substantially more concerned of missing a crisis (m ¼0.8), which also reﬂects the imbalanced relative costs between missing a crisis and giving a false alarm. Following Sarlin (2013), the usefulness of one speciﬁc threshold is derived by subtracting L(m) from the expected value of a

1535

best-guess given the preferences and unconditional probabilities ðminðmP1 ,P 2 ð1mÞÞLðmÞÞ. Hence, we can deﬁne the relative Usefulness by computing the share of usefulness to the available usefulness of the model (minðmP 1 ,P 2 ð1mÞÞ): minðmP1 ,P 2 ð1mÞÞLðmÞ , Ur m ¼ minðmP 1 ,P2 ð1mÞÞ

ð7Þ

where the Usefulness of the model is compared with the loss of disregarding the model and divided by the available usefulness. Thereby, the interpretation of the relative Usefulness Ur(m) is the percentage gain relative to not using a model. Finally, the performance of a model is solicited by choosing a threshold on the probability of a crisis such that usefulness Ur is maximized.

3. Exploiting the self-organizing ﬁnancial stability map The SOFSM uses the SOM for mapping the state of ﬁnancial stability onto a two-dimensional plane. It uses as inputs a set of indicators commonly used in the macroprudential literature and class information representing pre-crisis, crisis, post-crisis and tranquil periods. Sarlin and Peltonen (2011) put forward a technical framework for building a two-dimensional ﬁnancial stability map, evaluate it and test its robustness, as well as show its usefulness for visualizing data. The parameters of the SOFSM are tuned with respect to three aspects: (1) parsimonity and generalization, (2) objective learning on in-sample data, and (3) visualization capabilities. The SOFSM and trajectories for the United States (US) and Euro area conditions can be found in Fig. 1a, and sample feature planes in Fig. 1b. This section applies the extensions described in Section 2 to the SOFSM for the following four tasks: (1) fuzziﬁcation of the map, (2) probabilistic modeling of state transitions, (3) contagion analysis and (4) outlier detection. The extensions of the SOFSM, and their usefulness, are illustrated with sample visualizations and predictive performance. 3.1. Fuzziﬁcation of the SOFSM We employ the distance- and class-based fuzziﬁcation of the SOFSM. Class information is accounted for by setting the number of states equal to the number of classes, i.e., four, and their centers as perfect states of the ﬁnancial stability cycle: pre-crisis, crisis, post-crisis and tranquil states. The choice of C ¼4 is hence not a free parameter as it is speciﬁed from the number of classes K. We test different speciﬁcations of the fuzziﬁer (y ¼{1.0,1.2,...,5.0}), but end up using squared Euclidean distances (y ¼2) since that allows for fuzzy, but not entirely erased, state borders. While the differences between the most extreme choices are signiﬁcant, the results are stable for values close to y ¼2. In Fig. 2, memberships to each state are shown on membership planes, where also the defuzziﬁed crisp states are shown by contour lines (as well as on all other grids, e.g., Fig. 1). The crispest part is the upper right corner of the tranquil state, whereas the rest have more overlap. Fig. 3 shows how the fuzziﬁcation can be turned into line graphs for the trajectories in Fig. 1, where membership in states and their variation over time represent ﬂuctuation in the current state of ﬁnancial stability. The line graphs clearly depict increases in memberships in pre-crisis states prior to crises. 3.2. Transition probabilities on the SOFSM We exploit temporal variation on the SOFSM by employing probabilistic modeling of state transitions. The transition probabilities are computed for node-to-state switches and visualized on own transition planes (Fig. 4), and summarized as maximum transition probabilities conditional on switching, where labels

1536

P. Sarlin / Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539

US 2004-05

Pre-crisis

US 2006

Tranquil US 2002 2010

US 2007

Euro 2002

Euro 2006 Euro 2007

US 2008-09

Euro 2008

Real GDP growth

Real credit growth

CA deficit

Government deficit

Global inflation

Global leverage

Global equity valuation

Pre-crisis periods

Euro 2004-05

Euro 2010

Post-crisis US 2003

Crisis

Inflation

Euro 2003

Euro 2009

Fig. 1. SOFSM and trajectories for the US and Euro area conditions (a), and feature planes (b). Notes: The feature planes in (b) are layers of the SOM grid in (a) and show the spread of indicators and classes on the grid. While the ﬁrst eight feature planes represent sample indicators, the last feature plane shows the spread of a class, the pre-crisis periods.

Tranquil

Pre crisis

Crisis

Post crisis

0.2 0.3 0.3 0.4 0.5 0.6 0.7 0.7 0.8 0.9 1.0 0.00 0.05 0.10 0.14 0.19 0.24 0.29 0.33 0.38 0.43 0.00 0.05 0.09 0.14 0.18 0.23 0.27 0.32 0.36 0.41 0.00 0.04 0.09 0.13 0.17 0.22 0.26 0.30 0.35 0.39

Fig. 2. Membership planes for the SOFSM, where X-marks represent state centers.

State memberships for US from 2002-2011

State memberships

1.0 Tranquil Pre-crisis Crisis Post-crisis

0.8 0.6 0.4 0.2 0.0 2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

Time t State memberships for the euro area from 2002-2011

State memberships

1.0 Tranquil Pre-crisis Crisis Post-crisis

0.8 0.6 0.4 0.2

show location and color probability. We have tested a wide range of time spans (s¼{6,12,18,24,48}), but chose for analysis a time span of 18 months (s¼18). Hence, transition probabilities represent the likelihood of switching to a state within 18 months. The rationale behind choosing s ¼18 is that the SOFSM is also calibrated for optimal performance in terms of predicting vulnerable states 18 months prior to a crisis. Moreover, the transition patterns are considerably robust to changes in s. The length of movements increases with increases in s, as expected, while the directions of movements are stable. Most notably, while the transition patterns validate the assumed ﬁnancial stability cycle, the cycle is shown not to be entirely wellbehaving or continuous. For instance, the SOFSM shows high probability of transition to the crisis state on the border between the tranquil and pre-crisis states, as well as during extreme tranquil times. We also perform a similar line graph representation as that for the fuzziﬁcation, but instead with indications of future states. Fig. 5, while depicting probabilities according to the ﬁnancial stability cycle, illustrates the crisis indications in the extreme part of the tranquil state and a rise of new instabilities in 2010.

0.0 2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

3.3. Shock propagation on the SOFSM

Time t Fig. 3. Line graphs of US and Euro area state membership degrees (see trajectory in Fig. 1).

We assess transmission of shocks on the SOFSM with two methods: a portfolio network topology and SOM neighborhoods.

P. Sarlin / Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539

Tranquil

Pre crisis

Crisis

Post crisis

1537

Summary (t+{1,2,...,6}) 3 3 3 3 1 2 2 3 3 3 1 3 3 3 3 2 2 2 3 3 3 3 1 3 3 3 3 2 2 2 3 3 3 3 3 3 3 3 2 2 2 3 3 3 3 3 3 3 2 1 2 3 3 3 3 2 3 2 1 3 2 3 3 2 3 2 1 2 2 2 1 2 1 2 2 1 1 1 2 2 1 1 1 1 1 1 1 1 2 1 1 1 1 1

Fig. 4. Transition planes and summarized maximum state transitions for the SOFSM. Notes: In the summary grid, 1–4 stands for tranquil, pre-crisis, crisis and post-crisis states and E for empty nodes, respectively.

Transition probabilities for US from 2002-2011 Tranquil Pre-crisis Crisis Post-crisis

Transition probabilities

1.0 0.8 0.6 0.4 0.2 0.0 2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

Time t

Transition probabilities for the euro area from 2002-2011 Transition probabilities

1.0

Tranquil Pre-crisis Crisis Post-crisis

0.8 0.6 0.4 0.2 0.0 2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

Time t Fig. 5. Line graphs of US and Euro area transition probabilities (see trajectory in Fig. 1).

Tranquil

Pre-crisis

economy are located in their BMUs and their size is scaled as to the sum of exposures to other economies. The thickness of the edges represents the size of external exposure to total exposures per economy, where the color of the edge indicates the address of the exposure holder. Indeed, Fig. 6 combines the state of ﬁnancial stability, or probability of a crisis, with the systemic relevance of each economy. The size of ﬁnancial linkages to high-risk economies (e.g., Brazil and Mexico) enlighten about both past and present: high levels of previous ﬁnancial stress in the US may have impacted their current state and they still have a high risk of current and future shock propagation from the US. While crises are often transmitted through asset-based contagion channels, such as ﬁnancial linkages, they may also be propagated through similarities in macro-ﬁnancial conditions. Independent of location on the map, an economy in the neighborhood of countries in crisis could through contagion experience a similar wave of ﬁnancial distress. This is particularly important when dealing with data of changing nature, as it does not necessitate macro-ﬁnancial conditions to follow patterns during historical crises. Thus, an economy in the upper left part of Fig. 6, say Mexico, could propagate ﬁnancial instabilities to countries with similar macro-ﬁnancial vulnerabilities, e.g., Argentina and Brazil. While this is particularly useful for visual real-time surveillance, we can also test this by letting locations of crises in period t be signals of crises in that location in period t þ1. More precisely, we create a leading indicator that signals a crisis in node mb in period tþ 1 if a country that experienced a crisis in t was located in mb in period t. Table 1 shows the predictive performance of neighborhoods on the SOFSM with forecast horizons of 6–48 months, where a horizon of 24 months outperforms the rest. A policymaker with m ¼0.8 derives the largest usefulness. While the table conﬁrms the usefulness of detecting transmission of crisis, the nature of the contagion measures suggest that they are rather complements than substitutes to standard early-warning models. 3.4. Outlier analysis with the SOFSM

Crisis Post-crisis Fig. 6. A ﬁnancial network topology on the SOFSM, where the US is at the centre.

This enables analyzing spread of ﬁnancial instabilities from two points of views: contagion via asset-based linkages and propagation of ﬁnancial distress to similar macro-ﬁnancial conditions. On the SOFSM grid in Fig. 6, we have superimposed a network of ﬁnancial links in 2010 with the US as its center. Nodes of each

We assess whether or not, and to what extent, distances between each data vector and its BMU is an indication of ﬁnancial imbalances. The rationale for this is that large distances are by deﬁnition extreme events with respect to the data distribution on the SOFSM. Table 1 shows the predictive performance of QEs on the SOFSM with forecast horizons of 6–48 months. The table illustrates that, while the aim is conceptually different, outliers do not provide equally good means to predict ﬁnancial instabilities than contagion does. The predictive capability improves with shorter forecast horizons and yields the largest usefulness for a horizon of 6 months. The measure is most useful for a policymaker with m ¼0.7. This reﬂects the fact that the measure is binary rather than a probability on which a threshold is chosen to optimize usefulness. Moreover, as the imbalance may be located in different states of the ﬁnancial stability cycle, it provides only

1538

P. Sarlin / Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539

Table 1 Predictive performance of contagion and quantization error on the SOFSM. Model

Horizon

Threshold

RP

RN

PP

PN

Accuracy

Ur(l ¼0.7)

Contagion Contagion Contagion Contagion Contagion

6 12 18 24 48

BMU BMU BMU BMU BMU

0.23 0.22 0.19 0.17 0.13

0.92 0.91 0.89 0.88 0.85

0.21 0.32 0.35 0.39 0.47

0.93 0.86 0.78 0.71 0.48

0.86 0.80 0.73 0.67 0.48

0.30 0.30 0.38 0.56 0.42

0.27 0.26 0.35 0.57 0.36

0.03 0.01 0.12 0.45 0.13

QE QE QE QE QE

6 12 18 24 48

0.65 0.66 0.66 0.63 0.71

0.49 0.48 0.54 0.72 0.53

0.66 0.69 0.71 0.71 0.79

0.12 0.23 0.30 0.35 0.44

0.93 0.87 0.86 0.92 0.84

0.65 0.65 0.67 0.71 0.73

0.19 0.17 0.13 0.11 0.05

0.02 0.00 0.03 0.06 0.12

0.38 0.40 0.44 0.49 0.56

(dj) (dj) (dj) (dj) (dj)

Ur(l ¼0.8)

Ur(l ¼0.9)

AUC – – – – – 0.56 0.55 0.60 0.73 0.69

Notes: The table reports results for contagion and quantization errors (QEs) on the SOFSM. QE is a percentile transformation of dj and contagion neighborhood is the BMUs of previous pre-crisis periods. The following measures are reported: TP (true positives), FP (false positives), TN (true negatives), FN (false negatives), precision positives (PP) ¼ TP/(TP þFP), recall positives (RP) ¼TP/(TPþ FN), precision negatives (PN) ¼ TN/(TN þ FN), recall negatives (RN) ¼TN/(TN þFP), accuracy ¼ (TP þTN)/(TPþ TNþ FP þ FN), AUC ¼area under the receiver operating characteristic curve (not computed for contagion as it is binary), and Usefulness (Ur(m))¼ [Min(mP1,P2 (1 m)) (mT1P1 þ(1 m)T2P2)]/Min(mP1,P2(1 m)), where m stands for relative preferences between false alarms and missing crises. For QE, the threshold is chosen as for optimal usefulness. Best values per measure and method are bolded.

information of possible impending instabilities rather than information on the exact timing of a crisis.

4. Conclusions This paper has enhanced the exploitation of the SOFSM by the means of four tasks: (1) fuzziﬁcation of the map, (2) probabilistic modeling of state transitions, (3) contagion analysis and (4) outlier detection. To illustrate the usefulness of the extensions, we have provided some sample visualizations and tested their predictive performance. This could, however, be extended along several directions. For transition probabilities, different time spans (e.g., moving averages) and regions (e.g., early warning nodes) may also be applied, as appropriate for the given task. Neighborhood contagion could as well use different deﬁnitions of neighborhood functions. When detecting outliers, one could decide to solely focus on imbalances in speciﬁc locations, such as ﬁnancial stability states. These questions are, however, dependent upon the questions being asked and the task one is attempting to solve, and could hence be varied within the framework in this paper. The present paper does not, however, deal with changes in cluster structures. A limitation is hence the assumption of a stationary ﬁnancial stability cycle. An important issue in earlywarning modeling is the changing nature of crises due to, e.g., ﬁnancial innovation. Hence, a key question remains: How should models be adapted to events that potentially surpass historical experience? These suggestions, among other things, set the directions for future research and work in progress. References Alessi, L., Detken, C., 2011. Quasi real time early warning indicators for costly asset price boom/bust cycles: a role for global liquidity. Eur. J. Polit. Econ. 27 (3), 520–533. Borio, C., Lowe, P., 2002. Asset Prices, Financial and Monetary Stability: Exploring the Nexus. BIS Working Papers, No. 114. Borio, C., Lowe, P., 2004. Securing Sustainable Price Stability: Should Credit Come Back from the Wilderness? BIS Working Papers, No. 157. Boulet, R., Jouve, B., Rossi, F., Villa, N., 2008. Batch kernel SOM and related Laplacian methods for social network analysis. Neurocomputing 71 (7–9), 1257–1273. Cheng, C.T., Wang, W.C., Xu, D.M., Chau, K.W., 2008. Optimizing hydropower reservoir operation using hybrid genetic algorithm and chaos. Water Resour. Manage. 22 (7), 895–909. Cottrell, M., Letre´my, P., 2005. Missing values: processing with the Kohonen algorithm. In: Proceedings of Applied Stochastic Models and Data Analysis (ASMDA 2005), Brest, pp. 489–496.

Dattels, P., McCaughrin, R., Miyajim, K., Puig, J., 2010. Can You Map Global Financial Stability? IMF Working Paper, WP/10/145. Deboeck, G., Kohonen, T., 1998. Visual Explorations in Finance with SelfOrganizing Maps. Springer-Verlag, Berlin. ¨ Demirguc-Kunt, A., Detragiache, E., 2000. Monitoring banking sector fragility: a multivariate logit approach. World Bank Econ. Rev. 14, 287–307. ECB, 2011. Mapping the state of ﬁnancial stability. In: ECB (Ed.), Financial Stability Review, December 2011, pp. 149–157. Fuertes, J.J., Domı´nguez, M., Reguera, P., Prada, M.A., Dı´az, I., Cuadrado, A.A., 2010. Visual dynamic model based on self-organizing maps for supervision and fault detection in industrial processes. Eng. Appl. Artif. Intell. 23 (1), 8–17. IMF, 2012. Global Financial Stability Report. International Monetary Fund, USA, Washington D.C. Jia, W., Ling, B., Chau, K.W., Heutte, L., 2008. Palmprint identiﬁcation using restricted fusion. Appl. Math. Comput. 205 (2), 927–934. Kohonen, T., 1982. Self-organized formation of topologically correct feature maps. Biol. Cybern. 66, 59–69. Kohonen, T., 2001. Self-Organizing Maps. Springer-Verlag, Berlin. Lin, J.Y., Cheng, C.T., Chau, K.W., 2006. Using support vector machines for longterm discharge prediction. Hydrol. Sci. J. 51 (4), 599–612. Lo Duca, M., Peltonen, T.A., 2012. Assessing systemic risks and predicting systemic events. J. Bank Financ. Luyssaert, S., Sulkava, M., Raitio, H., Hollme´n, J., 2004. Evaluation of forest nutrition based on large-scale foliar surveys: are nutrition proﬁles the way of the future? J. Environ. Monit. 6 (2), 160–167. Mayer, R., Abdel Aziz, T., Rauber, A., 2007. Visualising class distribution on selforganising maps. In: Proceedings of the International Conference on Artiﬁcial Neural Networks (ICANN’07), Porto, Portugal, September 9–13. Muttil, N., Chau, K.W., 2007. Machine learning paradigms for selecting ecologically signiﬁcant input variables. Eng. Appl. Artif. Intell. 20 (6), 735–744. Resta, M., 2009. Early warning systems: an approach via self organizing maps with applications to emergent markets. In: Proceedings of the 18th Italian Workshop on Neural Networks, pp. 176–184, IOS Press, Amsterdam. Sarlin, P., 2012a. Chance discovery with self-organizing maps: discovering imbalances in ﬁnancial networks. In: Ohsawa, Y., Abe, A. (Eds.), Advances in Chance Discovery. Springer Verlag, Berlin, pp. 49–61. Sarlin, P., 2012b. Data and Dimension Reduction for Visual Financial Performance Analysis. TUCS Technical Report No. 1049. Sarlin, P., 2013. On Policymakers’ Loss Functions and the Evaluation of Early Warning Systems. Econ. Lett. 119 (1), 1–7. Sarlin, P., Eklund, T., 2011. Fuzzy clustering of the self-organizing map: some applications on ﬁnancial time series. In: Proceedings of the 8th International Workshop on Self-Organizing Maps, Springer, Helsinki, pp. 40–50. Sarlin, P., Eklund, T. Financial performance analysis of European Banks using a fuzziﬁed self-organizing map. Int. J. Knowledge-Based Intell. Eng. Syst, forthcoming. Sarlin, P., Marghescu, D., 2011. Visual predictions of currency crises using selforganizing maps. Intell. Sys. Acc. Fin. Mgmt. 18 (1), 15–38. Sarlin, P., Peltonen, T.A., 2011. Mapping the State of Financial Stability. ECB Working Paper, No. 1382. Sarlin, P., Yao, Z., Eklund, T., 2012. Probabilistic modeling of state transitions on the self-organizing map: some temporal ﬁnancial applications. Intell. Syst. Acc. Fin. Mgmt. 19 (1), 189–203. Saunders, R., Gero, J.S., 2001. Designing for interest and novelty: motivating design agents. In: de Vries, B., van Leeuwen, J., Achten, H. (Eds.), CAADFutures 2001. Kluwer, Dordrecht, pp. 725–738. Vesanto, J., Alhoniemi, E., 2000. Clustering of the self-organizing map. IEEE Trans. Neural Networks 11 (3), 586–600.

P. Sarlin / Engineering Applications of Artiﬁcial Intelligence 26 (2013) 1532–1539

Vesanto, J., Himberg, J., Siponen, M., Simula, O., 1998. Enhancing SOM based data visualization. In: Proceedings of the International Conference on Soft Computing and Information/Intelligent Systems (IIZUKA’98), Iizuka, Japan, pp. 64–67. Sulkava, M., Hollme´n, J., 2003. Finding proﬁles of forest nutrition by clustering of the self-organizing map. In: Proceedings of the Workshop on Self-Organizing Maps, September 11–14, Hibikino, Kitakyushu, Japan.

1539

Xie, J.X., Cheng, C.T., Chau, K.W., Pei, Y.Z., 2006. A hybrid adaptive time-delay neural network model for multi-step-ahead prediction of sunspot activity. Int. J. Environ. Pollut. 28 (3-4), 364–381. Zhang, J., Chau, K.W., 2009. Multilayer ensemble pruning via novel multisub-swarm particle swarm optimization. J. Univers. Comput. Sci. 15 (4), 840–858.

Exploiting the self-organizing financial stability map

Exploiting the self-organizing financial stability map

Recommend Documents