Engineering Applications of Artificial Intelligence 26 (2013) 1532–1539
Contents lists available at SciVerse ScienceDirect
Engineering Applications of Artificial Intelligence journal homepage: www.elsevier.com/locate/engappai
Exploiting the self-organizing financial stability map Peter Sarlin n ˚ Akademi University, Joukahaisenkatu 3–5, 20520 Turku, Finland Department of Information Technologies, Abo
a r t i c l e i n f o
a b s t r a c t
Article history: Received 9 July 2012 Received in revised form 5 November 2012 Accepted 7 January 2013 Available online 5 February 2013
This paper enhances the visualization and extraction of information on the self-organizing financial stability map (SOFSM). The SOFSM uses the self-organizing map to represent a multidimensional financial stability space on a two-dimensional grid and allows monitoring economies in the financial stability cycle represented by four states. The SOFSM lacks, however, means for thorough assessment of general structures and individual data. We enhance the visualization and information extraction of the SOFSM by the means of four tasks: (1) fuzzification of the map, (2) probabilistic modeling of state transitions, (3) contagion analysis and (4) outlier detection. The usefulness of the extensions is shown with sample visualizations and predictive performance. & 2013 Elsevier Ltd. All rights reserved.
Keywords: Financial crisis Self-organizing financial stability map Fuzzification Transition probabilities Contagion Outlier detection
1. Introduction Machine learning and artificial intelligence techniques are common means for prediction, modeling and intelligence augmentation in a wide range of engineering applications (e.g., Cheng et al., 2008; Fuertes et al., 2010; Jia et al., 2008; Lin et al., 2006; Muttil and Chau, 2007; Xie et al., 2006; Zhang and Chau, 2009). Yet, the application of such techniques in financial modeling, such as financial stability surveillance, is still in its early beginning. In particular, visual exploration of high-dimensional data, while being an important task for the understanding of data structures, is oftentimes a neglected part in financial modeling. A starting point to this has been set by two attempts to map the state of financial stability: a spider-chart published by the International Monetary Fund (IMF) (Dattels et al., 2010; IMF, 2012) and a self-organizing financial stability map (SOFSM) published by the European Central Bank (ECB) (Sarlin and Peltonen, 2011; ECB, 2011). This paper focuses on enhancing the SOFSM that employs a self-organizing map (SOM) (Kohonen, 1982, 2001) based method for mapping the state of financial stability onto a two-dimensional plane.1 The SOFSM allows monitoring economies in the financial stability cycle represented by four stages: pre-crisis, crisis, post-crisis and tranquil states. It uses
n
Corresponding author. Tel.: þ358 22154670; fax: þ 3582 215 4809. E-mail address: psarlin@abo.fi 1 SOMs have been applied in a wide range of topics, e.g., on currency crisis Sarlin and Marghescu (2011) and general economic (Resta, 2009) indicators. In addition, see Deboeck and Kohonen (1998) for an overview of applications in finance and Sarlin (2012b) for a comparison on using data and dimension reduction methods for financial performance analysis. 0952-1976/$ - see front matter & 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.engappai.2013.01.002
the SOM to perform a neighborhood-preserving projection onto a grid of nodes with a predefined regular shape. The predefined grid shape facilitates using the SOFSM as a display or basis for additional information, such as visualizing individual data samples or general structural properties of data. This paper attempts to enhance the exploitation of the SOFSM by further developing the visualization and extraction of information on the map. In Sarlin and Peltonen (2011), the SOFSM was not only used for visualizing cross-sectional and temporal data on the two-dimensional display, but also for non-linear correlation hunting in relation to all four states, as well as evaluated in terms of out-of-sample predictive capabilities. It was not, however, paired with other techniques for enhanced visualization, information extraction and financial stability surveillance. This paper uses the SOFSM as its basis and enhances the exploitation from four perspectives:
1. Fuzzification of the SOFSM for visualizing temporal belongingness to financial stability states of individual data and class distance structures on the map. 2. Probabilistic modeling of state transitions on the SOFSM for visualizing probabilities of transition to financial stability states of individual data and for assessing the general cyclical and temporal structure of the financial stability cycle. 3. Using neighborhoods on the SOFSM and superimposed portfolio network topologies to assess the spread of financial distress and shock propagation. 4. Computing distances between data and their mean profiles on the SOFSM to find extreme events and imbalances in economies’ macro-financial conditions.
P. Sarlin / Engineering Applications of Artificial Intelligence 26 (2013) 1532–1539
First, we fuzzify and classify the SOFSM with a distance-based metric. Instead of applying a crisp clustering algorithm on the units, as is commonly done (see, e.g., Vesanto and Alhoniemi, 2000), we use perfect financial stability states derived from classes in data as representative cluster centers. Then, an adapted version of the membership functions in Cottrell and Letre´my (2005) and Sarlin and Eklund (forthcoming) is used to compute class memberships. By a defuzzification of the results using the maximum-membership method, we also provide a crisp clustering. This enhances the visualization capability by enabling assessment of temporal belongingness to the financial stability states, where the states are expressed by representative cluster centers and fluctuations in macro-financial conditions are represented by the temporal variation of belongingness. In addition, visualizations of the memberships on a SOM grid enable assessment of the class structures. Second, we assess temporal variation on the SOFSM. Following previous works on transition probabilities on the SOM (Sulkava and Hollme´n, 2003; Luyssaert et al., 2004; Fuertes et al., 2010; Sarlin et al., 2012), we exploit the temporal patterns by computing, summarizing and visualizing probabilities of future state transitions. This enables not only assessing the most likely future state transitions for each node, and thus also pairing to individual data, but also country profiling of low- and high-risk financial stability states. Most notably, while this validates the assumed financial stability cycle, intra cluster differences in transition patterns show that the cycle is not entirely well-behaving or continuous. Third, we assess contagion effects on the SOFSM. We analyze shock propagation from two perspectives: neighborhoods on the SOFSM and a superimposed portfolio network topology. While the former approach was suggested already in the stand-alone SOFSM, this paper extends the concept by evaluating the value added in terms of predictive performance. The latter perspective on contagion applies the approach in Sarlin (2012a) by combining a portfolio network topology with a financial stability topology, i.e., superimposing a network of bilateral portfolio exposures on the SOFSM. This enables two approaches for analyzing the spread of financial instabilities: the financial stability topology indicates propagation of financial distress to similar macro-financial conditions, while the portfolio network topology indicates contagion through asset-based direct linkages. Fourth, we assess whether or not, and to what extent, distances between each datum and its mean profile on the SOFSM (best-matching unit, BMU) is an indication of financial imbalances. The rationale for this is that the units of the SOM, while being topologically ordered, tend to approximate the probability density function of data (Kohonen, 2001). The distance to a BMU represents thus the fit of a single datum to the multivariate data distribution, i.e., its degree of extremity. When monitoring financial stability, one could assume that large distances represent financial imbalances in macro-financial conditions. Our approach goes beyond applying a pre-defined threshold value on the distance to assess whether or not a datum is an outlier (see, e.g., Vesanto et al. (1998) and Saunders and Gero (2001)), by setting the threshold to optimize predictive performance. Out of the above approaches, we show the usefulness of the fuzzification, transition probabilities and portfolio network topology with sample visualizations on or together with the SOFSM. The usefulness of neighborhood contagion and outlier analysis is tested in terms of their performance in predicting financial crises. The paper is structured as follows. Section 2 introduces the data, the SOM, the fuzzification, transition probabilities, contagion analysis, outlier detection and the evaluation scheme. We present the enhanced visualization and extraction of information on the SOFSM in Section 3, and conclude in Section 4 by presenting our key findings, limitations and future research directions.
1533
2. Methodology This section presents the data, methods and evaluations used in this paper. 2.1. Data and pre-processing The data used in this paper are of two sorts: macro-financial indicators with class information and asset-based linkages. The first type of data follow the dataset in Lo Duca and Peltonen (2012) and consist of a set of vulnerability measures commonly used in the macroprudential literature (see, e.g., Alessi and Detken, 2011; Borio and Lowe, 2002, 2004) and binary class information representing pre-crisis, crisis, post-crisis and tranquil periods. The quarterly dataset consists of 28 countries, 10 advanced and 18 emerging economies, spanning from 1990: 1–2010:3, that is a panel dataset consisting of both a crosssectional and a temporal dimension. Hence, a data vector xjAR18 is formed of a class vector xj(cl)AR4 and an indicator vector xj(in)AR14 for each quarter and country in the sample. To control for cross-country differences and fixed effects, each column of the input data is transformed into its historic country-specific percentile distribution. The second types of data consist of a network of financial linkages. Portfolio exposures are based upon external assets (equities and bonds) as reported in the Coordinated Portfolio Investment Survey by the IMF. It is worth noting that exposures of central banks are not included due to the different nature of their holdings. 2.2. Self-organizing maps (SOMs) The SOM (Kohonen, 1982) is a technique with both clustering and projection capabilities that performs a neighborhoodpreserving projection onto a grid of nodes with a predefined regular shape. As the SOM algorithm is well-known and the main emphasis is on exploiting additional information on the SOM, we do not present details of it here—for details see Kohonen (2001). For its excellent visual representation, the Viscovery SOMine 5.1 package is used as a basis in this study. After an initialization of the reference vectors as to the two principal components, the training process follows two steps. First each input data vector xj (where j ¼1,2,y,N) is paired with its best-matching unit (BMU) mb of all reference vectors mi (where i¼1,2,y,M) using Euclidean distance. The second step adjusts each reference vector mi with the batch updating formula: N P
mi ðt þ 1Þ ¼
hibðjÞ ðtÞxj
j¼1 N P
,
ð1Þ
hibðjÞ ðtÞ
j¼1
where t is a discrete time coordinate and hib(j) is a decreasing function of neighborhood radii and time. However, we employ a semi-supervised version of the SOM by using data vector xjAR18, formed of a class vector xj(cl) and an indicator vector xj(in), in both steps of training. It is important noting the use of the class and indicator vectors when mapping data onto the SOM. After training a SOM model, only the indicator vector xj(in) is used for locating data to their BMUs (mb(in)). Then, the class values of their BMU (mb(cl)) are used as estimates of the current state. The two-dimensional SOM grid describes the topology of a multidimensional space. As all information on the map cannot be visualized in two dimensions, additional information can be shown on own grids. The predefined grid shape of the SOM facilitates this type of visualization linking. For instance, a socalled feature plane can be used for showing the spread of an
1534
P. Sarlin / Engineering Applications of Artificial Intelligence 26 (2013) 1532–1539
indicator on the SOM, where cold colors (blue) indicate low and warm colors (red) high values. As these are different views of the same grid, a unique point represents always the same node.
2.3. Fuzzification of SOMs The SOM is commonly partitioned using a crisp clustering technique (see, e.g., Vesanto and Alhoniemi, 2000). However, as we not only have class information, but also utilize a semisupervised SOM with the classes in the ordering process, we do not have to estimate clusters and their centroids. We adapt a fuzzification of the SOM proposed in Sarlin and Eklund (2011, forthcoming) for computing class memberships. Thus, we can classify and fuzzify the SOM based on the class vectors xj(cl)AR4 by assuming the following: the number of states C equals the number of classes K, i.e., C¼K, and the state center ck (where k¼1,2,y,C) for each class is a perfect representative state vector, i.e., 1 if k equals the state of ck : ð2Þ 8k ckðclÞ ¼ 0 otherwise While there exist other methods for class visualization on the SOM, such as Voronoi regions (Mayer et al., 2007), they fall short in dealing with partial truth, uncertainty and imprecision in class memberships. We follow the approach in Sarlin and Eklund (forthcoming) by computing a membership degree using Euclidean distances between nodes and state centers. The fuzzification resembles that in Cottrell and Letre´my (2005), by being implemented on second-level state centers instead of directly on the nodes, by not assuming inverse exponential distances and by introducing a fuzzification parameter. In addition, our aim is a visualization rather than a method for imputing missing values which is the key focus of Cottrell and Letre´my. Our approach differs from that in Sarlin and Eklund (forthcoming) by only using xj(cl)AR4 for computing the distances. The rationale for this is the focus on distances between mean profiles of classes rather than those between indicators. The SOM is fuzzified by computing the inverse distance between reference vector mi(cl) and each state center ck(cl): uik ¼
1 2
1 þ 99miðclÞ ckðclÞ 99y1
,
ð3Þ
where yA(1,N) is a similar fuzzifier as the one used in fuzzy C-means clustering. The fuzzifier y affects the distance metric by setting the extent of overlap between the states, where low values provide crisp results and high fuzzy results. The fuzzifiers y ¼2 and y ¼3 can be seen as benchmark values by resulting in simple and squared Euclidean distances. Eq. (3) extends the approach in Sarlin and Eklund (forthcoming) by also adding a 1 to the denominator to be defined for all distances. We normalize the similarity matrix uik to the following matrix of state membership degrees for each node: vik ¼
uik , C P uik
ð4Þ
k¼1
P to fulfill the probabilistic constraint Ck ¼ 1 vik ¼ 1: In addition to computing membership degrees, we follow previous studies by defuzzifying the results using the maximum-membership method. The crisp states on the SOM can be shown by contours, while nodes’ membership degrees to each state can be visualized on own grids per state, i.e., membership planes. Further, the evolution of individual data can be visualized using line graphs by associating each datum with the membership degrees of its BMU.
2.4. Transition probabilities on SOMs The standard SOM paradigm does not explicitly address the issue of temporality on the grid. While trajectories have been a common means to illustrate temporal movements, small samples give no indication of overall patterns and large clutter the display. A more general probabilistic model of the temporal variations in a SOM model has been introduced through node-to-node transition probabilities (Sulkava and Hollme´n, 2003; Luyssaert et al., 2004; Fuertes et al., 2010). Recently, node-to-cluster transition probabilities have been proposed to be computed on SOMs with a second-level clustering and visualized on their own SOM grids (Sarlin et al., 2012). Movements between nodes on the twodimensional SOM are used to compute probabilities of switching from one place to another in a specified time period. In general, we can compute transition probabilities of switches from each node to every other node. To facilitate visualization, we focus on node-to-state switches by computing pik, where the transition refers to movements from reference vector i to state k: pik ðt þ sÞ ¼
nik ðt þsÞ : C P nik ðt þsÞ
ð5Þ
k¼1
where nik is the cardinality of data switching from mi to ck, t is a time coordinate and s is the time span for the switch. The probability of switching to a particular state can again be associated with each of the SOM nodes, and linked to the SOM visualization. That is, we visualize the probabilities on their own transition planes, where color codes show the probability of transition. To summarize the most likely switches, we show maximum transition probabilities (maxk ðpik Þ) conditional on switching on an own transition plane, where labels show location and color probability of transition. The evolution of individual data can again be associated to the transition probability of their BMU and visualized using line graphs. 2.5. Shock propagation on SOMs The process of some event being transmitted to another entity goes by different names, such as contagion, shock propagation or the spread of an event. We analyze this occurrence through two transmission channels: links between entities and similarities in indicator values. The first approach follows that in Sarlin (2012a) by superimposing a network of bilateral exposures on the SOM. While the standard SOM provides a, for example, twodimensional mapping of input data, the SOM does not per se account for relationships or links in data. Visual clustering of network data has been directly performed by applying a standalone SOM on data relationships (Boulet et al., 2008). However, we extend the standard SOM with a superimposed network for simultaneous assessment of dependencies in data, such that positions are set by the indicator-based SOM while links are set by the relationships between the entities. Networks of relationships are mostly expressed in matrix form, where the link between entities g and l in matrix A is represented by element agl. The matrix is of size n2, where n is the number of entities. Matrices of directed graphs, as the one we use, can be read in two directions: rows of A represent the relationship of g to l and columns of A represent the relationship of l to g. To combine SOMs and network analysis, we superimpose network relations on top of the standard SOM grid by visualizing relationships between entities. Labels of entities under analysis, say g and l, are projected to their BMUs of the SOM based upon their data xj ¼ g and xj ¼ l. After that, relations between entities g and l are visualized by edges between the locations of the BMUs of xg and xl on the SOM grid using elements agl and alg.
P. Sarlin / Engineering Applications of Artificial Intelligence 26 (2013) 1532–1539
The second approach measures neighborhood effects on the SOM, as previously suggested but not quantitatively tested in the SOFSM (Sarlin and Peltonen, 2011). First, we assign one of the classes K to be the event of interest. Then, instead of using the associated class vector (pre-crisis class) based upon historical data as an estimate of probable future events, we assign the locations of the event of interest in period t to be signals of similar events in that location in period tþs, where s is the time span for transmission. 2.6. Outlier analysis with SOMs The SOM paradigm is well suited for outlier detection. The nodes mi of the SOM, while being topologically ordered, tend to approximate the probability density function of data p(x) (Kohonen, 2001). The standard quantization error (QE) can be seen as the correspondence between the nodes and the data. However, a more meaningful estimate of event rarity is computing the distance of individual data xj to their BMUs mb. Due to the property of approximating the probability density function, the individual QE represents the fit of a single datum to the multivariate data distribution. An outlier, and its degree of extremity, can thus be estimated by the distance to the SOM in a multidimensional setting, i.e., dj ¼ 99xjðinÞ mbðinÞ 99. The literature has commonly applied a pre-defined threshold value on the distance to assess whether or not a datum is an outlier (see, e.g., Vesanto et al. (1998) and Saunders and Gero (2001)). The approach in the present paper goes, however, beyond using a pre-defined threshold by setting the threshold to optimize predictive performance. 2.7. Performance evaluation The performance of the extensions, in particular contagion and outliers, is evaluated with a Usefulness measure that incorporates the preferences of a policymaker. We assume the predictive model to issue a signal when the estimated probability of a crisis is above a specified threshold. The signals and non-signals are then collected into a contingency matrix comprising of true positives (TP), true negatives (TN), false positives (FP) and false negatives (FN). In this paper, we follow the approach proposed in Sarlin (2013) by maximizing the usefulness for a policymaker with specific preferences about giving false alarms (type I errors) and missing crises (type II errors) when calibrating an optimal model and threshold for policy action. The used approach extends ¨ the measures used in Demirguc-Kunt and Detragiache (2000) and Alessi and Detken (2011). We derive the following loss function for a policymaker: LðmÞ ¼ mT 1 P 1 þð1mÞT 2 P 2 ,
ð6Þ
where the parameter m sets the policymaker’s relative preferences between missing crises and giving false alarms, type I errors T1A[0,1]¼FN/(TPþ FN) represent the proportion of missed crises relative to the number of crises in the sample, type II errors T2A[0,1]¼FP/(FPþTN) the proportion of false alarms relative to the number of tranquil periods in the sample, and unconditional probabilities P1 and P2 correspond to the frequency of pre-crisis and tranquil times. Thus, the preference parameter m defines the risk-aversion between type I and II errors of the policymaker. The policymaker is equally concerned of missing crises and issuing false alarms for a value of m ¼0.5, more concerned about issuing false alarms when m o0.5, and more concerned about missing crises when m 40.5. In this paper, we follow the preferences in the SOFSM by being substantially more concerned of missing a crisis (m ¼0.8), which also reflects the imbalanced relative costs between missing a crisis and giving a false alarm. Following Sarlin (2013), the usefulness of one specific threshold is derived by subtracting L(m) from the expected value of a
1535
best-guess given the preferences and unconditional probabilities ðminðmP1 ,P 2 ð1mÞÞLðmÞÞ. Hence, we can define the relative Usefulness by computing the share of usefulness to the available usefulness of the model (minðmP 1 ,P 2 ð1mÞÞ): minðmP1 ,P 2 ð1mÞÞLðmÞ , Ur m ¼ minðmP 1 ,P2 ð1mÞÞ
ð7Þ
where the Usefulness of the model is compared with the loss of disregarding the model and divided by the available usefulness. Thereby, the interpretation of the relative Usefulness Ur(m) is the percentage gain relative to not using a model. Finally, the performance of a model is solicited by choosing a threshold on the probability of a crisis such that usefulness Ur is maximized.
3. Exploiting the self-organizing financial stability map The SOFSM uses the SOM for mapping the state of financial stability onto a two-dimensional plane. It uses as inputs a set of indicators commonly used in the macroprudential literature and class information representing pre-crisis, crisis, post-crisis and tranquil periods. Sarlin and Peltonen (2011) put forward a technical framework for building a two-dimensional financial stability map, evaluate it and test its robustness, as well as show its usefulness for visualizing data. The parameters of the SOFSM are tuned with respect to three aspects: (1) parsimonity and generalization, (2) objective learning on in-sample data, and (3) visualization capabilities. The SOFSM and trajectories for the United States (US) and Euro area conditions can be found in Fig. 1a, and sample feature planes in Fig. 1b. This section applies the extensions described in Section 2 to the SOFSM for the following four tasks: (1) fuzzification of the map, (2) probabilistic modeling of state transitions, (3) contagion analysis and (4) outlier detection. The extensions of the SOFSM, and their usefulness, are illustrated with sample visualizations and predictive performance. 3.1. Fuzzification of the SOFSM We employ the distance- and class-based fuzzification of the SOFSM. Class information is accounted for by setting the number of states equal to the number of classes, i.e., four, and their centers as perfect states of the financial stability cycle: pre-crisis, crisis, post-crisis and tranquil states. The choice of C ¼4 is hence not a free parameter as it is specified from the number of classes K. We test different specifications of the fuzzifier (y ¼{1.0,1.2,...,5.0}), but end up using squared Euclidean distances (y ¼2) since that allows for fuzzy, but not entirely erased, state borders. While the differences between the most extreme choices are significant, the results are stable for values close to y ¼2. In Fig. 2, memberships to each state are shown on membership planes, where also the defuzzified crisp states are shown by contour lines (as well as on all other grids, e.g., Fig. 1). The crispest part is the upper right corner of the tranquil state, whereas the rest have more overlap. Fig. 3 shows how the fuzzification can be turned into line graphs for the trajectories in Fig. 1, where membership in states and their variation over time represent fluctuation in the current state of financial stability. The line graphs clearly depict increases in memberships in pre-crisis states prior to crises. 3.2. Transition probabilities on the SOFSM We exploit temporal variation on the SOFSM by employing probabilistic modeling of state transitions. The transition probabilities are computed for node-to-state switches and visualized on own transition planes (Fig. 4), and summarized as maximum transition probabilities conditional on switching, where labels
1536
P. Sarlin / Engineering Applications of Artificial Intelligence 26 (2013) 1532–1539
US 2004-05
Pre-crisis
US 2006
Tranquil US 2002 2010
US 2007
Euro 2002
Euro 2006 Euro 2007
US 2008-09
Euro 2008
Real GDP growth
Real credit growth
CA deficit
Government deficit
Global inflation
Global leverage
Global equity valuation
Pre-crisis periods
Euro 2004-05
Euro 2010
Post-crisis US 2003
Crisis
Inflation
Euro 2003
Euro 2009
Fig. 1. SOFSM and trajectories for the US and Euro area conditions (a), and feature planes (b). Notes: The feature planes in (b) are layers of the SOM grid in (a) and show the spread of indicators and classes on the grid. While the first eight feature planes represent sample indicators, the last feature plane shows the spread of a class, the pre-crisis periods.
Tranquil
Pre crisis
Crisis
Post crisis
0.2 0.3 0.3 0.4 0.5 0.6 0.7 0.7 0.8 0.9 1.0 0.00 0.05 0.10 0.14 0.19 0.24 0.29 0.33 0.38 0.43 0.00 0.05 0.09 0.14 0.18 0.23 0.27 0.32 0.36 0.41 0.00 0.04 0.09 0.13 0.17 0.22 0.26 0.30 0.35 0.39
Fig. 2. Membership planes for the SOFSM, where X-marks represent state centers.
State memberships for US from 2002-2011
State memberships
1.0 Tranquil Pre-crisis Crisis Post-crisis
0.8 0.6 0.4 0.2 0.0 2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
Time t State memberships for the euro area from 2002-2011
State memberships
1.0 Tranquil Pre-crisis Crisis Post-crisis
0.8 0.6 0.4 0.2
show location and color probability. We have tested a wide range of time spans (s¼{6,12,18,24,48}), but chose for analysis a time span of 18 months (s¼18). Hence, transition probabilities represent the likelihood of switching to a state within 18 months. The rationale behind choosing s ¼18 is that the SOFSM is also calibrated for optimal performance in terms of predicting vulnerable states 18 months prior to a crisis. Moreover, the transition patterns are considerably robust to changes in s. The length of movements increases with increases in s, as expected, while the directions of movements are stable. Most notably, while the transition patterns validate the assumed financial stability cycle, the cycle is shown not to be entirely wellbehaving or continuous. For instance, the SOFSM shows high probability of transition to the crisis state on the border between the tranquil and pre-crisis states, as well as during extreme tranquil times. We also perform a similar line graph representation as that for the fuzzification, but instead with indications of future states. Fig. 5, while depicting probabilities according to the financial stability cycle, illustrates the crisis indications in the extreme part of the tranquil state and a rise of new instabilities in 2010.
0.0 2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
3.3. Shock propagation on the SOFSM
Time t Fig. 3. Line graphs of US and Euro area state membership degrees (see trajectory in Fig. 1).
We assess transmission of shocks on the SOFSM with two methods: a portfolio network topology and SOM neighborhoods.
P. Sarlin / Engineering Applications of Artificial Intelligence 26 (2013) 1532–1539
Tranquil
Pre crisis
Crisis
Post crisis
1537
Summary (t+{1,2,...,6}) 3 3 3 3 1 2 2 3 3 3 1 3 3 3 3 2 2 2 3 3 3 3 1 3 3 3 3 2 2 2 3 3 3 3 3 3 3 3 2 2 2 3 3 3 3 3 3 3 2 1 2 3 3 3 3 2 3 2 1 3 2 3 3 2 3 2 1 2 2 2 1 2 1 2 2 1 1 1 2 2 1 1 1 1 1 1 1 1 2 1 1 1 1 1
Fig. 4. Transition planes and summarized maximum state transitions for the SOFSM. Notes: In the summary grid, 1–4 stands for tranquil, pre-crisis, crisis and post-crisis states and E for empty nodes, respectively.
Transition probabilities for US from 2002-2011 Tranquil Pre-crisis Crisis Post-crisis
Transition probabilities
1.0 0.8 0.6 0.4 0.2 0.0 2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
Time t
Transition probabilities for the euro area from 2002-2011 Transition probabilities
1.0
Tranquil Pre-crisis Crisis Post-crisis
0.8 0.6 0.4 0.2 0.0 2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
Time t Fig. 5. Line graphs of US and Euro area transition probabilities (see trajectory in Fig. 1).
Tranquil
Pre-crisis
economy are located in their BMUs and their size is scaled as to the sum of exposures to other economies. The thickness of the edges represents the size of external exposure to total exposures per economy, where the color of the edge indicates the address of the exposure holder. Indeed, Fig. 6 combines the state of financial stability, or probability of a crisis, with the systemic relevance of each economy. The size of financial linkages to high-risk economies (e.g., Brazil and Mexico) enlighten about both past and present: high levels of previous financial stress in the US may have impacted their current state and they still have a high risk of current and future shock propagation from the US. While crises are often transmitted through asset-based contagion channels, such as financial linkages, they may also be propagated through similarities in macro-financial conditions. Independent of location on the map, an economy in the neighborhood of countries in crisis could through contagion experience a similar wave of financial distress. This is particularly important when dealing with data of changing nature, as it does not necessitate macro-financial conditions to follow patterns during historical crises. Thus, an economy in the upper left part of Fig. 6, say Mexico, could propagate financial instabilities to countries with similar macro-financial vulnerabilities, e.g., Argentina and Brazil. While this is particularly useful for visual real-time surveillance, we can also test this by letting locations of crises in period t be signals of crises in that location in period t þ1. More precisely, we create a leading indicator that signals a crisis in node mb in period tþ 1 if a country that experienced a crisis in t was located in mb in period t. Table 1 shows the predictive performance of neighborhoods on the SOFSM with forecast horizons of 6–48 months, where a horizon of 24 months outperforms the rest. A policymaker with m ¼0.8 derives the largest usefulness. While the table confirms the usefulness of detecting transmission of crisis, the nature of the contagion measures suggest that they are rather complements than substitutes to standard early-warning models. 3.4. Outlier analysis with the SOFSM
Crisis Post-crisis Fig. 6. A financial network topology on the SOFSM, where the US is at the centre.
This enables analyzing spread of financial instabilities from two points of views: contagion via asset-based linkages and propagation of financial distress to similar macro-financial conditions. On the SOFSM grid in Fig. 6, we have superimposed a network of financial links in 2010 with the US as its center. Nodes of each
We assess whether or not, and to what extent, distances between each data vector and its BMU is an indication of financial imbalances. The rationale for this is that large distances are by definition extreme events with respect to the data distribution on the SOFSM. Table 1 shows the predictive performance of QEs on the SOFSM with forecast horizons of 6–48 months. The table illustrates that, while the aim is conceptually different, outliers do not provide equally good means to predict financial instabilities than contagion does. The predictive capability improves with shorter forecast horizons and yields the largest usefulness for a horizon of 6 months. The measure is most useful for a policymaker with m ¼0.7. This reflects the fact that the measure is binary rather than a probability on which a threshold is chosen to optimize usefulness. Moreover, as the imbalance may be located in different states of the financial stability cycle, it provides only
1538
P. Sarlin / Engineering Applications of Artificial Intelligence 26 (2013) 1532–1539
Table 1 Predictive performance of contagion and quantization error on the SOFSM. Model
Horizon
Threshold
RP
RN
PP
PN
Accuracy
Ur(l ¼0.7)
Contagion Contagion Contagion Contagion Contagion
6 12 18 24 48
BMU BMU BMU BMU BMU
0.23 0.22 0.19 0.17 0.13
0.92 0.91 0.89 0.88 0.85
0.21 0.32 0.35 0.39 0.47
0.93 0.86 0.78 0.71 0.48
0.86 0.80 0.73 0.67 0.48
0.30 0.30 0.38 0.56 0.42
0.27 0.26 0.35 0.57 0.36
0.03 0.01 0.12 0.45 0.13
QE QE QE QE QE
6 12 18 24 48
0.65 0.66 0.66 0.63 0.71
0.49 0.48 0.54 0.72 0.53
0.66 0.69 0.71 0.71 0.79
0.12 0.23 0.30 0.35 0.44
0.93 0.87 0.86 0.92 0.84
0.65 0.65 0.67 0.71 0.73
0.19 0.17 0.13 0.11 0.05
0.02 0.00 0.03 0.06 0.12
0.38 0.40 0.44 0.49 0.56
(dj) (dj) (dj) (dj) (dj)
Ur(l ¼0.8)
Ur(l ¼0.9)
AUC – – – – – 0.56 0.55 0.60 0.73 0.69
Notes: The table reports results for contagion and quantization errors (QEs) on the SOFSM. QE is a percentile transformation of dj and contagion neighborhood is the BMUs of previous pre-crisis periods. The following measures are reported: TP (true positives), FP (false positives), TN (true negatives), FN (false negatives), precision positives (PP) ¼ TP/(TP þFP), recall positives (RP) ¼TP/(TPþ FN), precision negatives (PN) ¼ TN/(TN þ FN), recall negatives (RN) ¼TN/(TN þFP), accuracy ¼ (TP þTN)/(TPþ TNþ FP þ FN), AUC ¼area under the receiver operating characteristic curve (not computed for contagion as it is binary), and Usefulness (Ur(m))¼ [Min(mP1,P2 (1 m)) (mT1P1 þ(1 m)T2P2)]/Min(mP1,P2(1 m)), where m stands for relative preferences between false alarms and missing crises. For QE, the threshold is chosen as for optimal usefulness. Best values per measure and method are bolded.
information of possible impending instabilities rather than information on the exact timing of a crisis.
4. Conclusions This paper has enhanced the exploitation of the SOFSM by the means of four tasks: (1) fuzzification of the map, (2) probabilistic modeling of state transitions, (3) contagion analysis and (4) outlier detection. To illustrate the usefulness of the extensions, we have provided some sample visualizations and tested their predictive performance. This could, however, be extended along several directions. For transition probabilities, different time spans (e.g., moving averages) and regions (e.g., early warning nodes) may also be applied, as appropriate for the given task. Neighborhood contagion could as well use different definitions of neighborhood functions. When detecting outliers, one could decide to solely focus on imbalances in specific locations, such as financial stability states. These questions are, however, dependent upon the questions being asked and the task one is attempting to solve, and could hence be varied within the framework in this paper. The present paper does not, however, deal with changes in cluster structures. A limitation is hence the assumption of a stationary financial stability cycle. An important issue in earlywarning modeling is the changing nature of crises due to, e.g., financial innovation. Hence, a key question remains: How should models be adapted to events that potentially surpass historical experience? These suggestions, among other things, set the directions for future research and work in progress. References Alessi, L., Detken, C., 2011. Quasi real time early warning indicators for costly asset price boom/bust cycles: a role for global liquidity. Eur. J. Polit. Econ. 27 (3), 520–533. Borio, C., Lowe, P., 2002. Asset Prices, Financial and Monetary Stability: Exploring the Nexus. BIS Working Papers, No. 114. Borio, C., Lowe, P., 2004. Securing Sustainable Price Stability: Should Credit Come Back from the Wilderness? BIS Working Papers, No. 157. Boulet, R., Jouve, B., Rossi, F., Villa, N., 2008. Batch kernel SOM and related Laplacian methods for social network analysis. Neurocomputing 71 (7–9), 1257–1273. Cheng, C.T., Wang, W.C., Xu, D.M., Chau, K.W., 2008. Optimizing hydropower reservoir operation using hybrid genetic algorithm and chaos. Water Resour. Manage. 22 (7), 895–909. Cottrell, M., Letre´my, P., 2005. Missing values: processing with the Kohonen algorithm. In: Proceedings of Applied Stochastic Models and Data Analysis (ASMDA 2005), Brest, pp. 489–496.
Dattels, P., McCaughrin, R., Miyajim, K., Puig, J., 2010. Can You Map Global Financial Stability? IMF Working Paper, WP/10/145. Deboeck, G., Kohonen, T., 1998. Visual Explorations in Finance with SelfOrganizing Maps. Springer-Verlag, Berlin. ¨ Demirguc-Kunt, A., Detragiache, E., 2000. Monitoring banking sector fragility: a multivariate logit approach. World Bank Econ. Rev. 14, 287–307. ECB, 2011. Mapping the state of financial stability. In: ECB (Ed.), Financial Stability Review, December 2011, pp. 149–157. Fuertes, J.J., Domı´nguez, M., Reguera, P., Prada, M.A., Dı´az, I., Cuadrado, A.A., 2010. Visual dynamic model based on self-organizing maps for supervision and fault detection in industrial processes. Eng. Appl. Artif. Intell. 23 (1), 8–17. IMF, 2012. Global Financial Stability Report. International Monetary Fund, USA, Washington D.C. Jia, W., Ling, B., Chau, K.W., Heutte, L., 2008. Palmprint identification using restricted fusion. Appl. Math. Comput. 205 (2), 927–934. Kohonen, T., 1982. Self-organized formation of topologically correct feature maps. Biol. Cybern. 66, 59–69. Kohonen, T., 2001. Self-Organizing Maps. Springer-Verlag, Berlin. Lin, J.Y., Cheng, C.T., Chau, K.W., 2006. Using support vector machines for longterm discharge prediction. Hydrol. Sci. J. 51 (4), 599–612. Lo Duca, M., Peltonen, T.A., 2012. Assessing systemic risks and predicting systemic events. J. Bank Financ. Luyssaert, S., Sulkava, M., Raitio, H., Hollme´n, J., 2004. Evaluation of forest nutrition based on large-scale foliar surveys: are nutrition profiles the way of the future? J. Environ. Monit. 6 (2), 160–167. Mayer, R., Abdel Aziz, T., Rauber, A., 2007. Visualising class distribution on selforganising maps. In: Proceedings of the International Conference on Artificial Neural Networks (ICANN’07), Porto, Portugal, September 9–13. Muttil, N., Chau, K.W., 2007. Machine learning paradigms for selecting ecologically significant input variables. Eng. Appl. Artif. Intell. 20 (6), 735–744. Resta, M., 2009. Early warning systems: an approach via self organizing maps with applications to emergent markets. In: Proceedings of the 18th Italian Workshop on Neural Networks, pp. 176–184, IOS Press, Amsterdam. Sarlin, P., 2012a. Chance discovery with self-organizing maps: discovering imbalances in financial networks. In: Ohsawa, Y., Abe, A. (Eds.), Advances in Chance Discovery. Springer Verlag, Berlin, pp. 49–61. Sarlin, P., 2012b. Data and Dimension Reduction for Visual Financial Performance Analysis. TUCS Technical Report No. 1049. Sarlin, P., 2013. On Policymakers’ Loss Functions and the Evaluation of Early Warning Systems. Econ. Lett. 119 (1), 1–7. Sarlin, P., Eklund, T., 2011. Fuzzy clustering of the self-organizing map: some applications on financial time series. In: Proceedings of the 8th International Workshop on Self-Organizing Maps, Springer, Helsinki, pp. 40–50. Sarlin, P., Eklund, T. Financial performance analysis of European Banks using a fuzzified self-organizing map. Int. J. Knowledge-Based Intell. Eng. Syst, forthcoming. Sarlin, P., Marghescu, D., 2011. Visual predictions of currency crises using selforganizing maps. Intell. Sys. Acc. Fin. Mgmt. 18 (1), 15–38. Sarlin, P., Peltonen, T.A., 2011. Mapping the State of Financial Stability. ECB Working Paper, No. 1382. Sarlin, P., Yao, Z., Eklund, T., 2012. Probabilistic modeling of state transitions on the self-organizing map: some temporal financial applications. Intell. Syst. Acc. Fin. Mgmt. 19 (1), 189–203. Saunders, R., Gero, J.S., 2001. Designing for interest and novelty: motivating design agents. In: de Vries, B., van Leeuwen, J., Achten, H. (Eds.), CAADFutures 2001. Kluwer, Dordrecht, pp. 725–738. Vesanto, J., Alhoniemi, E., 2000. Clustering of the self-organizing map. IEEE Trans. Neural Networks 11 (3), 586–600.
P. Sarlin / Engineering Applications of Artificial Intelligence 26 (2013) 1532–1539
Vesanto, J., Himberg, J., Siponen, M., Simula, O., 1998. Enhancing SOM based data visualization. In: Proceedings of the International Conference on Soft Computing and Information/Intelligent Systems (IIZUKA’98), Iizuka, Japan, pp. 64–67. Sulkava, M., Hollme´n, J., 2003. Finding profiles of forest nutrition by clustering of the self-organizing map. In: Proceedings of the Workshop on Self-Organizing Maps, September 11–14, Hibikino, Kitakyushu, Japan.
1539
Xie, J.X., Cheng, C.T., Chau, K.W., Pei, Y.Z., 2006. A hybrid adaptive time-delay neural network model for multi-step-ahead prediction of sunspot activity. Int. J. Environ. Pollut. 28 (3-4), 364–381. Zhang, J., Chau, K.W., 2009. Multilayer ensemble pruning via novel multisub-swarm particle swarm optimization. J. Univers. Comput. Sci. 15 (4), 840–858.