Comparing the distribution of tropical tuna associated with drifting fish aggregating devices (DFADs) resulting from catch dependent and independent data

Comparing the distribution of tropical tuna associated with drifting fish aggregating devices (DFADs) resulting from catch dependent and independent data

Journal Pre-proof Comparing the distribution of tropical tuna associated with drifting fish aggregating devices (DFADs) resulting from catch dependent...

13MB Sizes 0 Downloads 37 Views

Journal Pre-proof Comparing the distribution of tropical tuna associated with drifting fish aggregating devices (DFADs) resulting from catch dependent and independent data Blanca Orue, Jon Lopez, Maria Grazia Pennino, Gala Moreno, Josu Santiago, Hilario Murua PII:

S0967-0645(19)30069-4

DOI:

https://doi.org/10.1016/j.dsr2.2020.104747

Reference:

DSRII 104747

To appear in:

Deep-Sea Research Part II

Received Date: 21 February 2019 Revised Date:

31 January 2020

Accepted Date: 31 January 2020

Please cite this article as: Orue, B., Lopez, J., Pennino, M.G., Moreno, G., Santiago, J., Murua, H., Comparing the distribution of tropical tuna associated with drifting fish aggregating devices (DFADs) resulting from catch dependent and independent data, Deep-Sea Research Part II (2020), doi: https:// doi.org/10.1016/j.dsr2.2020.104747. This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. © 2020 Published by Elsevier Ltd.

Blanca Orue: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Roles/Writing – original draft, Writing – review & editing. Jon Lopez: Conceptualization, Investigation, Methodology, Validation, Writing – review & editing. Maria Grazia Pennino: Conceptualization, Formal analysis, Investigation, Methodology, Software, Validation, Writing – review & editing. Gala Moreno: Funding acquisition, Investigation, Project administration, Supervision, Validation, Writing – review & editing. Josu Santiago: Funding acquisition, Investigation, Project administration, Validation, Writing – review & editing. Hilario Murua: Funding acquisition, Investigation, Project administration, Supervision, Validation, Writing – review & editing

1

Comparing the distribution of tropical tuna associated with drifting fish

2

aggregating devices (DFADs) resulting from catch dependent and independent

3

data

4

Blanca Oruea,*, Jon Lopez b, Maria Grazia Penninoc, Gala Morenod, Josu Santiagoa, Hilario Muruaa-d

5 6 7 8 9 10

a

11

*corresponding author.

12

E-mail address: [email protected] (B.Orue)

AZTI-Tecnalia, Herrera kaia portualdea z/g 20110 Pasaia (Gipuzkoa), Spain

b

Inter-American Tropical Tuna Commission 8901 La Jolla Shores Drive, La Jolla CA 92037-1509

c

Instituto Español de Oceanografía, Centro Oceanográfico de Vigo. Subida a Radio Faro, 50-52 36390 Vigo, Spain

d

International Seafood Sustainability Foundation (ISSF), 1440 G Street NW Washington DC 20005

13 14 15

Keywords: DFAD, Indian Ocean, INLA, Species distribution model, Tuna

16 17 18

ABSTRACT

19 20

Species distribution models (SDMs) are used for a variety of scientific and management applications.

21

For species associated with drifting fish aggregating devices (DFADs), such as tuna, spatial models

22

can help tuna Regional Fisheries Management Organizations (t-RFMOs) understand their habitat

23

characteristics and dynamics. DFADs are monitored and tracked with satellite linked echo-sounder

24

buoys, which remotely provide fishers rough estimates of the abundance of fish underneath them.

25

Although this type of catch-independent data has been recently used in scientific studies, SDMs

26

using these data have never been compared with models using catch-dependent data (i.e. nominal

27

catch data). This study investigates the results obtained with both data sources using Bayesian

28

Hierarchical spatio-temporal models, allowing to analyze their advantages and disadvantages, as well

29

as compare the predicted distributions. Although the two model outputs show, in general, similar

30

areas of tuna presence under the DFADs, the most remarkable result of the comparison between the

31

models derived from the two different data sources is the precision of the hotspots identified in the

32

prediction maps. The maps obtained with acoustic data allow identifying areas of high probability of

33

tuna presence under the DFADs with greater precision, whereas the maps derived from catch data do

34

not allow observing any variation on a finer scale. The application of spatio-temporal models of tuna

35

associated with DFADs using acoustic data provided by fishers´ echo-sounder buoys appears

36

promising to identify the distribution dynamics of the species in a cost-effective way and may help

37

designing integrated spatial programs for more efficient fishery management.

38 39 40

1. Introduction

41 42

Tropical tuna, such as skipjack (Katsuwonus pelamis), yellowfin (Thunnus albacares) and

43

bigeye (Thunnus obesus), aggregate under floating objects in the Atlantic, Indian and Pacific Oceans

44

(Castro et al., 2002). Man-made floating objects, also called drifting fish aggregating devices

45

(DFADs), have been deployed and used by the tropical tuna purse-seine fishery since the 1990s to

46

increase their catches and fishing efficiency (Fonteneau et al., 2013; Davies et al., 2014). These

47

devices have been extensively used since then, and especially in recent years, it has been shown a

48

shift in strategy towards the use of more DFADs (Lennert-Cody et al., 2018). For example, around

49

80% of the Spanish tuna purse-seine sets in the Indian Ocean in 2017 were made on DFADs (Báez et

50

al., 2018), while the rest of the catch comes from sets on unassociated tuna schools, also called free-

51

swimming schools (FSC). Currently, DFADs are monitored and tracked with satellite linked echo-

2

52

sounder buoys (Lopez et al., 2014), which remotely inform fishers in near real-time about the

53

geolocation of the object and provide rough estimates of the abundance of fish underneath them.

54

Because of the enormous importance and value of this fishery, increasing the knowledge of

55

the spatio-temporal distribution of tuna associated with DFADs may help to develop adequate

56

science-based advice for tuna Regional Fisheries Management Organizations (t-RFMOs). Species

57

distribution models (hereinafter SDMs) are an essential tool for science and management, as they

58

provides a clear picture of the distribution dynamics and extent of marine resources (Pennino et al.,

59

2014; Robinson et al., 2017; Martínez-Minaya et al., 2018). In general, information to build marine

60

SDMs can be obtained from two main sources: catch-independent and catch-dependent data.

61

To date, most studies of the distribution of tuna species are conducted using catch-dependent

62

data, such as catch data coming from fishers logbook (Chen et al., 2005; Lee et al., 2005; Rajapaksha

63

et al., 2013) or observers´ data onboard commercial vessels (Sequeira et al., 2012; Lezama-Ochoa et

64

al., 2016; Coelho et al., 2017). Catch-dependent data, due to its mandatory collection nature, are

65

generally easier to access, less expensive to collect and usually involve a relatively large spatially and

66

temporally sample size as it covers the whole distribution of the fishery. Yet, there are some biases

67

associated with such data. One of the main concerns associated with these data is that target-species´

68

samples are collected by preferential sampling as the fishing fleets are commercially driven (Conn et

69

al., 2017). In addition, they suffer from quality control and assurance issues due to legibility,

70

problems with standardization in data collection, aggregation of species into generic groups

71

transcription errors and misreporting of catches (Lowell et al., 2015; Pennino et al., 2016; Bradley et

72

al., 2019).

73

On the other hand, catch-independent data usually have fewer biases, but one of the major

74

associated problems is that they are often collected at high economic and human cost, resulting in

75

smaller sample sizes covering generally specific areas. In addition, for large migratory pelagic

76

species like tuna, catch-independent surveys covering their whole spatio-temporal distribution are 3

77

very difficult and expensive to carry out due to their enormous geographic range. Recent works have

78

highlighted the importance that fishers’ echo-sounder buoys may have to investigate several issues of

79

scientific relevance (Lopez et al., 2016; Moreno et al., 2016). Although these data are recorded by

80

tools that are not originally developed for scientific analyses, they present many notable advantages.

81

They are continuously recording and collecting geolocated acoustic data and, hence, have the

82

potential to stream huge amounts of information in a very cost-effective manner. Indeed, information

83

from these devices have already been used in recent years to investigate tuna dynamics at different

84

temporal and spatial scales (Lopez et al., 2010; Robert et al., 2013; Lopez et al., 2017a; Lopez et al.,

85

2017b; Orue et al., 2019b).

86

This study compares the resulting spatio-temporal distribution of tuna in the Western Indian

87

Ocean by implementing Bayesian Hierarchical spatial models (hereinafter B-HSM) using both catch-

88

dependent (i.e. nominal catch data) and independent (i.e. acoustic data from fishers´ echo-sounder

89

buoys) data. The results obtained will allow analyzing the advantages and disadvantages of both data

90

sources, as well as determining the suitability of using fishers´ echo-sounders buoys acoustic data for

91

species distribution models, and more generally, the validity of the same data for scientific purposes.

92

Moreover, results of this exercise will contribute to increase the knowledge of the spatial dynamics

93

and distribution of tuna in the Western Indian Ocean and assist the scientific-advice to t-RFMOs.

94 95 96

2. Materials and methods

97 98

2.1. Study area

99

4

100

Our study area is bounded by longitude 30ºE to 80ºE and latitude 15ºN to 30ºS in the Western

101

Indian Ocean, where tropical tuna purse seine fishery operates throughout the year using DFADs

102

(Davies et al., 2014) (Fig. 1). DFADs are not distributed evenly in space and their trajectories are

103

influenced by surface currents and winds (Dagorn et al., 2013; Davies et al., 2014). The ocean

104

circulation in the Indian Ocean is related to the marked monsoon system that greatly affects the

105

oceanography and production of the area (Schott and McCreary, 2001; Wiggert et al., 2006; Schott et

106

al., 2009). The drastic changes in circulation of the surface currents affect biophysical factors (e.g.

107

chlorophyll, temperature, salinity, dissolved oxygen) (Tomczak and Godfrey, 2013) and could affect

108

the presence and relative species composition in an area (Jury et al., 2010). In addition, it is known

109

that the aggregation dynamics of tuna associated with DFADs differed between monsoon periods

110

(Orue et al., 2019b).

111

112

2.2. Catch-independent data

113 114

The acoustic data were collected by Satlink buoys (SATLINK, Madrid, Spain,

115

www.satlink.es) attached to DFADs and were provided by the Spanish purse seine vessel company

116

Echebastar. The echo-sounder transmits to the user the amount of biomass aggregated under each

117

object using an Inmarsat satellite connection and a depth layer echo-integration procedure

118

(Simmonds and MacLennan, 2005). The observation depth range extends from 3 to 115 m, composed

119

of ten homogeneous layers, each with a resolution of 11.2 m (see Lopez et al. (2016) for more

120

technical details on the buoy used) (Fig. 2). A vertical depth limit of 25 m was established as a

121

potential boundary between tuna and non-tuna species, based on experimental evidences from

122

tagging and acoustic surveys around DFADs in the Indian Ocean (Moreno et al., 2007; Govinden et

123

al., 2010; Filmalter et al., 2011; Forget et al., 2015). Although temporary overlap may exist between 5

124

species, it is assumed that, in this region, most of the time tuna is below 25 m while non-tuna species

125

remain in shallower waters. Other studies using the same echo-sounder buoy, addressing similar

126

aggregation dynamics and distribution studies, used the vertical limit of 25 m to separate tuna from

127

non-tuna species (Robert et al., 2013; Lopez et al., 2017a; Lopez et al., 2017b; Orue et al., 2019b).

128

Therefore, if there was an acoustic record between 25 and 115 m, it was tuna presence and, if not,

129

absence.

130

In addition to the acoustic information, buoy data include information about the owner vessel,

131

buoy ID (unique alphanumeric code given by the manufacturer), spatial information (i.e. latitude and

132

longitude), and the date and GMT hour of the sampling. All this information was available for 962

133

buoys with 42,322 acoustic samples, including presence and absence information, which were active

134

in the Indian Ocean from January 2012 to May 2015. Although echo-sounders provide daily data and

135

precise positions, catch-independent information were aggregated at 1ºx1º degrees per month to have

136

the same spatio-temporal resolution as catch-dependent data. All the buoys were attached to newly

137

deployed DFADs (i.e. virgin DFADs; objects with no aggregation associated when first deployed),

138

with a maximum of 60 days at sea (i.e. time constrained based on a preliminary data distribution

139

analysis that showed that after 60 days only 50% of the DFADs were available). Data cleaning was

140

carried out following the protocol proposed by (Orue et al., 2019a). Buoy data and FAD and fishing

141

logbook information were crossed to assure that no fishing activity occurred in our virgin DFAD.

142

Potential fishing sets were identified on the logbooks based on the information of buoy code, position

143

and date. If a fishing set was identified to occur during the DFAD trajectory, the buoy information

144

after the set was eliminated

145 146

2.3. Catch-dependent data

147

6

148

Nominal tropical tuna catch data (i.e. any species; skipjack, yellowfin and bigeye) of the

149

purse-seine fisheries in the Indian Ocean were obtained from the Indian Ocean Tuna Commission

150

(IOTC; www.iotc.org). The dataset includes information on EU DFAD catches over the study period

151

(2012–2015) with a spatial resolution of 1° latitude-longitude grid cell and a monthly temporal

152

resolution. A total of 7357 strata (1ºx1º grid cell and month) with catch information reported by the

153

EU fleets were observed over a period of 4 years (i.e. 2012-2015).

154

Only fishery strata with positive samples (i.e. presences) are available in the original dataset.

155

To generate a presence/absence dataset, pseudo-absences were generated randomly for the study

156

area. The set of pseudo-absences was generated 100 times using the “srswor” function (simple

157

random sampling without replacement) from the “sampling” package (Tille and Matei, 2016) of the

158

R software (R Development Core Team, 2017). In each case, the number of generated pseudo-

159

absences was the same as the number of real presences (Sequeira et al., 2013). Pseudo-absences were

160

then combined with real presences into a single presence-absence dataset to be used in the binomial

161

modeling phase. Although the generation of pseudo-absence datasets is an active research topic and

162

is subject to some debate (e.g. Hastie and Fithian (2013)), we preferred to use such a binomial

163

distribution with a Bayesian spatial model instead of a less accurate model that allows the use of

164

presence-only data (e.g. BIOCLIM, DOMAIN) (Roos et al., 2015).

165 166

2.4. Environmental data

167 168

The environmental variables were obtained from the EU Copernicus Marine Environment

169

Monitoring Service (CMEMS; http://marine.copernicus.eu/). Eight abiotic and biotic variables were

170

downloaded and processed based on previous studies analyzing tropical tuna environmental

171

preferences (Fonteneau et al., 2008; Song et al., 2008; Fraile et al., 2010; Arrizabalaga et al., 2015;

7

172

Lopez et al., 2017b) (Table 1). For the acoustic data, the environmental variables were extracted for

173

each record, according to their position and date. In the case of tuna catch data, as they are monthly

174

data aggregated in 1x1°, we extract the average value of the strata for each variable. All

175

environmental variables were tested for correlation, collinearity, outliers and missing values before

176

their use in the models. Velocity was eliminated as it is highly correlated to eddy kinetic energy

177

(Pearson correlation, r > 0.9; p-value < 0.001). The rest of the variables presented low correlation and

178

collinearity values. Moreover, variables were standardized using the function “decostand” in the

179

“vegan” package (Oksanen et al., 2013) of the R software, in order to facilitate interpretation and to

180

enable comparison of relative weights between variables (Kinas and Andrade, 2017).

181 182

2.5.Species distribution model

183 184

B-HSMs were used to predict the probability of tuna presence with respect to the selected

185

environmental variables using both catch-dependent and independent data. In the case of models

186

performed using catch-independent data, the buoy ID was included as a random effect to account for

187

the potential autocorrelation of the sampling structure, as acoustic data are collected repeatedly in

188

various days by the same buoy for each DFAD. The days spent at sea by the DFAD was also

189

considered as a covariate for the catch-independent model. In both models the response variable was

190

a binary variable that represents the presence (1) or absence (0) of tuna (Yi) in each location i, and

191

then the occurrence is modelled as: ~

=

= ,…., +

+



~ !"# , $# % 8

=

0, ' (, )

2

( =

"+ , ,+

2

) =

"- , ,-

192

where πi represents the probability of the species presence for a given location (i), Xiβ represents the

193

matrix of the fixed effects of the linear predictor, Zi is the buoy random effect and Wi represents the

194

spatially structured random effect at the location i. Prior Gaussian distributions with a zero mean and

195

covariance matrix (Q) was assumed for the spatial component, which depend on the hyperparameters

196

k and τ, and determine the range of the effect and the total variance, respectively. Hyperpriors for k

197

and τ are cantered in values such that the range is about 20% of the diameter of the region and the

198

variance is equal to 1 (Lindgren et al., 2011).

199

We use the Integrated Nested Laplace Approximations (INLA) approach (Rue et al., 2009)

200

and the package INLA (http:\\www.r-inla.org) that is implemented in the R software to obtain

201

Bayesian parameter estimates and predictions. The spatial effects (W) were computed using the

202

Stochastic Partial Differential Equations (SPDE) approach implemented in INLA (Lindgren et al.,

203

2011), which ensures that the continuous spatial domain (also known as Gaussian Random Field;

204

GRF) is discretized into smaller spatial units (known as Gaussian Markov Random Field; GMRF). In

205

such a set-up, the GMRF is reduced to a multivariate Gaussian distribution that is characterized by a

206

zero mean and a spatial dependent covariance matrix that is defined by the Matérn function

207

(Lindgren et al., 2011). Default zero-mean Gaussian prior distributions with a variance of 100 were

208

used for all of the parameters involved in the fixed effects as recommended by Held et al. (2010).

209

Following the Bayesian framework, posterior distributions were obtained for each parameter.

210

This type of distribution, in contrast to the mean and confidence interval produced by classical

211

frequentist analyses, enables explicit probability statements about the parameters. Therefore, the 9

212

region bounded by the 0.025 and 0.975 quantiles of the posterior distribution indicates that the

213

unknown parameter is 95% likely to fall within this range of values (Dell'Apa et al., 2017).

214 215

2.5.1. Model selection

216

The selection of explanatory variables was conducted by comparing all possible interactions,

217

but only the best combination of variables was chosen, based on the Watanabe-Akaike (WAIC)

218

information criterion (Watanabe, 2010) and Log-Conditional Predictive Ordinations (LCPO) (Roos

219

and Held, 2011). Specifically, lower WAIC values indicate a better fit, while lower LCPO scores

220

represent better predictive quality. The best compromise between fit, parsimony and predictive

221

quality is the smaller the WAIC and LCPO values are. Thus, the best models were selected based on

222

the mentioned compromise between low WAIC and LCPO values, containing only relevant

223

predictors (i.e. those with 95% credibility intervals excluding zero (Fonseca et al., 2017)).

224 225

2.5.2. Model prediction

226

Once the inference was carried out, we predicted monthly probability tuna presence in the area

227

of interest using Bayesian kriging (Muñoz et al., 2013). The prediction in INLA was performed

228

simultaneously with the inference, considering the prediction locations as points where the response

229

is missing. The INLA SPDE module allows the construction of a Delaunay triangulation covering the

230

region of interest for the prediction. This has at least three advantages over a regular grid. First, the

231

triangulation is denser in regions where there are more observations and consequently there is more

232

information, and more detail is needed. Second, it saves computing time, because prediction

233

locations are typically much lower in number than those in a regular grid. Thirdly, it is possible to

234

take boundary effects into account by generating a triangulation with small triangles in the domain of

235

interest and using larger triangles in the extension used to avoid boundary effects. Once the

236

prediction is performed in the observed location, there are additional functions that linearly 10

237

interpolate the results within each triangle into a finer regular grid. As a result of the process, a

238

faceted surface prediction is obtained which approximates to the true predictive surface (Muñoz et

239

al., 2013). In order to have the same spatio-temporal resolution, all environmental data were

240

aggregated at 1ºx1º degrees per month, using the ‘raster’ package (Hijmans et al., 2017) of the R

241

software. The two final established models derived from the two different data sources (see section

242

3.1 and 3.2) were used to compute monthly predictions on the monthly means of the selected

243

environmental variables, to take into account possible intraannual variation based on the marked

244

oceanographic dynamics of the area.

245 246

2.5.3. Model validation

247

Model predictions were validated using the leave-group-out cross-validation procedure (Kuhn

248

and Johnson, 2013). Both datasets were randomly split into two main subsets: a training dataset

249

including 80% of the total observations, and a validation dataset containing the remaining 20% of the

250

data (Fielding and Bell, 1997). The training data were used to model the relationship between

251

observed data and the explanatory variables while the second dataset was used to assess the quality of

252

predictions. We repeated the validation procedure 10 times for the best model of each data source and

253

month. Model performance was assessed using the area under the receiver-operating characteristic

254

curve (AUC) (Fielding and Bell, 1997) and the true skill statistic (TSS) (Allouche et al., 2006). AUC

255

measures the ability of the model to discriminate species presence’s locations from those in which

256

the species is absent. AUC values range from 0 to 1, where values closer to 1 indicate better

257

predictions. TSS corrects AUC for the dependence of prevalence on specificity (i.e. ability to

258

correctly predict absences) and sensitivity (i.e. ability to correctly predict presences). TSS ranges

259

from -1 to +1, where +1 indicates perfect agreement and values of zero or less indicate a performance

260

no better than random (Pennino et al., 2017). Both AUC and TSS are used in combination when

261

evaluating the predictive power of a SDM (Pearson et al., 2006; Brodie et al., 2015). Model 11

262

validation was performed using the ‘PresenceAbsence’ package (Freeman and Moisen, 2008) of the

263

R software.

264 265

2.6. Comparison of model predictions

266 267

The monthly spatial structure similarity of predictions of species' distributions derived from

268

the two different data sources were compared using the Schoener's D (Schoener, 1968) and Warren's

269

I statistics (Warren et al., 2008). Both statistics assume probability distributions defined over

270

geographic space and are calculated using the difference between species suitability score at each

271

grid cell (Pennino et al., 2018). The two metrics ranged from 0, where two distributions have no

272

overlap, to 1, where two distributions are identical. These analyses were carried out using the

273

“nicheOverlap” function of the “dismo” package (Hijmans et al., 2016) of the R software. The

274

Kruskal-Wallis H test was used to elucidate whether there were significant differences between

275

months for each statistic index.

276

Moreover, Spearman rank correlation between monthly predictions derived from the two

277

different data sources was also calculated, as Schoener's D and Warren's I statistics may tend to

278

overestimate similarity between rasters when many grid cells are of similar values (e.g. when a large

279

part of the region contains low probability values). The Spearman rank correlation coefficient ranged

280

from -1 (perfectly negatively correlated) and 1 (perfectly positively correlated).

281 282 283

3. Results

284 285

3.1. Catch-independent data

286 12

287

The final B-HSM, after testing all the possible variable combinations, included SST, SSH,

288

EKE and days at sea as relevant predictors, included as linear terms. The spatial effect (W) and buoy

289

identification were included as random effects (Table 2).

290

According to the model, higher probability of tuna presence is expected in warmer waters (posterior

291

mean = 0.12; 95% CI = [0.09, 0.16]), with higher SSH values (posterior mean = 0.17; 95% CI =

292

[0.14, 0.20]) and low eddy kinetic energy values (posterior mean = -0.17; 95% CI = [-0.21, -0.13]).

293

Moreover, days at sea also explained the probability of presence of tuna, finding greater probabilities

294

with higher days at sea values (posterior mean = 0.03; 95% CI = [0.02, 0.03]) (Table 3).

295

Maps of the predicted probability of occurrence of tuna using catches-independent data show

296

relatively small and disaggregated hotspots. During autumn and part of winter (i.e. October to

297

January), hotspots are mostly distributed in the equatorial zone. During the rest of the year, small

298

hotspots distributed from -10°S to 10°N are shown, with some variation in February, where a marked

299

hotspot in the Horn of Africa is found, and in March, where highest probabilities were found close to

300

the coast of Somalia and Kenya (Fig. 3). The standard deviation maps (Fig. S.1) show very low

301

values in the area where data were collected, while the error increases along the edges and off the

302

data collection area.

303

The analyses of the model prediction performances obtained in the cross-validation showed AUC

304

values of 0.71±0.05, indicating a reasonable degree of discrimination between locations in which

305

tuna were present and those in which were absent. Likewise, TSS values of 0.31±0.05 were found,

306

which represents a reasonable ability of the model to predict true negative and true positive

307

predictions (Brodie et al., 2015).

308 309

3.2. Catch-dependent data

13

310 311

The main predictors of tuna habitats in the Western Indian Ocean were SAL, SST, SSH and a

312

stochastic spatial component that accounted for the residual spatial autocorrelation (W) (Table 4).

313

Specifically, results showed a positive relationship between SST (posterior mean = 0.17; 95% CI =

314

[0.07, 0.26]), SAL (posterior mean = 0.26; 95% CI = [0.18, 0.34]), and the probability of tuna

315

occurrence. On the contrary, the SSH showed a negative relationship (posterior mean = -0.20; 95%

316

CI = [-0.30, -0.09]) with tuna occurrence (Table 5).

317

From January to May the tuna probability hotspot moves from the central Western Indian Ocean to

318

the south, creating a second hotspot in the Mozambique Channel. In June, the hotspot in the

319

Mozambique Channel disappears and it becomes smaller and limited to the equatorial zone. From

320

July to December the hotspot remains more or less stable in the same area, with small variations.

321

Another less marked hotspot in the east can be also observed, varying in intensity throughout the year

322

and even disappearing in certain months (i.e. April, May and November) (Fig. 4). In this case, the

323

standard deviation maps (Fig. S.2) show low values along the edges and practically zero values in the

324

rest of the area.

325

The generation of pseudo-absences is an open and well-known discussion in the scientific

326

community. To check the reliability of our results, we generated randomly for the study area three

327

other sets of pseudo-absences and we ran again the models. The results were similar in terms of final

328

covariates selected, and model validation and prediction capabilities (i.e. performance scores) (Table

329

S.1, Fig. S.3), underlining that the model has been correctly established and that it can be a

330

representative selection of the average results.

331 332

AUC and TSS values showed reasonable values, with averages of 0.74±0.08) and 0.45±0.05, respectively, indicating good performance of the models for prediction as well. 14

333 334

3.3. Model comparison statistics

335 336

On the one hand, the similarity statistics indicate a good level of overlap in the predictions of

337

the two model outputs. In fact, for all months, Schoener's D ranged from about 0.43 to 0.67 (mean

338

0.58±0.07), while Warren's I ranged from 0.74 to 0.91 (mean 0.85±0.06). On the other hand, the

339

Spearman rank correlation coefficient indicated positive but not very high scores, which ranged from

340

0.43 to 0.67 (mean 0.58±0.07) (Table 6). Although intra-annual variations were detected, the

341

differences found between months are not statistically significant for either index (Kruskal Wallis,

342

p>0.05).

343 344 345

4.

Discussion

346 347

This study represents the first investigation of spatio-temporal distribution of tropical tuna

348

aggregated under the DFADs using observations from catches-dependent and independent data

349

sources. Moreover, this study provides new information about spatio-temporal habitat preferences of

350

tuna aggregated under the DFADs in the Western Indian Ocean.

351 352

4.1. Model comparison

353 354

Although the similarity statistics computed in this study (i.e. Schoener's D and Warren's I)

355

indicated a good level of overlap in the predictions of the two model outputs, Spearman rank

356

correlation showed low values. These differences could most likely be due to the areas of low

357

probability. This study considers large-scale habitat preference predictions in which, outside the main 15

358

fishing area, very low probabilities are estimated for both models using the two data sources, which

359

may result in misleading similarity statistics. This highlights the need of working with various

360

indices of similarity, and not only Schoener's D and Warren's I or Spearman rank correlation

361

statistics individually, to better understand the spatial structure of the results and obtain non-

362

misleading results.

363

Despite the two model outputs showing, in general, similar areas of tuna presence under the

364

DFADs, some discrepancies are present. While temperature is relevant, and it has a positive effect on

365

both models, SSH has the opposite effect in each model (i.e. positive in catch-independent model and

366

negative in catch-dependent model). Tuna’s preference for warm surface waters is consistent with

367

previous works, where warmer waters have been linked to both higher catch rates of large tuna

368

(Zagaglia et al., 2004; Lan et al., 2011; Lan et al., 2017) or acoustic biomass values (Lopez et al.,

369

2017b). The opposite effect found for SSH is more complex to interpret. Tropical tuna usually live in

370

warmer environments that display near null or positive SSH values (Arrizabalaga et al., 2015; Lan et

371

al., 2017; Lopez et al., 2017b), which is consistent with catch-independent model results (positive

372

SSH). Catch-dependent data also revealed a positive relationship with salinity, which is not identified

373

by catch-independent data as one of the main covariates. Also, the buoy-based model found a

374

negative relationship with EKE whereas the catch-based model did not find any significant

375

relationship. These differences in the relevance of environmental variables between the models could

376

be due to several factors. On the one hand, it must be taken into account that the environmental

377

variables were considered to play a less important role in DFADs than in for example free school

378

communities in the Indian Ocean (Lezama-Ochoa et al., 2015), which suggests that other ecological

379

processes may also be involved at different scales on the aggregation process of the species at

380

DFADs. On the other hand, in this study we have not considered the specific composition of the

381

species aggregated under the object, and it could be possible that the behavior of the species in

382

relation to the environmental variables varies between species. Current echo-sounder buoys provide a 16

383

single biomass value without determining species or size composition of the fish underneath the

384

DFAD, although work is being conducted to enable echo-sounder buoys provide abundance of tuna

385

by species in the near future (Boyra et al., 2018). Therefore, once this becomes possible, the specific

386

composition of the species aggregated under the DFADs should be included in the models, to

387

consider possible differences in species behavior and environmental variables.

388

It is also possible that the differences found between models are related to the characteristics

389

of the data (e.g. strata differences), data processing (e.g. aggregated or fine-scale environmental data)

390

or areas where catches occur and the trajectories of the DFADs. For example, there is a significant

391

difference in sampling size between the two datasets: for the same period, catch-independent data

392

have 42,322 observations compared to the 7357 of the catch-dependent data. These may be the main

393

reasons for the difference in the precision of the hotspots identified in the prediction maps. The maps

394

obtained with acoustic data allow identifying areas of high probability of tuna presence under the

395

DFADs with greater precision, whereas the maps derived from catch data do not allow observing any

396

variation on a finer scale. Maps illustrating the distribution of tuna associated with DFADs are

397

fundamental sources to help design management and conservation measures, so it is very useful to

398

obtain maps with as much detailed as possible within the large area occupied by tropical purse-seine

399

fisheries. With the availability of high-resolution data of tuna presence derived from fishers’ echo-

400

sounder buoys and environmental variables from satellite data, broad-scale studies can be made to

401

estimate tuna distribution at fine resolution.

402

One of the most important limitations of catch data is that they are constrained to the areas

403

where fishing occurs. Therefore, there are areas where, for several reasons (e.g. exclusive economic

404

zones without fishing agreements, distance to port, lower catch rates, etc.), fishing is not occurring

405

and, hence, there are no observations to build SDMs. In contrast, echo-sounders buoys are

406

continuously recording and providing information on the DFADs trajectories and biomass of fish

407

aggregated underneath in a non-invasive manner, covering areas and seasons where catch data are not 17

408

available. Based on our results, the application of spatio-temporal models for tuna associated with

409

DFADs using acoustic data provided by fishers´ echo-sounder buoys appears promising for the

410

identification of the spatial distribution of tuna in specific areas where fishing effort is limited or does

411

not exist. Nevertheless, echo-sounder buoys are sometimes switched off when they move away from

412

the fishing zone, to avoid communication fees for DFADs outside the practical range. However, in

413

this study, the presence of DFADs outside regular fishing grounds throughout the year suggest that

414

that is not the case for all the buoys. Despite all the advantages of acoustic data derived from fishers´

415

echo-sounder buoys, there are also some limitations. For example, using data obtained in DFADs,

416

only the distribution of small fish of the population can be modeled as large fish are mainly

417

unassociated to DFADs and, presently can only be modeled using a mixture of 3 species of tunas

418

(skipjack, yellowfin and bigeye tuna) and not a single species as it cannot still differentiate using

419

buoy’s echo-sounder biomass information.

420 421

4.2. Spatio-temporal distribution of tuna

422 423

Results showed month-specific distribution patterns, suggesting that tuna species may have

424

different habitat preferences depending on the season. The Indian Ocean is characterized by strong

425

environmental fluctuations associated with monsoon regimes that affect ocean circulation and

426

biological production (Schott and McCreary, 2001), seasonal variability in fishing grounds (Davies et

427

al., 2014), and catches (Kaplan et al., 2014). Changes in biophysical factors associated with

428

seasonality (i.e. chlorophyll, temperature, salinity, dissolved oxygen) may play an important role in

429

the distribution of tuna aggregated under floating objects.

430

In the case of the maps obtained using catch data, we observe a more general hotspot that

431

varies throughout the year but does not allow the identification of more precise areas. On the other

432

hand, the maps obtained with echo-sounder buoy data identify hotspots at a finer scale and the intra18

433

annual changes in tuna distribution. For example, an intermediate zone of low probability of tuna

434

presence was found between the two hotspots identified by the acoustic data (one in horn of Africa

435

and the another one around Seychelles and west of Seychelles) during February. This intermediate

436

zone of low probability of tuna presence may be related to the relatively high speed of the currents in

437

the area. In fact, tuna could be forced to leave floating objects in this area because of the high

438

energetic cost being associated with fast-DFADs carried by the South Equatorial Countercurrent

439

flowing eastwards at speeds > 1- 1.5 knots (Schott and McCreary, 2001; Tomczak and Godfrey,

440

2013). This finding is consistent with the study of Lopez et al. (2017b), that observed greater

441

abundances of tuna aggregated to DFADs at low speeds (i.e. 0.5-1 knots or lower). From July to

442

September, tuna distribution patterns seem to be related to the upwelling system of the Somali coast.

443

Indeed, during the summer monsoon (i.e. June to September) there is an extensive coastal upwelling

444

over the margin of continental areas in the northwestern Indian Ocean (Schott et al., 2009). Cold and

445

high-salinity waters shallow as a result of the upwelling with high nutrient contents that stimulate the

446

increase of primary production in the area (Veldhuis et al., 1997; Hitchcock et al., 2000). In addition,

447

one of the main characteristic of this upwelling is the rapid export of surface waters offshore reaching

448

over 500 km from the coast (Hitchcock et al., 2000; Wiggert et al., 2006), due to the presence of a

449

large anticyclonic eddy, called Great Whirl, in the vicinity of the upwelling zone (Beal and Donohue,

450

2013). These patterns are mainly reflected in the July and August prediction maps, which show the

451

highest probability of tuna in Somali waters. Moreover, in September higher tuna probability areas

452

move to the east. This may be related to the offshore export of surface productive waters after the

453

summer upwelling, indicating that tuna may aggregate to DFADs that follow this nutritive current

454

heading eastward. Although in this study chlorophyll has not been identified as a relevant variable,

455

previous studies have found positive relationship between tuna associated with DFADs and

456

chlorophyll concentration, particularly in the Indian Ocean (Fraile et al., 2010), but also in the

457

Atlantic Ocean (Lopez et al., 2017b). 19

458 459 460 461

5. Conclusions

462

The present study compares, for the first time, species distribution models developed using

463

catch-dependent (i.e. catch) data with models using catch-independent data (i.e. acoustic data derived

464

from fishers´ echo-sounder buoys), concluding that the maps obtained with acoustic data allow

465

identifying areas of high probability of tuna presence under the DFADs with greater precision than

466

maps derived from catch data. This highlights the great potential fishers´ echo-sounder buoys have

467

for addressing key scientific questions on tuna ecology and behavior. Furthermore, these data could

468

significantly contribute to the development of possible management measures, for example time–area

469

closures, in which, by having greater precision in the spatio-temporal distribution of tunas associated

470

with DFADs, management measures could be more efficient. The integration of novel sources of

471

information in science and management, such as fishers´ echo-sounder buoys data, are a valuable

472

tool.

473 474 475

Acknowledgments

476 477

The authors would like to thank the Spanish fishing company of tuna purse seiners in the

478

Indian Ocean ECHEBASTAR who kindly provided the acoustic data from their echo-sounder buoys

479

used in the present study.

480 481 482

References 20

483 484 485 486

Allouche, O., Tsoar, A., Kadmon, R., 2006. Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS). J Appl Ecol. 43, 1223-1232. Arrizabalaga, H., Dufour, F., Kell, L., Merino, G., Ibaibarriaga, L., Chust, G., Irigoien, X., Santiago,

487

J., Murua, H., Fraile, I., 2015. Global habitat preferences of commercially valuable tuna. Deep

488

Sea Res. Part 2 Top. Stud. Oceanogr. 113, 102-112.

489

Báez, J.C., Fernández, F., Pascual-Alayón, P.J., Ramos, M.L., Deniz, S., Abascal, F., 2018. Updating

490

the statistics of the EU-Spain purse seine fleet in the Indian Ocean (1990-2017). IOTC-2018-

491

WPTT20-15.

492 493 494

Beal, L., Donohue, K., 2013. The Great Whirl: Observations of its seasonal development and interannual variability. J Geophys Res-Oceans 118, 1-13. Boyra, G., Moreno, G., Sobradillo, B., Pérez-Arjona, I., Sancristobal, I., Demer, D.A., 2018. Target

495

strength of skipjack tuna (Katsuwanus pelamis) associated with fish aggregating devices (FADs).

496

ICES J Mar Sci. 75, 1790-1802.

497

Bradley, D., Merrifield, M., Miller, K.M., Lomonico, S., Wilson, J.R., Gleason, M.G., 2019.

498

Opportunities to improve fisheries management through innovative technology and advanced

499

data systems. Fish Fish. 20 564-583.

500

Brodie, S., Hobday, A.J., Smith, J.A., Everett, J.D., Taylor, M.D., Gray, C.A., Suthers, I.M., 2015.

501

Modelling the oceanic habitats of two pelagic species using recreational fisheries data. Fish

502

oceanogr. 24, 463-477.

503

Castro, J., Santiago, J., Santana-Ortega, A., 2002. A general theory on fish aggregation to floating

504

objects: an alternative to the meeting point hypothesis. Rev. Fish Biol. Fish. 11, 255-277.

505

Coelho, R., Mejuto, J., Domingo, A., Yokawa, K., Liu, K.M., Cortés, E., Romanov, E.V., Silva, C.,

506

Hazin, F., Arocha, F., 2017. Distribution patterns and population structure of the blue shark

507

(Prionace glauca) in the Atlantic and Indian Oceans. Fish Fish. 19 90-106.

508

Conn, P.B., Thorson, J.T., Johnson, D.S., 2017. Confronting preferential sampling when analysing

509

population distributions: diagnosis and model‐based triage. Methods Ecol Evol. 8, 1535-1546.

510 511 512 513

Chen, I., LEE, P.F., TZENG, W.N., 2005. Distribution of albacore (Thunnus alalunga) in the Indian Ocean and its relation to environmental factors. Fish oceanogr. 14, 71-80. Dagorn, L., Bez, N., Fauvel, T., Walker, E., 2013. How much do fish aggregating devices (FADs) modify the floating object environment in the ocean? Fish oceanogr. 22, 147-153.

21

514 515 516

Davies, T.K., Mees, C.C., Milner-Gulland, E., 2014. The past, present and future use of drifting fish aggregating devices (FADs) in the Indian Ocean. Mar Policy. 45, 163-170. Dell'Apa, A., Pennino, M.G., Bonzek, C., 2017. Modeling the habitat distribution of spiny dogfish

517

(Squalus acanthias), by sex, in coastal waters of the northeastern United States. Fish. Bull. 115,

518

89-100.

519 520

Fielding, A.H., Bell, J.F., 1997. A review of methods for the assessment of prediction errors in conservation presence/absence models. Environ conserv. 24, 38-49.

521

Filmalter, J.D., Dagorn, L., Cowley, P.D., Taquet, M., 2011. First descriptions of the behavior of

522

silky sharks, Carcharhinus falciformis, around drifting fish aggregating devices in the Indian

523

Ocean. Bull. Mar. Sci. 87, 325-337.

524 525 526

Fonseca, V.P., Pennino, M.G., de Nóbrega, M.F., Oliveira, J.E.L., de Figueiredo Mendes, L., 2017. Identifying fish diversity hot-spots in data-poor situations. Mar. Environ. Res. 129, 365-373. Fonteneau, A., Chassot, E., Bodin, N., 2013. Global spatio-temporal patterns in tropical tuna purse

527

seine fisheries on drifting fish aggregating devices (DFADs): Taking a historical perspective to

528

inform current challenges. Aquat. Living Resour. 26, 37-48.

529 530

Fonteneau, A., Lucas, V., Tewkai, E., Delgado, A., Demarcq, H., 2008. Mesoscale exploitation of a major tuna concentration in the Indian Ocean. Aquat. Living Resour. 21, 109-121.

531

Forget, F.G., Capello, M., Filmalter, J.D., Govinden, R., Soria, M., Cowley, P.D., Dagorn, L., 2015.

532

Behaviour and vulnerability of target and non-target species at drifting fish aggregating devices

533

(FADs) in the tropical tuna purse seine fishery determined by acoustic telemetry. Can. J. Fish.

534

Aquat. Sci. 72, 1398-1405.

535

Fraile, I., Murua, H., Goni, N., Caballero, A., 2010. Effects of environmental factors on catch rates of

536

FAD-associated yellowfin (Thunnus albacares) and skipjack (Katsuwonus pelamis) tunas in the

537

western Indian Ocean, IOTC-2010-WPTT-46.

538 539 540 541 542 543 544 545

Freeman, E.A., Moisen, G., 2008. PresenceAbsence: An R package for presence absence analysis. Journal of Statistical Software. 23 (11): 31 p. Govinden, R., Dagorn, L., Filmalter, J., Soria, M., 2010. Behaviour of Tuna a ssociated with Drifting Fish Aggregating Devices (FADs) in the Mozambique Channel, IOTC-2010-WPTT-25. Hastie, T., Fithian, W., 2013. Inference from presence‐only data; the ongoing controversy. Ecography 36, 864-867. Held, L., Schrödle, B., Rue, H., 2010. Posterior and cross-validatory predictive checks: a comparison of MCMC and INLA. Statistical modelling and regression structures, 91-110.

22

546 547 548 549 550 551 552 553 554

Hijmans, R., Phillips, S., Leathwick, J., Elith, J., 2016. dismo: species distribution modeling. R package ver. 1.0-15. Hijmans, R., van Etten, J., Cheng, J., Mattiuzzi, M., Sumner, M., Greenberg, J., 2017. Raster: Geographic Data Analysis and Modeling. R package version 2.3–33; 2016. Hitchcock, G.L., Key, E.L., Masters, J., 2000. The fate of upwelled waters in the Great Whirl, August 1995. Deep Sea Res. Part 2 Top. Stud. Oceanogr. 47, 1605-1621. Jury, M., McClanahan, T., Maina, J., 2010. West Indian ocean variability and east African fish catch. Mar. Environ. Res. 70, 162-170. Kaplan, D.M., Chassot, E., Amandé, J.M., Dueri, S., Demarcq, H., Dagorn, L., Fonteneau, A., 2014.

555

Spatial management of Indian Ocean tropical tuna fisheries: potential and perspectives. ICES J

556

Mar Sci. 71, 1728-1749.

557

Kinas, P.G., Andrade, H.A., 2017. Introdução à análise bayesiana (com R). Consultor Editorial.

558

Kuhn, M., Johnson, K., 2013. Applied predictive modeling. Springer.

559

Lan, K.-W., Lee, M.-A., Lu, H.-J., Shieh, W.-J., Lin, W.-K., Kao, S.-C., 2011. Ocean variations

560

associated with fishing conditions for yellowfin tuna (Thunnus albacares) in the equatorial

561

Atlantic Ocean. ICES J Mar Sci. 68, 1063-1071.

562

Lan, K.-W., Shimada, T., Lee, M.-A., Su, N.-J., Chang, Y., 2017. Using Remote-Sensing

563

Environmental and Fishery Data to Map Potential Yellowfin Tuna Habitats in the Tropical

564

Pacific Ocean. Rem Sen. 9, 444.

565 566

Lee, P.-F., Chen, I., Tzeng, W.-N., 2005. Spatial and temporal distribution patterns of bigeye tuna (Thunnus obesus) in the Indian Ocean. Zool stud. 44, 260-270.

567

Lennert-Cody, C.E., Moreno, G., Restrepo, V., Román, M.H., Maunder, M.N., Poos, H.e.J.J., 2018.

568

Recent purse-seine FAD fishing strategies in the eastern Pacific Ocean: what is the appropriate

569

number of FADs at sea? ICES J Mar Sci. 75, 1748-1757.

570

Lezama-Ochoa, N., Murua, H., Chust, G., Ruiz, J., Chavance, P., de Molina, A.D., Caballero, A.,

571

Sancristobal, I., 2015. Biodiversity in the by-catch communities of the pelagic ecosystem in the

572

Western Indian Ocean. Biodivers conserv. 24, 2647-2671.

573

Lezama-Ochoa, N., Murua, H., Chust, G., Van Loon, E., Ruiz, J., Hall, M., Chavance, P., Delgado

574

De Molina, A., Villarino, E., 2016. Present and future potential habitat distribution of

575

Carcharhinus falciformis and Canthidermis maculata by-catch species in the tropical tuna purse-

576

seine fishery under climate change. Front. Mar. Sci. 3, 34.

23

577

Lindgren, F., Rue, H., Lindström, J., 2011. An explicit link between Gaussian fields and Gaussian

578

Markov random fields: the stochastic partial differential equation approach. J. R. Stat. Soc. 73,

579

423-498.

580

Lopez, J., Moreno, G., Boyra, G., Dagorn, L., 2016. A model based on data from echosounder buoys

581

to estimate biomass of fish species associated with fish aggregating devices. Fish. Bull. 114,

582

166-178.

583

Lopez, J., Moreno, G., Ibaibarriaga, L., Dagorn, L., 2017a. Diel behaviour of tuna and non-tuna

584

species at drifting fish aggregating devices (DFADs) in the Western Indian Ocean, determined

585

by fishers’ echo-sounder buoys. Mar. Biol. 164, 44.

586

Lopez, J., Moreno, G., Lennert-Cody, C., Maunder, M., Sancristobal, I., Caballero, A., Dagorn, L.,

587

2017b. Environmental preferences of tuna and non-tuna species associated with drifting fish

588

aggregating devices (DFADs) in the Atlantic Ocean, ascertained through fishers’ echo-sounder

589

buoys. Deep Sea Res. Part 2 Top. Stud. Oceanogr. 140, 127-138.

590

Lopez, J., Moreno, G., Sancristobal, I., Murua, J., 2014. Evolution and current state of the technology

591

of echo-sounder buoys used by Spanish tropical tuna purse seiners in the Atlantic, Indian and

592

Pacific Oceans. Fish. Res. 155, 127-137.

593 594 595 596 597 598 599

Lopez, J., Moreno, G., Soria, M., Cotel, P., Dagorn, L., 2010. Remote discrimination of By-cath in purse seine fishery using fishers' echo-sounder buoys. IOTC-2010-WPEB-03. Lowell, B., Mustain, P., Ortenzi, K., Warner, K., 2015. One name, one fish: why seafood names matter. Oceana, 1-12. Martínez-Minaya, J., Cameletti, M., Conesa, D., Pennino, M.G., 2018. Species distribution modeling: a statistical review with focus in spatio-temporal issues. Stoch. Env. Res. Risk A, 1-18. Moreno, G., Dagorn, L., Capello, M., Lopez, J., Filmalter, J., Forget, F., Sancristobal, I., Holland, K.,

600

2016. Fish aggregating devices (FADs) as scientific platforms. Fish. Res. 178, 122-129.

601

Moreno, G., Josse, E., Brehmer, P., Nøttestad, L., 2007. Echotrace classification and spatial

602

distribution of pelagic fish aggregations around drifting fish aggregating devices (DFAD).

603

Aquat. Living Resour. 20, 343-356.

604

Muñoz, F., Pennino, M.G., Conesa, D., López-Quílez, A., Bellido, J.M., 2013. Estimation and

605

prediction of the spatial occurrence of fish species using Bayesian latent Gaussian models. Stoch.

606

Env. Res. Risk A. 27, 1171-1180.

607

Oksanen, J., Blanchet, F.G., Kindt, R., Legendre, P., Minchin, P.R., O’hara, R., Simpson, G.L.,

608

Solymos, P., Stevens, M.H.H., Wagner, H., 2013. Package ‘vegan’. Community ecology

609

package, version 2. 24

610

Orue, B., Lopez, J., Moreno, G., Santiago, J., Boyra, G., Uranga, J., Murua, H., 2019a. From fisheries

611

to scientific data: A protocol to process information from fishers’ echo-sounder buoys. Fish. Res.

612

215, 38-43.

613

Orue, B., Lopez, J., Moreno, G., Santiago, J., Soto, M., Murua, H., 2019b. Aggregation process of

614

drifting fish aggregating devices (DFADs) in the Western Indian Ocean: Who arrives first, tuna

615

or non-tuna species? PloS one 14, e0210435.

616

Pearson, R.G., Thuiller, W., Araújo, M.B., Martinez‐Meyer, E., Brotons, L., McClean, C., Miles, L.,

617

Segurado, P., Dawson, T.P., Lees, D.C., 2006. Model‐based uncertainty in species range

618

prediction. J. Biogeogr. 33, 1704-1711.

619

Pennino, M.G., Conesa, D., López-Quílez, A., Munoz, F., Fernández, A., Bellido, J.M., 2016.

620

Fishery-dependent and-independent data lead to consistent estimations of essential habitats.

621

ICES J Mar Sci. 73, 2302-2310.

622

Pennino, M.G., Mérigot, B., Fonseca, V.P., Monni, V., Rotta, A., 2017. Habitat modeling for

623

cetacean management: Spatial distribution in the southern Pelagos Sanctuary (Mediterranean

624

Sea). Deep Sea Res. Part 2 Top. Stud. Oceanogr. 141, 203-211.

625 626

Pennino, M.G., Muñoz, F., Conesa, D., López-Quílez, A., Bellido, J.M., 2014. Bayesian spatiotemporal discard model in a demersal trawl fishery. J. Sea Res. 90, 44-53.

627

Pennino, M.G., Rufener, M.-C., Thomé-Souza, M.J., Carvalho, A.R., Lopes, P.F., Sumaila, U.R.,

628

2018. Searching for a compromise between biological and economic demands to protect

629

vulnerable habitats. Sci. Rep. 8, 7791.

630 631 632

R Development Core Team, 2017. R: A language and environment for statistical computing. R.Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/. Rajapaksha, J.K., Samarakoon, L., Gunathilaka, A.A.J.K., 2013. Environmental Preferences of

633

Yellowfin Tuna in the North East Indian Ocean: An Application of Satellite Data to Longline

634

Catches. Int. J. Fish. Aquat.Stud 2(4): 72-80.

635

Robert, M., Dagorn, L., Lopez, J., Moreno, G., Deneubourg, J.-L., 2013. Does social behavior

636

influence the dynamics of aggregations formed by tropical tunas around floating objects? An

637

experimental approach. J. Exp. Mar. Biol. 440, 238-243.

638

Robinson, N.M., Nelson, W.A., Costello, M.J., Sutherland, J.E., Lundquist, C.J., 2017. A Systematic

639

Review of Marine-Based Species Distribution Models (SDMs) with Recommendations for Best

640

Practice. Front. Mar. Sci. 4, 421.

641 642

Roos, M., Held, L., 2011. Sensitivity analysis in Bayesian generalized linear mixed models for binary data. Bayesian Anal. 6, 259-278. 25

643 644 645 646 647 648 649 650 651 652 653 654 655

Roos, N.C., Carvalho, A.R., Lopes, P.F., Pennino, M.G., 2015. Modeling sensitive parrotfish (Labridae: Scarini) habitats along the Brazilian coast. Mar. Environ. Res. 110, 92-100. Rue, H., Martino, S., Chopin, N., 2009. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. J. R. Stat. Soc. B 71, 319-392. Schoener, T.W., 1968. The Anolis lizards of Bimini: resource partitioning in a complex fauna. Ecology 49, 704-726. Schott, F.A., McCreary, J.P., 2001. The monsoon circulation of the Indian Ocean. Prog. Oceanogr. 51, 1-123. Schott, F.A., Xie, S.P., McCreary, J.P., 2009. Indian Ocean circulation and climate variability. Rev. Geophys. 47. Sequeira, A., Mellin, C., Rowat, D., Meekan, M.G., Bradshaw, C.J., 2012. Ocean‐scale prediction of whale shark distribution. Divers. Distrib. 18, 504-518. Sequeira, A.M., Mellin, C., Delean, S., Meekan, M.G., Bradshaw, C.J., 2013. Spatial and temporal

656

predictions of inter-decadal trends in Indian Ocean whale sharks. Mar. Ecol. Prog. Ser. 478, 185-

657

195.

658 659 660

Simmonds, J., MacLennan, D., 2005. Fishery acoustic theory and practice, Blackwell Scientific Publications, Oxford, UK. Song, L.M., Zhang, Y., Xu, L.X., Jiang, W.X., Wang, J.Q., 2008. Environmental preferences of

661

longlining for yellowfin tuna (Thunnus albacares) in the tropical high seas of the Indian Ocean.

662

Fish oceanogr. 17, 239-253.

663 664

Tille, Y., Matei, A., 2016. sampling: Survey Sampling. R package version 2.8. https://CRAN.Rproject.org/package=sampling.

665

Tomczak, M., Godfrey, J.S., 2013. Regional oceanography: an introduction. Elsevier.

666

Veldhuis, M.J., Kraay, G.W., Van Bleijswijk, J.D., Baars, M.A., 1997. Seasonal and spatial

667

variability in phytoplankton biomass, productivity and growth in the northwestern Indian Ocean:

668

the southwest and northeast monsoon, 1992–1993. Deep Sea Res. Part I: Oceanogr. Res. Pap. 44,

669

425-449.

670 671

Warren, D.L., Glor, R.E., Turelli, M., 2008. Environmental niche equivalency versus conservatism: quantitative approaches to niche evolution. Evolution 62, 2868-2883.

672

Watanabe, S., 2010. Asymptotic equivalence of Bayes cross validation and widely applicable

673

information criterion in singular learning theory. J. Mach. Learn. Res. 11, 3571-3594.

26

674

Wiggert, J., Murtugudde, R., Christian, J., 2006. Annual ecosystem variability in the tropical Indian

675

Ocean: Results of a coupled bio-physical ocean general circulation model. Deep Sea Res. Part 2

676

Top. Stud. Oceanogr. 53, 644-676.

677

Zagaglia, C.R., Lorenzzetti, J.A., Stech, J.L., 2004. Remote sensing data and longline catches of

678

yellowfin tuna (Thunnus albacares) in the equatorial Atlantic. Remote sens. environ. 93, 267-

679

281.

680 681 682 683

Table 1. Predictor variables used for modeling tuna occurrence using catches dependent and independent data in the Indian Ocean. Variables Variable name

Spatial

Temporal

resolution

resolution

Unit

acronym SST

Sea Surface Temperature

ºC

0.25°

Daily

SAL

Salinity

PSU

0.25°

Daily

SSH

Sea Surface Height

m

0.25°

Daily

VEL

Velocity

m s-1

0.25°

Daily

EKE

Eddy Kinetic Energy

m2s2

0.25°

Daily

HEADING

Direction

degrees

0.25°

Daily

CHL

Chlorophyll concentration

mg m-3

0.50°

Weekly

O2

Oxygen Concentration

mmol m-3

0.50°

Weekly

684 685 686 687 688 689

Table 2. Comparison of the most relevant models for the tuna B-HSMs selected using catchindependent data. Statistics acronyms are: Watanabe Akaike Information Criterion (WAIC) and Logarithmic Cross Validated Score (LCPO). Predictor acronyms are: W= spatial effect, SST = Sea Surface Temperature, SSH = Sea Surface Height, SAL = Salinity, O2 = Oxygen Concentration, EKE= Kinetic Energy, CHL= Chlorophyll. The best model is highlighted in bold. MODEL

WAIC

LCPO

b0+W+SST+SSH+EKE+Days at sea+ buoy ID

23415.36

0.53

b0+W+SST+SSH+EKE+ buoy ID

25489.22

0.55

27

b0+W+EKE+ buoy ID

25989.14

0.55

b0+W+SST+SSH+EKE+Days at sea

29475.19

0.57

b0+ CHL+SAL+SST+SSH+O2+EKE

30451.66

0.57

690 691 692 693 694 695

Table 3. Numerical summary of the marginal posterior distribution of the fixed effects for the best tuna B-HSMs selected using catch-independent data. For each variable, the mean, standard deviation, and a 95% credible central interval (Q0.025- Q0.975) is provided, containing 95% of the probability under the posterior distribution. Predictor acronyms are: W= spatial effect, B = buoy identification, SST = Sea Surface Temperature, SSH = Sea Surface Height, EKE = Eddy Kinetic Energy.

696 VARIABLE MEAN

MODEL

SD

Q0.025

Q0.975

SSH

0.17

0.02

0.14

0.20

SST

0.12

0.01

0.09

0.16

EKE

-0.17

0.02

-0.21

-0.13

Days at sea

0.03

0.01

0.02

0.03

b0 +W +SSH +SST +EKE +Days at sea+ B

697 698 699 700 701 702

Table 4. Comparison of the most relevant models for the tuna B-HSMs selected using catchdependent data. Statistics acronyms are: Watanabe Akaike Information Criterion (WAIC) and Logarithmic Cross Validated Score (LCPO). Predictor acronyms are: W= spatial effect, SST = Sea Surface Temperature, SSH = Sea Surface Height, SAL = Salinity, HEADING= Direction. The best model is highlighted in bold. MODEL

WAIC

LCPO

b0 +W +SST +SSH +SAL

8553.15

0.33

b0 +W +SST +SSH + HEADING+SAL

8599.92

0.33

b0 +W +SST +SSH

8600.34

0.34

b0+W

10214.16

0.42

b0

17878.28

0.69

703 28

704 705 706 707 708

Table 5. Numerical summary of the marginal posterior distribution of the fixed effects for the best tuna B-HSMs selected using catch-dependent data. For each variable, the mean, standard deviation, and a 95% credible central interval (Q0.025- Q0.975) is provided, containing 95% of the probability under the posterior distribution. Predictor acronyms are: W= spatial effect, SST = Sea Surface Temperature, SSH = Sea Surface Height, SAL = Salinity. MODEL

b0 +W +SST +SSH +SAL

VARIABLE

MEAN

SD

Q0.025 Q0.975

SST

0.17

0.05 0.07

SSH

-0.20

0.05 -0.30 -0.09

SAL

0.26

0.04 0.18

0.26

0.34

709 710

Table 6. Similarity statistics and spearman rank correlation coefficients obtained for each month Month Schoener's D Warren's I 0.647 0.907 January 0.624 0.894 February 0.523 0.792 March 0.512 0.731 April 0.586 0.871 May 0.655 0.917 June 0.432 0.743 July 0.586 0.874 August 0.864 September 0.558 0.662 0.902 October 0.874 November 0.601 0.671 0.911 December

Spearman rank correlation (p-value) 0.647 (<0.001) 0.624 (<0.001) 0.523 (<0.001) 0.512 (<0.001) 0.586 (<0.001) 0.645 (<0.001) 0.431 (<0.001) 0.586 (<0.001) 0.558 (<0.001) 0.662 (<0.001) 0.601 (<0.001) 0.671 (<0.001)

711

29

712 713

Fig. 1. Map of the study area (Western Indian Ocean) and spatial distribution of the tropical tuna

714

catch data of the purse-seine fisheries (green points) with a spatial resolution of 1° latitude/longitude

715

grid cell and a monthly temporal resolution.

30

716 717

Fig. 2. Characteristics of the Satlink echo-sounder buoy: Beam angle (a), depth range (h), and

718

diameter (d) at 115 m. An example of the echogram display for the 10 depth layers (ranging from 3

719

m to 115 m) (taken from Lopez et al. (2016)).

720

31

721 722 723

Fig. 3. Maps of the posterior mean of tuna occurrence probability for different months using catchindependent data.

724

32

725 726 727

Fig. 4. Maps of the posterior mean of tuna occurrence probability for different months using catchdependent data.

728 729 730

Supplementary material

731 732

Table S.1. Final covariates selected and model validation and prediction capabilities for each of the pseudo-absence creation tests. Model

Selected variables

WAIC

LCPO

Test 1

SST +SSH +SAL

8591.25 0.333

Test 2

SST +SSH +SAL

8549.14 0.331

33

Test 3

SST +SSH +SAL

8557.90 0.331

Model selected

SST +SSH +SAL

8553.15 0.330

733

734 735 736

Fig. S.1. Maps of the standard deviation of tuna occurrence probability for different months using catch-independent data.

34

737 738 739

Fig. S.2. Maps of the standard deviation of tuna occurrence probability for different months using catch-dependent data.

35

740 741 742

Fig. S.3. Maps of the posterior mean of tuna occurrence probability for different months for each of the pseudo-absence creation tests.

36

This manuscript describes an original work and is not under consideration by any other journal. All authors approved the manuscript and this submission. The authors have no conflicts of interest to declare.