Impacts of predictor variables and species models on simulating Tamarix ramosissima distribution in Tarim Basin, northwestern China

Data collection

The presence/absence data of T. ramosissima vegetation were calculated from the newest 1:1 000 000 vegetation map of China edited by Zhang (2008) and used as the response variable in models. Digital elevation model (DEM) data and 12 climatic and 14 edaphic environmental parameters were used as predictors to establish the model.

The DEM data and soil data were obtained from Void-filled seamless SRTM data V1 (International Centre for Tropical Agriculture (CIAT), 2004, CGIAR-CSI SRTM 90m Database, http://srtm.csi.cgiar.org) and the Harmonized World Soil Database produced by IIASA (2008), respectively. Both datasets were made available by WESTDC (Environmental and Ecological Science Data Center for West China, National Natural Science Foundation of China, http://westdc.westgis.ac.cn) and were resampled to 10 × 10 km from 1 × 1 km resolution. The original soil data were produced from a series of soil maps covering the extent of China at a scale of 1:1 million based on the Second National Soil Survey of China and were transformed to a digital format by the Institute of Soil Science, Chinese Academy of Sciences, Nanjing. The edaphic factors used as predictors in this study included the content of gravel, sand, silt and clay, organic carbon, pH, Electrical Conductivity (ECE) and bulk density within topsoil (0–30 cm) and subsoil (30–100 cm). Among these environment variables, soil organic matter is the main nutrient source for plants, and soil pH and ECE exert direct physiological limitations on plants, while elevation, soil bulk density, soil gravel, sand, slit and clay content would affect the availability of nutrients, water and heat to plant.

Climatic factors used in the models were MAT, mean annual precipitation (MAP), CI (Kira cold index), WI (Kira warm index), mean temperature in the growing season from April to September (GST), mean precipitation in the growing season from April to September (GSP), PET (potential evapotranspiration from the United Nation’s Food and Agriculture Organization, Allen et al. 1998), range of annual temperature (ATR), Holdridge’s biotemperature (BT), mean temperatures in July (JulT), mean temperature in January (JanT) and the Arid index (AI, AI = PET/AMP). Among the selected climate variables, MAT, GST, ATR, BT, WI and CI reflect the heat condition and energy supply for plant growth and development; JulT and JanT reflect the extreme temperatures that plants can endure and survive; MAP, GST, PET and AI reflect the water supply and the degree of dryness tolerable for plant growth and survival. Because there are only 14 climate stations in the Tarim Basin, these indexes were calculated by interpolating data recorded at 752 standard climate stations over China with 10 × 10 km resolution employing the kriging method. The resampling and interpolation of the spatial data were processed with Arcgis9.3 software.

Statistical models

The GLM was introduced by Austin et al. (1984) to model the presence/absence data of the tree species. The GLM method provides a less restrictive form than classic multiple regressions by providing error distributions for the dependent variable other than normal and non-constant variance functions (McCullagh and Nelder 1989). If the response with a predictor variable is not linear, then a transformation can be included; polynomial terms are allowed for the simulation of skewed and bimodal responses (Guisan et al. 1999), β functions (Austin and Gaywood 1994) or hierarchical sets of models (Huisman et al. 1993). The associated shortcoming of GLM is that the nature of the relationship between species and environmental gradients has to be known a priori. Furthermore, the GLM cannot deal with complex response curves (Yee and Mitchell 1991).

CART was developed by Breiman et al. (1984). Rather than trying to identify and model a general relationship between predictor variables and responses, CART recursively partitions the multidimensional space defined by the predictor variables into zones that are as homogeneous as possible in terms of response. The tree is built by repeatedly splitting the data, defined by a simple rule based on a single explanatory variable. At each split, the data are partitioned into two exclusive groups, each of which is as homogeneous as possible (Thuiller 2003). CART is less commonly used than GLM methods but is accurate and useful to describe hierarchical interactions between species (Franklin 1998; Thuiller et al. 2003). The main drawback of CART model is that the generated models can be extremely complex and difficult to interpret when used to predict organism distributions, with more than just a handful of predictor variables or cases to classify (Muñoz and Felicísimo 2004).

Random Forests modelling is an ensemble learning technique that generates many classification trees that are aggregated, based on majority voting, to classify (Breiman 2001; Breiman et al. 1984). Bootstrap samples are drawn to construct multiple trees, each tree is grown with a randomized subset of total number of predictors and a large number of trees are grown. Observations in the original dataset that do not occur in a bootstrap sample are called out-of-bag observations (OOB) and can be used to calculate an unbiased error rate and variable importance, eliminating the need for a test set or cross-validation. The trees are grown to maximum size without pruning, and each is used to predict the OOB. The predicted class of an observation is calculated by majority vote of the OOB for that observation. Random Forests produces a limiting value of the generalization error, which means that no overfitting is possible, a very useful feature for prediction (Breiman 2001; Prasad et al. 2006).

CART, GLM and Random Forests models were established to predict the presence/absence of T.ramosissma using four datasets. The four datasets were (i) climatic variables, (ii) climatic variables, edaphic variables and the DEM, (iii) PCA axes of climatic variables, (iv) PCA axes of climatic variables, edaphic variables and the DEM. The presence/absence data of T.ramosissma were randomly divided into two groups, which were then used to split four environmental datasets into two groups, where 80% of data were used to build the model and 20% of data were used to calibrate and evaluate the model. Each model was constructed using the BIOMOD platform (Thuiller 2003; Thuiller et al. 2009). The stepwise procedure of the GLM based on Akaike's information criterion. The number of repetitions in BIOMOD was set to three. The area under the receiver operating characteristic (ROC) curve (AUC) was used to evaluate the model performance. The other options of BIOMOD were set to default. The differences among the three models was tested by one-way ANOVA using R 2.12.1 software (R Development Core Team 2010).

RESULTS

Effects of chosen environmental variables and PCA management of predictor variables on the model performance

The effects of using different environment variables on the model are illustrated in Fig. 2. In the case of the CART model, the performance was better for the dataset of only climate variables than for the dataset of climate, soil and DEM data. However, this was not the case when using PCA-based data. In the case of the GLM, the performance with climate, soil and DEM data was better than that with only climate variables, but the use of PCA-based data did not result in significantly better performance than the use of the original data. In the case of the Random Forests model, there was no significant difference in performances between only with climate variables and with climate, soil and DEM data, while the performance achieved by using the PCA-based dataset of climate, soil and DEM data was better than that using the PCA-based dataset of only climate variables. Additionally, the original datasets performed better than that of PCA-based data did. However, none of the above described different was significant.

Figure 2:

effects of the choice of the predictor variables and the PCA management of predictor variables on the model. The abbreviations in this figure are the same as in Table 1.

Table 1:

Open in new tab

average AUC for each Tamarix ramosissma model

	CART	GLM	RF
All	0.755	0.859	0.951
Cli	0.781	0.738	0.956
prcli	0.744	0.741	0.878
prall	0.690	0.874	0.929

‘cli’ is the dataset of climate variables, ‘all’ is the dataset of soil, climate and DEM variables, ‘prcli’ is the PCA-based dataset of climate variables ‘prall’ is the PCA-based dataset of soil, climate and DEM variables and ‘RF’ is the Random Forests model.

Table 1:

Open in new tab

average AUC for each Tamarix ramosissma model

	CART	GLM	RF
All	0.755	0.859	0.951
Cli	0.781	0.738	0.956
prcli	0.744	0.741	0.878
prall	0.690	0.874	0.929

‘cli’ is the dataset of climate variables, ‘all’ is the dataset of soil, climate and DEM variables, ‘prcli’ is the PCA-based dataset of climate variables ‘prall’ is the PCA-based dataset of soil, climate and DEM variables and ‘RF’ is the Random Forests model.

Best model for predicting the potential distribution of T. ramosissima

The performances of the different models are shown in Fig. 3 and the average AUC for each T. ramosissima model can be seen in Table 1. In light of AUC, the performance of Random Forests model was the best and followed by the GLM and then the CART model. When comparing different models within different datasets, the Random Forests moldel outperformed than GLM and CART with CART having the lowest performance (Table 1). The Random Forests model built by the dataset of climate variables had the highest AUC (0.956). Thus, this model was considered as the best model for predicting the potential distribution of T. ramosissima.

Figure 3:

mean AUC for different models. The abbreviations in this figure are the same as in Table 1.

Potential distribution of T. ramosissma in the Tarim Basin

The potential distribution of T. ramosissima in the Tarim Basin was predicted by the Random Forests model only with climate data (Fig. 4). The predicted result was close to the original distribution. Compared with the actual distribution of T. ramosissma from the vegetation map to the predicted distribution area from the model, for 19.6% of the predicted inhabited area, T. ramosissima was not reported in datasets. For 15% of the inhabited area reported in datasets, the model failed to predict the presence of the plant. The predicted potential distribution area of T. ramosissima was ∼3.57 × 10⁴ km². Tamarix ramosissima is distributed mainly around the borders of the Taklamakan desert and along the Tarim River, especially the northern Tarim Basin.

Figure 4:

Tamarix ramosissima desert vegetation distribution and prediction map.

DISCUSSION

The missing of key predictor variables is considered to be the main source of uncertainty (Barry and Elith 2006; Guisan and Harrell 2000). For all the models, results in the current study showed that the performance of the dataset with climate, soil and the DEM variables was better than that of the dataset with climate variables alone (Fig. 1). Previous researchers have shown that it is not wise to use too many predictor variables in model (Barry and Elith 2006; Guisan and Harrell 2000). Our results indicate that sometimes a greater number of predictor variables result in poor performance of the model (Fig. 1). More attention should be paid to the selection of predictors before establishing the model.

Tamarix ramosissima as an azonal vegetation is affected by groundwater, flood inundation and soil salt (Gries et al. 2003; Yang et al. 2004). However, data on the groundwater table are scarce. In this study, the study area spanned the Tarim Basin plain, which is flat, and the sedimentary characteristic generally is consistent because the whole basin sits on the Tarim platform. Therefore, the altitude could be considered to represent the groundwater table here. Since T. ramosissima could distribute in sand, loam habitat and even in the infertile Gobi desert and has high salt resistance, therefore, none of the soil particles’ composition, salinity and nutrient could be the main factors to limit its distribution. In addition, the resolution of soil data was low. Consequently, the edaphic factors in this study did not improve the prediction accuracy.

PCA is generally used to avoid the collinearity of correlated predictor variables (Dormann et al. 2008; Elith et al. 2011; Mellin et al. 2010; Rotenberry et al. 2006; Townsend et al. 2007) and to reduce the number of variables (Guisan et al. 1998). Our results indicate that the differences between the models constructed by PCA-based data and the original data were not significant. In PCA, each principal component reduces the remaining variance in the matrix of environmental data, and all variables contribute to all axes of PCA (Dormann et al. 2008). The outcome could differ for different models, e.g. Elith et al. (2011) argued that MaxEnt (a species distribution model technique) did not require PCA to avoid collinearity. Our results also demonstrated that whether PCA is required to reduce the effect of correlated predictor variables depends on the predictor variables used. Alternatively, the number of correlated predictors can be reduced before model processing.

Different models have different predictive powers (Austin 2007; Elith and Leathwick 2009; Elith et al. 2006; Guisan and Harrell 2000). The GLM and CART are generally considered as good techniques with high prediction power (Austin 2007; Elith et al. 2006; Guisan and Harrell 2000; Muñoz and Felicísimo 2004). Thuiller et al. (2003) pointed out that classification tree analysis is less accurate than the generalized methods, especially at finer scales. The results in this study are similar to those of Thuiller et al. (2003). Lawler et al. (2006) found that random forest consistently outperformed the GLM, generalized additive models, CART, Genetic Algorithm for Rule Set Production and Artificial Neural Networks techniques. Prasad et al. (2006) found that Random Forests models and bagging (a tree-based model-averaging approach) consistently outperformed multivariate adaptive regression splines and regression trees in predicting the distributions of four tree species. Broennimann et al. (2007) modelled the distribution of Centaurea maculosa with BIOMOD tool and found that the performance rank, from best to worst, was Random Forests, GLM and CART. Jeschke and Strayer (2008) pointed out that new techniques, e.g. Random Forests, outperform more established methods. The results of this study indicate that the prediction precision of the Random Forests model is better than that of the GLM and CART models.

The different modelling techniques applied in this study make different assumptions about the relationships between species and their environments (Guisan and Zimmermann 2000). The choice of methods always depends on the species, dataset and question. However, the newest techniques often achieve the most accurate predictions (Jeschke and Strayer 2008). The strength of Random Forests likely lies in the power derived from averaging hundreds of different models (Breiman 2001; Lawler et al. 2006). In addition to providing a method for modelling complex interactions without having to specify them a priori, tree-based models allow the relationships between the response and the predictors to vary over the domain of the study. Therefore, we recommend using Random Forests to model species distributions because of its higher predictive power.

Different models have different assumptions to suit to different species, while different species are characterized by different environmental factors. Thus, the uncertainty and the performance of different models for different species are very complex. There are only three models, one species and three sets of environmental variables in this study, which might be insufficient to completely explain the uncertainty and the performance of species distribution models. Therefore, more models, more species and more environmental variables are still needed to the comparison work, especially at a global scale.

Because of their huge area, drylands provide a huge potential to mitigate global warming through vegetation restoration, which would increase carbon sequestration (Lal 2001, 2009). In this study, the predicted potential distribution area of T. ramosissima was ∼3.57 × 10⁴ km². Annual aboveground productivity including wood and assimilation organs ranged from 1.55 to 1.74 Mg/ha (based on total ground area) or from 3.10 to 7.15 Mg/ha (in homogenous stands) for Tamarix vegetation (Gries et al. 2005). It could be inferred that the potential biomass production of T. ramosissima in the Tarim Basin is huge; therefore, there is great potential to mitigate global warming and produce bioenergy through restoration of T. ramosissima in the Tarim Basin.

CONCLUSIONS

The predictive variables for species distribution models should be chosen carefully, as the use of too many predictors might reduce the prediction power. Using PCA to reduce the correlation among predictors and enhance the accuracy of species distribution model depends on the predictor variables and the models. From the comparison of models with and without PCA-based predictors, reducing the number of correlated predictors before model processing is recommended. Among the GLM, CART and Random Forests, the best model for predicting the T. ramosissima distribution was Random Forests with climate variables. The soil variables considered in this study did not increase the predictive performance of the model. The Random Forests model was more precise than the GLM and CART models. The predicted potential distribution area of T. ramosissima was ∼3.57 × 10⁴ km² in the Tarim Basin. In order to entirely figure out the uncertainty and the performance of different models with different species, studies with more species, more models and more data are still needed.

FUNDING

National Basic Research Program of China (973 Program) (No. 2010CB951303 and No. 2009CB421106).

The author thanks Dr Guofang Liu at Institute of Botany of CAS for processing the climate data. We would also like to thank Dr Christine Verhille at the University of British Columbia and Dr Yongbo Liu at Chinese Research Academy of Environmental Sciences for their assistance with English language and grammatical editing of the manuscript. The soil and DEM dataset were provided by the Environmental and Ecological Science Data Center for West China, National Natural Science Foundation of China.

References

Abbott

I

Le Maitre

D

,

Monitoring the impact of climate change on biodiversity: the challenge of megadiverse mediterranean climate ecosystems

,

Austral Ecol

,

2010

, vol.

35

(pg.

406

-

22

)

Abideen

Z

Ansari

R

Khan

MA

,

Halophytes: potential source of ligno-cellulosic biomass for ethanol production

,

Biomass Bioenergy

,

2011

, vol.

35

(pg.

1818

-

22

)

Allen

RG

Pereira

LS

Raes

D

et al. ,

Crop Evapotranspiration—Guidelines for Computing Crop Water Requirements

,

1998

Rome, Italy

Food & Agriculture Organization of the UN

Araujo

MB

New

M

,

Ensemble forecasting of species distributions

,

Trends Ecol Evol

,

2007

, vol.

22

(pg.

42

-

7

)

Austin

M

,

Species distribution models and ecological theory: a critical assessment and some possible new approaches

,

Ecol Modell

,

2007

, vol.

200

(pg.

1

-

19

)

Austin

MP

Cunningham

RB

Fleming

PM

,

New approaches to direct gradient analysis using environmental scalars and statistical curve-fitting procedures

,

Plant Ecol

,

1984

, vol.

55

(pg.

11

-

27

)

Austin

MP

Gaywood

MJ

,

Current problems of environmental gradients and species response curves in relation to continuum theory

,

J Veg Sci

,

1994

, vol.

5

(pg.

473

-

82

)

Barry

S

Elith

J

,

Error and uncertainty in habitat models

,

J Appl Ecol

,

2006

, vol.

43

(pg.

413

-

23

)

Breiman

L

,

Random forests

,

Mach Learn

,

2001

, vol.

45

(pg.

5

-

32

)

Breiman

L

Friedman

JH

Olshen

RA

et al. ,

Classification and Regression Trees

,

1984

New York, NY

Chapman and Hall

Broennimann

O

Treier

UA

Muller-Scharer

H

et al. ,

Evidence of climatic niche shift during biological invasion

,

Ecol Lett

,

2007

, vol.

10

(pg.

701

-

9

)

Cleverly

JR

Smith

SD

Sala

A

et al. ,

Invasive capacity of Tamarix ramosissima in a Mojave Desert floodplain: the role of drought

,

Oecologia

,

1997

, vol.

111

(pg.

12

-

8

)

De'ath

G

Fabricius

KE

,

Classification and regression trees: a powerful yet simple technique for ecological data analysis

,

Ecology

,

2000

, vol.

81

(pg.

3178

-

92

)

Dormann

CF

Purschke

O

Marquez

JRG

et al. ,

Components of uncertainty in species distribution analysis: a case study of the great grey shrike

,

Ecology

,

2008

, vol.

89

(pg.

3371

-

86

)

Elith

J

Graham

CH

Anderson

RP

et al. ,

Novel methods improve prediction of species' distributions from occurrence data

,

Ecography

,

2006

, vol.

29

(pg.

129

-

51

)

Elith

J

Leathwick

JR

,

Species distribution models: ecological explanation and prediction across space and time

,

Annu Rev Ecol Evol Syst

,

2009

, vol.

40

(pg.

677

-

97

)

Elith

J

Phillips

SJ

Hastie

T

et al. ,

A statistical explanation of maxent for ecologists

,

Diver Distributions

,

2011

, vol.

17

(pg.

43

-

57

)

Eshel

A

Zilberstein

A

Alekparov

C

et al. ,

Biomass production by desert halophytes: alleviating the pressure on food production

,

In: Rosen MA, Perryman R, Dodds S, et al. (ed). Recent Advances in Energy & Environment: Proceedings of the 5th IASME/WSEAS international conference on Energy & environment (EE' 10). Stevens Point. WI: WSEAS Press,

,

2010

(pg.

362

-

7

)

Evangelista

PH

Stohlgren

TJ

Morisette

JT

et al. ,

Mapping invasive tamarisk (Tamarix) a comparison of single-scene and time-series analyses of remotely sensed data

,

Remote Sens

,

2009

, vol.

1

(pg.

519

-

33

)

Feagin

RA

,

Heterogeneity versus homogeneity: a conceptual and mathematical theory in terms of scale-invariant and scale-covariant distributions

,

Ecol Complex

,

2005

, vol.

2

(pg.

339

-

56

)

Feng

L

,

Halophytes promising for biomass energy resources in china

,

J Biotechnol

,

2008

, vol.

136

pg.

271

Ferrier

S

,

Mapping spatial pattern in biodiversity for regional conservation planning: where to from here?

,

Syst Biol

,

2002

, vol.

51

(pg.

331

-

63

)

Franklin

J

,

Predicting the distribution of shrub species in southern California from climate and terrain-derived variables

,

J Veg Sci

,

1998

, vol.

9

(pg.

733

-

48

)

Garzon

MB

Blazek

R

Neteler

M

et al. ,

Predicting habitat suitability with machine learning models: the potential area of pinus sylvestris l. in the Iberian Peninsula

,

Ecol Modell

,

2006

, vol.

197

(pg.

383

-

93

)

Gries

D

Foetzki

A

Arndt

SK

et al. ,

Production of perennial vegetation in an oasis-desert transition zone in NW china—allometric estimation, and assessment of flooding and use effects

,

Plant Ecol

,

2005

, vol.

181

(pg.

23

-

43

)

Gries

D

Zeng

F

Foetzki

A

et al. ,

Growth and water relations of Tamarix ramosissima and populus euphratica on taklamakan desert dunes in relation to depth to a permanent water table

,

Plant Cell Environ

,

2003

, vol.

26

(pg.

725

-

36

)

Guisan

A

Harrell

FE

,

Ordinal response regression models in ecology

,

J Veg Sci

,

2000

, vol.

11

(pg.

617

-

26

)

Guisan

A

Theurillat

JP

Kienast

F

,

Predicting the potential distribution of plant species in an alpine environment

,

J Veg Sci

,

1998

, vol.

9

(pg.

65

-

74

)

Guisan

A

Weiss

SB

Weiss

AD

,

GLM versus CCA spatial modeling of plant species distribution

,

Plant Ecol

,

1999

, vol.

143

(pg.

107

-

22

)

Guisan

A

Zimmermann

NE

,

Predictive habitat distribution models in ecology

,

Ecol Modell

,

2000

, vol.

135

(pg.

147

-

86

)

Hamann

A

Wang

TL

,

Potential effects of climate change on ecosystem and tree species distribution in British Columbia

,

Ecology

,

2006

, vol.

87

(pg.

2773

-

86

)

Huisman

J

Olff

H

Fresco

LMF

,

A hierarchical set of models for species response analysis

,

J Veg Sci

,

1993

, vol.

4

(pg.

37

-

46

)

Ibanez

I

Silander

JA

Wilson

AM

et al. ,

Multivariate forecasts of potential distributions of invasive plant species

,

Ecol Appl

,

2009

, vol.

19

(pg.

359

-

75

)

Jeschke

JM

Strayer

DL

,

Usefulness of bioclimatic models for studying climate change and invasive species

,

Ann N Y Acad Sci

,

2008

, vol.

1134

(pg.

1

-

24

)

Jones

CC

Acker

SA

Halpern

CB

,

Combining local- and large-scale models to predict the distributions of invasive plant species

,

Ecol Appl

,

2010

, vol.

20

(pg.

311

-

26

)

Lal

R

,

Potential of desertification control to sequester carbon and mitigate the greenhouse effect

,

Clim Change

,

2001

, vol.

51

(pg.

35

-

72

)

Lal

R

,

Sequestering carbon in soils of arid ecosystem

,

Land Degrad Dev

,

2009

, vol.

20

(pg.

441

-

54

)

Larssen

T

Hogasen

T

Cosby

BJ

,

Impact of time series data on calibration and prediction uncertainty for a deterministic hydrogeochemical model

,

Ecol Modell

,

2007

, vol.

207

(pg.

22

-

33

)

Lawler

JJ

White

D

Neilson

RP

et al. ,

Predicting climate-induced range shifts: model differences and model reliability

,

Global Change Biol

,

2006

, vol.

12

(pg.

1568

-

84

)

Li

X

Huang

Y

Gong

J

et al. ,

A study of the development of bio-energy resources and the status of eco-society in china

,

Energy

,

2009

, vol.

35

(pg.

4451

-

6

)

Liu

MT

,

Synthesis Study and Expanding Application for Plants from Genus of Tamarix L. [in Chinese]

,

1995

Lanzhou, China

Lanzhou University Press

McCullagh

P

Nelder

JA.

,

Generalized Linear Models

,

1989

London, UK: Chapman & Hall/CRC

Mckenney

DW

Pedlar

JH

Lawrence

K

et al. ,

Potential impacts of climate change on the distribution of North American trees

,

Bioscience

,

2007

, vol.

57

(pg.

939

-

48

)

Mellin

C

Bradshaw

CJA

Meekan

MG

et al. ,

Environmental and spatial predictors of species richness and abundance in coral reef fishes

,

Global Ecol Biogeogr

,

2010

, vol.

19

(pg.

212

-

22

)

Muñoz

J

Felicísimo

ÁM

,

Comparison of statistical methods commonly used in predictive modelling

,

J Veg Sci

,

2004

, vol.

15

(pg.

285

-

92

)

Pearson

RG

Dawson

TP

,

Predicting the impacts of climate change on the distribution of species: are bioclimate envelope models useful?

,

Global Ecol Biogeogr

,

2003

, vol.

12

(pg.

361

-

71

)

Peters

J

Verhoest

NEC

Samson

R

et al. ,

Uncertainty propagation in vegetation distribution models based on ensemble classifiers

,

Ecol Modell

,

2009

, vol.

220

(pg.

791

-

804

)

Phillips

DL

Marks

DG

,

Spatial uncertainty analysis: propagation of interpolation errors in spatially distributed models

,

Ecol Modell

,

1996

, vol.

91

(pg.

213

-

29

)

Prasad

AM

Iverson

LR

Liaw

A

,

Newer classification and regression tree techniques: bagging and random forests for ecological prediction

,

Ecosystems

,

2006

, vol.

9

(pg.

181

-

99

)

R Development Core Team

,

R: A Language and Environment for Statistical Computing

,

2010

Vienna, Austria

R Foundation for Statistical Computing

Randin

CF

Engler

R

Normand

S

et al. ,

Climate change and plant distribution: local models predict high-elevation persistence

,

Global Change Biol

,

2009

, vol.

15

(pg.

1557

-

69

)

Ray

N

Burgman

MA

,

Subjective uncertainties in habitat suitability maps

,

Ecol Modell

,

2006

, vol.

195

(pg.

172

-

86

)

Retuerto

R

Carballeira

A

,

Estimating plant responses to climate by direct gradient analysis and geographic distribution analysis

,

Plant Ecol

,

2004

, vol.

170

(pg.

185

-

202

)

Rotenberry

JT

Preston

KL

Knick

ST

,

Gis-based niche modeling for mapping species' habitat

,

Ecology

,

2006

, vol.

87

(pg.

1458

-

64

)

Stromberg

JC

Lite

SJ

Marler

R

et al. ,

Altered stream-flow regimes and invasive plant species: the Tamarix case

,

Global Ecol Biogeogr

,

2007

, vol.

16

(pg.

381

-

93

)

Tang

Y

Xie

J-S

Geng

S

,

Marginal land-based biomass energy production in china

,

J Integr Plant Biol

,

2010

, vol.

52

(pg.

112

-

21

)

Thuiller

W

,

Biomod—optimizing predictions of species distributions and projecting potential future shifts under global change

,

Global Change Biol

,

2003

, vol.

9

(pg.

1353

-

62

)

Thuiller

W

Araujo

MB

Lavorel

S

,

Generalized models vs. classification tree analysis: predicting spatial distributions of plant species at different scales

,

J Veg Sci

,

2003

, vol.

14

(pg.

669

-

80

)

Thuiller

W

Lafourcade

B

Engler

R

et al. ,

Biomod—a platform for ensemble forecasting of species distributions

,

Ecography

,

2009

, vol.

32

(pg.

369

-

73

)

Townsend

PA

Pape

M

Eaton

M

,

Transferability and model evaluation in ecological niche modeling: a comparison of garp and maxent

,

Ecography

,

2007

, vol.

30

(pg.

550

-

60

)

van Horssen

PW

Pebesma

EJ

Schot

PP

,

Uncertainties in spatially aggregated predictions from a logistic regression model

,

Ecol Modell

,

2002

, vol.

154

(pg.

93

-

101

)

Van Niel

KP

Austin

MP

,

Predictive vegetation modeling for conservation: impact of error propagation from digital elevation data

,

Ecol Appl

,

2007

, vol.

17

(pg.

266

-

80

)

Yang

WK

Yin

LK

Zhang

DY

et al. ,

Study on ecological types and habitat similarity of Tamarix L. in Xinjiang [in Chinese]

,

Arid Land Geogr

,

2004

, vol.

27

(pg.

186

-

92

)

Yates

CJ

Elith

J

Latimer

AM

et al. ,

Projecting climate change impacts on species distributions in megadiverse South African Cape and Southwest Australian Floristic Regions: opportunities and challenges

,

Austral Ecol

,

2010

, vol.

35

(pg.

374

-

91

)

Yee

TW

Mitchell

ND

,

Generalized additive-models in plant ecology

,

J Veg Sci

,

1991

, vol.

2

(pg.

587

-

602

)

Zhang

Q

,

Simulation the potential geographical distribution and evaluation the restoration of Tamarix vegetation in Tarim Basin [in Chinese]

,

2011

Ph.D. Thesis. Graduate University of Chinese Academy of Sciences. Beijing

Zhang

XM

Runge

M

,

Ecological Basis for Sustainable Managing the Vegetation in the Fringe of Taklimakan Desert [in Chinese]

,

2006

Beijing, China

Science Press

Zhang

XS

,

China 1:100 Million Vegetation Map [in Chinese]

,

2008

Beijing, China

Geological Publishing House of China