Arid
DOI10.1016/j.geoderma.2014.09.019
Machine learning for predicting soil classes in three semi-arid landscapes
Brungard, Colby W.1; Boettinger, Janis L.1; Duniway, Michael C.2; Wills, Skye A.3; Edwards, Thomas C., Jr.4
通讯作者Brungard, Colby W.
来源期刊GEODERMA
ISSN0016-7061
EISSN1872-6259
出版年2015
卷号239页码:68-83
英文摘要

Mapping the spatial distribution of soil taxonomic classes is important for informing soil use and management decisions. Digital soil mapping (DSM) can quantitatively predict the spatial distribution of soil taxonomic classes. Key components of DSM are the method and the set of environmental covariates used to predict soil classes. Machine learning is a general term for a broad set of statistical modeling techniques. Many different machine learning models have been applied in the literature and there are different approaches for selecting covariates for DSM. However, there is little guidance as to which, if any, machine learning model and covariate set might be optimal for predicting soil classes across different landscapes.


Our objective was to compare multiple machine learning models and covariate sets for predicting soil taxonomic classes at three geographically distinct areas in the semi-arid western United States of America (southern New Mexico, southwestern Utah, and northeastern Wyoming). All three areas were the focus of digital soil mapping studies. Sampling sites at each study area were selected using conditioned Latin hypercube sampling (cLHS). We compared models that had been used in other DSM studies, including clustering algorithms, discriminant analysis, multinomial logistic regression, neural networks, tree based methods, and support vector machine classifiers. Tested machine learning models were divided into three groups based on model complexity: simple, moderate, and complex. We also compared environmental covariates derived from digital elevation models and Landsat imagery that were divided into three different sets: 1) covariates selected a priori by soil scientists familiar with each area and used as input into cLHS, 2) the covariates in set 1 plus 113 additional covariates, and 3) covariates selected using recursive feature elimination.


Overall, complex models were consistently more accurate than simple or moderately complex models. Random forests (RF) using covariates selected via recursive feature elimination was consistently the most accurate, or was among the most accurate, classifiers between study areas and between covariate sets within each study area. We recommend that for soil taxonomic class prediction, complex models and covariates selected by recursive feature elimination be used.


Overall classification accuracy in each study area was largely dependent upon the number of soil taxonomic classes and the frequency distribution of pedon observations between taxonomic classes. Individual subgroup class accuracy was generally dependent upon the number of soil pedon observations in each taxonomic class. The number of soil classes is related to the inherent variability of a given area. The imbalance of soil pedon observations between classes is likely related to cLHS. Imbalanced frequency distributions of soil pedon observations between classes must be addressed to improve model accuracy. Solutions include increasing the number of soil pedon observations in classes with few observations or decreasing the number of classes. Spatial predictions using the most accurate models generally agree with expected soil landscape relationships. Spatial prediction uncertainty was lowest in areas of relatively low relief for each study area. (C) 2014 Elsevier B.V. All rights reserved.


英文关键词Digital soil mapping Machine learning Recursive feature elimination Random forests Brier score
类型Article
语种英语
国家USA
收录类别SCI-E
WOS记录号WOS:000347768000007
WOS关键词SPATIAL PREDICTION ; RANDOM FORESTS ; CLASSIFICATION ; VARIABLES ; KNOWLEDGE ; MODEL ; IMAGE ; AREAS ; SELECTION ; SCALES
WOS类目Soil Science
WOS研究方向Agriculture
来源机构United States Geological Survey
资源类型期刊论文
条目标识符http://119.78.100.177/qdio/handle/2XILL650/187466
作者单位1.Utah State Univ, Dept Plants Soils & Climate, Logan, UT 84322 USA;
2.US Geol Survey, Southwest Biol Sci Ctr, Moab, UT 84532 USA;
3.Nat Resources Conservat Serv, Natl Soil Survey Ctr, USDA, Lincoln, NE 68508 USA;
4.Utah State Univ, Dept Wildland Resources, Utah Cooperat Fish & Wildlife Res Unit, US Geol Survey, Logan, UT 84322 USA
推荐引用方式
GB/T 7714
Brungard, Colby W.,Boettinger, Janis L.,Duniway, Michael C.,et al. Machine learning for predicting soil classes in three semi-arid landscapes[J]. United States Geological Survey,2015,239:68-83.
APA Brungard, Colby W.,Boettinger, Janis L.,Duniway, Michael C.,Wills, Skye A.,&Edwards, Thomas C., Jr..(2015).Machine learning for predicting soil classes in three semi-arid landscapes.GEODERMA,239,68-83.
MLA Brungard, Colby W.,et al."Machine learning for predicting soil classes in three semi-arid landscapes".GEODERMA 239(2015):68-83.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Brungard, Colby W.]的文章
[Boettinger, Janis L.]的文章
[Duniway, Michael C.]的文章
百度学术
百度学术中相似的文章
[Brungard, Colby W.]的文章
[Boettinger, Janis L.]的文章
[Duniway, Michael C.]的文章
必应学术
必应学术中相似的文章
[Brungard, Colby W.]的文章
[Boettinger, Janis L.]的文章
[Duniway, Michael C.]的文章
相关权益政策
暂无数据
收藏/分享

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。