Mapping the major soil-landscape resources of the Ethiopian Highlands using random forest


Geospatially explicit information of soil-landscape resources of Ethiopia is lacking or fragmented for much of the country. Recently, massive soil data were collected, however these are limited to properties related to soil fertility and valid for the topsoil only. Understanding the country’s soil-landscape resources, including their qualities and constraints beyond the topsoil, remains key information for systematic and reliable scaling up of evidence-based agricultural best practices including soil fertility management recommendations. The objective of this study was to produce a coherent dataset of the major soil-landscape resources of 30 highland woredas (districts), contributing to the Agricultural Growth Program of the Government of Ethiopia. The study started with an exploratory survey to identify the major (most common) soils occurring across the landscapes followed by a full survey to assess the distribution of the identified major soils. Representative soil profiles were characterized from soil pits and classified as Reference Soil Groups (RSGs), with prefix qualifiers (PQs), according to the World Reference Base for soil resources (WRB). A large number of soil profiles were classified from auger observations. Observed soil classes at both RSG and RSG + PQ level were combined with spatial explanatory variables (covariates), representing the soil forming factors in the landscapes, and their relationships were modeled and validated by random forest. A multitude of tree models was trained using each profile for calibration in approximately two third and cross-validation in approximately one third of the models. Cross-validation showed that RSGs were predicted with a reasonable overall purity of 0.58 and RSGs +PQ were predicted with a purity of 0.48. The most relevant covariate in the models was the Geomorphology and Soils map of Ethiopia at 1: 1 M scale disaggregated into soil-landscape facets. Next models were used to predict soil classes across woredas which resulted in a 250 m resolution raster map of the most probable major soils. This raster map was generalised into a polygon map of major soil-landscape resources. The purity of this final map was estimated to be 0.54 for RSGs and 0.45 for RSGs + PQ. Soil properties relevant for agricultural interpretation, such as depth, drainage, texture, pH, CEC and organic carbon and nutrient contents, were mapped according to the RSGs depicted on the soil-landscape resources map with a RMSE/mean ratio of on average 42%. We conclude that soil expert knowledge and conventional soil-landscape survey combined with random forest modelling results in an attractive hybrid approach. The approach proves cost-effective and sufficiently accurate and can be used to inform scaling up of evidence-based agricultural best practices. Read the full report here.

J.G.B. Leenaars a, E. Elias b, J.H.M. Wöstenc, M. Ruiperez-Gonzáleza, B. Kempena

a ISRIC – World Soil Information, PO Box 353, Wageningen 6700 AJ, The Netherlands

b College of Natural and Computational Sciences, Centre for Environmental Science, Addis Ababa University, Addis Ababa, Ethiopia

c Wageningen Environmental Research, PO Box 47, Wageningen 6700 AA, The Netherlands

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s