ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

Using carbonate absorbance peak to select the most suitable regression model before predicting soil inorganic carbon concentration by mid-infrared reflectance spectroscopy

Gomez, C and Chevallier, T and Moulin, P and Arrouays, D and Barthès, BG (2022) Using carbonate absorbance peak to select the most suitable regression model before predicting soil inorganic carbon concentration by mid-infrared reflectance spectroscopy. In: Geoderma, 405 .

[img] PDF
geo_405_2022.pdf - Published Version
Restricted to Registered users only

Download (2MB) | Request a copy
Official URL: https://doi.org/10.1016/j.geoderma.2021.115403

Abstract

Mid-Infrared reflectance spectroscopy (MIRS, 4000�400 cm�1) is being considered to provide accurate estimations of soil inorganic carbon (SIC) contents, based on prediction models when the test dataset is well represented by the calibration set, with similar SIC range and distribution and pedological context. This work addresses the case where the test dataset, here originating from France, is poorly represented by the calibration set, here originating from Tunisia, with different SIC distributions and pedological contexts. It aimed to demonstrate the usefulness of 1) classifying test samples according to SIC level based on the height of the carbonate absorbance peak at 2510 cm�1, and then 2) selecting a suitable prediction model according to SIC level. Two regression methods were tested: Linear Regression using the height of the carbonate peak at 2510 cm�1, called Peak-LR model; and Partial Least Squares Regression using the entire MIR spectrum, called Full-PLSR model. First, our results showed that Full-PLSR was 1) more accurate than Peak-LR on the Tunisian validation set (R2val = 0.99 vs. 0.86 and RMSEval = 3.0 vs. 9.7 g kg�1, respectively), but 2) less accurate than Peak-LR when applied on the French dataset (R2test = 0.70 vs. 0.91 and RMSEtest = 13.7 vs. 4.9 g kg�1, respectively). Secondly, on the French dataset, predictions on SIC-poor samples tended to be more accurate using Peak-LR, while predictions on SIC-rich samples tended to be more accurate using Full-PLSR. Thirdly, the height of the carbonate absorbance peak at 2510 cm�1 might be used to discriminate SIC-poor and SIC-rich test samples (<5 vs. > 5 g kg�1): when this height was > 0, Full-PLSR was applied; otherwise Peak-LR was applied. Coupling Peak-LR and Full-PLSR models depending on the carbonate peak yielded the best predictions on the French dataset (R2test = 0.95 and RMSEtest = 3.7 g kg�1). This study underlined the interest of using a carbonate peak to select suitable regression approach for predicting SIC content in a database with different distribution than the calibration database. © 2021 Elsevier B.V.

Item Type: Journal Article
Publication: Geoderma
Publisher: Elsevier B.V.
Additional Information: The copyright for this article belongs to Elsevier B.V.
Keywords: Calibration; Carbonation; Classification (of information); Forecasting; Least squares approximations; Linear regression; Reflection; Soils; Spectroscopy; Statistical tests, Absorbance peak; Carbon content; Infrared reflectance spectroscopy; Mid-infrared reflectance spectroscopy; Mid-infrared reflectances; National dataset; Partial least square regression; Prediction model; Soil inorganic carbons; Test samples, Infrared devices
Department/Centre: Division of Mechanical Sciences > Civil Engineering
Date Deposited: 01 Dec 2021 14:43
Last Modified: 01 Dec 2021 14:43
URI: http://eprints.iisc.ac.in/id/eprint/70051

Actions (login required)

View Item View Item