Benchmarking Machine Learning Models on a Dielectric Constant Database for Bandgap Prediction

Mohammad Hadi Yazdani, Paulo S. Branicio, and Ken-ichi Nomura

Volume 15, Issue 1 (March 2024), pp. 10–12

https://doi.org/10.22369/issn.2153-4136/15/1/2

PDF icon Download PDF

BibTeX
@article{jocse-15-1-2,
  author={Mohammad Hadi Yazdani and Paulo S. Branicio and Ken-ichi Nomura},
  title={Benchmarking Machine Learning Models on a Dielectric Constant Database for Bandgap Prediction},
  journal={The Journal of Computational Science Education},
  year=2024,
  month=mar,
  volume=15,
  issue=1,
  pages={10--12},
  doi={https://doi.org/10.22369/issn.2153-4136/15/1/2}
}
Copied to clipboard!

In this study, we investigate the performance of several regression models by utilizing a database of dielectric constants. First, the database is processed using the Matminer Python library to create features, and then divided into training, validation, and testing subsets. We evaluate several models: Linear Regression, Random Forest, Gradient Boosting, XGBoost, Support Vector Regression, and Feedforward Neural Network, with the objective of predicting the bandgap values. The results indicate superior performance of tree-based ensemble models over Linear Regression and Support Vector Regression. Additionally, a Feedforward Neural Network with two hidden layers demonstrates comparable proficiency in capturing the relationship between the features generated by Matminer and the bandgap target values.