Predicting the percentage of student placement: A comparative study of machine learning algorithms

Creative Commons License

Çakıt E., Dağdeviren M.

EDUCATION AND INFORMATION TECHNOLOGIES, vol.27, no.1, pp.997-1022, 2022 (SSCI)

  • Publication Type: Article / Article
  • Volume: 27 Issue: 1
  • Publication Date: 2022
  • Doi Number: 10.1007/s10639-021-10655-4
  • Journal Indexes: Social Sciences Citation Index (SSCI), Emerging Sources Citation Index (ESCI), Scopus, EBSCO Education Source, Educational research abstracts (ERA), ERIC (Education Resources Information Center), INSPEC
  • Page Numbers: pp.997-1022
  • Gazi University Affiliated: Yes


In recent years, there has been an increase in the demand for higher education in Turkey, where the demand, as in most other countries, exceeds what is available. The main purpose of this research is to develop machine learning algorithms for predicting the percentage of student placement based on the data related to the university’s academic reputation, opportunities of the city where the university is located, facilities and cultural opportunities of the university. When the model accuracy was evaluated on the basis of performance metrics, the Extreme Gradient Boosting (XGBoost) algorithm showed greater predictive accuracy than other machine learning approaches. A sensitivity analysis was performed using the extreme gradient boosting machines algorithm to identify the degree to which the input variables contribute to the determination of the output variable. Five input variables, namely the percentage of student placement at year t-1, the university scientific document score, university Phd programme score, university faculty member/student score, and the percentage of student placement at year t-2 were found to be the most effective parameters. Prediction and sensitivity analysis results obtained in this study can be used in many different ways, such as determining the quotas for universities, allocating resources, and making new regulations.