Data-Driven Analysis of Senior High Students’ Sentiments and Strand Selection Using Machine Learning

Authors

  • James E. Rosales Department of Education
  • Cristopher C. Abalorio Caraga State University - Main Campus, Butuan City, Philippines

Keywords:

Cross-Validation, Senior High Student, Sentiment, Support Vector Machine

Abstract

Senior high school students’ sentiments play a vital role in shaping educational strategies that promote mental health, well-being, and learner protection as mandated by the Department of Education’s child-friendly school system. This study investigates the relationship between students’ sentiments and their chosen academic strand using machine learning algorithms. A total of 580 students participated, providing feedback through a guided support session conducted after the first quarter of the school year. The sentiments that were collected, which are code-mixing data, were preprocessed and vectorized using Term Frequency–Inverse Document Frequency (TF-IDF). A Support Vector Machine (SVM) classifier with four-fold cross-validation was employed to categorize sentiments into positive, neutral, and negative classes. The model achieved an average accuracy of 72.59% with a standard deviation of 3.91%, indicating a statistically significant relationship between students’ sentiments and their chosen strands. Findings suggest the need for a dedicated technical working group to monitor and interpret sentiment trends per strand, enabling timely interventions. Future research is encouraged to explore alternative machine learning algorithms, such as Multinomial Naive Bayes, Logistic Regression, and Gradient Boosting (LightGBM), as well as advanced techniques like semantic embeddings and sarcasm detection, to improve classification accuracy. The results of this study contribute to data-driven decision-making and provide actionable insights for educational policy frameworks aimed at enhancing student welfare and learning outcomes.

References

Madasu, A., & Sivasankar, E. (2020). Efficient feature selection techniques for sentiment analysis. arXiv preprint arXiv:1911.00288. https://arxiv.org/abs/1911.00288

Ahmad, S. R., Yusop, N. M. M., Asri, A. M., & Amran, M. F. M. (2021). A review of feature selection algorithms in sentiment analysis for drug reviews. International Journal of Advanced Computer Science and Applications, 12(12). https://doi.org/10.14569/IJACSA.2021.0121217

Mujawar, S. S. ., & Bhaladhare, P. R. . (2023). Effective Feature Selection Methods for User Sentiment Analysis using Machine Learning. International Journal on Recent and Innovation Trends in Computing and Communication, 11(3s), 37–45. https://doi.org/10.17762/ijritcc.v11i3s.6153

Sharma, A., & Dey, S. (2012). A comparative study of feature selection and machine learning techniques for sentiment analysis. In Proceedings of the 2012 ACM Research in Applied Computation Symposium (RACS ’12) (pp. 1–7). Association for Computing Machinery. https://doi.org/10.1145/2401603.2401605

Pooja, Bhalla R. A Review Paper on the Role of Sentiment Analysis in Quality Education. SN Comput Sci. 2022;3(6):469. doi: 10.1007/s42979-022-01366-9. Epub 2022 Sep 9. PMID: 36106178; PMCID: PMC9462624.

Lee, K. M., & Kim, K. (2025). An analysis of educational satisfaction using sentiment analysis: correlation analysis between Likert-scale evaluation and descriptive feedback. Human Resource Development International, 1–25. https://doi.org/10.1080/13678868.2025.2563353

Baragash, R. S., Aldowah, H., & Umar, I. N. (2022). Students’ perceptions of e-learning in Malaysian universities: Sentiment analysis based machine learning approach. Journal of Information Technology Education: Research, 21, 439-463. https://doi.org/10.28945/5024

Saraswathi, N., Sasi Rooba, T., & Chakaravarthi, S. (2023). Improving the accuracy of sentiment analysis using a linguistic rule-based feature selection method in tourism reviews. Measurement: Sensors, 29, 100888. https://doi.org/10.1016/j.measen.2023.100888

P. H. Prastyo, I. Ardiyanto and R. Hidayat, "A Review of Feature Selection Techniques in Sentiment Analysis Using Filter, Wrapper, or Hybrid Methods," 2020 6th International Conference on Science and Technology (ICST), Yogyakarta, Indonesia, 2020, pp. 1-6, doi: 10.1109/ICST50505.2020.9732885.

Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM computing surveys (CSUR), 54(6), 1-35.

Downloads

Published

2026-01-02

How to Cite

Rosales, J. E., & Abalorio, C. C. (2026). Data-Driven Analysis of Senior High Students’ Sentiments and Strand Selection Using Machine Learning. Advances in Engineering and Information Sciences, 1(2). Retrieved from https://journals.carsu.edu.ph/AEIS/article/view/205