DOI : 10.5281/zenodo.20536176
- Open Access

- Authors : Prof. Gade Somanath Ashok, Mr. Ugale Saurabh Sopan, Mr. Rasave Pralhad Maroti, Miss. Kale Renuka Bhausaheb, Miss. Khare Bhavana Sandip
- Paper ID : IJERTV15IS060023
- Volume & Issue : Volume 15, Issue 06 , June – 2026
- Published (First Online): 04-06-2026
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License:
This work is licensed under a Creative Commons Attribution 4.0 International License
Predicting Best Learning Strategies through Facial Expression Analysis
Prof. Gade Somanath Ashok (1) , Mr. Ugale Saurabh Sopan (2) , Mr. Rasave Pralhad Maroti (3), Miss. Kale Renuka Bhausaheb (4) , Miss. Khare Bhavana Sandip (5)
S.N.D. College of Engineering & Research Center Yeola, Nashik, Maharashtra, India 423401
ABSTRACT – The rapid growth of artificial intelligence in education has enabled new opportunities for enhancing student engagement and personalized learning. Traditional e-learning systems and classroom instruction often overlook the affective state of students, which plays a critical role in learning effectiveness. This research proposes an emotion-aware adaptive learning system that leverages facial emotion detection to suggest or predict the most suitable learning model for each student. Using real-time image capture, preprocessing, and deep learning-based emotion recognition, the system identifies students emotional states such as happiness, confusion, boredom, or frustration. These insights are integrated with the learners profile and past performance to recommend adaptive strategies, such as adjusting content complexity, pacing, or teaching methodology. The approach aims to improve student engagement, knowledge retention, and overall academic performance by bridging the gap between affective computing and personalized learning environments.
INTRODUCTION
Education has increasingly shifted towards digital platforms and AI-driven systems that provide customized learning experiences. However, most adaptive learning systems primarily consider cognitive factors such as knowledge levels and learning styles, while ignoring emotional states that significantly influence student performance and motivation. Emotions such as confusion, frustration, or boredom can act as barriers to learning if not addressed in real time. Facial emotion recognition has emerged as a promising method for detecting students affective states in both classroom and online settings. By analyzing facial expressions using computer vision and deep learning techniques, it is possible to capture valuable insights into student engagement. Integrating these insights into adaptive learning systems allows for personalized interventions, such as simplifying content for confused learners, providing motivational feedback for disengaged students, or accelerating pace for confident learners. This research aims to design an intelligent architecture that combines facial emotion detection, student modeling, and adaptive feedback to predict or suggest the most effective learning model for each student. Such a system has the potential to improve not only academic performance but also emotional well-being and engagement in the learning process. Facial expression recognition plays an important role in the field of education, which is mainly reflected in two
aspects: firstly, it helps teachers to assess students emotional states and attention levels in real time [1], further adjusting teaching strategies. Secondly, it supports personalized teaching; specifically, teachers can gain insights into their learning situation and needs through analyzing students facial expressions and emotional states to provide targeted teaching and counseling [2]. In recent years, facial emotion recognition (FER) has emerged as a powerful technique for understanding and monitoring learners emotional states in real-time. Facial expressions are among the most natural and reliable indicators of human emotions. Advances in computer vision, image processing, and deep learning algorithms have enabled automatic detection and classification of facial expressions from images and video streams with high accuracy. By analyzing facial features, muscle movements, and expression patterns, FER systems can identify emotions such as happiness, sadness, anger, surprise, fear, disgust, confusion, and neutrality. These emotional insights provide valuable information about a learners engagement level, concentration, and overall learning experience.
LITERATURE SURVEY
|
Sr. No. |
Author & Year |
Technique Used |
Advantages |
Limitations |
|
1 |
Mohammed Aly et al. (2025) |
ResNet-50 + CBAM + 3D CNN + AGTO Optimization |
Achieved high emotion recognition accuracy; supports real-time learner engagement monitoring; improves adaptive learning systems. |
Requires high computational resources and large training datasets. |
|
2 |
Zhi-Rui Li et al. (2025) |
Ada-DF++ with Global-Aware Spatial Attention and Squeeze-and-Excitation (SE) Attention |
Enhances recognition of subtle facial expressions; improves robustness against background noise. |
Performance decreases in highly occluded facial images and complex classroom environments. |
|
Sr. No. |
Author & Year |
Technique Used |
Advantages |
Limitations |
|
3 |
Xiaoyu Tang et al. (2024) |
Facial Expression Recognition (FER) with Emotional Engagement Analysis |
Helps evaluate students emotional engagement during science learning activities; supports personalized teaching. |
Limited generalization across different educational domains and age groups. |
EXISTING SYSTEM
Current e-learning and adaptive learning platforms primarily focus on students’ cognitive performance, such as quiz scores, assignment results, learning speed, and course completion rates. These systems recommend learning materials and instructional strategies based on academic performance and predefined learning paths. Popular Learning Management Systems (LMS) and Intelligent Tutoring Systems (ITS) use machine learning algorithms to personalize content; however, they generally overlook the emotional state of learners during the learning process.
Several recent studies have incorporated Facial Emotion Recognition (FER) techniques into educational environments to monitor student engagement and emotional responses. These systems utilize computer vision and deep learning models such as Convolutional Neural Networks (CNNs), ResNet, VGGNet, and Vision Transformers (ViTs) to classify emotions including happiness, sadness, anger, surprise, confusion boredom, and neutrality from facial expressions captured through webcams or cameras. Existing FER-based educational systems can detect student emotions and provide basic analytics to instructors regarding learner engagement. Some systems generate reports showing emotional trends during lectures, while others classify students as attentive or inattentive based on facial cues. A few advanced systems integrate emotion recognition with adaptive learning mechanisms to recommend additional learning resources or modify content difficulty levels. Despite these advancements, most existing systems operate independently and focus only on emotion detection without combining emotional analysis with student performance metrics, learning history, and behavioral patterns.
PROPOSED SYSTEM
Proposes an emotion-aware adaptive learning system that leverages facial emotion detection to suggest or predict the most suitable learning model for each student. Using real-time image capture, preprocessing, and deep learning-based
emotion recognition, the system identifies students emotional states such as happiness, confusion, boredom, or frustration. The process begins wth the student interacting with the system. The learners input could be facial expressions, gestures, or responses captured through a camera, microphone, or direct interaction with the platform. This stage collects raw data from the student. For facial emotion recognition, the primary input is a video stream or image captured using a webcam. Other possible inputs may include voice (tone, pitch) or behavioral data (e.g., mouse clicks, typing speed The detected emotions and behavioral patterns are mapped into a student profile. This profile represents both cognitive and emotional states of the learner (e.g., confused but engaged or bored and disengaged). Over time, it builds a history of the students learning patterns, challenges, and preferences.
Fig 1. System Architecture
METHODOLOGY / ALGORITHM METHODOLOGY
The proposed system integrates Facial Emotion Recognition (FER), Student Performance Analysis, and Machine Learning-based Recommendation techniques to provide personalized learning experiences. The system continuously monitors students’ facial expressions during learning sessions, identifies their emotional states, analyzes academic performance, and recommends the most suitable learning strategy or content.
Step 1: Data Acquisition
The system captures student facial images or video frames through a webcam, mobile camera, or online learning platform. Along with facial data, academic information such as quiz scores, assignment marks, attendance, and learning progress is collected.
Input Data
-
Facial images/video frames
-
Student profile information
-
Quiz and examination scores
-
Learning history
-
Attendance records
Step 2: Image Preprocessing
The captured images are preprocessed to improve quality and enhance facial feature extraction.
Preprocessing Operations
-
Image resizing
-
Noise removal
-
Normalization
-
Face detection using OpenCV
-
Facial region extraction
Step 3: Facial Emotion Recognition
A Deep Learning model (CNN/ResNet/Vision Transformer) analyzes facial expressions and classifies emotions.
Detected Emotions
-
Happy
-
Sad
-
Angry
-
Fear
-
Surprise
-
Neutral
-
Confused
-
Bored
The emotion with the highest confidence score is selected as the student’s current emotional state.
Step 4: Student Performance Analysis
The system evaluates student academic performance using collected learning data.
Performance Parameters
-
Quiz Scores
-
Assignment Scores
-
Attendance Percentage
-
Learning Completion Rate
-
Previous Academic Records
-
Students are categorized into:
-
High Performer
-
Average Performer
-
Low Performer
Step 5: Student Modeling
The emotional state and academic performance are combined to create a dynamic student profile.
Student Model Attributes
-
Current Emotion
-
Engagement Level
-
Learning Speed
-
Academic Performance
-
Historical Learning Behavior
Step 6: Learning Recommendation Engine
A Machine Learning classifier analyzes the student profile and recommends the most suitable learning strategy.
Recommendation Examples
Emotion
Performance
Recommended Action
Confused
Low
Provide simplified content and video tutorials
Bored
High
Increase difficulty level and advanced exercises
Neutral
Average
Continue standard learning path
Frustrated
Low
Provide additional examples and personalized support
Happy
High
Accelerate learning pace
Step 7: Adaptive Feedback Generation
The system dynamically adjusts learning content according to recommendations.
Feedback Types
-
Personalized learning materials
-
Motivational messages
-
Additional practice exercises
-
Video tutorials
-
Interactive quizzes
Step 8: Continuous Monitoring
The system continuously monitors student emotions and performance throughout the learning session and updates recommendations in real time.
ALGORITHM
Algorithm: Emotion-Based Adaptive Learning Recommendation
Input:
-
Student Facial Image (I)
-
Academic Performance Data (P)
Output:
-
Personalized Learning Recommendation (R)
Steps:
-
Start
-
Capture facial image/video frame.
-
Preprocess image.
-
Detect face using OpenCV Face Detector.
-
Extract facial features.
-
Apply CNN/ResNet model for emotion classification.
-
Identify dominant emotion (E).
-
Retrieve student performance data (P).
-
Analyze academic performance.
-
Generate student profile S = {E, P}.
-
Apply Recommendation Model.
-
Determine optimal learning strategy (R).
-
Display personalized recommendation.
-
Update learning content dynamically.
-
Repeat steps 214 until session ends.
-
Stop.
Mathematical Representation
Student Profile:
S = {E, P, H}
Where:
-
E = Emotion State
-
P = Academic Performance
-
H = Learning History Recommendation Function:
R = f(E, P, H)
Where:
-
R = Recommended Learning Model
-
f = Machine Learning Decision Function
-
Technologies Used
-
OpenCV Face Detection
-
CNN / ResNet50 / Vision Transformer (ViT) Emotion Recognition
-
TensorFlow / PyTorch Deep Learning Framework
-
Python Backend Processing
-
Flask/Django Web API
-
MySQL Student Data Storage
-
React/Android User Interface
The proposed methodology enables real-time emotion-aware adaptive learning by combining affective computing and educational data analytics to improve student engagement, learning efficiency, and academic performance.
RESULTS AND DISCUSSION
Loin
This image shows a Login Page for a multi-role educational management system where Students, Teachers, and
Administrators can access their respective dashboards using a single authentication interface
Fig 1.Login Page
Admin Dashboard
The Fig shows the Admin Dashboard of an Adaptive Learning Platform. This dashboard enables administrators to monitor users, courses, examinations, and emotion-based learning analytics in a centralized interface.
Fig 2 Admin Dashboard
Monitor platform performance, Track learning analytics. Manage educational resources.
Teacher Dashboard
The image shows the Teacher Dashboard of the Adaptive Learning Platform. This dashboard allows teachers to manage courses, monitor student engagement, track emotion-based learning analytics, and provide personalized academic support.
Fig 3.Teacher Dashboard
Student Dashboard
The fig shows the Student Dashboard (Student Learning Hub) of the Adaptive Learning Platform. This dashboard provides students with access to courses, live lectures, study materials, examinations, and personalized learning resources.
Fig.4.Student Dashboard
Student Dashboard acts as an intelligent learning portal that integrates:
-
Course Management
-
Live Learning
-
Study Materials
-
Examination System
-
Emotion Recognition
-
Adaptive Learning Analytics
Face Emotion Detection
The image shows the Live Lecture Module of the Adaptive Learning Platform, where students attend online lectures
while the system continuously monitors their emotions and provides adaptive learning support in real time.
Fig 5.Emotion Detection
APPLICATIONS:
-
Online Learning Platforms
The system can be integrated into e-learning platforms to monitor students’ emotions during virtual classes. Based on detected emotions such as confusion, boredom, or frustration, the system can recommend personalized learning materials and adaptive interventions.
-
Learning Management Systems (LMS)
The proposed system can be integrated with Learning Management Systems such as Moodle, Blackboard, and Google Classroom to provide emotion-based learning analytics and recommendations.
-
Distance Education Programs
Students learning remotely often face challenges in maintaining attention and motivation. The system helps identify emotional difficulties and provides timely support.
ADVANTAGES:
Real-Time Intervention
Actionable: Instantly adjusts content to prevent student frustration or confusion.
Deep Personalization
Tailored: Customizes the learning path based on the student’s emotional state.
Enhanced Outcomes
Focus-Driven: Maximizes retention by sustaining the student’s optimal “Flow State.
Objective Engagement Data
Measurable: Provides quantifiable, non-verbal metrics for educators to improve lessons.
Simple & Scalable
Practical: Integrates easily and affordably using standard webcam technology.
CONCLUSION
We propose a student facial expression recognition model that integrates multi scale feature fusion with fine-grained attention enhancement. The model captures facial expression information at different scales and fuses them through a multi scale dual-pooling aggregation module, enabling a more comprehensive and robust representation of facial features. To further refine this process, a key region-oriented attention mechanism is introduced, allowing the system to focus on subtle variations in facial expressions. Unlike traditional window-based cross-attention mechanisms, this approach dynamically identifies and emphasizes critical regions within the image without being restricted to a fixed window, thereby enhancing the overall expressiveness and accuracy of facial feature representation. The proposed student facial expression recognition model effectively integrates multi-scale feature fusion with fine-grained attention enhancement to achieve a more comprehensive and discriminative representation of facial features. By employing a multi-scale dual-pooling aggregation module, the model captures expression details across multiple spatial resolutions, ensuring robustness against variations in lighting, pose, and occlusion. Additionally, the introduction of a key region-oriented attention mechanism enables the system to dynamically identify and emphasize critical facial regions without relying on fixed window constraints, thus improving the models sensitivity to subtle emotional variations. Experimental analysis demonstrates that these enhancements contribute to more accurate and expressive feature representations. This capability provides a reliable foundation
for predicting optimal learning strategies through facial recognition analysis. By correlating emotion-driven insights such as attentiveness, confusion, and motivation with learning outcomes, the proposed framework facilitates adaptive, personalized learning experiences.
REFERENCES
-
Goldberg, P.; Sumer, ¨ O.; St ¨ urmer, K.; Wagner, W.; G ¨ ollner, R.; Gerjets, P.; Kasneci, ¨ E.; Trautwein, U. Attentive or not? Toward a machine learning approach to assessing students visible engagement in classroom instruction. Educ. Psychol. Rev. 2021, 33, 2749. [Google Scholar] [CrossRef]
-
Munna, A.S.; Kalam, M.A. Teaching and learning process to enhance teaching effectiveness: A literature review. Int. J. Humanit. Innov. (IJHI) 2021, 4, 14. [Google Scholar] [CrossRef]
-
Castiblanco Jimenez, I.A.; Olivetti, E.C.; Vezzetti, E.; Moos, S.; Celeghin, A.; Marcolin, F. Effective affective EEG-based indicators in emotion-evoking VR environments: An evidence from machine learning. Neural Comput. Appl. 2024. [Google Scholar] [CrossRef]
-
Mar´n-Morales, J.; Higuera-Trujillo, J.L.; Greco, A.; Guixeres, J.; Llinares, C.; Scilingo, E.P.; Alcaniz, M.; Valenza, G. Affective computing in virtual reality: Emotion recognition from brain and heartbeat dynamics using wearable sensors. Sci. Rep. 2018, 8, 13657. [Google Scholar] [CrossRef] [PubMed]
-
Saurav, S.; Saini, R.; Singh, S. Fast facial expression recognition using boosted histogram of oriented gradient (BHOG) features. Pattern Anal. Appl. 2023, 26, 381402. [Google Scholar] [CrossRef]
-
Shi, Y.; Lv, Z.; Bi, N.; Zhang, C. An improved SIFT algorithm for robust emotion recognition under various face poses and illuminations. Neural Comput. Appl. 2020, 32, 92679281. [Google Scholar] [CrossRef]
-
Niu, B.; Gao, Z.; Guo, B. Facial expression recognition with LBP and ORB features. Comput. Intell. Neurosci. 2021, 2021, 8828245. [Google Scholar] [CrossRef]
-
Lakshmi, D.; Ponnusamy, R. Facial emotion recognition using modified HOG and LBP features with deep stacked autoencoders. Microprocess. Microsyst. 2021, 82, 103834. [Google Scholar] [CrossRef]
-
Goldberg, P.; Sumer, ¨ O.; St ¨ urmer, K.; Wagner, W.; G ¨ ollner, R.; Gerjets, P.; Kasneci, ¨ E.; Trautwein, U. Attentive or not? Toward a machine learning approach to assessing students visible engagement in classroom instruction. Educ. Psychol. Rev. 2021, 33, 2749. [Google Scholar] [CrossRef]
-
Munna, A.S.; Kalam, M.A. Teaching and learning process to enhance teaching effectiveness: A literature review. Int. J. Humanit. Innov. (IJHI) 2021, 4, 14. [Google Scholar] [CrossRef]
-
Castiblanco Jimenez, I.A.; Olivetti, E.C.; Vezzetti, E.; Moos, S; Celeghin, A.; Marcolin, F. Effective affective EEG-based indicators in emotion-evoking VR environments: An evidence from machine learning. Neural Comput. Appl. 2024. [Google Scholar] [CrossRef
-
Mar´n-Morales, J.; Higuera-Trujillo, J.L.; Greco, A.; Guixeres, J.; Llinares, C.; Scilingo, E.P.; Alcaniz, M.; Valenza, G. Affective computing in virtual reality: Emotion recognition from brain and heartbeat dynamics using wearable sensors. Sci. Rep. 2018, 8, 13657. [Google Scholar] [CrossRef] [PubMed]
