Ijertv13n10 46withibthal-0.5
Ijertv13n10 46withibthal-0.5
Ijertv13n10 46withibthal-0.5
net/publication/345197059
CITATIONS READS
5 520
2 authors:
All content following this page was uploaded by Mohammed Abdullah Al-Hagery on 02 November 2020.
2895
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
The motivation for doing this research is to help the college of One of the common methods that can be employed in this field
CBE to find useful solutions that help in achieving quality in is the decision tree, using the decision tree model as a classifier
the educational process. In addition to searching for the reasons or predictor for students' academic data can help to analyse the
that led to the level weakness of some students and their low data and to study student performance and the discovery of their
academic achievement, or searching for outstanding students in achievements [8], [26].
its various departments to benefit from their experiences in
Besides, applying the Data Mining Tools can constitute a
achieving high academic performance.
practical guide for decision-makers and teachers in higher
2. RELATED WORK education institutions, to identify hidden problems related to
student success and failure [27]. Furthermore, the classification
Research in EDM is an interesting domain for academics and
techniques are useful to predict a student's career [28].
researchers, especially in educational institutions. The research
in this area generates useful knowledge related to students, The use of association rules algorithms can be extensively used
instructors, courses and the educational management system, as in studies related to EDM alongside the other algorithms. The
a whole. Since the knowledge from data collected in benefit of association rules extraction is to find frequent
educational systems is a veritable gold mine, it is important to patterns in databases and to explore the relationship between
make accurate decisions in achieving the requirements for this the various attributes that affect the academic achievement of
work, as it helps raise the educational process, in addition to students [29], [30]. Furthermore, revelation the useful
increasing the quality of the educational institution and information from behavioral data for students by using
reducing failure. association rules. Additionally, by the association rules, we can
obtain frequent patterns of behaviors that have a significant
Data mining can be used in the area of education for a better
impact on student performance and students' Failures cases can
understanding of the learning process and acquiring practical
be identified. This may help educational institutions understand
knowledge. This, in turn, helps identify problems facing
and improve students' behavior and also make the appropriate
students and reduce failure in academic performance [9]. Data
decisions, besides, the use of the association rule method that
mining in the educational area is called Educational Data
offers insight into improving admissions planning [14], [28],
Mining (EDM). It has contributed significantly to the
[31].
measurement of student academic performance and preventing
dropouts, and to better understanding failure [7]. The EDM is a In this paper, we selected the most significant tools to analysis
research field that assists in discovering ways to enhance the students' historical dataset from the CBE to identify aspects of
quality of education [10], [11]. It is a computer-based learning student failure, success and predict their academic performance
method that helps discover new patterns of data sets in using these technologies, which include classification and
educational institutions and represents one particular field of regression, Outlier Analysis. Where Outlier Analysis are
data mining [8]. representing the anomalies cases. Also, the use of technologies
that help discover students' achievements and find out the
EDM includes various sets of users or members, including the
reasons behind some students' failure by using association rules.
educational institution's administrators, teaching staff, students,
Also, this paper contributes to the search for anomalies
curriculum developers, and planners [10], [12]. Since 1993,
detection that may be distinct cases of the college that help in
many research works have employed EDM, with the number of
making the appropriate decisions in the interest of those
these studies growing significantly since then [13], [14]. A
students.
research works focused on extracting knowledge from student
data, predicting performance, evaluating student performance
in specific courses or finding an association between courses
3. METHODOLOGY
using various data mining techniques [15].
The proposed method uses several various techniques to focus
Some related works have obtained their data from the learning
on student performance analysis of the CBE. The overall
management system (LMS) known as kalboard 360 [16]–[18].
architecture of the proposed method is shown in Figure 1. In
Whereas many studies relied on the analysis of real data from
this study, we used the Orange data mining platform as open-
different environments of institutions, such as colleges,
source software for data mining and machine learning [32]. The
universities, or schools using common classification methods,
data mining techniques include Linear Regression, Association
like collected data sets from the College of Computer
Rules, Decision Tree, Naive Bayes, and Random Forest. The
Applications in India, also, from the National Defense
classification and regression techniques were used to predict
University in Malaysia. Some of the datasets were not enough
students' performance. Whereas, the association rules
[19]–[23].
technique was used for detecting frequents items among
Additionally, some of the previous works utilised limited students' records; to understand the reasons for their failure.
methods such as the classification and regression methods in
their study [24], [25].
2896
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
2897
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
# CONFIRMED_MARK Class
1 >=60 P (Pass)
2 <60 F (Fail)
2898
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
performance of models. In this study, our focus is on when the generated rules with values of confidence and support
Classification Accuracy (CA), Precision, F1-score and Recall, for itemsets is lower than the predefined minimum value, these
as seen in Equations (1) to (4) [2]. itemsets are not accepted as a frequent itemset; consequently,
𝑇𝑃+𝑇𝑁 the generated rules will be rejected [30], [41], [42]. The
CA= (1) Equation of support and confidence measures are given in
𝑇𝑃+𝐹𝑃+𝐹𝑁+𝑇𝑁
Equation (9) and Equation (10), respectively. A and B are
𝑇𝑃
Precision= (2) frequent itemsets, P is the probability [2].
𝑇𝑃+𝐹𝑃
𝑓𝑟𝑒𝑞(𝐴,𝐵)
F1-score =
2× 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 × 𝑅𝑒𝑐𝑎𝑙𝑙
(3) Support(𝐴 ⟹ 𝐵) = 𝑃(𝐴 ∪ 𝐵) = (9)
𝑁
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛+𝑅𝑒𝑐𝑎𝑙𝑙
𝑇𝑃 𝑓𝑟𝑒𝑞(𝐴,𝐵)
Recall=
𝑇𝑃+𝐹𝑁
(4) Confidence (𝐴 ⟹ 𝐵) = 𝑃(𝐴|𝐵) = (10)
𝑓𝑟𝑒𝑞(𝐴)
Linear Regression (LR) is a predictive model used to predict 4.1 The General Analysis of Student Performance
the value of the dependent variable (y) based on the value of Students' performance was analysed through the Orange
the independent variable (x) [10], [38]. LR can produce platform. We used the Distribution widget to shows the values
accurate predictions and is considered one of the easiest for Class_GPA based on five years of study. We compared the
techniques to apply. In the LR model, the two-dimensional data performance of students over five years to determine the
is represented as dots falling into a straight line, where the X- possibility of failure and excellence. Table 5 shows the
axis is the predictor and the Y-axis is the target [39]. The probability of failure and excellence for five years and the total
performance of the regression model is evaluated based on four number of student records in each semester. It also displays the
of the most popular metrics: Mean Square Error (MSE), Root percentage of students who excel and fail.
Mean Square Error (RMSE), Mean Absolute Error (MAE), and
coefficient of determination (R-squared) [40]. The MSE, In the next part, we compared the failure and excellence rates,
RMSE, MAE, and R-squared are presented below, from where the records were divided into ten semesters. The goal
Equation (5) to Equation (8). Where n is the total number of was to search for the semester that comprised large numbers of
observation/ rows, yi represents the actual values, ŷi represents students failing and excelling. Table 6 focusses on students
predicted/ estimated values, y̅i is the mean of the actual yi who excel and fail and compares this to the rates of excellence
values and the i value ranging from 1 to n. and failure throughout the ten semesters.
𝑛
1 2
Students' GPA was analysed based on Major_Name, to know
𝑀𝑆𝐸 = ∑(𝑦𝑖 − 𝑦̂)
𝑖 (5) which majors include the most significant number of excellent
𝑛
𝑖=1
and failed students. Table 7 shows data on excellent and failed
students based on Majors. Since the number of students
∑𝑖(𝑦𝑖 − 𝑦̂)
𝑖
2
influences the failure or excellence rate, the total number of
𝑅𝑀𝑆𝐸 = √ (6)
𝑛 students in each major is calculated, as shown in the following
Table 7.
𝑛
1
𝑀𝐴𝐸 = ∑ |𝑦𝑖 − 𝑦̂|
𝑖 (7) In the next part, the data will be analysed based on gender, to
𝑛
𝑖=1 identify which gender more often fails to achieve a high CGPA.
Table 8 presents data on failed and excellent students based on
∑𝑛𝑖=1(𝑦𝑖 − 𝑦̂)
𝑖
2
𝑅2 = 1 − 𝑛 (8) gender, where the number of failed male students' records was
∑𝑖=1(𝑦𝑖 − 𝑦̅)𝑖
2
2,289, whereas the number of failed female students' records
was 798. The table also shows the probability of failure and
excellence in student records. From this table, it became clear
(3) Association Rules that female students earned higher percentages of distinction.
2899
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
2900
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
Female 27177 3380 798 85.40% 25.85% 0.124 0.004 0.029 0.002
Male 25230 578 2289 14.60% 74.15% 0.023 0.002 0.091 0.004
The data set was divided, with 75% training data and 25% test
data, with Class_GPA as the target variable. Table 9 presents
the evaluation results of DT, RF, and NB. The table contains
the results of CA, F1-score, Precision, and Recall. Figure 4
shows the results of the evaluation of the three models.
2901
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
MAJOR_NA
Class_GPA=
9 ME= Pre- 4.4% 74.9%
Fail
Major
MAJOR_NA
Class_GPA=
10 ME= Pre- 3.4% 31.5%
Acceptable
Major
MAJOR_NA
Class_GPA=
11 ME= 2.4% 36.3%
Very_Good
Economics
MAJOR_NA
Class_GPA=
12 ME= 2.3% 30.4%
Excellent
Accounting
MAJOR_NA
Class_GPA=
13 ME= 2.2% 32.8%
Good
Economics
2902
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
2903
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
the probability of a high GPA in female students' records was squared was 80.1%, which indicates that the model explains
between 0.128 and 0.120, whereas the probability of a high 80.1 % of the variability in the CUM_GPA. The result of
GPA in male students' records was between 0.025 and 0.021. evaluating the models' performance has shown us that the DT
Overall, the results indicated that female students' model is good and is better than LR, as the error rate in the DT
outperformed male students and that they are less likely to fail is less than LR.
than male students. Besides, the probability of male students
obtaining a failed GPA is 7% higher than the probability of The association's rules were analysed based on three
excelling. As for female students, the probability of superiority approaches. The first approach is the impact of a major on the
is 9.7% higher than the probability of failing. CGPA; we observed from the first and third rule that students
in the BA category are most likely to obtain a good GPA, with
Accordingly, these results lead us to the fact that female 43.4% confidence. They are also most likely to obtain an
students are more diligent in obtaining high rates and avoiding acceptable GPA, with 38.5% confidence. Furthermore, we
failure in their academic performance. These results help the noted from the second and fourth rule that the vast majority
college to try to search for the reasons that led to the failure of who obtain a good GPA, with 36.1% confidence and an
male students in their academic performance, educate students acceptable GPA, with 46.6% confidence are BA students.
by setting up seminars that support them in raising their
academic performance, the search behind the reasons that led As in the fifth, sixth and seventh rule, Accounting, Finance, and
to their failure and take the crucial decisions to reduce this MIS students are more likely to get a good GPA with 40.2%,
failure in the coming years 46.1%, and 46.1% confidence, respectively. It was noted in the
eighth and ninth rules, the Pre-Major students, often get a GPA
On the other hand, the evaluation results of classification to fail with 40.6% confidence. Also, that the failed students
methods showed that RFs achieved the highest scoring 71.3% most probably belong to the Pre-Major with 74.9% confidence.
on CA and Recall, 71.5% on Precision and 71.2% on F1-score. As the 10th rule states, students of Pre-Major may obtain an
The next algorithm was the DT with 69.7% on F1-score, 69.8% acceptable GPA, with 31.5% confidence.
on CA and Recall, and showed slight increases on Precision by
0.1%, which means it scored 69.9%. Meanwhile, the NB As for Economics students, the 11th and 13th rules show that
appeared to be the worst algorithm, obtaining 59.4% on CA and they are more likely to have a very good GPA, with 36.3%
Recall, 59.5% on F1-score, and 60.5% on Precision. We can confidence, and a good GPA with 32.8% confidence. As for the
conclude from these findings that the performance of the RF 12th rule, they are the lowest in confidence value, 30.4%; this
algorithm on this type of data set is excellent. Therefore, one of rule says that if the GPA class belongs to the excellent group,
the points to be taken into account is that the principle of RF then the major will be Accounting. This rule indicates that most
and the ensemble learning method is proportional to our data students who excel the most belong to the Accounting major.
set, which is structured data. Where the basic principle of RF is
that a group of weak learners can be combined to form a strong Through these thirteen rules, it is clear to us that the highest
collective learner, this principle helped to obtain an adequate confidence obtained was 74.9%, which shows that failure rates
evaluation in the classification of student performance. often occur in the Pre-Major. As we mentioned previously, the
Furthermore, we found that the DT was lower by 1.5% on CA Pre-Major is a major that is taken before specialisation and
than RF; this indicates that the RF is more accurate with results comprises courses from all majors. We surmise that its students
than the DT, and the DT built according to IF-THEN rules [2]. often fail because some of their courses are from disciplines
Accordingly, we concluded from this assessment that a rule- they do not like.
based classifier is proportional to the data set used in this study.
We noticed the next rule that scores the highest confidence, at
Finally, according to the results, RFs have outperformed the 46.6%, states that if the GPA class belongs to "acceptable,"
other algorithms in all evaluation measures. This can be used then the major is a BA. The BA is dominated by an acceptable
to meet the requirements of the university in achieving quality GPA, and it is the most popular specialisation in the CBE with
and discovering weak students, as well as finding students who 17,433 records. This discovery may indicate that most students
show excellent and exceptional capabilities tend to belong to this specialisation due to the belief among
many that courses tend to be easy. This may also be due to the
As for the results of the evaluation of regression models, the popularity of this major, which provides jobs for graduates at
value of the average of the squared of the errors (MSE) was many companies and organisations.
estimated at 15.4% in LR, whereas in DT it was estimated at
13.5%. The value of the differences between the actual values The second approach is the impact of a major on the students'
and the values predicted by the LR (RMSE) was 39.3%, marks; we noticed that the rule with the highest confidence,
whereas the DT was 36.8%. Also, the value of the average of 97.8%, is the seventh rule. This rule shows that if a major is in
the absolute values decided, calculating the differences among Economics, it is more likely that it will obtain a mark of "P",
predicted and actual values (MAE), was estimated at 31.6% in which means that Economics students will likely pass all
LR, whereas in DT it was 28.8%. Moreover, the value of the courses. This is followed by the fourth rule, with a 93%
proportion of variance of the dependent variable explained by confidence. This rule clarifies that if the major is Finance, they
the independent variable (R-squared) was 77.3% in LR, which are likely to pass the courses. The fifth rule, with 92%
indicates that the model shows 77.3 % of the variability in the confidence, indicated that if the major is Management
CUM_GPA (the target variable). Whereas, in DT, the R- Information Systems, the marks will constitute a pass. The next
2904
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
rule is the third rule, with 91.2% confidence, denoting that if of the college is that students must complete the courses of the
the major is Accounting, then they are likely to pass the courses first three levels before specialisation, with a second condition
too. The last rule with high confidence is the first rule, at 88.8%, being the obligation to obtain a GPA higher than 2, conditions
which shows the student who belongs to the BA group is most that led some students to be shut out of specialisations. So, they
likely to obtain a pass mark. On the other hand, if the major in have to add these general courses to finish previous courses or
Pre-Major, then they will pass the courses with a confidence of raise their GPA. We did note that the rules with the F mark did
65.7%, as in the sixth rule. Whereas the ninth rule states that if not appear in this analysis under the measures' selected values.
the major in Pre-Major, then it is likely that the failure of a We conclude from this that there were more instances of
course will be obtained with a confidence of 34.3%. The second success than failure.
rule says that if the class mark is "P," it is likely that the major
is a BA, with a 33.4% confidence. Whereas the eighth and 10 th Furthermore, after the outliers' analysis, we noticed that
rules state that if the class of marks is "F" this indicates that the significant anomalous data appeared in the records of students
major is a BA or Pre-Major, with little confidence, 32.3% and of Pre-Major. The anomaly was due to the weak SGPA and
32.2%, respectively. Finally, we have concluded that these CGPA. In addition to their course failures. Student failure at the
majors, Economics, Finance, MIS and Accounting, are more first level was often due to several reasons, such as the
likely to get the pass in courses, with high confidence, over difficulty of the courses, the difference in the methods of
90%. This demonstrates that excellent and interested students lecturers teaching the courses or the standardisation of
always belong to these majors. Furthermore, when students questions (standardised test) between the female and male
belong to the fields they prefer, they give their best. students department. Also, there may be personal reasons
related to the student's social life. Consequently, a strategic plan
The third approach is the impact of courses on the students' must be designed to understand difficulties and problems
marks; the resulting rules show that a student who registered in experienced by the students of the first level, and then practical
the course "Feasibility analysis of projects" is most likely to decisions could be made that are appropriate to these problems,
obtain a pass mark, with a 99.4% confidence, as in the first rule. to avoid students failing in future years. This brings up the
We also noted in the fifth and sixth rules, that with a 98.2% and necessity of the academic advisor, especially for Pre-Major
98% confidence, and if the course is "Saudi Commercial Law" students, to guide them in the continuation of their studies and
and the course is "Strategic Management," then students will to overcome difficulties. We have observed the problem of
most likely pass this course. The fourth rule states that if the "academic separation" in the academic cases of most Pre-Major
course is "Organisational Behavior," then students will pass students. The terms "dropout," "discontinuation of study" and
this course, with a 95.2% confidence, as the third rule, with a "termination" were also read. We also discovered a group of
92.6% confidence. If the course is "Introduction to observed anomalies that serve the college in many respects,
Management Information Systems," students are more likely to especially in obtaining high-quality standards in the education
pass this course. Moreover, we have two rules where we see process. Where a group of students was found who have a high
less confidence than 92.6% by almost 13%, which are the CGPA at all levels of study, they nevertheless graduated with
second and seventh rules, with 79.6% and 72%, respectively. an excellent CGPA. The college should, in turn, realize that the
As the second rule states, that if the course is "Operations excellent students' experience leads to organised volunteer
Management," students will succeed in this course. As for the courses. These could be offered by the students who excel, and
seventh rule, it appears that if the course is "Principles of that can assist students of the same major. Those students'
Management Accounting," students will also pass this course. experiences may be used to provide advice to those who wish
Finally, according to our experience in the CBE courses, to join this major and could be achieved through social media.
"Feasibility analysis of projects," "Saudi Commercial Law" and
"Organisational Behavior" are general education courses in the
five departments: Management Information Systems, 6. CONCLUSION AND FUTURE WORK
Accounting, Finance, Economics and Business Administration,
the "Strategic Management" course is a general education The purpose of this study was to analyse student data in the
course in Management Information Systems, Accounting, CBE by extracting new patterns and features from their
Economics and Business Administration. These general academic data. It additionally sought to detect anomaly cases.
education courses aim to expand the scope of students' It did this by predicting the academic performance of students
understanding by adding courses from different specialisations, over the last five years, from 2014 to 2018, using data mining
for the student to graduate with knowledge of majors different techniques. Moreover, it identified the students' weaknesses
than the one they primarily studied. and failures and explored the knowledge that helps to improve
the educational process. Furthermore, it tried to find the reasons
The research findings suggest that the knowledge obtained for the students' repeated failure in a particular course.
from the third approach means that students often pass general
education courses. Many students intentionally add these This study explored, through the application of data analysis,
courses, either to raise their GPA due to the course being easy first, that the probability of excellence and failure was in the
or because of the cooperation they are felt with the lecturer. fifth year more than in the fourth year (in the first and second
Also, these courses may be added to fill the gap of the academic semesters of 2018). We found through these results that the rate
schedule because some students prefer to not have too much of excellence in the last year exceeded the failure rate by 2.7%.
free time in their schedules. Furthermore, one of the conditions Second, the probability of increasing excellence among
2905
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
students of the department of Economics was the highest 2014 International Conference on Engineering and
among other majors by more than 18.9%. On the other hand, Technology (ICET), 2014, pp. 1–6.
the probability of increased failure in a Pre-Major was
[2] M. K. Jiawei Han Jian Pei, Data Mining: Concepts and
estimated to be more than 39.3%. Third, the probability of
Techniques, 3rd ed. Elsevier, 2012.
excellence in the records of female students was estimated
between 12.8% and 12%, whereas the probability of excellence [3] R. Lawrance and V. Shanmugarajeshwari, “An assay
among the records of male students was estimated to be of teachers’ attainmentusing decision tree based
between 2.5% and 2.1%. Therefore, the analysis leads us to the classification techniques,” in Proceedings of IEEE
following conclusion: male students and Pre-Major students are International Conference on Circuit, Power and
more likely to fail and therefore need, in this period, to follow Computing Technologies, ICCPCT 2017, 2017.
up with academic advisors.
[4] K. N. Shah, M. R. Patel, N. V Trivedi, P. N. Gadariya,
Additionally, according to the results of classification, RF has R. H. Shah, and N. Adhvaryu, “Study of Data Mining
outperformed the other algorithms in all evaluation measures, in Higher Education-A Review,” International
with 71.3% of CA and Recall, F1-score 71.2%, and Precision Journal of Computer Science and Information
71.5%. Furthermore, as a result of evaluating the performance Technologies, vol. 6, no. 1, pp. 455–458, 2015.
of the regression models, we have noticed that the DT model is [5] U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, “From
not only good but is better than LR, as the error rate in the DT Data Mining to Knowledge Discovery in Databases,”
is less than LR. On this basis, we conclude from this study that AI Magazine, vol. 17, no. 3, pp. 37–54, Mar. 1996.
the best classification model is RF, and the best regression
model is DT. [6] B. Guo, R. Zhang, G. Xu, C. Shi, and L. Yang,
“Predicting Students Performance in Educational Data
Moreover, the results that we reached through the association's Mining,” in 2015 International Symposium on
rules indicate that the knowledge obtained from the first Educational Technology (ISET), 2015, pp. 125–128.
approach was that failure rates often appeared in the Pre-Major [7] B. Kumar and S. Pal, “Mining Educational Data to
with a 74.9% estimated confidence. The results also showed Analyze Students Performance,” International Journal
that if the GPA class belongs to "acceptable," then the major is of Advanced Computer Science and Applications, vol.
BA with an estimated 46.6% confidence. Students from the 2, no. 6, pp. 63–69, 2011.
following majors, Economics, Finance, MIS and Accounting,
are more likely to get the pass marks in the courses, with an [8] A. I. Adekitan and E. Noma-Osaghae, “Data mining
over-90% confidence. Our findings from the third approach approach to predicting the performance of first year
suggest that the knowledge obtained shows that students often student in a university using the admission
pass general education courses. It was clear through research requirements,” Education and Information
and submitted questions to the officials in the CBE that many Technologies, vol. 24, no. 2, pp. 1527–1543, Mar. 2018.
students intentionally add these courses either to raise their
[9] L. A. Buschetto Macarini, C. Cechinel, M. F. Batista
GPA due to the course being easy or because of the cooperation
Machado, V. Faria Culmant Ramos, and R. Munoz,
they felt with the lecturer. There is also passion and curiosity “Predicting Students Success in Blended Learning—
felt by some students, who enroll in these courses and obtain Evaluating Different Interactions Inside Learning
valuable information that will benefit them in future. Management Systems,” Applied Sciences, vol. 9, no.
24, p. 5523, Dec. 2019.
In future, researchers would need additional data for the
analysis, to increase the accuracy of the prediction. They may [10] S. Angra and S. Ahuja, “Implementation of data mining
also want to focus on features that have a substantial impact on algorithms on student’s data using rapid miner,” in
student performance, such as high school rate, absences, the 2017 International Conference on Big Data Analytics
number of notifications and the number of failures in a course. and Computational Intelligence (ICBDAC), 2017, pp.
Additional models, such as traditional Neural Networks and 387–391.
deep learning, could be employed.
[11] R. S. J. Baker, “Data Mining for Education,” 3rd
Editio., vol. 7, 2010, pp. 112–118.
ACKNOWLEDGMENT [12] C. Romero and S. Ventura, “Educational Data Mining:
A Review of the State of the Art,” IEEE Transactions
The authors would like to thank the College of Business and
on Systems, Man, and Cybernetics, Part C
Economics at Qassim University that provided the data
(Applications and Reviews), vol. 40, no. 6, pp. 601–618,
required for this research.
Nov. 2010.
[13] S. Roy and A. Garg, “Predicting academic performance
REFERENCES of student using classification techniques,” 2017 4th
IEEE Uttar Pradesh Section International Conference
[1] R. M. Damin, M. A. Kadry, and E. M. Hamed, “An
on Electrical, Computer and Electronics, UPCON
investigation into the use of the education Management
2017, vol. 2018-Janua, pp. 568–572, 2018.
Information System (EMIS) in Iraq: Case study,” in
2906
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
[14] S. Ahmed, R. Paul, and A. S. M. L. Hoque, MEC International Conference on Big Data and Smart
“Knowledge discovery from academic data using City (ICBDSC), 2019, pp. 1–4.
Association Rule Mining,” in 2014 17th International
[25] H. Mousa and A. Maghari, “School Students ’
Conference on Computer and Information Technology
Performance Predication Using Data Mining
(ICCIT), 2014, pp. 314–319.
Classification,” International Journal of Advanced
[15] M. Hasibur Rahman and M. Rabiul Islam, “Predict Research in Computer and Communication
Student’s Academic Performance and Evaluate the Engineering, vol. 6, no. 8, pp. 136–141, 2017.
Impact of Different Attributes on the Performance
[26] A. Al Mazidi and E. Abusham, “Study of general
Using Data Mining Techniques,” 2nd International
education diploma students’ performance and
Conference on Electrical and Electronic Engineering,
prediction in Sultanate of Oman, based on data mining
ICEEE 2017, no. December, pp. 1–4, 2018.
approaches,” International Journal of Engineering
[16] B. Kapur, N. Ahluwalia, and S. R, “Comparative Study Business Management, vol. 10, pp. 1–11, 2018.
on Marks Prediction using Data Mining and
[27] K. Sunday, P. Ocheja, S. Hussain, S. S. Oyelere, B. O.
Classification Algorithms,” International Journal of
Samson, and F. J. Agbo, “Analyzing Student
Advanced Research in Computer Science, vol. 8, no. 3,
Performance in Programming Education Using
pp. 632–636, Apr. 2017.
Classification Techniques,” International Journal of
[17] C. Jalota and R. Agrawal, “Analysis of Educational Emerging Technologies in Learning (iJET), vol. 15, no.
Data Mining using Classification,” in Proceedings of 02, p. 127, Jan. 2020.
the International Conference on Machine Learning,
[28] P. Rojanavasu, “Educational data analytics using
Big Data, Cloud and Parallel Computing: Trends,
association rule mining and classification,” in ECTI
Prespectives and Prospects, COMITCon 2019, 2019,
DAMT-NCON 2019 - 4th International Conference on
pp. 243–247.
Digital Arts, Media and Technology and 2nd ECTI
[18] J. H. Sharp and L. A. Sharp, “A comparison of student Northern Section Conference on Electrical,
academic performance with traditional, online, and Electronics, Computer and Telecommunications
flipped instructional approaches in a C# programming Engineering, 2019, pp. 142–145.
course,” Journal of Information Technology Education:
[29] S. Kotsiantis and D. Kanellopoulos, “Association
Innovations in Practice, vol. 16, no. 1, pp. 215–231,
Rules Mining: A Recent Overview,” GESTS
2017.
International Transactions on Computer Science and
[19] V. Shanmugarajeshwari and R. Lawrance, “Analysis of Engineering, vol. 32(1), pp. 71–82, 2006.
students’ performance evaluation using classification
[30] V. Nida Uzel, S. Sevgi Turgut, and S. Ayse Ozel,
techniques,” 2016 International Conference on
“Prediction of Students’ Academic Success Using Data
Computing Technologies and Intelligent Data
Mining Methods,” in 2018 Innovations in Intelligent
Engineering, ICCTIDE 2016, pp. 1–7, 2016.
Systems and Applications Conference (ASYU), 2018,
[20] S. B. Rahayu, N. D. Kamarudin, and Z. Zainol, “Case pp. 1–5.
Study of UPNM Students Performance Classification
[31] A. F. Meghji, N. Ahmed Mahoto, M. A. Unar, and M.
Algorithms,” International Journal of Engineering and
Akram Shaikh, “Analysis of Student Performance
Technology, vol. 7, no. December 2018, pp. 285–289,
using EDM Methods,” in 2018 5th International Multi-
2018.
Topic ICT Conference (IMTIC), 2018, pp. 1–7.
[21] R. Hasan, S. Palaniappan, A. R. A. Raziff, S. Mahmood,
[32] A. Naik and L. Samant, “Correlation Review of
K. U. Sarker, and A. Rafi, “Student Academic
Classification Algorithm Using Data Mining Tool:
Performance Prediction by using Decision Tree
WEKA, Rapidminer, Tanagra, Orange and Knime,”
Algorithm,” in 2018 4th International Conference on
Procedia Computer Science, vol. 85, pp. 662–668, Jan.
Computer and Information Sciences (ICCOINS), 2018,
2016.
pp. 1–5.
[33] M. A. Al-Hagery, “Classifiers’ Accuracy Based on
[22] A. Marwaha and A. Singla, “A study of factors to
Breast Cancer Medical Data and Data Mining
predict at-risk students based on machine learning
Techniques,” International Journal of Advanced
techniques,” in Advances in Intelligent Systems and
Biotechnology and Research, vol. 7, no. 2, pp. 760–772,
Computing, 2020, vol. 989, pp. 133–141.
2016.
[23] A. I. Adekitan and O. Salau, “Toward an improved
[34] K. Limsathitwong, K. Tiwatthanont, and T.
learning process: the relevance of ethnicity to data
Yatsungnoen, “Dropout prediction system to reduce
mining prediction of students’ performance,” SN
discontinue study rate of information technology
Applied Sciences, vol. 2, no. 1, pp. 1–15, Jan. 2020.
students,” in 2018 5th International Conference on
[24] S. S. Al-Nadabi and C. Jayakumari, “Predict the Business and Industrial Research (ICBIR), 2018, pp.
selection of mathematics subject for 11 th grade 110–114.
students using Data Mining technique,” in 2019 4th
2907
International Journal of Engineering Research and Technology. ISSN 0974-3154, Volume 13, Number 10 (2020), pp. 2895-2908
© International Research Publication House. http://www.irphouse.com
2908