- Open Access
Influence of algorithmic abstraction and mathematical knowledge on rates of dropout from Computing degree courses
Journal of the Brazilian Computer Societyvolume 24, Article number: 10 (2018)
This paper presents a study of rates of dropout from Brazilian degree courses, based on data provided by the National Institute for Educational Studies and Research “Anísio Teixeira” (INEP) and a case study carried out at the University of Brasilia (UnB). Dropout was calculated by tracking the status of each student between 2010 and 2014 in the eight major areas according to the classification of the Organisation for Economic Co-operation and Development (OECD), for the major area of Science, Mathematics, and Computing, and for the area of Computing. Data were analyzed in order to check for potential evidence regarding the influence on dropout of factors such as algorithmic abstraction, number of applicants per place, or the gender of students. A survey was also performed using online questionnaires for circumvented students from the courses of Bachelor of Computer Science, Degree in Computing, and Computer Engineering between 2005 and 2015. This survey revealed the influence on dropout of several factors and particularly institutional and vocational factors; it is clear that difficulties in algorithmic abstraction and mathematical knowledge influence rates of dropout from computing courses.
The number of enrolled students contributes to the budget matrix of an institution, and dropout generates a negative budget impact. “Student dropout in higher education is an international problem affecting the result of educational systems. Losses of students who begin but do not conclude their courses are social, academic, and economical wastes” . In public institutions, for instance, an understanding of this phenomenon may contribute significantly to reducing wastage of resources. In private institutions, dropout and revenue loss are clearly interrelated. In Brazil, higher education in the area of Science, Mathematics, and Computing shows high dropout rates, as can be observed from the data from the National Institute for Educational Studies and Research “Anísio Teixeira” (INEP). Dropout also arises from the late diagnosis of several students at risk of circumvention, thus making it difficult to combat and reverse the problem. Dropout analysis is an important step in the search for subsidies for understanding this phenomenon in order to mitigate it. The following hypotheses were therefore investigated in the present study:
The number of applicants per place in a course is related to student dropout.
Courses in the major areas of Science, Mathematics, and Computing requiring higher levels of mathematics and algorithmic abstraction show higher dropout rates.
The general purpose of the present study was to analyze dropout from Brazilian Computing degree courses, based on data provided by INEP and a case study carried out at the University of Brasilia (UnB), in which a student was considered circumvented when “disengage[d] from the course for any reason different from degree obtainment”  (except in the case of death).
Dropout rates were analyzed in order to check a possible correlation with the number of applicants per place, the influence of gender, and a possible relationship between the requirements of algorithmic abstraction and mathematical knowledge and rates of dropout from courses in the major areas of Science, Mathematics, and Computing (and particularly from Computing degree courses). Non-parametric statistical tests were performed in these analyses since some variables did not show a normal distribution. A case study was also carried out at the UnB through the application of questionnaires to circumvented students of Computing degree courses to investigate their reasons for dropping out. Non-parametric tests, a Cronbach’s alpha test, and an analysis of the content of spontaneous replies were used for data analysis.
This paper is organized as follows: The “Background” section presents the background providing the context for the development of this study. The “Methodology” section describes the methodology and the case study carried out at UnB. The “Analyses and results” section describes the analyses and results, and the “Conclusions” section presents the conclusions.
In a study performed by the Special Commission for Studies of Dropout in Brazilian Public Universities , the concepts of course dropout (disengagement only from the course, with no disengagement from the institution), institutional dropout (disengagement from the institution in which the students were enrolled), and system dropout (disengagement from higher education) were presented. The commission recommended that the most suitable concept of dropout should be selected based on the purpose of each study.
Dropout from Computing courses is not restricted to the Brazilian education system. According to Kinnunen and Malmi , courses in the area of Computer Science at Helsinki University of Technology (Finland) have 500–600 entries every year, and dropout rates have ranged from 30 to 50%.
According to Barroso and Falcão , conditions leading to school dropout can be classified into three groups: (i) economic issues, i.e., the impossibility of continuing the course due to socioeconomic reasons; (ii) vocational issues, i.e., a lack of identification with the course; and (iii) institutional issues, i.e., disengagement due to failure in the initial courses, early disabilities of previous contents, a mismatch with the methods of study, and difficulties in the relationships with institutional colleagues or members. Xenos et al.  identified the reasons for dropout from Computer Science degree courses at the Hellenic Open University in Patras, Greece, as follows (the student may have indicated more than one reason): professional (62.1%), academic (46.2%), family (17.8%), personal (8.9%), and health-related (9.5%).
For courses in the major areas of Science, Mathematics, and Computing, the occurrence of students with difficulties in subjects requiring algorithmic abstraction has been reported by several authors; according to Piva and Freitas , “the difficulty shown by students as to algorithmic representation is due to their difficulty to extract details composing a problem situation”. Regarding Computing students, Paula et al.  state: “Students have difficulties to build mental representations in fact abstracting a problem completely, so it is necessary to seek means to stimulate them to develop such an ability.” A study carried out at the AL-AQSA University (Palestine) by Abu-Oda and El-Halees  involving 1290 student records between 2005 and 2011 and using data mining techniques showed that success in algorithm-related initial subjects reduces the probability of disengagement by students. Difficulties regarding algorithmic abstraction, i.e., difficulties in abstraction related to problem-solving translation in software, have therefore been identified by authors as being a particular problem for Computing students.
Several studies of the difficulties experienced by students of mathematical subjects have been published. Souza et al.  evaluated dropout from Brazilian degree courses over a period of 10 years, based on academic studies of this issue during this period, and observed that failures in subjects requiring mathematical knowledge are among the most frequently seen. In a study of African students of Computer Science, Gruner  detected a low percentage of students who wished to become researchers or scientists. This author also stated that Mathematics is becoming less fully covered in the curricula of such courses. Regarding the field of Computing, Barcelos and Silveira  found that “the lack of a proper domain of mathematical knowledge by students is a possible factor explaining the lack of interest and dropout in courses of this area.”
The studies described in the “Analyses and results” section of the current work required dropout values to be statistically tested, with parametric and non-parametric tests as options. Regarding non-parametric tests, Maroco  stated that “such tests do not require a known variable (normal).” Although parametric tests have often been considered more powerful than non-parametric tests, this effect is only observed for large samples and those with equivalent size; thus, non-parametric tests are suitable for small samples and those with changeable sizes . According to Vieira , when data normality and homoscedasticity cannot be confirmed and the sample size is reduced, non-parametric testing becomes a viable option. Since data normality is the main assumption in parametric tests, non-parametric testing must be used when normality is not detected.
In studies involving the application of questionnaires, it is necessary to check whether the items used are suitable for the purpose of the study, thus ensuring the reliability of the questionnaire. In the “Analyses and results” section, a Cronbach’s alpha test was one of the mechanisms used to analyze the answers. According to Hora et al. , “alpha measures the correlation among answers in a questionnaire through their profile analysis. It refers to a mean correlation among questions.” According to Maroco and Garcia-Marques , “the α index estimates how evenly items contribute with the unweighted sum of the instrument, ranging from 0 to 1.”
The methodologies adopted in the present study are described below. These consisted of an analysis of national data (the “National data” section) and a case study carried out at UnB (the “Case study—UnB” section).
Quantitative analyses were performed using micro-data from the Higher Education Census between 2010 and 2014, provided by INEP (2017). INEP began to make the individual data of students from 2009 available, whereas the data were provided in aggregate prior to 2009. Changes in the methodology were made between 2009 and 2010, as observed in the technical summary of the Higher Education Census in 2010 . Thus, data from 2010 were considered in the present study.
The micro-data information for each year was stored in ASCII (micro-data of 2010, 2011, and 2012) and CSV (micro-data of 2013 and 2014) files. These files contained information on students, courses, higher education institutions, professors, and places offered; files related to students and courses were used in the current work.
Before calculating dropout rates, the data were cleaned, retaining the information on the status of the students on the course (transferred, disengaged, died, attending, enrolment blocked, or graduated), the student code (unique to each student and provided by INEP), gender, the name of the course, its general area, its specific area according to the classification by the Organisation for Economic Co-operation and Development (OECD) , and the maintenance entity (public or private).
After cleaning, the data were transferred to a MySQL databank, from which dropout was calculated. Statistical tests were performed using SPSS Statistics version 22 software. Only non-parametric tests were used since data normality was not detected for some variables. The Spearman correlation coefficient was used to check for a possible correlation between dropout and competition. The p value (significance level) was set at 0.05, so that p < 0.05 was considered acceptable when evaluating the significance of the obtained results. A box plot diagram was used to compare dropout rates among knowledge areas.
The number of circumvented students per year was calculated as the number of enrolments of disengaged students plus the number of enrolments of transferred students in that year, for each course. Total enrolments were obtained from the sum of the number of enrolments of active, graduated, transferred, disengaged, and deceased students per year per course. Dropout rates were obtained from the ratio between the number of circumvented students and the total number of enrolments. In this study, transfers are considered as course dropouts, even in the absence of institutional dropout.
The analyzed data comprising the dropout between 2010 and 2014 were separated into tables according to the maintenance entity (public or private institutions) and gender (male and female).
A case study was carried out at UnB to investigate the causes of student dropout from Computing courses. The total number of circumvented students between 2005 and 2015 from the degree courses offered at the Darcy Ribeiro Campus (Bachelor of Computer Science degree in Computing and Computer Engineering) was obtained from the Secretariat of Academic Records.
Several items were included in the study in order to evaluate the influence of each of them on the decision of circumvented students to disengage from the course. The prevalence of dropout-related reasons was unknown. However, a prevalence of 50% was used for sample size calculation. The following criteria were used to define the minimum number of questionnaires to be answered: 10% error margin, 90% confidence level, and 50% prevalence. Equation 1 was used for sample size calculation:
where p = 50.0%, Za/2 = 1.645 represents the critical value corresponding to a 90% confidence level, and E = 10% represents the error margin used. After obtaining n0, a correction was made for a finite population according to Eq. 2, in which N corresponds to the total of circumvented students per course. The calculation below was made per course, thus defining the sample size for each course:
Table 1 shows the number of circumvented students per course, the minimum sample size, and the number of questionnaires answered.
In order to contact students who circumvented between 2005 and 2015, their names and e-mail addresses were obtained from the Secretariat of Academic Records at UnB. Students were considered circumvented when they disengaged from the course before its conclusion, for any reason except death. Each student received an e-mail explaining the purposes of this study and a link to the questionnaire, which was available online via Google Forms. It must be emphasized that this questionnaire did not request the student’s identity.
To investigate reasons for dropout, 21 items were made available using a Likert scale questionnaire in which students were required to answer, on a scale from 0 to 5 (six levels), how strongly each item influenced their disengagement, where 0 indicated zero influence and 5 indicated a strong influence. Students could also report other reasons for their dropout if these were not mentioned in the scaled questionnaire. They could also report which subjects they considered the most difficult during the course and their reasons for choosing to attend a degree course in the field of Computing.
The Likert scale questionnaire contained items grouped into socioeconomic, vocational, and institutional issues, according to the classification proposed by Barroso and Falcão . Other classifications were also used for items which did not fit into these categories, such as family, personal, health, or other issues.
The values obtained for each item of the scaled questionnaire were compared using the Kruskal-Wallis test, and the post-hoc Dunn test was used to obtain homogenous groups, with a 95% confidence interval. A non-parametric test was adopted since data normality was not detected according to the Kolmogorov-Smirnov test, in which the p value was < 0.05; thus, the null hypothesis in which the data originated from a normal distribution was rejected. Items belonging to the homogeneous group with the highest score per course were tested using Spearman’s rank correlation (Spearman’s rho). These tests were performed separately for each course and for all courses in general.
Other factors investigated were gender, mode of entry, period of disengagement, possible receipt of financial support, and whether the course was the first degree course of the student.
Analyses and results
This section describes the analyses, and the results of quantitative studies involving micro-data from INEP are shown (the “Analysis of micro-data from INEP” section) and the analyses and results of studies involving questionnaires applied to circumvented students of UnB (the “Analysis of questionnaires applied to circumvented students of UnB”).
Analysis of micro-data from INEP
Table 2 shows the dropout rates and competition (number of applicants per place) of Brazilian courses composing the eight major knowledge areas, according to the OECD . Municipal, state, and federal public institutions were considered. Private institutions included both profit-seeking and non-profit establishments. In Table 3, N = number of tested samples.
Table 2 shows higher dropout rates in private institutions than in public ones, in all major areas. The competition (number of applicants per place) was higher in public institutions than in private ones.
Table 3 presents the Spearman correlation coefficient used to check for a possible statistical correlation between competition and dropout. Values were − 0.485 for private institutions and − 0.487 for public ones. Forty values (N = 40) were tested in each group, corresponding to the dropout rates and a number of applicants per place in each major area (eight in total) over 5 years (2010–2014). According to Dancey and Reidy , correlation coefficients can be classified as follows: 0.10–0.30 (weak), 0.40–0.6 (moderate), and 0.70–1 (strong). A negative value indicates an inverse correlation, and the values range from − 1 to 1. Thus, competition and dropout showed a moderate inverse correlation between 2010 and 2014 in private and public institutions. The Special Commission for Studies of Dropout from Brazilian Public Universities  found that “low remuneration in job market decreases the search for courses, affects the training quality of students, and increases the propensity to drop out.” Silva Filho et al.  evaluated the correlation between dropout and the number of applicants per place in general areas of knowledge in 2005 and observed that major areas showing larger numbers of applicants per place also experienced lower mean annual dropout rates than the national average (22%). These major areas were Health and Social Welfare, Agriculture and Veterinary Medicine, Engineering, Production, and Construction. The highest dropout rate between 2001 and 2005 (services) corresponded to the lowest number of applicants per place.
In Figs. 1 and 3, the x-axis of the box plot diagram shows the major knowledge areas according to OECD . In Figs. 2 and 4, the x-axis shows the grouping of the areas or courses per gender. The y-axis shows dropout rates. In the boxes, the lower value is represented by the lower bar, while the upper value is represented by the upper bar. The black line inside the box represents the median value. Some boxes are larger than the others, indicating a higher variation in dropout rates than in the smaller boxes. Elements outside the box (represented by a circle or an asterisk) are referred to as outliers, which correspond to unusual values in relation to the other datasets. These boxes facilitate a comparison of dropout rates between areas, since those located in an upper position in the graph correspond to higher dropout rates in relation to those located in a lower position. In Figs. 1 and 2, the major areas of knowledge are represented as follows: 1—Education; 2—Humanities and Arts; 3—Social Sciences, Business, and Law; 4—Science, Mathematics, and Computing; 5—Engineering, Production, and Construction; 6—Agriculture and Veterinary Medicine; 7—Health and Social Welfare; 8—Services. Box plot diagrams were generated using SPSS Statistics software version 22.
Figure 1 was generated from the data shown in Table 2. The highest dropout rate between 2010 and 2014 was observed for the areas of Science, Mathematics, and Computing and the area of Services, in public and private institutions. Dropout rates in these two areas were similar since their boxes are around the same height on the graph. The variability of dropout rates was higher for Services (larger box) than Science, Mathematics, and Computing. Thus, these major areas experienced higher dropout rates than others. Silva Filho et al.  also carried out a study of dropout from the general areas of knowledge between 2001 and 2005. The authors investigated the mean dropout from the public and private institutions during this period and observed higher values for Services (29%) and Science, Mathematics, and Computing (28%) than for those of other major areas and the national mean over the same period (22%). This result was similar to that observed in the present study, in which the abovementioned major areas also showed higher dropout rates between 2010 and 2014.
Figure 2 shows the box plot diagrams generated from the data in Tables 4 and 5 for private and public institutions, respectively. Each box is numbered with the identification of the major areas in Tables 4 and 5. The x-axis is divided into male and female students. Boxes related to male students are located in a higher position than those of the female ones, thus indicating a generally higher dropout rate for male students in the major areas of private and public institutions.
Table 6 shows the dropout rates for areas composing the major area of Science, Mathematics, and Computing, based on the classification of the OECD adopted by INEP. Thus, the area of Computing comprised the areas of Computer Science, Computer Use, and Information Processing. Degree courses were not included, since INEP classifies training courses of Computing professors in the major area of Education, according to the classification of the OECD .
Figure 3 shows the box plot diagrams generated from the data in Tables 5 and 6 in order to compare the dropout from the courses composing the major area of Science, Mathematics, and Computing. The area of Life Sciences was excluded from Fig. 3 due to the low rates of enrolment between 2010 and 2014. The area of Physical Sciences was excluded from the graph of private institutions since enrolments were only detected in 2010. For private institutions, the boxes are located in a higher position, thus indicating higher dropout rates for Physics, Mathematics, Statistics, and Computing. Since these areas require greater mathematical and algorithmic abstraction knowledge than the other courses, the hypothesis was confirmed for private institutions, thus proving hypothesis ii (the “Introduction” section). For public institutions, higher dropout rates were detected in the areas of Physics and Mathematics, which also require a high level of mathematics knowledge. Dropout rates from Computing courses were similar to those of Physical Sciences, Statistics, Chemistry, and Environmental Sciences.
Table 7 shows the dropout from Computing courses at private institutions per gender, while Table 8 shows the dropout from public institutions. The courses presented in these tables were the same as those mentioned by Nunes  in statistical reports of the Brazilian Computer Society (SBC). Training courses of Computing professors were considered to be degree courses, according to the nomenclature provided by the OECD. Table 7 does not show the dropout from courses in Software Engineering in 2010, 2011, and 2012, since no enrolment was detected in the micro-data from INEP over this period.
Figure 4 shows the box plot diagrams generated from the data in Tables 7 and 8. The area of Software Engineering was excluded from the diagram of private institutions since no enrolment was detected in 2010, 2011, or 2012. One the x-axis, the boxes are separated per gender. Boxes for both genders are located at a similar height, which indicates no significant differences in dropout per gender, with some minor variations, e.g., in the course of Degree in Computing in private institutions, which shows a higher dropout of male students. Figure 4 does not indicate a relationship between dropout from Computing courses and the gender of students.
Analysis of questionnaires applied to circumvented students of UnB
This section presents an analysis of the answers from circumvented students on the degree courses of Computer Science, Computer Engineering, and Degree in Computing, offered by UnB. Table 9 shows the sample demographics.
As mentioned in the “Case study—UnB” section, the questionnaire applied to circumvented students of UnB to identify their reasons for dropout consisted of 21 items. The respondents were asked to indicate how far each item influenced their decision to circumvent the course. Each item ranged from 0 to 5 (where 0 indicated zero influence and 5 indicated a strong influence on dropout). Items were organized according to Table 10, which contains their classification and description, and the position of the question in the sequence in which they were answered by the respondent. Items were chosen according to those fitting into the reasons for student dropout presented by Barroso and Falcão  (socioeconomic, vocational, and institutional issues); options added were family/personal/health reasons and other issues.
Cronbach’s alpha test was used to check for correlations between the classifications used in the present study. Table 11 shows the values of Cronbach’s alpha for each classification. A total of 231 cases were tested, corresponding to the number of respondents from Computer Science (93 cases), Degree in Computing (85 cases), and Computer Engineering (53 cases).
According to Hair et al.  regarding the values of Cronbach’s alpha, 0.60 represents the lower limit of acceptability for exploratory studies, while 0.70 is the limit for other types of study. The higher the value, the higher the reliability of the indicator. The present study is considered to be exploratory. According to Godoy , a study is exploratory if it aims to improve the knowledge of the target object. Table 11 shows that all classifications except “Other issues” had values of Cronbach’s alpha higher than 0.60. Since the classification of “Other issues” includes several items not fitting into the other classifications, it is understandable that its value is low. However, the more specific classifications had suitable values for an exploratory study, thus indicating that, in the general analysis of all respondents, reasons for dropout can be classified according to the system presented in Table 10.
Table 12 shows the results of the Kruskal-Wallis test for subsets of homogenous items for the course of Computer Science. A total of 1953 values were tested (93 answers in each item, multiplied by 21 items). The chi-square value was 335.607, with 20 degrees of freedom and p value < 0.05, thus indicating statistically significant differences between the items. The largest homogeneous subset observed in Table 12 is subset 9, composed of items 3, 6, 12, 1, 11, 4, 5, 9, 14, and 7; of these, item 1 is classified as socioeconomic, while the others are classified as vocational or institutional. This shows that vocational, institutional, and socioeconomic issues were predominant among the reasons for dropout from the course of Bachelor of Computer Science at UnB.
Table 13 shows the results of the Spearman correlation test between item 9 (difficulties in mathematics-related subjects) and the other items in the largest homogenous subset. A significant correlation was detected between item 9 and items 7 (criteria for evaluation of students were unsuitable or very rigid), 11 (lack of information on subjects covered by the course), 12 (unsuitable physical infrastructure within the institution), and 14 (difficulties in understanding lecturers). According to Dancey and Reidy , the correlation coefficient can be classified as follows: 0.10–0.30 (weak), 0.40–0.6 (moderate), and 0.70–1 (strong). Based on this classification, a moderate correlation was detected for items 7, 11, and 14 (all of which are classified as institutional), thus indicating that mathematics-related difficulties were correlated to institutional factors, according to the respondents.
Table 14 shows the results of the Kruskal-Wallis test for the subsets of homogenous items for the course of Degree in Computing. A total of 1785 values were tested (85 answers for each item multiplied by 21 items). The chi-square value was 278.360, with 20 degrees of freedom and p value < 0.05, thus indicating statistically significant differences between the items. Item 1 showed the highest score. It should be emphasized that the Degree in Computing is offered at UnB in the evenings, allowing students to work during the day. Thus, difficulty in reconciling work and study was a very important factor affecting the dropout of students. Since item 1 is alone in subset 10, subset 9 was also considered for analysis. Subset 9 was composed of items 4, 12, 11, 5, 14, 9, and 7, which were classified as vocational or institutional. This indicates that vocational, institutional, and socioeconomic (as described by item 1) issues were predominant among the reasons for dropout from the course of Degree in Computing at UnB, as for the course of Bachelor of Computer Science.
Table 15 shows the results of the Spearman correlation test between item 9 (difficulties in mathematics-related subjects) and other items in the largest homogenous subset. Significant correlation was detected between item 9 and items 4 (lack of affinity with the course), 5 (dedication to other studies), 7 (criteria for the evaluation of students were unsuitable or very rigid), 11 (lack of information on subjects covered by the course), 12 (unsuitable physical infrastructure within the institution), and 14 (difficulties in understanding lecturers). According to the classification proposed by Dancey and Reidy , this correlation can be classified as moderate for items 7 and 11 and strong for item 14 (all of which are classified as institutional), as observed for the course in Bachelor of Computer Science, thus indicating that mathematics-related difficulties were correlated with institutional factors, according to the respondents.
Table 16 shows the results of the Kruskal-Wallis test for the subsets of homogenous items in the course of Computer Engineering. A total of 1113 values were tested (53 answers for each item multiplied by 21 items). The chi-square value was 292.670, with 20 degrees of freedom and p value < 0.05, thus indicating statistically significant differences between the items. The largest homogenous subset observed in Table 16 was subset 9, composed of items 8, 13, 7, 3, 11, 14, 6, and 4, all of which are classified as vocational or institutional. This indicates that vocational and institutional issues were predominant among the reasons for dropout from the course in Computer Engineering at UnB.
Table 17 shows the results of the Spearman correlation test between item 8 (difficulties in subjects related to programming and algorithms) and the other items in the largest homogenous subset. A significant correlation was detected between item 8 and items 3 (change to a course in different area), 4 (lack of affinity with the course), and 6 (lack of identification with professionals in this area). Based on the classification proposed by Dancey and Reidy , this correlation can be classified as moderate for item 3 (vocational), thus indicating that the difficulty with algorithms was correlated with vocational issues, according to the respondents.
Table 18 shows the results of the Kruskal-Wallis test for the subsets of homogenous items considering all results for the three evaluated courses. A total of 4851 values were tested (231 answers for each item multiplied by 21 items). The chi-square value was 787.352, with 20 degrees of freedom and p value < 0.05, thus indicating statistically significant differences between the items. The largest homogenous subset observed in Table 18 was subset 7, composed of items 6, 1, 5, 4, 11, 9, 14, and 7, which are classified as vocational (4, 5, 6), institutional (7, 9, 11, 14), and socioeconomic (item 1).
Table 19 shows the results of the Spearman correlation test between item 9 (difficulties in mathematics-related subjects) and other items in the largest homogenous subset. Significant correlation was detected between item 9 and items 4 (lack of affinity with the course), 5 (dedication to other studies), 6 (lack of identification with professionals in this area), 7 (criteria for the evaluation of students were unsuitable or very rigid), 11 (lack of information on subjects covered by the course), and 14 (difficulty in understanding lecturers). Based on the classification proposed by Dancey and Reidy , this correlation can be classified as moderate for the institutional items 7, 11, and 14, thus indicating that the difficulties in mathematics are correlated with institutional factors, according to the respondents.
Circumvented students also had the option to add other relevant reasons for their disengagement from the three courses at UnB, in the case where the elements described in the scale questionnaire did not reflect all their reasons for dropout. Of the 93 respondents for Computer Science, 26 added reasons for dropout; of the 85 respondents for the Degree in Computing, 27 added reasons for dropout; and of the 53 respondents for Computer Engineering, 20 added reasons for dropout. These answers were classified as institutional, vocational, socioeconomic, and other issues, and institutional issues were widely cited, as shown in Table 20. Widely cited issues included an outdated curriculum and an unsuitable choice of courses, lack of suitability of the course with respect to the job market, lack of educational background, and successive failures.
When giving reasons for choosing a Computing course, more than 50% of students from each course attributed their motivation as being mainly due to affinity, curiosity, and an interest in computers or the field of computing.
According to the answers from the respondents, most of these students disengaged from the course during the first four periods: 61.28% for Computer Science, 58.82% for Degree in Computing, and 64.14% for Computer Engineering.
The analysis of micro-data from INEP between 2010 and 2014 based on non-parametric tests gave the following results: the number of applicants per place was inversely proportional to dropout; rates of dropout were different for the major knowledge areas, with the highest values in the major areas of Services and Science, Mathematics, and Computing; for courses in the major area of Science, Mathematics, and Computing, those requiring a greater knowledge of mathematics and algorithmic abstraction had higher dropout rates in private institutions. On average over all the major areas, dropout was higher for male students than for females. For Computing, dropout rates were similar for both genders.
Based on the case study carried out at UnB, institutional, vocational, and socioeconomic issues were the reasons most cited as contributing significantly to dropout from the courses of Computer Science and Degree in Computing. For Computer Engineering, institutional and vocational factors were also often cited. For the courses of Computer Science and Degree in Computing, difficulty in mathematics-related subjects was one of the institutional factors most often cited. This factor was directly correlated with institutional factors (criteria for the evaluation of students were unsuitable or very rigid, lack of information subjects covered by the course, difficulty in understanding lecturers), thus indicating that in addition to a difficulty in mathematics representing a relevant factor for dropout, students see a relationship between their difficulties in these subjects and institutional issues. For the course of Computer Engineering, difficulty in subjects related to programming and algorithms was one of the most often cited institutional factors. This factor was significantly and moderately correlated with the vocational item 3 (change to a course in a different area). It must be noted that when giving their reasons for choosing a Computing degree course, most of the circumvented students attributed their main reasons as affinity, curiosity, and an interest in computers and in the field of computing. However, vocational aspects were widely cited by respondents as the reasons for dropout. Regarding this divergence, it should be emphasized that the item “Lack of information on subjects covered by the course: I expected to find different subjects and contents from those offered” was one of the most often cited in the questionnaires answered by students from all courses, thus indicating that many students had no idea of what the course covered, although they had already had some contact with computing. Family/personal/health and other issues were not cited as the strongest reasons for dropout from the evaluated courses.
Although the present study cannot reach the overall conclusions for such a complex and vast subject, it does provide some evidence for the influence of requirements regarding knowledge of mathematics and algorithmic abstraction on the dropout of students from Computing degree courses. It should be emphasized that the survey of circumvented students was performed in a public institution (UnB). Thus, the reasons for dropout are slightly different than for private institutions. For instance, it is reasonable that socioeconomic issues would have a higher impact on dropout in private institutions than in public ones. It can also be observed that students associate their difficulties in mathematics- and algorithm-related subjects with institutional factors such as classes taught by professors and evaluation criteria. Thus, the evidence points to a need for investment in teacher training.
An understanding of dropout from Computing degree courses has a great importance in the search for subsidies to mitigate this problem. Education managers can benefit from such information. In the face of the reasons highlighted in the present study, which mainly focus on institutional and vocational issues, measures can be suggested to decrease dropout rates from Computing degree courses: motivating students to form study groups with periodic meetings; stimulation of monitoring works with granting of benefits; teaching of reinforcement classes early in the course for students experiencing difficulties in algorithm-based and calculation subjects; with the aid of psychology professionals, students could be monitored in order to check their affinity with the course, since students often begin a degree course full of uncertainty, and professional guidance can help them to change courses (if applicable) within the same institution; encouraging students to participate as collaborators in scientific and extension projects, where possible associating theory and practice, and encouraging students to work in a team, which is an essential requirement in the job market.
Although difficulties in mathematics-related subjects and in algorithmic abstraction are not the only reasons for dropout from computing courses and are associated with other reasons, their relevance was observed in terms of dropout. An understanding of these difficulties within computing courses can contribute towards a decrease in dropout rates.
American Standard Code for Information Interchange
National secondary education examination. The ENEM is a non-compulsory, standardized Brazilian national exam, managed by the Ministry of Education’s National Institute for Educational Studies and Research (INEP). The ENEM is used as an admission exam for enrollment in many public and private universities in Brazil
National Institute for Educational Studies and Research “Anísio Teixeira”
Organisation for Economic Co-operation and Development
Program of Serial Evaluation. PAS is a form of entrance into higher education in Brazil adopted by several universities, divided into three exams done at the end of each year of high school
Brazilian Computer Society
The Unified Selection System is a federal government program that uses students’ ENEM scores to classify candidates for openings in public universities without taking the Vestibular
University of Brasilia
Abu-Oda GS, El-Halees AM (2015) Data mining in higher education: university student dropout case study. Int J Data Min Knowl Manage Proc 5:15–27
Barcelos TS, Silveira IF (2012) Pensamento computacional e educação matemática: relações para o ensino de computação na educação básica. In: XX Workshop sobre Educação em Computação, Curitiba. Anais do XXXII Congresso da Sociedade Brasileira de Computação, Mackenzie Presbyterian University, Curitiba, pp 16–19
Barroso MF, Falcão EB (2004) Evasão universitária: o caso do Instituto de Física da UFRJ. In: Anais do 9° encontro nacional de pesquisa em ensino de física. Federal University of Rio de Janeiro, Jaboticatubas, pp 26–30
Brasil (2016) SESu/MEC. Diplomação, Retenção e Evasão nos Cursos de Graduação em Instituições de Ensino Superior Públicas. http://www.andifes.org.br/wp-content/files_flutter/Diplomacao_Retencao_Evasao_Graduacao_em_IES_Publicas-1996.pdf. Accessed 22 May 2017.
Dancey C, Reidy J (2006) Estatística sem matemática para psicologia: usando SPSS para Windows. Artmed, Porto Alegre
Gaioso NPL (2005) Evasão discente na educação superior: a perspectiva dos dirigentes e dos alunos. Dissertação, Universidade Católica de Brasília.
Godoy AS (1995) Introdução à pesquisa qualitativa e suas possibilidades. Rev Adm Empres 35:57–63
Gruner S (2015) On the future of computer science in South Africa: a survey amongst students at university. In: Proceedings of the 44th annual Southern African Computer Lecturers Association, University of the Witwatersrand, Johannesburg, 1–3 July 2015.
Hair JF Jr, Black WC, Babin BJ, Anderson RE, Tatham RL (2005) Multivariate data analysis. Prentice Hall, New Jersey
Hora HRM, Monteiro GTR, Arica J (2010) Confiabilidade em questionários para qualidade: um estudo com o coeficiente alfa de Cronbach. Produto Produção 11:85–103
INEP (Instituto Nacional de Estudos e Pesquisas Educacionais Anísio Teixeira) (2012). Resumo técnico censo da educação superior de 2010. http://download.inep.gov.br/download/superior/censo/2010/resumo_tecnico_censo_educacao_superior_2010.pdf. Accessed 02 Jan 2017.
Kinnunen P, Malmi L (2006) Why students drop out CS1 course? In: Proceedings of the 2nd International Workshop on Computing Education Research, Canterbury, United Kingdom, 9–10 September 2006
Maroco J (2014) Análise estatística com o SPSS Statistics. Report Number, Pêro Pinheiro.
Maroco J, Garcia-Marques T (2013) Qual a fiabilidade do alfa de Cronbach? Questões antigas e soluções modernas? Laboratório de Psicologia 4:65–90
Nunes DJ (2008) Estatísticas da educação superior: área da computação. Relatório Técnico, Sociedade Brasileira de Computação
OCDE (Organização para a Cooperação e Desenvolvimento Econômico) (2016) Classificação internacional EUROSTAT/UNESCO/OCDE. http://download.inep.gov.br/download/superior/2009/Tabela.OCDE.2009.pdf. Accessed 3 Mar 2016.
Paula LQD, Piva D Jr, Freitas RL (2009) A leitura e a abstração do problema no processo de formação do raciocínio lógico-abstrato em alunos de computação. Reverte 1:124–133
Piva D Jr, Freitas RL (2011) Estratégias para melhorar os processos de abstração na disciplina de algoritmos. In: Anais of the Simpósio Brasileiro de Informática na Educação of the 31st Congresso da Sociedade Brasileira de Computação, Federal University of Pernambuco, Pernambuco, 23–26 November 2010
Silva Filho RLL, Motejunas PR, Hipólito O, Lobo MBCM (2007) A evasão no ensino superior brasileiro. Cad Pesqui 37:641–659
Souza C, Petró C, Gessinger R (2012) Um estudo sobre evasão no ensino superior do Brasil nos últimos dez anos: as possíveis causas e fatores que influenciam no abandono. Prevendo o risco do abandono. In: Anais of the 2nd Conferência Latinoamericana Sobre el Abandono em la Educacion Superior, Pontifical Catholic University of Rio Grande do Sul Porto Alegre, 8–9 November 2012
Vieira S (2003) Bioestatística - tópicos avançados. Editora Campus, Rio de Janeiro
Xenos M, Pierrakeas C, Pintelas P (2002) A survey on student dropout rates and dropout causes concerning the students in the course of informatics of the Hellenic Open University. Comput Educ 39:361–377
We thank the Secretariat of Academic Records at UnB, which kindly and rapidly provided student records, thus enabling them to be contacted for the present study.
Availability of data and materials
The data from INEP can be downloaded from http://portal.inep.gov.br/microdados.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.