Introduction
The process of European convergence in higher education, in its efforts to improve the quality of teaching-learning processes and their results, has made a strong commitment to a learning-centered model, oriented towards development and acquisition of competencies (Arco & Fernández, 2011). In this context, students are forced to face their learning in an autonomous, intentional, and effective way, with a great dose of commitment and involvement (Arco & Fernández, 2011; Fernández et al., 2013; Panadero & Alonso, 2014). However, some studies suggest that the students' preparation to face this demand is not adequate enough, as they present important deficiencies in several aspects related to the use of learning strategies and the control of certain variables involved in learning (Carbonero et al., 2013; Roces & Sierra, 2017; Rosário et al., 2015), besides, we cannot forget there aren't many teachers spending time on their students' autonomous learning training (Fernández et al., 2013).
In this sense, there is a clear need for university institutions to implement initiatives and measures aimed at improving the processes and quality of their students' learning (Arco & Fernández, 2011; Arco et al., 2020; Carbonero et al., 2013; Cerezo et al., 2015; Fernández et al., 2013; Roces & Sierra, 2017), and to that end, it is essential to consider research on the self-regulated learning, given its significant explanatory potential for the key processes involved in learning and academic success (Bernardo et al., 2016; García & Pérez, 2011; Jansen et al., 2019; Richardson et al., 2012; Rosário et al., 2014; Schneider & Preckel, 2017; Sitzmann & Ely, 2011). This construct includes a number of theoretical models that, despite their differences, allow it to be conceived as an active process in which the student, through different strategies, controls, monitors and regulates the cognitive, metacognitive, affective-emotional, motivational, contextual and behavioral variables involved in learning, primarily in order to achieve the established objectives that guide their learning (Hernández & Camargo, 2017; Jansen et al., 2019; Panadero, 2017; Panadero & Alonso, 2014; Rosário et al., 2014).
These self-regulated learning processes and strategies can be optimized with certain intervention programs, which in turn promote improvements in learning and performance (Fernández et al., 2013; Jansen et al., 2019; Núñez et al., 2011; Zimmerman, 2015), which has led some universities to offer this type of training to their students (Roces & Sierra, 2017), using a variety of strategies, resources and materials (e.g., from minimal adjustments to classroom processes, using rubrics, charts, protocols, etc., to specific courses or subjects, face-to-face or virtual) (Hernández & Camargo, 2017). Nevertheless, a high percentage of these programs do not usually establish any type of evaluation and too often adopt qualitative, pre-experimental or quasi-experimental designs with nonequivalent control groups (Broadbent & Poon, 2015; Cerezo et al., 2010; Hernández & Camargo, 2017), which forces us to question the strength of the available evidence on the efficacy and effectiveness of many of them.
The Peer-Tutoring Program (PTP), in its version 2.0 presented in this research was implemented in the 2017/2018 academic year and incorporates a series of elements that aim to correct these limitations, such as improving the degree of experimentation of these interventions, the fidelity of their implementation and impact. PTP is peer-learning based (i.e., peers of different age and academic year with high levels of self-regulated learning, after a training process, provide help and support with a fixed role in pairs to other students to improve their self-regulated learning) and has been implemented in a previous edition, in which it has already demonstrated its capacity to improve the work and study habits of the participating students, as shown by the results of its evaluation (Arco & Fernández, 2011). However, these results also showed the need to implement changes in certain elements and characteristics of its design, implementation, and evaluation (e.g., extension of sessions throughout the academic year, increase in monitoring activities, increase in the number of control variables, etc.) (Arco & Fernández, 2011), modifications that have been included in this version 2.0.
Thus, the purpose of this research was to test the effectiveness of PTP 2.0 in improving self-regulated learning among freshmen, identifying in parallel the effects of participation in the program on tutor students' self-regulated learning. The hypotheses that were established were: (1) as a result of participation in the program, there will be a statistically significant improvement in the self-regulated learning of freshmen in the experimental group compared to students in the control group in the post-test phase; (2) as a consequence of their participation in PTP 2.0, there will be a statistically significant improvement in freshmen' self-regulated learning in the experimental group in the post-test phase with respect to the pre-test phase; and (3) in the case of tutor students, as a result of the program, a statistically significant improvement in self-regulated learning will be observed in the post-test phase with respect to the pre-test phase.
Method
Participants
The sample had a total of 152 students from the University of Granada, 102 freshmen and 50 seniors.
Freshmen was divided into two groups: (a) experimental group, composed of 51 students, 43 females and 8 males, with an average age of 18 years old (SD = 0.00), and a distribution by degrees of 16 students from the Degree in Pharmacy, 4 from the Degree in Economics, 24 from the Degree in Psychology and 7 from the Degree in Business Administration and Management.; and (b) control group, consisting of 51 students, with the same distribution by degree and sex, and identical mean and age range as the experimental group.
The average age of the student tutors was 22.62 years, with a range between 20 and 32 years. The distribution by degree was 16 students from the Degree in Pharmacy, 4 from the Degree in Economics, 23 from the Degree in Psychology and 7 from the Degree in Administration. By sex, 41 were women and 9 were men, of Spanish nationality.
The sampling was non-probabilistic, of convenience, and implied the following actions: (a) selection of participating degrees, considering the declared interest of their institutional heads, together with the authorization of the ethics committee (150/CEIH/2016), as well as the objectives and resources of the program.; (b) execution of the dissemination plan among the students, i.e., 32 group sessions were held in their respective groups-classrooms during the first 10 minutes of the large group theory class, in which they were verbally informed of the conditions and benefits of the program, in addition to requesting their participation and summoning them to a group recruitment session to formalize their enrollment in the program; (c) implementation of the recruitment plan, i.e., 2 group sessions in the aula magna of each of the participating university centers, in the mornings and afternoons, in which, after providing more information to the students attending and resolving their doubts and queries, 269 freshmen and 141 seniors voluntarily enrolled in the program, for which they had to sign the Participation agreements (document with rights and duties of participants), complete the Participation protocols (self-report with demographic, academic and interest information of participants) (Arco & Fernández, 2011) and the Motivated Strategies for Learning Questionnaire (MSLQ) (Albert, 2017; García & Pérez, 2011; Roces et al., 1995), and provide a copy of their academic report; and (d) final selection of the sample.
In this regard, from the 269 freshmen, 83 associated pairs (N = 166) were created based on two groups of control variables (Ato et al., 2013) associated with the processes and quality of learning (Barbera et al., 2017; Bernardo et al., 2016; Garcia & Perez, 2011; Richardson et al., 2012; Schneider & Preckel, 2017): (a) academic: degree, year, group, subjects and number of credits enrolled, baccalaureate branch of origin, number of times the student has taken the university entrance exam, place of choice of degree in pre-enrolment, average high school grade, average university entrance exam grade, previous dropout and change of studies, repetition of previous courses, self-regulated learning, and effort/commitment; and (b) sociodemographic data: age, sex, marital status, nationality, employment status, with whom they live, socioeconomic status and financing of studies.
The remaining 103 freshmen were excluded due to not having an appropriate associated pair. In addition, the minimum size of incoming freshmen required for program evaluation, total (N = 102) and per group (N = 51), was calculated, so 32 of the 83 associated pairs were randomly discarded.
On the other hand, from the 141 final year students enrolled in the program, the tutor students were selected (N = 50), considering the following criteria: having an average grade higher than 7 points, presenting a centile score higher than 30 points in the subscales of the MSLQ, and having a score higher than 30 points in the MSLQ subscales (Albert, 2017; García & Pérez, 2011; Roces et al., 1995), to have time available in the required time windows, to attend the three training sessions and to pass the different practical exercises carried out during the training.
Instruments
MSLQ (Albert, 2017; García & Pérez, 2011; Roces et al., 1995). Like the original version (Pintrich et al., 1993), it is a Likert scale consisting of 81 items with 7 response alternatives (i.e., 1: does not describe me at all, up to 7: describes me completely) grouped into 15 subscales and 5 components: (a) cognitive and metacognitive strategies: rehearsal, elaboration, organization, critical thinking, and metacognitive self-regulation (e.g., “64. When I read subject content, I try to relate new content to the knowledge I already possessed”); (b) resource regulation strategies: time and environment management, effort management, peer learning and help-seeking (e.g., “68. When I do not understand some subject content, I ask a classmate for help”); (c) affective motivation: anxiety (e.g., “8. When I am taking a test, I think of those test questions that I cannot answer”); (d) motivation expectations: self-efficacy and control beliefs about learning (e.g., “29. I am convinced that I can master the skills and techniques taught in the subjects”); and (e) value motivation: intrinsic and extrinsic goal orientation, and task value (e.g., “24. When I have the opportunity, I choose jobs that I can learn from, even if they do not guarantee me a good grade”). This instrument was chosen because it is the most widely used to measure self-regulated learning in university students (Roth et al., 2016), in addition to presenting adequate reliability (Cronbach's alpha of .82 and .89 for the motivation and learning strategies components, respectively, and between .48 and .86 points for the subscales) and validity (factor analysis demonstrating its factorial structure).
Procedure
The methodological design adopted for hypotheses 1 and 2 was quasi-experimental with non-equivalent control group enhanced with statistical control techniques, while for hypothesis 3 it was pre-experimental pre-test-post-test (Ato et al., 2013).
As for the procedure, once the sampling was performed, the freshmen experimental and control groups were configured, i.e., each of the members of the 51 selected associated pairs was randomly assigned to the experimental or control condition. (https://www.randomizer.org). It was then confirmed that the two groups were equivalent with respect to the control variables, given that: (a) some of them had only one value, such as age (18 years old), grade (first year), number of times he/she had taken the university entrance exam (1 time), nationality (Spanish), previous dropout and change of studies (0 times), previous repetition of courses (0 times), marital status (single) and employment status (not working); (b) other showed the same proportion in both groups, such as degree (Degree in Pharmacy = 31.38%, Degree in Economics = 7.84%, Degree in Psychology = 47.06%, Degree in Business Administration and Management = 13.72%), group (morning = 78.40%, afternoon = 21.60%), place of choice of degree in pre-registration (first = 90.20%, second = 5.90%, third = 3.90%), baccalaureate branch of origin (sciences = 45.10%, humanities and social sciences = 54.90%) and sex (male = 15.70%, female = 84.30%); and (c) the non-parametric contrasts carried out revealed no statistically significant differences between the two groups in number of credits enrolled (U = 1300.50, p > .05, d = 0.00), average high school grade (U = 955.50, p > .05, d = 0.02), average university entrance exam grade (U = 445.00, p > .05, d = 0. 03), average university entrance score (U = 1292.00, p > .05, d = 0.01), self-regulated learning (see Table 1), effort/engagement (U = 1116.50, p > .05, d = 0.01), with whom he/she lives (χ2 = 0.21, p > .05), socioeconomic status (χ2 = 2.06, p > .05) and study financing (χ2 = 0.04, p > .05).
In the terms mentioned above, the student tutors had to undergo training aimed at acquiring the necessary skills to perform their functions in the program effectively and efficiently. To that end, during three three-hour sessions, through an active and participatory methodology (i.e., expository teaching, direct instruction and problem-based learning), the following contents were worked on: (a) session 1: presentation of participants and training plan, program rationale, self-regulated learning model and intervention measures (Arco et al., 2009); (b) session 2: use of the Workbooks (a set of materials in which each of the tutoring sessions was presented in a structured way) (Arco et al., 2009; Fernández & Arco, 2009a, 2009b), and execution of tasks of the first tutoring session; and (c) session 3: needs assessment of freshmen (preparation and analysis of self-registration), establishment of objectives, strategies and tasks (Arco et al., 2009; Fernández & Arco, 2009a, 2009b), and analysis and solution of possible difficulties during the tutoring sessions.
Subsequently, we proceeded to the pairing, as recommended by the specialized literature (i.e., Durán, 2002; Fernández, 2007; Tindall & Black, 2009; Topping, 2015; Topping et al., 2017), assigning to the student tutors, based on their equivalence in degree and time availability, the number of students in the experimental group that requested tutoring (one student tutor requested two students and the rest one student), to then implement the tutoring sessions.
The tutoring sessions were implemented in a fixed-role format of a couple of different ages and academic year, extending throughout the academic year, with a weekly frequency and a duration of 90 minutes. In conclusion, for each student in the assigned experimental group, the student tutors carried out 20 tutoring sessions at the times (i.e., between 9-13h. and 16-20h.) and places (i.e., library work and study rooms and student delegation meeting rooms) arranged in each of the participating university centers. These sessions were structured and sequenced in the Workbooks (Arco et al., 2009; Fernández & Arco, 2009a, 2009b) to facilitate the work of the student tutors, although they had to adapt and specify certain actions to the characteristics, needs and progress of their students, thus awarding them a greater protagonist and level of responsibility and control in the course of these sessions, as recommended by the literature (Topping, 2015; Topping et al., 2017).
In this sense, in order to work on dispositional and supportive learning strategies, the tutor students and their corresponding freshmen carried out the following actions in session 1: (a) presentation and guided tour of the Faculty, describing its services and providing relevant academic information; (b) recording in Workbooks contact information, schedules and locations of sessions, academic information (subjects, professors and tutorials) and relevant dates (e.g., exams, assignments, etc.); (c) reading and comments on their rights and duties; (d) description of university services; (e) completion of a brief description of what was learned in this session; and (f) assignment of tasks to the freshmen for the following session (preparation of self-registration). In tutoring session 2, regarding the metacognitive strategies of planning and regulation, the student tutors and their freshmen jointly carried out the following actions: review of pending tasks, exploration of the self-regulated learning model included in the Workbooks, needs assessment of the freshmen (analysis of self-registration, as well as the selection, recording and graphic representation of relevant variables: hours of sleep, hours of study that hurts and does not hurt, and hours of leisure), establishment of objectives, strategies and tasks related to the environmental conditions of study and time planning (analysis and changes in study conditions, analysis of academic tasks, elaboration of weekly study plan and establishment of rewards, taking as a reference the information provided in the Workbooks), a brief description of what has been learned and a list of the tasks for the next session (implementation of the study plan and recording of the degree of compliance). On the other hand, the aspects worked on by the student tutors and their freshmen on metacognitive strategies of planning, regulation and evaluation in session 3 were: review of set tasks, comparative analysis between the weekly study plan prepared and its degree of compliance, with recording and graphic representation of variables, establishment of objectives, strategies and tasks associated with time planning and procrastination (making adjustments to the study plan and self-administration of rewards, based on the information provided in the Workbooks), brief description of what was learned in this session and assignment of tasks for the next session (implementation of the study plan, with the adjustments made, and recording of its degree of compliance).
Regarding the rest of the sessions, the structure and sequence of action was common to session 3, and was aimed at perfecting and reinforcing the freshmen' study plan and their degree of compliance (metacognitive strategies for planning, regulation and evaluation), but also establishing parallel objectives, strategies and tasks related to healthy habits (sleep hygiene in session 4 and nutrition in session 5), cognitive strategies for rehearsal, organization and elaboration (techniques for searching, collecting and selecting information in sessions 6 and 12, repetition techniques in session 7, and techniques for establishing relationships when working with information in sessions 13, 14 and 15), and dispositional and support strategies (reduction of public speaking anxiety in sessions 16 and 17), always taking as a reference the information provided in the Workbook. However, the three sessions prior to the exam periods (sessions 8, 9 and 10, and 18, 19 and 20) were characterized by a work aimed at the freshmen: (a) to examine and reinforce their study plan, respecting class time, meals and sleep; (b) to increase review, providing more time to consolidate (understanding and memorizing) knowledge, and simulated evaluation; (c) to control their anxiety before exams; and (d) to make coherent and realistic decisions when prioritizing subjects. In addition, the first session of the second semester (session 11), was devoted to the joint assessment of the results achieved in the first semester, specifying attributions, in order to establish objectives, strategies and tasks related to metacognitive strategies of planning and regulation (analysis of tasks on new subjects and study plan), also stimulating positive attitudes towards new teachers and subjects.
In addition, in parallel to the intervention plan, a monitoring plan was developed to monitor possible deviations from the program: (a) a passive participant observer was included in the training with the tutoring students; (b) 3 individual monitoring sessions between those responsible for the program and the tutoring students after tutoring sessions 2, 5 and 15, in which the actions carried out in these tutoring sessions were evaluated and the appropriate recommendations were issued with a view to establishing objectives, strategies and tasks in the following sessions; and (c) 2 monitoring group sessions after tutoring sessions 10 and 20, in which the program, the difficulties in its development and possible solutions were globally assessed. Finally, with the outcome evaluation plan, measures were taken of the dependent variable, before and after the implementation of the program, to verify the presence or absence of statistically and educationally significant effects.
Data analysis
Power and sample size analyses were calculated considering the expected effect size (0.50), the associated probability (.05) and the desired statistical power levels (0.80) (Soper, 2021).
Likewise, after checking the absence of univariate (Kolmogorov-Smirnov and Shapiro-Wilk tests) and multivariate (Mardia coefficients) normality in the distribution of the scores, and the absence of outliers, missing and influential values (Mahalanobis distance), the data were analyzed using the tests: (a) Mann-Whitney U test for two independent samples and Pearson χ2 to determine the equivalence of the experimental and control groups in the control variables; (b) Mann-Whitney U test for two independent samples for hypothesis 1; and (c) Wilcoxon z test for hypotheses 2 and 3. In addition, Cohen's d value was calculated, while the error rate per family, resulting from the problem of multiple comparisons, given the impossibility of performing multivariate contrasts, was controlled with the Bonferroni correction.
Results
The significance level for each of the multiple comparison tests was adjusted with the Bonferroni correction, being .003 for the different hypotheses (.05/15).
The results derived from the intergroup comparisons in the pre-test phase reveal no statistically significant differences in self-regulated learning, while in the post-test phase, hypothesis 1, statistically significant differences in favor of the experimental group were observed in self-efficacy, control beliefs about learning, metacognitive self-regulation, and time and environment management (Table 1).
Regarding hypothesis 2, pre-test-post-test comparisons in the experimental group show a statistically significant increase in the post-test phase with respect to the pre-test phase in self-efficacy, control beliefs about learning, metacognitive self-regulation, time and environment management, peer learning, and help-seeking (Table 2). These same comparisons in the control group only show statistically significant differences in favor of the pre-test phase in self-efficacy (Table 2).
Finally, the results of the intragroup comparisons in the tutor students, hypothesis 3, reveal statistically significant differences in favor of the post-test phase in task value, metacognitive self-regulation, time and environment management, and effort management (Table 3).
Discussion and conclusions
The purpose of this research was to test the complementary capacity of PTP 2.0 to improve the level of self-regulated learning among university students in comparison with its previous edition. Therefore, considering the results obtained, the following conclusions can be drawn: (1) PTP 2.0 had a statistically significant impact in the post-test phase in favor of the experimental group on certain indicators of self-regulated learning (self-efficacy, control beliefs about learning, metacognitive self-regulation, and time and environment management), so hypothesis 1 can be partially accepted; (2) since statistically significant differences are observed in the experimental group in favor of the post-test phase in certain indicators of self-regulated learning (self-efficacy, control beliefs about learning, metacognitive self-regulation, time and environment management, peer learning, and help-seeking) as a result of the program, hypothesis 2 can also be partially accepted; (3) given that there are statistically significant differences in favor of the post-test phase with respect to the pre-test phase in some indicators of self-regulated learning (task value, metacognitive self-regulation, time and environment management, and effort management), hypothesis 3 can also be partially accepted.
These results based on statistical significance, despite the overly conservative nature of the Bonferroni correction, clearly show that participating in PTP 2.0 has a positive and statistically significant effect on various indicators of self-regulated learning. Moreover, if the hypotheses are tested using tests that provide answers on their practical significance, as recommended by the literature (Ledesma et al., 2008), the effect size achieved for most indicators was medium-large (Cohen, 1988), i.e., the intergroup differences generated can be detected by observation (Coe, 2002). Moreover, it is also necessary to highlight the deterioration over time observed in most of the indicators in the case of the control group, without forgetting the improvements seen in the student tutors.
In sum, the results obtained confirm the effectiveness of PTP 2.0 in improving the levels of self-regulated learning, which also reinforces the idea that this competence can be optimized with appropriate training (Roces & Sierra, 2017). Likewise, PTP 2.0 has demonstrated greater effectiveness than its previous version (Arco & Fernández, 2011) and even other programs with different strategies, resources, and materials (e.g., learning to learn courses, audiovisual workshops, web applications, rubrics, and letters) (Hernández & Camargo, 2017; Roces & Sierra, 2017), although it is also necessary to highlight its greater efficiency. Of course, this increased effectiveness and efficiency of PTP 2.0 can be attributed to the incorporation of peer-tutoring as an intervention strategy, but also to the key changes and improvements revealed by the evaluation of the previous edition.
Nevertheless, when interpreting the results obtained, it is also necessary to consider certain methodological limitations such as, firstly, (a) the possible sample selection bias, which means that the program is only suitable for students who are willing to benefit from this training, (b) the possible measurement bias, by using a single instrument to measure the dependent variable, or (c) the possible grouping effects, which may mean that the results in new students are the result of the level of competence of the tutor students (Arco et al., 2020). Secondly, mention should be made of the absence of statistically significant differences in some indicators of self-regulated learning, as well as the size of the effect achieved, since it was of a small magnitude. However, these values do not jeopardize the importance and contribution of this research, mainly because effect sizes in educational research are usually much smaller than in other disciplines, and values around 0.30 are already considered of important practical relevance (Hattie, 2009). Thirdly, it should be pointed out that, in this edition of the program, despite the efforts made, it has again been impossible to complement the tasks performed by the student tutors with the tutorial action of the teaching staff. And, fourthly, in similar terms to the previous edition of the program, certain difficulties have been identified linked to the behavior of the participating students (e.g., lack of punctuality, attendance to sessions without materials, etc.).
In this sense, in future work, it would be appropriate to evaluate the program in other university degrees and institutions, and even in previous educational stages, of course, after making the appropriate adjustments and adaptations. Obviously, it is always possible to increase the size and representativeness of the sample, the number of measurement instruments and increase the degree of experimentation of any applied research, as well as to modify those elements and characteristics of the program that continue to be susceptible to improvement, but this type of improvement will always require greater economic resources (Bettinger & Baker, 2014). It would also be desirable to confirm whether the effects of the program are maintained over time, and even if these effects lead to improvements in the academic performance of students, as this is one of the main criticisms and challenges of these interventions (Arco & Fernandez, 2011; Roces & Sierra, 2017).
This research provides empirical evidence on the power and validation of the PTP 2.0 causal model of intervention to generate improvements in the processes and quality of university student learning, in addition to providing higher education institutions with an efficient, very low-cost tool or initiative to do so (Arco et al., 2020). In this sense, and following international standards of quality in educational innovations, it can be stated that this is an example of an evidence-based practice or program (Slavin, 2017). The scarcity of works with these characteristics and methodological rigor (Hernández & Camargo, 2017) recommends continuing to support the systematic evaluation of this type of interventions, in order to accumulate more evidence and improve their impact.