Poster Topic: Assessment
To view the poster, click on the image below the abstract.
101 - Analysis of the Relationship of Medical Student Personal Distress and Emotion Regulation using Machine Learning

102 - Analyzing the Impact of Personality on Autonomy and the Mediator Role of Motivation
The purpose of this study is to analyze the impact of personality factors on medical student motivation as mediated by having purpose.
METHODS: In 2017/18, 205/500 M-1/M-2 medical students voluntarily completed the Five Factor Personality Inventory (IPIP-50, scale:1=very inaccurate,5=very accurate), RS-25 Resilience Scale (scale:1=strongly disagree,7=strongly agree) to measure purpose, and Trait-Emotional Intelligence Questionnaire to measure motivation (TEIQue-sf, scale:1=completely disagree,7=completely agree). Pearson correlations and multivariate linear regression generated via IBM® SPSS® 26.0. This research approved by the institution’s IRB.
RESULTS: Motivation (alpha=0.7) mean scores were significantly (p

103 - AWARD NOMINEE - Can formative assessments enhance student engagement and learning in a pandemic environment
Use of Guided and Frequent Formative Assessments to Enhance First Year Medical Student Engagement and Learning in a Virtual Environment.
AUTHOR
Cindy Funk, PH. D, Burrell College of Osteopathic Medicine, Las Cruces, NM
PURPOSE
The value of formative assessments in education is well-documented. Due to the pandemic, many medical curricula adapted a virtual delivery. In order to engage students in this environment, we sought to examine the effectiveness of guided formative assessments on medical student learning.
METHODS
During the COVID pandemic, the didactic curriculum for Burrell College of Osteopathic Medicine was delivered in a virtual-asynchronous format. In the Musculoskeletal I system, a series of online, formative quizzes was delivered via Learning-Catalytics for upper limb anatomy. Quizzes were composed of multiple choice, short-answer, matching, and identification questions, designed to provide active learning, feedback, and knowledge gap identification. “In-session” quizzes were provided during virtual lectures; at 15-minute intervals of lecture time targeting key concepts. “Post-session” quizzes were delivered after lectures to test higher order knowledge. Quizzes provided immediate, written feedback. ANOVA and correlation statistics were utilized to determine if quiz participation impacted summative exam performance.
RESULTS
There was robust usage of quizzes; students completed an average of 8/11 quizzes. To compare outcomes, students were divided into three groups: low engagement (completing 0-3 quizzes; 42 students), moderate engagement (completing 4-7 quizzes; 38 students), high engagement (completing 8-11 quizzes; 92 students). ANOVA revealed significant differences amongst these groups on summative exam performance (f=9.41624). T-test revealed significant differences in summative exam performance between high (84.8%) and low groups (78.75%). Positive correlation between summative exam performance and number of quizzes completed was also found.
CONCLUSIONS
Adapting to virtual environments has been difficult for many medical students; creating isolation and lack of academic direction. We demonstrate positive engagement and feedback with formative assessments; indicating their value in engaging students, supporting learning and preparing for summative assessments.

104 - OSCE: Objectively Scoring in the COVID Era
We did not feel recording the OSCE sessions was paramount given the logistical barriers imposed by government restrictions in place at the time of assessment. Conducting the sessions to minimize the possibility of learner advance notice of the case was our critical concern to maintain fidelity of the exam. Future directions include assessing the OSCE formats we utilized, contrasting and comparing the methods in terms of safety in the COVID era and validity for quality assurance given that learner assessment at a distance will likely continue into the next academic year.

105 - Exploring students' perception of online OSCE: A qualitative study

106 - AWARD NOMINEE - Determining The Relationship Between Gap Years, First Year Medical School Performance, and Academic Burnout
Second-year medical students participated in a two-part survey. The primary survey required general information disclosure including gap year participation and pre-health clinical experiences. The secondary survey consisted of the Maslach Burnout Inventory-Student Survey (MBI-SS). Student GPAs were de-identified and provided by the school's faculty.
Of the 60 responses, 37 (62%) medical students participated in a gap year after their undergraduate education while the remaining 23 (38%) students immediately matriculated into medical school. Pre-health clinical experiences were acquired by 40 (67%) students. We were unable to detect a difference in mean GPAs between students with one or more gap years (3.4, SD 0.59) and of students with no gap years (3.3, SD 0.48) (P = 0.55). Even when considering the wide range of gap years and separating the students into three groups (0 gap years, 1-2 gap years, 3+ gap years), differences between GPAs were still not found (P = 0.83). There were no correlations found between gap years and any component of the MBI-SS; exhaustion (r = 0.103, P = 0.593), cynicism (r = 0.055, P = 0.775), and academic efficacy (r = 0.166, P = 0.387).
There is variation in the number of years students spend in between their undergraduate education and medical school. Deferring medical school is likely explained by an increased demand for clinical experience and admission refusals. Our study explored the effect of education deferment on medical school academic performance and burnout, and it was unable to identify a difference in students’ GPAs and burnout risk when comparing their gap years. Our work highlights opportunities to better understand and further evaluate the impact of student burnout at the medical school level.

107 - The Impact of Gender on Resident Evaluations of Faculty Performanc
Methods: A mixed methods analysis will be employed to examine 14,669 teaching evaluations of faculty by residents at a single academic center between the years 2017-2020. Anonymized ratings of male and female faculty on a five-point Likert scale will be compared using chi-square test. Qualitative analysis of free-text will be analyzed using grounded theory to examine narrative evaluations for gendered language.
Results: We expect to report a quantitative comparison between evaluations of faculty with respect to gender. We plan to perform subset analyses based on academic rank. We anticipate reporting qualitative outcomes in the form of major themes which emerge during analysis as well as a comparison of narrative evaluations between male and female faculty. Based on prior literature on gender differences and their impact on teaching evaluations, we expect to find gender differences in quantitative ratings of faculty as well as differences in qualitative analysis with respect to learner expectations and cited areas of strength for faculty.
Conclusion: Implicit gender bias has previously been determined to impact evaluations of faculty. These disparities negatively impact promotion and tenure for female faculty. If there is found to be gender bias in the assessment of academic female faculty, evaluator training to mitigate implicit gender bias can be pursued to close the gender gap in academic emergency medicine.

108 - How to measure "Fit": Standardized comparison of program and applicant alignment of values and priorities

109 - Implications of using an SJT in admissions for predicting future professionalism issues

110 - Toward The Development And Construct Validity Of The 7ps Inventory Of Self-regulated Learning To Identify Student Academic Success
Daria Ellis PhD, Priyadarshini Dattathreya MBBS, MD, and Maureen Hall MD, MEd, BSc
PURPOSE: Self-regulated learning has been identified as a key factor that determines academic success. We used the principles of self-regulated learning and developed the 7Ps Inventory of metacognitive strategies that break down the task of ‘learning’ into strategies include planning and organizing, self-monitoring and evaluating academic progress through self-reflection. The purpose of this study is to establish the construct validity of the 7Ps Inventory.
METHODS: We conducted an initial psychometric validation of the 7Ps Inventory with 500 medical students. Exploratory (EFA) and confirmatory factor analyses (CFA) were conducted to assess the latent structure of 7Ps Inventory. Findings highlighted areas where the 7Ps Inventory required revision. Following revision, we conducted another psychometric validation with an additional sample of 191 first year medical students. We used EFA and CFA to assess construct validity and we assessed reliability using Cronbach’s measure of internal consistency.
RESULTS: The final version of the 7Ps Inventory comprised 26 items (rated on a five-point response scale) that correspond to seven discreet components: Plan, Prepare, Participate, Process, Practice, Performance and Pause. The models for the final revised scales had good fit and the internal reliability of these scales was marginal to excellent, with Cronbach’s α ranging from 0.52 to 0.86.
CONCLUSION:
Our preliminary evidence suggests that the 7Ps Inventory to help students reflect on their learning and assess their use of specific learning strategies is a psychometrically robust tool.

111 - AWARD NOMINEE - Comparing Online And In-person Educational Workshops For Canadian Occupational Therapists: Exploring The Learning Experience
The Do-Live-Well (DLW) framework is a health promotion approach that many occupational therapists (OTs) are interested in learning about. Although online education has become increasingly popular among health care professionals, studies of its effectiveness and learners’ experience have been limited in occupational therapy education. The objectives of this study were to compare the effectiveness of the online and in-person DLW workshop for Canadian OTs and to explore participant experiences in both types of workshops.
Methods
An explanatory sequential mixed-methods study design was used. In the quantitative phase, descriptive and inferential statistics were used to compare the effectiveness of the two educational methods at three points (pre, post, and 6-month follow-up). The primary outcome was knowledge change, and the secondary outcomes were changes in factors influencing the use of DLW in practice, satisfaction with the workshops, and the actual use of DLW. In the qualitative phase, an interpretative description methodology was used. Semi-structured one-on-one interviews conducted at follow-up were transcribed and analyzed using a six-step analysis process.
Results
There were no statistically significant differences between groups in knowledge changes at three time points (p > 0.57 – 0.99). There were statistically significant differences between groups in factors influencing DLW adoption (p > 0.001) and satisfaction with the workshop (p > 0.0005) at the post-test. Five themes were identified in relation to learners’ workshop experience: (1) synchronous in-person interaction, (2) flexibility in online learning, (3) ease of access to learning, (4) comfortable learning environment, and (5) relevance to practice and interest.
Conclusion
There were no statistically significant differences between the groups in most of the quantitative data, and participants identified each method’s benefits and challenges. The findings indicate online learning can be as effective as in-person learning. However, combining both methods’ positive aspects may improve learners’ educational experiences.

112 - Do graduating US students have the skills to perform the Association of American Medical Colleges (AAMC) Core Entrustable Professional Activities for entering residency (Core EPAs): Analysis of the national AAMC 2019 Graduation Questionnaire (GQ)

113 - Medical Students' Perceptions on Changing Osteopathic Manipulative Medicine Lab Practical Assessment Styles
PURPOSE
With the development of Osteopathic Core Competencies and Core Entrustable Professional Activities (EPAs), there has been a shift towards competency-based curriculum in osteopathic medical education. Medical students at the Edward Via College of Osteopathic Medicine-Auburn campus (VCOM-Auburn) utilizes the traditional, randomized Osteopathic Manipulative Medicine (OMM) practical testing style; and they were surveyed on their preferences towards an OMM practical assessment modality to determine how receptive students would be to curricular change. The study evaluated the learning and assessment preferences of first- and second-year osteopathic medical students (OMS I & OMS II) at VCOM-Auburn.
METHODS
Participants, regardless of sex, age, race, or academic achievement, were recruited using class announcement and emails. A 6-question anonymous and voluntary survey was conducted via iClicker to evaluate perception and readiness for change in OMM curriculum and assessment formats.
RESULTS
Out of the 308 enrolled first- and second-year students, 243 responded (78.9%). Study results found that OMS I and OMS II students selected similar choices for each question, and most students preferred the current traditional OMM practical testing style over competency-based testing. However, there is a significant difference in the proportion of student satisfaction and testing preference between OMS I and OMS II classes; satisfaction with the current practical setup decreased from 82% among OMS I’s to 68% among OMS II’s, x2(1)=5.114, p=0.024.
CONCLUSIONS
Data suggests that OMS I and OMS II at VCOM-Auburn are satisfied with the current traditional practical assessment with a significant decrease with the increase in seniority. Previous experience in traditional assessment may be a factor. The preference to utilize competency-based learning as medical education progresses predicts that students and residents with increased medical education experience acknowledge the importance of a more interactive and flexible curriculum. Therefore, this a is relevant consideration with the changes to the Single Accreditation System for Graduate Medical Education.

114 - Effects of organ-system courses of the first two years of medical school on performance of COMLEX-USA Level 2

115 - Early Prediction of the Risk of Scoring Lower than 500 on the COMLEX 1: A Study of Pre-Matriculation MCAT scores and Pre-Clinical Grades at an American Osteopathic Medical School
Osteopathic Medical Licensing Examination of the United States (COMLEX-USA) Level 1 (COMLEX-USA Level 1) and Level 2- Cognitive Evaluation (CE) are board examinations that each medical student in an osteopathic medical school must pass as part of the licensure requirements. Numerical scores on Level 2 CE are also important in the competitive residency match.
Objectives:
Our goal is to find the earliest predictors for performances on COMLEX Level-1 and Level 2-CE.
Methods:
Data from six cohorts of medical students matriculated at Rocky Vista University College of Osteopathic Medicine from 2012 to 2017 were collected, including independent variables of performances on each course from the first two years, and the dependent variables were the scores on COMLEX Level 1 or Level 2-CE. Predictive models were built with multiple linear regression and backward stepwise regression using SPSS. Predictive models for COMLEX Level 1 were based on performances of the first three semesters’ courses, and for COMLEX Level 2-CE were used performances on the first four semesters’ courses.
Results:
We found that the performances of third-semester Renal System II and Cardiovascular System II courses had the highest correlation with the scores of COMLEX-USA Level 1(r=0.7), and Level 2-CE (r=0.64-0.65), respectively. Performance on either Renal System II course or Cardiovascular System II course explains 49% of the variance in COMLEX-USA Level 1 scores, and 41-42% of the variance in COMLEX-USA Level 2 scores. The predictive regression models confirmed that scores of Renal II and Cardiovascular II are significant predictors of performances on COMLEX Level 1 and 2-CE.
Conclusion:
Students who perform poorly in third-semester Renal System II and Cardiovascular System II courses are at high risk of lower performance or failure on COMLEX Level 1 and 2-CE. The results may allow earlier interventions to improve students’ learning and performances.

116 - Influence Of Mcat Retesting On Performance Of Preclinical Medicine And Comlex-usa Level-1 And Level 2-ce
The Medical College Admission Test (MCAT) is utilized as one of the preadmission variables by the medical school admissions committees in the selection of students since 1928 in United States. Students are permitted to retake the MCAT up to three times in one calendar year and four times across two calendar years, with a maximum of seven attempts in their lifetime in order to maximize their score. The MCAT score is used as a predictor of how well a student can perform in medical school, with extensive research investigating the relationship between MCAT scores and preclinical performance as well as medical board examinations, yet sparce research have focused on the effects of retaking the MCAT. Furthermore, there has been no exploration into the influence of retesters’ MCAT scores and the number of MCAT attempts on COMLEX Level 1 and Level 2-CE in literature.
Objectives:
Our goal was to investigate whether MCAT retaking affects the performance of preclinical courses and board examinations.
Methods:
Data from 904 students who matriculated at Rocky Vista University College of Osteopathic Medicine during 2012-2017 included MCAT scores on first attempt, second attempt, third attempt, fourth attempt, preclinical course scores, and first attempted scores of COMLEX Level 1 and Level 2 CE. One-way ANOVA, X2 test, and Pearson correlation coefficient were performed.
Objectives:
The analysis revealed that compared to non-retesters, retesters had a significantly lower first-time and average MCAT scores, with the lowest seen in those who retook it four times. In addition, scores of COMLEX Level 1 in retesters who took the MCAT four times were significantly decreased compared to that in non-retesters.
Conclusion:
Increased attempts of the MCAT negatively influenced performance of COMLEX Level 1 and Level 2-CE.

117 - The Role of Examination Rankings in Medical Students' Experiences of the Impostor Phenomenon
The term ‘impostor phenomenon’, used to “designate an internal experience of intellectual phonies”, was first coined by Clance and Imes in 1978. Those who experience this have profound thoughts of fraudulence regarding their professional or intellectual activities. This perception of illegitimacy causes sufferers to credit their success to error, blocking high achievers from acknowledging their successes and hindering development in self-esteem.
METHODS
This research aimed to uncover and explore the relationship between medical students and the impostor phenomenon. An ethics-approved action research project was completed at The University of Sheffield, using a pragmatic approach which integrated quantitative and qualitative data from a questionnaire, focus groups and interviews. The main quantitative measure was the Clance Impostor Phenomenon Scale (CIPS), which produces scores between 20-100.
RESULTS
There were 191 questionnaire responses, and 19 students joined a focus group or interview. With a mean CIPS score of 65.81 ± 13.72, the average student had “frequent” impostor experiences. “Clinically significant” CIPS scores were recorded in 65.4% of students, and on average females scored 9.15 points more than males (p

118 - Do School-Based USMLE Testing Centers Provide a Home-Field Advantage?
The onset of the COVID-19 pandemic forced commercial testing centers to close worldwide, leaving medical learners unable to complete their USMLE Step exams. In response, six U.S. medical schools were selected to create secure spaces and train staff to administer medical licensure exams. This abstract describes examinee feedback from learners who took their USMLE Step exam at the University of South Florida (USF) regional testing center.
METHODS
A four-question survey was sent to 175 examinees that tested at the USF regional testing site. Questions evaluate the following objectives: whether the testing center and staff fit the examinees’ expectations, whether the option to test at the USF center lowered examinee stress, and whether examinees’ perceptions utilized the USF test center improved their performance. Response options included yes, no, or neutral. Chi-squared test of independence assessed differences in survey responses (p

119 - Using Discriminant Analysis To Assess The Validity Of A Predictive Regression Model For Identifying Students At Risk Of Failing Usmle Step 1
Andrea Vallevand, Brooke Shipley and Yenya Hu, Wake Forest School of Medicine, Winston Salem, NC 27101 USA.
PURPOSE
The accurate detection of students at academic risk can permit the deployment of interventions, such as the deliberate use of question banks, to identify and patch knowledge gaps. A priori detection is particularly critical for licensing examinations, where failure may impact residency aspirations. The current research explores the validity of a regression model employed to predict the risk of Step 1 failure.
METHODS
Preclinical Customized Assessment System examination and Step 1 scores were collected from three cohorts. Regression analysis was conducted and a roster of predicted Step 1 scores calculated for the subsequent cohort. The USMLE passing score and documented standard error of measurement (194 and ±6, respectively) informed an “at risk range”. Students identified “at risk” were offered additional academic coaching during the dedicated Step 1 study period. Discriminant analysis, employed retrospectively, investigated Step 1 pass/fail results and the accuracy of the regression analysis.
RESULTS
The regression analysis predicted scores, categorized as “at risk”, for 8/127 (6.3%) students. Predicted scores ranged between 186 and 203. Of these eight students, four took academic leave, three failed Step 1 and one passed.
Among the 117 students who took Step 1, during the designated examination cycle, five of them (4.3%) failed. The discriminant analysis accurately predicted the five students that failed. Three of these students were flagged as Step 1 risks initially, by the regression analysis, with their respective predicted scores ranging between 186 and 199.
CONCLUSION
Regression analysis has provided our academic coaching program with a frame of reference for where along the pass/fail continuum students are located at the start of the dedicated Step 1 study period. Discriminant analysis is used to retrospectively validate these initial predictions, particularly when students do not engage in academic coaching.

120 - Effect Of Covid-19 Pandemic On Comlex Level 1 Performance

122 - Convergent Validity Of A Revised Teamwork Assessment Tool
Teams are a foundational component of healthcare delivery. Having a reliable, valid, efficient, and effective method of evaluating team function is essential to improving team performance. As part of an International Association of Medical Science Educators’ educational grant, we attempted to develop a quantitative measurement suite for assessing teamwork.
METHODS
We employed a quasi-experiment pre-/post-intervention comparison design to assess inter-professional student teams participating in the Student Operating Room Team Training (SORTT) curriculum. Teams of nurse anesthesia, senior medical, and senior undergraduate nursing students completed a dual scenario session with immediate after action debriefing focusing on team-based competencies. Evaluation of team performance involved both quantitative measurement as well as observer-based evaluation using the Quick Teamwork Assessment Scales (Q-TAS), a 5-item, 3-subscale tool using a 6-point Likert-type scale ( 1=definitely no to 6=definitely yes). Changes in quantitative measurements from scenario 1 to scenario 2 were determined and compared to mean item changes in Q-TAS ratings.
RESULTS
In 2020, Q-TAS evaluation of 49 students divided into 7 simulated OR teams occurred. Statistically significant improvements were present in all 3-subscale ratings. Due to the unavailability of the sociometric badges included in the original protocol, an alternatively developed quantitative measurement suite incorporated dosimeters, radio frequency identification badges, and video recordings. Data analysis has proven challenging due to limitations in the manner in which data collection and recording occurs with the instruments. In addition, the sheer volume of data has proven difficult to sort out. Work continues related to overcoming these issues.
CONCLUSION
The SORTT program is effective in improving student team performance. The successful creation of a quantitative measurement suite for team function must take into account the manner in which data collection occurs, data presentation happens, and the volume of data involved.

123 - Affirming Institutional Assessment Equity with Differential Item Functioning (DIF)
The purpose of this study was to revise the eligibility criteria for Phase-I Clinical Science Domain Letters of Distinction using a methodology such that they are available to all students who meet the defined criteria. More specifically, we sought to classify student performance distinction based on demonstrated competencies against established benchmarks, rather than on performance in comparison to peers.
Background
Research on performance evaluation highlights the importance of using multiple measures to develop accurate and reliable profiles of student performance. The VTCSOM Clinical Science Domain assesses students in Phase I by creating compensatory composite domain scores consisting of multiple weighted measures of student performances, including Multiple Choice Assessments; Interview and Physical Exam Performance; Communication and Interpersonal Skills Performance; Written Presentation Skills; Clinical Reasoning Skills.
Methods
In this study, we conducted a review of ten standard-setting methods, comparing their advantages and disadvantages. Next, we selected a standard-setting method and conducted a standard-setting study with subject-matter experts. Prior to conducting the Hofstee standard-setting study, we aggregated and reviewed the cumulative distribution functions of three years of clinical science compensatory composite domain scores. Subject matter experts used knowledge of these distributions and the prescribed compromises to ultimately determine Clinical Science Letters of Distinction thresholds. After establishing the thresholds, we applied the results of our standard-setting study to classify performance for VTCSOM Phase-I Clinical Science Domain Distinction.
Discussion
Standards are more credible if they produce appropriate classification information and are sensitive to candidate performance and content. Standards must also be statistically sound and identify the “true” standard. Given the need to establish several standards across the curriculum, standards should be relatively easy to implement and compute. The review of standard setting methods helped us establish which of the empirically-based standard-setting methods was appropriate to apply to the Clinical Science Domain composite scores.
