File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1007/s40593-024-00408-y
- Scopus: eid_2-s2.0-85197722280
- WOS: WOS:001264576500002
- Find via

Supplementary
- Citations:
- Appears in Collections:
Article: How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses
| Title | How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses |
|---|---|
| Authors | |
| Keywords | ChatGPT Feedback GPT-4 Large language models Tutoring training |
| Issue Date | 2024 |
| Citation | International Journal of Artificial Intelligence in Education, 2024 How to Cite? |
| Abstract | One-on-one tutoring is widely acknowledged as an effective instructional method, conditioned on qualified tutors. However, the high demand for qualified tutors remains a challenge, often necessitating the training of novice tutors (i.e., trainees) to ensure effective tutoring. Research suggests that providing timely explanatory feedback can facilitate the training process for trainees. However, it presents challenges due to the time-consuming nature of assessing trainee performance by human experts. Inspired by the recent advancements of large language models (LLMs), our study employed the GPT-4 model to build an explanatory feedback system. This system identifies trainees’ responses in binary form (i.e., correct/incorrect) and automatically provides template-based feedback with responses appropriately rephrased by the GPT-4 model. We conducted our study using the responses of 383 trainees from three training lessons (Giving Effective Praise, Reacting to Errors, and Determining What Students Know). Our findings indicate that: 1) using a few-shot approach, the GPT-4 model effectively identifies correct/incorrect trainees’ responses from three training lessons with an average F1 score of 0.84 and AUC score of 0.85; and 2) using the few-shot approach, the GPT-4 model adeptly rephrases incorrect trainees’ responses into desired responses, achieving performance comparable to that of human experts. |
| Persistent Identifier | http://hdl.handle.net/10722/354340 |
| ISSN | 2023 Impact Factor: 4.7 2023 SCImago Journal Rankings: 1.842 |
| ISI Accession Number ID |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lin, Jionghao | - |
| dc.contributor.author | Han, Zifei | - |
| dc.contributor.author | Thomas, Danielle R. | - |
| dc.contributor.author | Gurung, Ashish | - |
| dc.contributor.author | Gupta, Shivang | - |
| dc.contributor.author | Aleven, Vincent | - |
| dc.contributor.author | Koedinger, Kenneth R. | - |
| dc.date.accessioned | 2025-02-07T08:48:00Z | - |
| dc.date.available | 2025-02-07T08:48:00Z | - |
| dc.date.issued | 2024 | - |
| dc.identifier.citation | International Journal of Artificial Intelligence in Education, 2024 | - |
| dc.identifier.issn | 1560-4292 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/354340 | - |
| dc.description.abstract | One-on-one tutoring is widely acknowledged as an effective instructional method, conditioned on qualified tutors. However, the high demand for qualified tutors remains a challenge, often necessitating the training of novice tutors (i.e., trainees) to ensure effective tutoring. Research suggests that providing timely explanatory feedback can facilitate the training process for trainees. However, it presents challenges due to the time-consuming nature of assessing trainee performance by human experts. Inspired by the recent advancements of large language models (LLMs), our study employed the GPT-4 model to build an explanatory feedback system. This system identifies trainees’ responses in binary form (i.e., correct/incorrect) and automatically provides template-based feedback with responses appropriately rephrased by the GPT-4 model. We conducted our study using the responses of 383 trainees from three training lessons (Giving Effective Praise, Reacting to Errors, and Determining What Students Know). Our findings indicate that: 1) using a few-shot approach, the GPT-4 model effectively identifies correct/incorrect trainees’ responses from three training lessons with an average F1 score of 0.84 and AUC score of 0.85; and 2) using the few-shot approach, the GPT-4 model adeptly rephrases incorrect trainees’ responses into desired responses, achieving performance comparable to that of human experts. | - |
| dc.language | eng | - |
| dc.relation.ispartof | International Journal of Artificial Intelligence in Education | - |
| dc.subject | ChatGPT | - |
| dc.subject | Feedback | - |
| dc.subject | GPT-4 | - |
| dc.subject | Large language models | - |
| dc.subject | Tutoring training | - |
| dc.title | How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses | - |
| dc.type | Article | - |
| dc.description.nature | link_to_subscribed_fulltext | - |
| dc.identifier.doi | 10.1007/s40593-024-00408-y | - |
| dc.identifier.scopus | eid_2-s2.0-85197722280 | - |
| dc.identifier.eissn | 1560-4306 | - |
| dc.identifier.isi | WOS:001264576500002 | - |
