This proof-of-concept study evaluated the feasibility of using large language models (LLMs) to support clinicians in identifying defense mechanisms in psychodynamic psychotherapy transcripts, comparing LLM assessments with those of psychodynamic therapists in both solo and supervision group settings. The first two sessions of a psychoanalytic case were analyzed. The transcribed sessions were segmented into excerpts and coded by two psychodynamic clinicians and two LLMs (ChatGPT and Gemini) guided with a structured prompt. Coding followed an adapted version of DMRS comprising 12 defenses organized into three maturity levels. For each excerpt, defense mechanisms were identified, and an overall defensive functioning index (ODF) was calculated. Inter-rater agreement was estimated using the Intraclass Correlation Coefficient (ICC), considering human-human, LLM-LLM, human-LLM, and integrated versions (clinical supervision vs. LLM integration) pairs. LLMs tended to assign a higher number of defenses, including more immature ones, producing profiles with lower average ODFs than clinicians, who instead provided more selective and, overall, more mature ratings. However, some of the defenses reported only by LLMs were subsequently recognized by supervised clinicians as clinically relevant and worthy of inclusion. Clinicians showed good to excellent inter-rater reliability, while ChatGPT and Gemini showed low inter-rater reliability, suggesting that the two models are not interchangeable. Agreement between clinicians and LLMs was low across the sessions. Therefore, the combined use of LLMs among clinicians is technically feasible and could potentially enhance understanding of patients’ defensive functioning, especially as a support for case formulation and supervision. Currently, LLMs cannot replace clinical judgment or the therapeutic relationship, but they might be considered as a support tool for clinical practice under human professional guidance.

Antichi, L., Sajno, E., Rossi, C., Frisone, F., De Salve, F., Oasi, O., Riva, G., Using LLMs as a clinician support for detecting defense mechanisms from psychotherapy transcripts: a proof-of-concept study, <<CURRENT PSYCHOLOGY>>, 2026; 45 (12): 1-14. [doi:10.1007/s12144-026-09685-3] [https://hdl.handle.net/10807/341415]

Using LLMs as a clinician support for detecting defense mechanisms from psychotherapy transcripts: a proof-of-concept study

Antichi, Lorenzo
Primo
;
Sajno, Elena;Rossi, Chiara;Frisone, Fabio;De Salve, Francesca;Oasi, Osmano;Riva, Giuseppe
2026

Abstract

This proof-of-concept study evaluated the feasibility of using large language models (LLMs) to support clinicians in identifying defense mechanisms in psychodynamic psychotherapy transcripts, comparing LLM assessments with those of psychodynamic therapists in both solo and supervision group settings. The first two sessions of a psychoanalytic case were analyzed. The transcribed sessions were segmented into excerpts and coded by two psychodynamic clinicians and two LLMs (ChatGPT and Gemini) guided with a structured prompt. Coding followed an adapted version of DMRS comprising 12 defenses organized into three maturity levels. For each excerpt, defense mechanisms were identified, and an overall defensive functioning index (ODF) was calculated. Inter-rater agreement was estimated using the Intraclass Correlation Coefficient (ICC), considering human-human, LLM-LLM, human-LLM, and integrated versions (clinical supervision vs. LLM integration) pairs. LLMs tended to assign a higher number of defenses, including more immature ones, producing profiles with lower average ODFs than clinicians, who instead provided more selective and, overall, more mature ratings. However, some of the defenses reported only by LLMs were subsequently recognized by supervised clinicians as clinically relevant and worthy of inclusion. Clinicians showed good to excellent inter-rater reliability, while ChatGPT and Gemini showed low inter-rater reliability, suggesting that the two models are not interchangeable. Agreement between clinicians and LLMs was low across the sessions. Therefore, the combined use of LLMs among clinicians is technically feasible and could potentially enhance understanding of patients’ defensive functioning, especially as a support for case formulation and supervision. Currently, LLMs cannot replace clinical judgment or the therapeutic relationship, but they might be considered as a support tool for clinical practice under human professional guidance.
2026
Inglese
Antichi, L., Sajno, E., Rossi, C., Frisone, F., De Salve, F., Oasi, O., Riva, G., Using LLMs as a clinician support for detecting defense mechanisms from psychotherapy transcripts: a proof-of-concept study, <<CURRENT PSYCHOLOGY>>, 2026; 45 (12): 1-14. [doi:10.1007/s12144-026-09685-3] [https://hdl.handle.net/10807/341415]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/341415
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact