MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment

arXiv ID: 2512.09636v1

Published: 2025-12-10

Authors: Mengxi Xiao, Kailai Yang, Pengde Zhao, Enze Zhang, Ziyan Kuang, Zhiwei Liu, Weiguang Han, Shu Liao, Lianting Huang, Jinpeng Hu, Min Peng, Qianqian Xie, Sophia Ananiadou

Categories: cs.CL

Relevance Score: 0.98 / 1.00

View on arXiv Download PDF

Summary

MentraSuite addresses the critical need for reliable Large Language Models (LLMs) in mental health by proposing a unified framework that enhances clinically aligned reasoning. It introduces MentraBench, a comprehensive benchmark for evaluating LLM performance and reasoning quality, and Mindora, a post-trained LLM optimized through a hybrid SFT-RL framework with inconsistency detection. Mindora achieves the highest average performance on MentraBench and demonstrates superior reasoning reliability across complex mental health scenarios.

Medical Relevance

This research is crucial for addressing the global burden of mental health disorders by providing more reliable, clinically aligned, and accessible digital support through Large Language Models. It directly improves the trustworthiness and safety of AI tools in mental healthcare, making them more suitable for sensitive applications like assessment and intervention planning.

AI Health Application

The research develops and evaluates Large Language Models (LLMs) to provide scalable and accessible assistance in mental health settings. Specifically, it focuses on improving the reliability and clinical alignment of LLMs for tasks such as mental health appraisal, diagnosis support, and intervention planning, aiming to enhance the quality of AI-driven tools in mental healthcare.

Key Points

Existing psychological LLMs often lack step-wise, clinically aligned reasoning necessary for appraisal, diagnosis, and intervention planning, primarily focusing on emotional understanding or knowledge recall.
MentraSuite is introduced as a unified framework to advance reliable mental-health reasoning in LLMs, aiming to mitigate risks from incomplete, inconsistent, or ungrounded AI outputs.
MentraBench is a comprehensive benchmark encompassing five core reasoning aspects, six tasks, and 13 datasets, evaluating both task performance and reasoning quality across five dimensions: conciseness, coherence, hallucination avoidance, task understanding, and internal consistency.
Mindora is a novel post-trained LLM optimized using a hybrid Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) framework, incorporating an inconsistency-detection reward to enforce faithful and coherent reasoning.
High-quality reasoning trajectories for Mindora's training were constructed using a novel strategy involving strategic filtering of difficult samples and a structured, consistency-oriented rewriting process.
Mindora achieved the highest average performance among 20 evaluated LLMs on MentraBench, demonstrating remarkable performance in reasoning reliability, particularly for complex mental-health scenarios.
The research aims to enhance the trustworthiness and clinical utility of LLMs for mental health support, information, and assessment by focusing on robust, clinically aligned reasoning.

Methodology

The authors introduced MentraSuite, a unified framework for advancing mental-health reasoning. They developed MentraBench, a comprehensive benchmark comprising five reasoning aspects, six tasks, and 13 datasets, along with five dimensions for evaluating reasoning quality. They then proposed Mindora, a post-trained LLM optimized using a hybrid Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) framework, specifically employing an inconsistency-detection reward. High-quality training trajectories for Mindora were generated through strategic sample filtering and a consistency-oriented rewriting process. Finally, 20 diverse LLMs, including Mindora, were evaluated on MentraBench.

Key Findings

Mindora achieved the highest average performance across 20 evaluated Large Language Models on the MentraBench benchmark. Critically, it demonstrated remarkable performance in reasoning reliability, confirming its effectiveness for navigating and providing sound reasoning in complex mental-health scenarios, which often pose challenges for other LLMs.

Clinical Impact

This work has the potential to significantly enhance the clinical utility and safety of AI in mental healthcare. By providing more reliable and clinically aligned reasoning, LLMs like Mindora can assist healthcare professionals in appraisal, diagnosis, and intervention planning, making these processes more efficient and consistent. It enables the development of scalable and accessible AI-powered mental health tools that offer grounded information and support, thereby reducing the risks of incomplete or inconsistent advice that could otherwise compromise patient care.

Limitations

The abstract primarily highlights the general risks associated with deploying existing LLMs in mental health settings (e.g., incomplete, inconsistent, ungrounded reasoning) as the problem MentraSuite aims to solve. It does not explicitly state specific limitations or caveats of the MentraSuite framework or Mindora model itself within the scope of this presented research.

Future Directions

The abstract does not explicitly mention specific future research directions for MentraSuite or Mindora.

Medical Domains

Psychiatry Clinical Psychology Mental Healthcare Digital Therapeutics Public Health

Keywords

Mental Health Large Language Models (LLMs) Clinical Reasoning Benchmarking Post-training Reinforcement Learning Psychiatric Assessment AI Reliability

Abstract

Mental health disorders affect hundreds of millions globally, and the Web now serves as a primary medium for accessing support, information, and assessment. Large language models (LLMs) offer scalable and accessible assistance, yet their deployment in mental-health settings remains risky when their reasoning is incomplete, inconsistent, or ungrounded. Existing psychological LLMs emphasize emotional understanding or knowledge recall but overlook the step-wise, clinically aligned reasoning required for appraisal, diagnosis, intervention planning, abstraction, and verification. To address these issues, we introduce MentraSuite, a unified framework for advancing reliable mental-health reasoning. We propose MentraBench, a comprehensive benchmark spanning five core reasoning aspects, six tasks, and 13 datasets, evaluating both task performance and reasoning quality across five dimensions: conciseness, coherence, hallucination avoidance, task understanding, and internal consistency. We further present Mindora, a post-trained model optimized through a hybrid SFT-RL framework with an inconsistency-detection reward to enforce faithful and coherent reasoning. To support training, we construct high-quality trajectories using a novel reasoning trajectory generation strategy, that strategically filters difficult samples and applies a structured, consistency-oriented rewriting process to produce concise, readable, and well-balanced trajectories. Across 20 evaluated LLMs, Mindora achieves the highest average performance on MentraBench and shows remarkable performances in reasoning reliability, demonstrating its effectiveness for complex mental-health scenarios.