SFT for Mental Health Question-Answering

Published:

Fine-tuned Llama2-7b for chat and question-answering on mental health datasets as part of research at the Kelley Data Science and AI Lab, Indiana University.

Approach:

  • Supervised fine-tuning using Hugging Face Transformers with QLoRA for parameter-efficient training
  • Deployed models using vLLM for efficient inference
  • Built a Streamlit-based evaluation platform for end-user interaction, feedback collection, and pairwise comparison evaluation across model responses
  • Platform served 50+ researchers with automated ELO-based model ranking

Tech: Python, Transformers, QLoRA, vLLM, Streamlit, SQLite