SFT for Mental Health Question-Answering
Published:
Fine-tuned Llama2-7b for chat and question-answering on mental health datasets as part of research at the Kelley Data Science and AI Lab, Indiana University.
Approach:
- Supervised fine-tuning using Hugging Face Transformers with QLoRA for parameter-efficient training
- Deployed models using vLLM for efficient inference
- Built a Streamlit-based evaluation platform for end-user interaction, feedback collection, and pairwise comparison evaluation across model responses
- Platform served 50+ researchers with automated ELO-based model ranking
Tech: Python, Transformers, QLoRA, vLLM, Streamlit, SQLite
