Data Analyst at The Weather Company of IBM | Enrolling in Master of Computer and Information Technology at University of Pennsylvania
- New York, NY
- http://www.linkedin.com/in/dawn-chen-ling/
Pinned Loading
-
Psy-Qwen-SFT
Psy-Qwen-SFT PublicAI Psychological Counselor (Qwen 3.5B-0.8B Fine-Tuning Project)
Shell 1
-
Psy-Qwen-DPO
Psy-Qwen-DPO PublicDPO alignment of a Chinese psychological counselor LLM (Qwen3.5-0.8B). 74.9% win rate vs SFT baseline on 202 held-out prompts, evaluated by DeepSeek V4-Flash with 2-way position-bias mitigation.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.