Mental Health Information
This content is for informational purposes only. AI therapy tools are not substitutes for professional mental health treatment. Always consult a licensed mental health professional.
In crisis? Call 988 (Suicide & Crisis Lifeline) or text HOME to 741741 — free, confidential, 24/7.
AI Therapy Clinical Evidence Scorecard
Most AI therapy tools claim to be "clinically validated" or "evidence-based." But what does that actually mean? We graded every platform in our database by the quality and quantity of their published clinical evidence. The disparity is striking.
Published: April 4, 2026 | Last updated: April 4, 2026
Our Evidence Rating System
- Gold: 5+ randomized controlled trials (RCTs) with control groups, published in peer-reviewed journals
- Silver: 1-4 peer-reviewed studies or large observational data with meaningful effect sizes
- Bronze: Internal data, third-party reviews, or validated assessment instruments used (but no independent RCTs about the platform itself)
- Unrated: No published clinical evidence about the platform's effectiveness
| Platform | Audience | Evidence Rating | Published Research | Regulatory Status |
|---|---|---|---|---|
| Wysa | B2C | Gold | 30+ peer-reviewed papers including RCTs in JMIR | CE-marked Class I (EU), NHS endorsed (UK) |
| Woebot | B2B (consumer app shut down) | Gold | 14+ RCTs — most validated AI chatbot globally | FDA pathway attempted (cited as reason for B2C exit) |
| Lyssn | B2B | Gold | 60+ peer-reviewed publications, 17+ years research | HIPAA compliant |
| Youper | B2C | Silver | Stanford study (N=4,517) — observational, not RCT | None |
| Elomia | B2B+B2C | Silver | Active clinical trial (NCT06725147); BMC Psychology study | None |
| MindDoc | B2C | Bronze | Uses validated instruments (PHQ-9, GAD-7); no platform-specific RCTs | EU Class I medical device |
| Talkiatry | B2C | Bronze | Internal outcomes data: 87% anxiety patients improve after 2 visits | None (licensed psychiatrists, not the platform) |
| Replika | B2C | Bronze | Mixed: some loneliness studies; also harm documentation. FTC complaint. | None — not a clinical tool |
| Bloom | B2C (discontinued) | Unrated | Content by licensed therapists; no independent studies | Discontinued Feb 2025 |
| Blueprint | B2B | Unrated | Measurement-based care validated; no AI-specific studies | HIPAA compliant |
| Mentalyc | B2B | Unrated | UC Berkeley-backed; SOC 2 Type II; no clinical studies | SOC 2 Type II, HIPAA |
| Upheal | B2B | Unrated | No published research; user reviews only | HIPAA + BAA |
| Freed | B2B | Unrated | No mental health-specific studies; general scribe | HIPAA, SOC 2 |
| Alma | B2B+B2C | Unrated | No published research | BBB accredited |
| SimplePractice | B2B | Unrated | No AI-specific research; most-used therapy EHR | HIPAA + BAA, HITRUST |
Key Takeaways
- Consumer apps have more evidence than therapist tools. Wysa (30+ papers) and Woebot (14+ RCTs) invested heavily in clinical validation. None of the B2B AI note tools (Upheal, Mentalyc, Blueprint, SimplePractice, Freed) have published peer-reviewed studies about their AI accuracy.
- Lyssn is the B2B exception. With 60+ peer-reviewed publications and 17+ years of research, Lyssn has the strongest evidence base of any B2B tool — but it's a training/QI tool, not a documentation tool.
- "Clinically validated" is often marketing. Many platforms claim clinical validation without peer-reviewed RCTs. Using validated assessment instruments (PHQ-9, GAD-7) is not the same as validating the platform itself.
- Regulatory status varies dramatically. Wysa has CE-mark and NHS endorsement. MindDoc has EU Class I classification. Most other platforms have no regulatory approval of any kind.
- Evidence does not equal safety. HIPAA compliance, BAA availability, and SOC 2 certification are separate from clinical effectiveness research. A tool can be secure without being clinically validated.
Why This Matters
For a YMYL (Your Money or Your Life) topic like mental health, the distinction between evidence-based and evidence-free matters. Consumers choosing therapy apps deserve to know that Wysa has 30+ papers while Replika has an FTC complaint. Therapists choosing documentation tools deserve to know that none of them have published accuracy studies.
This scorecard is updated quarterly. If any platform publishes new clinical research, we will update their rating. Read our full methodology for how we evaluate clinical evidence.