LLM Benchmarking for Healthcare

Our mission is to advance rigorous evaluation and performance assessment of large language models (LLMs) to ensure they deliver accurate, reliable, and clinically relevant outputs that improve patient care and support clinicians.

See benchmarks

Benchmarking models from

Google DeepSeek Qwen Anthropic OpenAI Mistral AI Meta

Our Mission

We are committed to fostering innovation in AI technology while prioritizing patient safety, data privacy, and equitable access to these transformative solutions.

Through research, education, collaboration, and advocacy, we aim to ensure that medical AI evolves as a trusted partner in healthcare, empowering professionals and enhancing outcomes for all.

We embrace open source principles to accelerate knowledge sharing, enable collaborative development, and promote transparency. We hope to create a foundation for responsible AI that will help advance healthcare and improve patient outcomes.

Our Team

Led by experienced professionals in AI and healthcare, committed to transforming patient care through ethical innovation.

Elie Toubiana

Elie Toubiana

CEO, ScribeMD.ai

An AI engineer and entrepreneur working on helping clinical practices through innovative tools. His developments currently serve over 10,000 physicians worldwide.

Maxime Cohen

Maxime Cohen

Scale AI Chair Professor, McGill University & Chief AI Strategy, CIUSSS West-Central Montreal

A distinguished leader bridging the gap between theoretical advancement and practical implementation of AI in healthcare.

Eddy Hage-Youssef

Eddy Hage-Youssef

Research Assistant

A 4th year Computer Science & Statistics student at McGill University, researching AI agents across various applications under Professor Cohen.