Blog - AI Chat Tools

Deep Dives

Training mRNA Language Models Across 25 Species for $165

OpenMed built a...

VAKRA: A New Benchmark That Shows How Bad AI Agents Are at Real Work

IBM Research's ...

QIMMA: The Arabic LLM Leaderboard That Actually Checks Its Homework

QIMMA is a new ...

Google’s AMIE Tried Taking Patient Histories in a Real Clinic. Here’s What Happened.

Google Research...

TurboQuant: Google’s New Trick to Squeeze AI Models Without the Usual Trade-offs

Google Research...

Google’s AI Takes on Breast Cancer Screening: What the New NHS Studies Really Show

Google Research...

Can LLMs Actually Help Physicists Figure Out Superconductivity?

Google Research...

ConvApparel: Why Your AI User Simulator Probably Sucks

Google's new Co...

Google’s New Framework Tests Whether LLMs Actually Behave Like Humans

Google Research...

Forest vs. Tree: Google Research on How Many Raters You Actually Need for AI Benchmarks

Google Research...

Simula: Why Google Thinks Synthetic Data Needs Better Architecture, Not Just More Prompts

Google Research...

Synthetic Neurons Are Making Brain Mapping a Whole Lot Faster

Google Research...