HaluMem: Evaluating Hallucinations in Memory Systems of Agents Paper • 2511.03506 • Published about 1 month ago • 93
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational Biology Paper • 2503.00096 • Published Feb 28 • 2 • 1
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational Biology Paper • 2503.00096 • Published Feb 28 • 2
Synthetic Observational Health Data with GANs: from slow adoption to a boom in medical research and ultimately digital twins? Paper • 2005.13510 • Published May 27, 2020
jeremygf/distilbert-base-uncased-finetuned-clinc Text Classification • 67.1M • Updated Jan 24, 2024 • 5