ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 151
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer Oct 14, 2024 • 99
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 239
sentence-transformers/msmarco-bert-base-dot-v5 Sentence Similarity • 0.1B • Updated Mar 6 • 354k • • 18
sentence-transformers/multi-qa-mpnet-base-dot-v1 Sentence Similarity • 0.1B • Updated Aug 19 • 5.09M • • 184
Medical QA Datasets Collection A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22 • 46
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 126