tc lin's picture

22 215

tc lin

stuser2023

·

https://github.com/stuser

stuser

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

deepseek-ai/DeepSeek-OCR

liked a model 15 days ago

FreeSEED-AI/gpt-oss-20b-mandarin-thinking

upvoted an article about 2 months ago

Introducing AI Sheets: a tool to work with datasets using open AI models!

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

Introducing AI Sheets: a tool to work with datasets using open AI models!

Aug 8

• 101

upvoted a paper 3 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 257

upvoted a paper 5 months ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 46

upvoted an article 5 months ago

Article

The Common Pile v0.1

By

and 2 others •

Jun 6

• 51

upvoted a collection 6 months ago

🧠 Traditional Chinese Reasoning Datasets

A curated collection of datasets designed to evaluate and train reasoning capabilities in Traditional Chinese across various domains. • 3 items • Updated 14 days ago • 8

upvoted an article 8 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

• 77

upvoted 2 collections 8 months ago

PaliGemma 2 Mix

13 items • Updated Jul 10 • 62

Breeze 2 Family

Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 6 items • Updated Feb 26 • 19

upvoted 2 collections 11 months ago

Cosmos-Tokenizer

A suite of image and video tokenizers • 13 items • Updated 6 days ago • 41

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Aug 25 • 81

upvoted a collection 12 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 291

upvoted an article about 1 year ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

By

•

Aug 26, 2024

• 77

upvoted a paper over 1 year ago

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17, 2024 • 51

upvoted a collection over 1 year ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 236

upvoted an article over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 420

upvoted 2 collections over 1 year ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 6 days ago • 162

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 6 days ago • 44

upvoted an article over 1 year ago

Article

CodeGemma - an official Google release for code LLMs

Apr 9, 2024

• 103

upvoted a paper over 1 year ago

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Paper • 2402.13616 • Published Feb 21, 2024 • 49

upvoted a collection over 1 year ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 344