zuijiang's picture

1 9 4

zuijiang

zuijiang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

upvoted a paper 5 months ago

Reinforcement Pre-Training

upvoted a paper 5 months ago

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

View all activity

Organizations

Papers 5

arxiv:2504.00502

arxiv:2503.18034

arxiv:2502.04675

arxiv:2502.02458

models 1

zuijiang/llava-qwen1.5-14B-chat

Text Generation • 15B • Updated Jul 1, 2024 • 1

datasets 3

zuijiang/alpaca-alpaca-clean

Viewer • Updated Aug 26, 2024 • 51.8k • 10

zuijiang/mistral-alpaca-clean

Viewer • Updated Aug 25, 2024 • 51.8k • 10

zuijiang/ocr_vqa

Viewer • Updated May 30, 2024 • 208k • 20