5 25 128

QinyuanCheng

Cqy2019

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

microsoft/VibeVoice-Realtime-0.5B

liked a dataset 8 days ago

RMSnow/SpeechJudge-Data

upvoted a paper 16 days ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

View all activity

Organizations

upvoted a paper 16 days ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published 17 days ago • 55

upvoted a paper 17 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 21 days ago • 75

upvoted a paper about 1 month ago

Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Paper • 2511.16669 • Published Nov 20 • 31

upvoted 4 papers about 2 months ago

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

Paper • 2511.03334 • Published Nov 5 • 52

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6 • 210

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28 • 71

Sparser Block-Sparse Attention via Token Permutation

Paper • 2510.21270 • Published Oct 24 • 24

upvoted a paper 2 months ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15 • 45

upvoted 3 papers 3 months ago

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

Paper • 2510.05034 • Published Oct 6 • 48

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Paper • 2510.00499 • Published Oct 1 • 19

VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions

Paper • 2509.09716 • Published Sep 9 • 11

upvoted 3 papers 4 months ago

upvoted 4 papers 6 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7 • 39

InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems

Paper • 2506.16381 • Published Jun 19 • 2

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17 • 44

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache

Paper • 2506.11886 • Published Jun 13 • 20

upvoted an article 9 months ago

Article

Welcome to the Falcon 3 Family of Open Models!

Dec 17, 2024

•

129

upvoted a paper 10 months ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13 • 55

QinyuanCheng

AI & ML interests

Recent Activity

Organizations

Cqy2019's activity

Welcome to the Falcon 3 Family of Open Models!