AI & ML interests

large scale real-robot-based benchmark platform of embodied intelligence

Recent Activity

AdinaY 
posted an update 4 days ago
view post
Post
2445
HunyuanWorld Mirror🔥a versatile feed forward model for universal 3D world reconstruction by Tencent

tencent/HunyuanWorld-Mirror

✨ Any prior in → 3D world out
✨ Mix camera, intrinsics, depth as priors
✨ Predict point clouds, normals, Gaussians & more in one pass
✨ Unified architecture for all 3D task
AdinaY 
posted an update 9 days ago
view post
Post
568
PaddleOCR VL🔥 0.9B Multilingual VLM by Baidu

PaddlePaddle/PaddleOCR-VL

✨ Ultra-efficient NaViT + ERNIE-4.5 architecture
✨ Supports 109 languages 🤯
✨ Accurately recognizes text, tables, formulas & charts
✨ Fast inference and lightweight for deployment
AdinaY 
posted an update 10 days ago
AdinaY 
posted an update 11 days ago
AdinaY 
posted an update 13 days ago
view post
Post
457
Ring-1T🔥 the trillion-parameter thinking model released by Ant group, the company behind Alipay

inclusionAI/Ring-1T

✨ 1T params (50B active)- MIT license
✨ 128K context (YaRN)
✨ RLVR, Icepop, and ASystem make trillion-scale RL stable
AdinaY 
posted an update 17 days ago
view post
Post
484
KAT-Dev-72B-Exp🔥 Kuaishou's ( the company behind Kring AI ) new open model for software engineering

Kwaipilot/KAT-Dev-72B-Exp

✨ 72B - Apache2.0
✨ Redesigned attention kernel & training engine for efficient context-aware RL
✨ 74.6% accuracy on SWE-Bench Verified
AdinaY 
posted an update 18 days ago
view post
Post
4389
At the close of the National Holiday🇨🇳, Antgroup drops a new SoTA model.

Ling-1T 🔥 the trillion-parameter flagship of the Ling 2.0 series.

inclusionAI/Ling-1T

✨1T total / 50B active params per token
✨20T+ reasoning-dense tokens (Evo-CoT)
✨128K context via YaRN
✨FP8 training: 15%+ faster, same precision as BF16
✨Hybrid Syntax-Function-Aesthetics reward for front-end & visual generation
  • 1 reply
·
AdinaY 
posted an update 21 days ago
AdinaY 
posted an update 23 days ago
view post
Post
597
New release from Ant Group 🔥

inclusionAI/ming-v2-68ddea4954413c128d706630

✨MingTok (Vision & Audio): continuous unified tokenizer, no quantization, preserves semantic & perceptual fidelity, enables faster convergence.

✨Ming-UniVision: MLLM unifying image understanding + generation, supports multi-round editing & visualized CoT.

✨Ming-UniAudio: unified speech LLM for ASR, TTS & free-form editing, integrates semantic + acoustic features for high-fidelity audio.
AdinaY 
posted an update 25 days ago
view post
Post
544
🔥 September highlights from Chinese open source community

zh-ai-community/september-2025-china-open-source-highlights-68b55c9e757c439ad9dd6aba

✨ Massive releases from the two tech giants

- At Alibaba Cloud Summit, Qwen dropped at least 7 new series of models. ( some are not open sourced )
- Since June, Tencent has doubled down on open source, especially after Hunyuan gained traction

✨ Some of the community’s hottest models come from startups.

- Kimi K2-0905
- GLM v4.6
-OpenBMB MiniCPM 4.1

✨ New players are pushing hard!

- Baidu ERNIE & Qianfan: enterprise-ready focus
- Ant Group: MoE + low-activation; from small to trillion, from core to reasoning fast track
- Xiaomi MiMo: stands out with Any-to-Any audio models

✨ Robotics is joining the open-source wave

- Unitree released its first open-source model
- BAAI launched RoboBrain-X0, an open-source robotics model + dataset

👀 Each month brings cooler models. After the 8-day National Holiday, expect another wave before the end of the year.

Stay tuned!
AdinaY 
posted an update 26 days ago
view post
Post
2777
GLM-4.6 is here🚀

zai-org/GLM-4.6

✨ 200K context window
✨ Superior coding & polished UI generation
✨ Stronger reasoning & tool use
✨ More capable agents & agent frameworks
AdinaY 
posted an update 27 days ago
view post
Post
408
MOSS-Speech 🔊 bilingual native speech-to-speech model, from Fudan University.

fnlp/moss-speech-68dbab23bc98501afede0cd3

✨ Supports Chinese & English
✨ Layer-splitting architecture + frozen pretraining
✨ Preserves tone, emotion & prosody
AdinaY 
posted an update 27 days ago
view post
Post
416
RoboBrain-X0- Preview 🤖 a unified cross-embodiment VLA model from
BAAI.

BAAI/robobrain-x0-68db67d3542e04c5d99f31f9

✨Zero-shot generalization across heterogeneous robots
✨Complex task decomposition & embodied reasoning
✨Unified Action Vocabulary + OmniSAT tokenizer
✨End-to-end: perception > reasoning > execution
✨Full version coming soon 🔥
AdinaY 
posted an update 27 days ago
view post
Post
1622
Ring-1T-preview 🔥 1T thinking model released by Ant Group.

inclusionAI/Ring-1T-preview

✨ MoE architecture + 20T tokens + RLVR via ASystem
✨ Strong natural language reasoning (AIME’25: 92.6, close to GPT-5)
✨IMO tests: advanced problem-solving & reasoning
AdinaY 
posted an update about 1 month ago
AdinaY 
posted an update about 1 month ago
view post
Post
2037
Ring-mini-linear-2.0 🔥a hybrid attention MoE model released by Ant group

inclusionAI/Ring-mini-linear-2.0

✨ Hybrid linear + standard attention
✨ 16.4B total, only 1.6B activated
✨ 512k context window via YaRN
✨ Faster than same-size MoE
  • 2 replies
·