Stoney Kang's picture

563 26

Stoney Kang

sikang99

·

AI & ML interests

Remote Control based on Vision

Recent Activity

upvoted a paper about 12 hours ago

SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting

upvoted a paper about 12 hours ago

TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels

upvoted a paper about 12 hours ago

Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

View all activity

Organizations

upvoted 3 papers about 12 hours ago

SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting

Paper • 2512.07197 • Published 3 days ago • 2

TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels

Paper • 2512.08358 • Published 1 day ago • 2

Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

Paper • 2512.08186 • Published 2 days ago • 3

upvoted a paper about 13 hours ago

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Paper • 2512.08478 • Published 1 day ago • 65

upvoted 4 papers 1 day ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 6 days ago • 71

VideoVLA: Video Generators Can Be Generalizable Robot Manipulators

Paper • 2512.06963 • Published 3 days ago • 2

ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation

Paper • 2512.03621 • Published 8 days ago • 8

Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image

Paper • 2512.05044 • Published 6 days ago • 15

upvoted 2 papers 5 days ago

NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation

Paper • 2512.05106 • Published 6 days ago • 15

Generative Neural Video Compression via Video Diffusion Prior

Paper • 2512.05016 • Published 6 days ago • 8

upvoted 9 papers 6 days ago

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

Paper • 2512.04797 • Published 7 days ago • 17

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Paper • 2512.03000 • Published 8 days ago • 34

SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization

Paper • 2512.02631 • Published 9 days ago • 7

On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral

Paper • 2512.04220 • Published 7 days ago • 11

SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment

Paper • 2512.02807 • Published 8 days ago • 7

RELIC: Interactive Video World Model with Long-Horizon Memory

Paper • 2512.04040 • Published 7 days ago • 22

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 14 days ago • 119

Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach

Paper • 2512.02834 • Published 8 days ago • 39

GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies

Paper • 2512.02581 • Published 9 days ago • 13

upvoted a paper 7 days ago

Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion

Paper • 2512.02017 • Published 9 days ago • 3