ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization Paper • 2509.13313 • Published Sep 16 • 77
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning Paper • 2509.13305 • Published Sep 16 • 88
MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs Paper • 2508.18264 • Published Aug 25 • 25
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper • 2508.15760 • Published Aug 21 • 46
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent Paper • 2508.05748 • Published Aug 7 • 137
Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering Paper • 2409.07441 • Published Sep 11, 2024 • 11
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image Paper • 2502.12894 • Published Feb 18 • 18
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning Paper • 2401.10727 • Published Jan 19, 2024 • 2
TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting Paper • 2204.01018 • Published Apr 3, 2022
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos Paper • 2303.12370 • Published Mar 22, 2023
Efficient Post-Training Refinement of Latent Reasoning in Large Language Models Paper • 2506.08552 • Published Jun 10 • 1
Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives Paper • 2506.24124 • Published Jun 30 • 1
Leveraging Large Language Models for Effective Label-free Node Classification in Text-Attributed Graphs Paper • 2412.11983 • Published Dec 16, 2024 • 1
Intelligent System for Automated Molecular Patent Infringement Assessment Paper • 2412.07819 • Published Dec 10, 2024 • 1
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Paper • 2503.20776 • Published Mar 26 • 10
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions Paper • 2412.09596 • Published Dec 12, 2024 • 98
Art-Free Generative Models: Art Creation Without Graphic Art Knowledge Paper • 2412.00176 • Published Nov 29, 2024 • 9
Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation Paper • 2407.12489 • Published Jul 17, 2024
Efficient Detection of Toxic Prompts in Large Language Models Paper • 2408.11727 • Published Aug 21, 2024 • 13