ShanghaiTech University

university

Verified

https://www.shanghaitech.edu.cn/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

kkzsocute authored a paper 27 days ago

Kairos: Towards Adaptive and Generalizable Time Series Foundation Models

HowieYan authored a paper about 2 months ago

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image

HowieYan authored a paper about 2 months ago

P3-SAM: Native 3D Part Segmentation

View all activity

zhaoyd

authored 2 papers about 1 month ago

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Paper • 2509.13313 • Published Sep 16 • 77

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16 • 88

Ironieser

authored 2 papers 2 months ago

MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs

Paper • 2508.18264 • Published Aug 25 • 25

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published Aug 21 • 46

zhaoyd

authored a paper 3 months ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7 • 137

ZarkLngeW

authored 3 papers 3 months ago

Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering

Paper • 2409.07441 • Published Sep 11, 2024 • 11

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image

Paper • 2502.12894 • Published Feb 18 • 18

BANG: Dividing 3D Assets via Generative Exploded Dynamics

Paper • 2507.21493 • Published Jul 29 • 64

Ironieser

authored 5 papers 3 months ago

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

Paper • 2401.10727 • Published Jan 19, 2024 • 2

TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting

Paper • 2204.01018 • Published Apr 3, 2022

Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos

Paper • 2303.12370 • Published Mar 22, 2023

Efficient Post-Training Refinement of Latent Reasoning in Large Language Models

Paper • 2506.08552 • Published Jun 10 • 1

Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives

Paper • 2506.24124 • Published Jun 30 • 1

lannester

authored 2 papers 6 months ago

Leveraging Large Language Models for Effective Label-free Node Classification in Text-Attributed Graphs

Paper • 2412.11983 • Published Dec 16, 2024 • 1

Intelligent System for Automated Molecular Patent Infringement Assessment

Paper • 2412.07819 • Published Dec 10, 2024 • 1

rhfeiyang

authored a paper 7 months ago

Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields

Paper • 2503.20776 • Published Mar 26 • 10

XinyueZhang1997

authored a paper 11 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 98

rhfeiyang

authored 2 papers 11 months ago

Art-Free Generative Models: Art Creation Without Graphic Art Knowledge

Paper • 2412.00176 • Published Nov 29, 2024 • 9

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Paper • 2407.12489 • Published Jul 17, 2024

kirkyDig

authored a paper about 1 year ago

Efficient Detection of Toxic Prompts in Large Language Models

Paper • 2408.11727 • Published Aug 21, 2024 • 13