Reverse-Engineered Reasoning
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
View all Papers
-
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Paper β’ 2508.17445 β’ Published β’ 80 -
m-a-p/TreePO-Qwen2.5-7B
Text Generation β’ 8B β’ Updated β’ 54 β’ 2 -
m-a-p/TreePO_data
Viewer β’ Updated β’ 3.12k β’ 115 -
m-a-p/TreePO-Qwen2.5-7B_fixed-div
8B β’ Updated β’ 60
All 1.3B & 340M hybrid linear-attention experiments.
This is the collections of COIG-P's models
-
m-a-p/Infinity-Instruct-3M-0625-Llama3-8B-COIG-P
Text Generation β’ 8B β’ Updated β’ 56 -
m-a-p/Qwen2.5-Instruct-7B-COIG-P
Text Generation β’ 8B β’ Updated β’ 58 -
m-a-p/Infinity-Instruct-3M-0625-Mistral-7B-COIG-P
Text Generation β’ 7B β’ Updated β’ 60 -
m-a-p/Qwen2-Instruct-7B-COIG-P
Text Generation β’ 8B β’ Updated β’ 57
YuE: Open Full-song Generation Foundation Model
-
m-a-p/YuE-s1-7B-anneal-en-cot
Text Generation β’ 6B β’ Updated β’ 29.4k β’ 435 -
m-a-p/YuE-s1-7B-anneal-en-icl
Text Generation β’ 6B β’ Updated β’ 950 β’ 52 -
m-a-p/YuE-s1-7B-anneal-jp-kr-cot
Text Generation β’ 6B β’ Updated β’ 444 β’ 21 -
m-a-p/YuE-s1-7B-anneal-jp-kr-icl
Text Generation β’ 6B β’ Updated β’ 123 β’ 11
The checkpoints for the MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.
Dataset and Models of Chinese Open Instruction Generalist Series.
Datasets, Benchmark and Models of ChatMusician: Understanding and Generating Music Intrinsically with LLM
Neo
https://arxiv.org/abs/2406.13923
CriticLean
Data and model collection for MARBLE: https://github.com/a43992899/MARBLE/
This is the collection of COIG-P's datasets
-
m-a-p/COIG-P
Viewer β’ Updated β’ 1.01M β’ 230 β’ 28 -
m-a-p/COIG-P-CRM
Viewer β’ Updated β’ 484k β’ 66 β’ 4 -
m-a-p/COIG-CRBench
Viewer β’ Updated β’ 1.04k β’ 29 β’ 2 -
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
Paper β’ 2504.05535 β’ Published β’ 44
MuPT
-
m-a-p/OpenCodeInterpreter-DS-1.3B
Text Generation β’ 1B β’ Updated β’ 82 β’ 25 -
m-a-p/OpenCodeInterpreter-DS-6.7B
Text Generation β’ 7B β’ Updated β’ 18.3k β’ 135 -
m-a-p/OpenCodeInterpreter-DS-33B
Text Generation β’ Updated β’ 188 β’ 148 -
m-a-p/OpenCodeInterpreter-CL-7B
Text Generation β’ Updated β’ 109 β’ 11
-
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Paper β’ 2306.00107 β’ Published β’ 4 -
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Paper β’ 2309.08730 β’ Published β’ 2 -
ChatMusician: Understanding and Generating Music Intrinsically with LLM
Paper β’ 2402.16153 β’ Published β’ 60 -
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark
Paper β’ 2401.11944 β’ Published β’ 27
This is the checkpoints and datasets of MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Reverse-Engineered Reasoning
https://arxiv.org/abs/2406.13923
-
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
Paper β’ 2508.17445 β’ Published β’ 80 -
m-a-p/TreePO-Qwen2.5-7B
Text Generation β’ 8B β’ Updated β’ 54 β’ 2 -
m-a-p/TreePO_data
Viewer β’ Updated β’ 3.12k β’ 115 -
m-a-p/TreePO-Qwen2.5-7B_fixed-div
8B β’ Updated β’ 60
CriticLean
All 1.3B & 340M hybrid linear-attention experiments.
Data and model collection for MARBLE: https://github.com/a43992899/MARBLE/
This is the collections of COIG-P's models
-
m-a-p/Infinity-Instruct-3M-0625-Llama3-8B-COIG-P
Text Generation β’ 8B β’ Updated β’ 56 -
m-a-p/Qwen2.5-Instruct-7B-COIG-P
Text Generation β’ 8B β’ Updated β’ 58 -
m-a-p/Infinity-Instruct-3M-0625-Mistral-7B-COIG-P
Text Generation β’ 7B β’ Updated β’ 60 -
m-a-p/Qwen2-Instruct-7B-COIG-P
Text Generation β’ 8B β’ Updated β’ 57
This is the collection of COIG-P's datasets
-
m-a-p/COIG-P
Viewer β’ Updated β’ 1.01M β’ 230 β’ 28 -
m-a-p/COIG-P-CRM
Viewer β’ Updated β’ 484k β’ 66 β’ 4 -
m-a-p/COIG-CRBench
Viewer β’ Updated β’ 1.04k β’ 29 β’ 2 -
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
Paper β’ 2504.05535 β’ Published β’ 44
YuE: Open Full-song Generation Foundation Model
-
m-a-p/YuE-s1-7B-anneal-en-cot
Text Generation β’ 6B β’ Updated β’ 29.4k β’ 435 -
m-a-p/YuE-s1-7B-anneal-en-icl
Text Generation β’ 6B β’ Updated β’ 950 β’ 52 -
m-a-p/YuE-s1-7B-anneal-jp-kr-cot
Text Generation β’ 6B β’ Updated β’ 444 β’ 21 -
m-a-p/YuE-s1-7B-anneal-jp-kr-icl
Text Generation β’ 6B β’ Updated β’ 123 β’ 11
The checkpoints for the MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training.
MuPT
Dataset and Models of Chinese Open Instruction Generalist Series.
-
m-a-p/OpenCodeInterpreter-DS-1.3B
Text Generation β’ 1B β’ Updated β’ 82 β’ 25 -
m-a-p/OpenCodeInterpreter-DS-6.7B
Text Generation β’ 7B β’ Updated β’ 18.3k β’ 135 -
m-a-p/OpenCodeInterpreter-DS-33B
Text Generation β’ Updated β’ 188 β’ 148 -
m-a-p/OpenCodeInterpreter-CL-7B
Text Generation β’ Updated β’ 109 β’ 11
Datasets, Benchmark and Models of ChatMusician: Understanding and Generating Music Intrinsically with LLM
-
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Paper β’ 2306.00107 β’ Published β’ 4 -
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Paper β’ 2309.08730 β’ Published β’ 2 -
ChatMusician: Understanding and Generating Music Intrinsically with LLM
Paper β’ 2402.16153 β’ Published β’ 60 -
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark
Paper β’ 2401.11944 β’ Published β’ 27
This is the checkpoints and datasets of MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Neo