-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper β’ 2402.14083 β’ Published β’ 47 -
Linear Transformers are Versatile In-Context Learners
Paper β’ 2402.14180 β’ Published β’ 7 -
Training-Free Long-Context Scaling of Large Language Models
Paper β’ 2402.17463 β’ Published β’ 24 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper β’ 2402.17764 β’ Published β’ 628
Yang Lee
innovation64
AI & ML interests
AGI
Recent Activity
upvoted a paper 3 days ago
Self-Distilled RLVR upvoted an article 3 days ago
Welcome Gemma 4: Frontier multimodal intelligence on device liked a model 11 days ago
Qwen/Qwen3.5-122B-A10BOrganizations
RAG
RAG research
-
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Paper β’ 2404.15676 β’ Published -
How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior
Paper β’ 2404.10198 β’ Published β’ 8 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper β’ 2403.10131 β’ Published β’ 72 -
FaaF: Facts as a Function for the evaluation of RAG systems
Paper β’ 2403.03888 β’ Published
papaer selecting
-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Paper β’ 2402.14083 β’ Published β’ 47 -
Linear Transformers are Versatile In-Context Learners
Paper β’ 2402.14180 β’ Published β’ 7 -
Training-Free Long-Context Scaling of Large Language Models
Paper β’ 2402.17463 β’ Published β’ 24 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper β’ 2402.17764 β’ Published β’ 628
RAG
RAG research
-
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Paper β’ 2404.15676 β’ Published -
How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior
Paper β’ 2404.10198 β’ Published β’ 8 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper β’ 2403.10131 β’ Published β’ 72 -
FaaF: Facts as a Function for the evaluation of RAG systems
Paper β’ 2403.03888 β’ Published
models 24
innovation64/gemma-2-2B-it-thinking-function_calling-V0
Updated
innovation64/llama3.1-nli
Updated
innovation64/llama3.1-8B-instruct-4bit-ruozhiba-4bit
Text Generation β’ 8B β’ Updated
innovation64/llama3.1-8B-instruct-4bit-ruozhiba-GGUF
8B β’ Updated β’ 38
innovation64/llama3.1-8B-instruct-4bit-ruozhiba-lora
Updated
innovation64/llama3.1-8B-instruct-4bit-ruozhiba-16
Text Generation β’ 8B β’ Updated
innovation64/speecht5_finetuned_voxpopuli_sl
Text-to-Speech β’ Updated β’ 3
innovation64/whisper-tiny-dv
Automatic Speech Recognition β’ Updated β’ 1
innovation64/distilhubert-finetuned-gtzan
Audio Classification β’ Updated β’ 14
innovation64/poca-aSoccerTwos
Reinforcement Learning β’ Updated β’ 8
datasets 0
None public yet