Distill Compatibility for PC w/ Ryzen 7 Pro 8840HS w/ 780M Graphics 2x32GB RAM 1TB DDR5 SSD
1
#115 opened 9 months ago
by
arzx
Upload gitattributes.txt
#114 opened 9 months ago
by
SafeerChalil
Introducing Deepseek's TinyZero
❤️
1
1
#113 opened 9 months ago
by
DeepSeekModerator
Create Kuch v
1
#112 opened 9 months ago
by
gamerdowntown
Request: DOI
#111 opened 9 months ago
by
Hassanabbas2975
quantization fp8 error occuring while using pipeline approach or transformer based approach
1
#110 opened 9 months ago
by
neethuvm
Deepseek-R1
#109 opened 9 months ago
by
KudanTao
deepseek-r1 源码中采用 MLA 架构的 KV Cache 压缩存储策略的实现似乎与文中说的不一致,这是为什么?代码中似乎没实现这个大优化
👍
3
2
#108 opened 9 months ago
by
Darkdust
Eating food in a car
#106 opened 9 months ago
by
Ayinbaby1313
Update README.md
#103 opened 9 months ago
by
jungvaclav
error while downloading model
👍
11
8
#102 opened 9 months ago
by
heikhama1982
Upload IMG_20250112_172711.jpg
#101 opened 9 months ago
by
aamir1
help from italy
5
#100 opened 9 months ago
by
MMPPIIAA
R1 distill to Mistral Small?
❤️
10
4
#99 opened 9 months ago
by
nfunctor
Running this model on Google Colab?
👍
1
3
#98 opened 9 months ago
by
Zakia
请问下deepseek的同学,能不能train出一个 stable 的 moe model?
#97 opened 9 months ago
by
tflchina
How to download DeepSeek-R1 7B parameters
1
#96 opened 9 months ago
by
barqawiz
HuggingFace version does NOT use efficient MLA caching
2
#95 opened 9 months ago
by
Avelina
Found a bug
#93 opened 9 months ago
by
amalgunatilake
Let's Give Credit Where It’s Due: Adding Source Links to AI Responses
🔥
👀
1
3
#88 opened 9 months ago
by
Munis01
When will this be available in Transformers library?
👍
2
#87 opened 9 months ago
by
solwol
cannot regenerate (blank respone)
#86 opened 9 months ago
by
pluhong
A Bug using hugging face API
3
#85 opened 9 months ago
by
Kevin355
Do we need an authorization access to use this ?
#84 opened 9 months ago
by
Natwar
where is the source code for this Model ? - what does they prodoudly say by open-source models?
1
#83 opened 9 months ago
by
tstarksys
智王发布deepseek-r1懒人包,解压即用Deepseek-r1 Lazy Package, easy to decompress and use
1
#81 opened 9 months ago
by
zwpython
model-00078-of-000163.safetensors not marked safe?
2
#80 opened 9 months ago
by
aborst
Create Dare
#79 opened 9 months ago
by
Dara996
problem with using serverless inference
1
#78 opened 9 months ago
by
manju2345
Some weird sensorship on unsensitive topic. 对非敏感话题的奇怪审查。
8
#77 opened 9 months ago
by
junnanwu
Upload dkfoEtm3H4bMcaI0KEJbq.1023.jpeg
#76 opened 9 months ago
by
luckysalami089
Update README.md
#75 opened 9 months ago
by
NuoNb
🚩 Report: Ethical issue(s)
#74 opened 9 months ago
by
Typeofprototype
Deepseek-R1 falls: ZW demon redesigns' Nine Birds' Deepseek-R1沦陷:zw魔改版“九只鸟”
#73 opened 9 months ago
by
zwpython
Consistency, can Deepseek pass?一致性,deepseek能及格吗?
#71 opened 9 months ago
by
zwpython
Does this model support text insertion (fill in middle)?
2
#70 opened 9 months ago
by
AayushShah
Thoughts on deepseek-r1. Correct me if I'm wrong
🚀
🔥
5
1
#69 opened 9 months ago
by
pkms
ImportError: cannot import name 'is_torch_greater_or_equal_than_1_13' from 'transformers.pytorch_utils'
➕
24
11
#67 opened 9 months ago
by
bashir-abubakar
e-currency
3
#63 opened 9 months ago
by
Zhendaxie
Meet PEEPSEEK, the first meme made by DeepSeek r1
👀
1
1
#61 opened 9 months ago
by
deepseeker3b56
鲸 Logo transparent
#60 opened 9 months ago
by
DorianDarko2525
Meet Finley, the Whale of DeepSeek!
👍
❤️
5
#59 opened 9 months ago
by
deepseekjanus
最近的炒作和硬币
#58 opened 9 months ago
by
Chester1111
Official DeepThink Crypto Currency
1
#56 opened 9 months ago
by
qwen-llm
Congrats, this is the by far the best open source model! Just a few steps until complete domination (feedback)
🔥
2
1
#54 opened 9 months ago
by
Dampfinchen
deepseek
#53 opened 9 months ago
by
denizkaya2022
Modify abbreviations in benchmark images into full name to avoid confusion
👍
1
#52 opened 9 months ago
by
karminski
How to deploy DeepSeek-R1 witn LMDeploy ?
#48 opened 9 months ago
by
vansin