Text Generation
Transformers
Safetensors
English
gpt_oss
esper
esper-3.1
esper-3
valiant
valiant-labs
gpt
gpt-oss
gpt-oss-20b
openai
20b
reasoning
code
code-instruct
python
javascript
dev-ops
jenkins
terraform
ansible
docker
kubernetes
helm
grafana
prometheus
shell
bash
azure
aws
gcp
cloud
scripting
powershell
problem-solving
architect
engineer
developer
creative
analytical
expert
rationality
conversational
chat
instruct
File size: 3,473 Bytes
f07f325 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 |
---
language:
- en
library_name: transformers
pipeline_tag: text-generation
tags:
- esper
- esper-3.1
- esper-3
- valiant
- valiant-labs
- gpt
- gpt-oss
- gpt-oss-20b
- openai
- 20b
- reasoning
- code
- code-instruct
- python
- javascript
- dev-ops
- jenkins
- terraform
- ansible
- docker
- jenkins
- kubernetes
- helm
- grafana
- prometheus
- shell
- bash
- azure
- aws
- gcp
- cloud
- scripting
- powershell
- problem-solving
- architect
- engineer
- developer
- creative
- analytical
- expert
- rationality
- conversational
- chat
- instruct
base_model: openai/gpt-oss-20b
datasets:
- sequelbox/Tachibana3-Part1-DeepSeek-V3.1-Terminus
- sequelbox/Tachibana3-Part2-DeepSeek-V3.2
- sequelbox/Titanium3-DeepSeek-V3.1-Terminus
- sequelbox/Mitakihara-DeepSeek-R1-0528
license: apache-2.0
---
**[Support our open-source dataset and model releases!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)**

Esper 3.1: [Qwen3-4B-Thinking-2507](https://huggingface.co/ValiantLabs/Qwen3-4B-Thinking-2507-Esper3.1), [gpt-oss-20b](https://huggingface.co/ValiantLabs/gpt-oss-20b-Esper3.1)
Esper 3.1 is a coding, architecture, and DevOps reasoning specialist built on gpt-oss-20b.
- Your dedicated DevOps expert: Esper 3.1 maximizes DevOps and architecture helpfulness, powered by [high-difficulty DevOps and architecture data](https://huggingface.co/datasets/sequelbox/Titanium3-DeepSeek-V3.1-Terminus) generated with DeepSeek-V3.1-Terminus!
- Improved coding performance: challenging code-reasoning datasets stretch [DeepSeek-V3.1-Terminus](https://huggingface.co/datasets/sequelbox/Tachibana3-Part1-DeepSeek-V3.1-Terminus) and [DeepSeek-V3.2](https://huggingface.co/datasets/sequelbox/Tachibana3-Part2-DeepSeek-V3.2) to the limits, allowing Esper 3.1 to tackle harder coding tasks!
- AI to build AI: our [high-difficulty AI expertise data](https://huggingface.co/datasets/sequelbox/Mitakihara-DeepSeek-R1-0528) boosts Esper 3.1's MLOps, AI architecture, AI research, and general reasoning skills.
- Small model sizes allow running on local desktop and mobile, plus super-fast server inference!
## Prompting Guide
Esper 3.1 uses the [gpt-oss-20b](https://huggingface.co/openai/gpt-oss-20b) prompt format.
Esper 3.1 is a reasoning finetune; **reasoning level high is generally recommended.**
**NOTE: This release of Esper 3.1 uses bf16 for all parameters. Consider quantized models if you're not looking to use bf16.**
Example inference script provided by [gpt-oss-20b](https://huggingface.co/openai/gpt-oss-20b) to get started:
```python
from transformers import pipeline
import torch
model_id = "ValiantLabs/gpt-oss-20b-Esper3.1"
pipe = pipeline(
"text-generation",
model=model_id,
torch_dtype="auto",
device_map="auto",
)
messages = [
{"role": "user", "content": "Design a serverless architecture for a real-time image processing application using AWS Lambda and Amazon S3."},
]
outputs = pipe(
messages,
max_new_tokens=15000,
)
print(outputs[0]["generated_text"][-1])
```

Esper 3.1 is created by [Valiant Labs.](http://valiantlabs.ca/)
[Check out our HuggingFace page to see all of our models!](https://huggingface.co/ValiantLabs)
We care about open source. For everyone to use.
|