|
22489 | 22489 | - filename: PokeeAI.pokee_research_7b.Q4_K_M.gguf |
22490 | 22490 | sha256: 4cf6e52b51f3595631b6e17ad2d5a5c8499d2e646cf9452dd32fb5ff7bbb37dd |
22491 | 22491 | uri: huggingface://DevQuasar/PokeeAI.pokee_research_7b-GGUF/PokeeAI.pokee_research_7b.Q4_K_M.gguf |
| 22492 | +- !!merge <<: *gptoss |
| 22493 | + name: "gpt-oss-20b-esper3.1-i1" |
| 22494 | + urls: |
| 22495 | + - https://huggingface.co/mradermacher/gpt-oss-20b-Esper3.1-i1-GGUF |
| 22496 | + description: | |
| 22497 | + **Model Name:** gpt-oss-20b-Esper3.1 |
| 22498 | + **Repository:** [ValiantLabs/gpt-oss-20b-Esper3.1](https://huggingface.co/ValiantLabs/gpt-oss-20b-Esper3.1) |
| 22499 | + **Base Model:** openai/gpt-oss-20b |
| 22500 | + **Type:** Instruction-tuned, reasoning-focused language model |
| 22501 | + **Size:** 20 billion parameters |
| 22502 | + **License:** Apache 2.0 |
| 22503 | + |
| 22504 | + --- |
| 22505 | + |
| 22506 | + ### 🔍 **Overview** |
| 22507 | + gpt-oss-20b-Esper3.1 is a specialized, instruction-tuned variant of the 20B open-source GPT model, developed by **Valiant Labs**. It excels in **advanced coding, software architecture, and DevOps reasoning**, making it ideal for technical problem-solving and AI-driven engineering tasks. |
| 22508 | + |
| 22509 | + ### ✨ **Key Features** |
| 22510 | + - **Expert in DevOps & Cloud Systems:** Trained on high-difficulty datasets (e.g., Titanium3, Tachibana3, Mitakihara), it delivers precise, actionable guidance for AWS, Kubernetes, Terraform, Ansible, Docker, Jenkins, and more. |
| 22511 | + - **Strong Code Reasoning:** Optimized for complex programming tasks, including full-stack development, scripting, and debugging. |
| 22512 | + - **High-Quality Inference:** Uses `bf16` precision for full-precision performance; quantized versions (e.g., GGUF) available for efficient local inference. |
| 22513 | + - **Open-Source & Free to Use:** Fully open-access, built on the public gpt-oss-20b foundation and trained with community datasets. |
| 22514 | + |
| 22515 | + ### 📌 **Use Cases** |
| 22516 | + - Designing scalable cloud architectures |
| 22517 | + - Writing and optimizing infrastructure-as-code |
| 22518 | + - Debugging complex DevOps pipelines |
| 22519 | + - AI-assisted software development and documentation |
| 22520 | + - Real-time technical troubleshooting |
| 22521 | + |
| 22522 | + ### 💡 **Getting Started** |
| 22523 | + Use the standard `text-generation` pipeline with the `transformers` library. Supports role-based prompting (e.g., `user`, `assistant`) and performs best with high-reasoning prompts. |
| 22524 | + |
| 22525 | + ```python |
| 22526 | + from transformers import pipeline |
| 22527 | + |
| 22528 | + pipe = pipeline("text-generation", model="ValiantLabs/gpt-oss-20b-Esper3.1", torch_dtype="auto", device_map="auto") |
| 22529 | + messages = [{"role": "user", "content": "Design a Kubernetes cluster for a high-traffic web app with CI/CD via GitHub Actions."}] |
| 22530 | + outputs = pipe(messages, max_new_tokens=2000) |
| 22531 | + print(outputs[0]["generated_text"][-1]) |
| 22532 | + ``` |
| 22533 | + |
| 22534 | + --- |
| 22535 | + |
| 22536 | + > 🔗 **Model Gallery Entry**: |
| 22537 | + > *gpt-oss-20b-Esper3.1 – A powerful, open-source 20B model tuned for expert-level DevOps, coding, and system architecture. Built by Valiant Labs using high-quality technical datasets. Perfect for engineers, architects, and AI developers.* |
| 22538 | + overrides: |
| 22539 | + parameters: |
| 22540 | + model: gpt-oss-20b-Esper3.1.i1-Q4_K_M.gguf |
| 22541 | + files: |
| 22542 | + - filename: gpt-oss-20b-Esper3.1.i1-Q4_K_M.gguf |
| 22543 | + sha256: 079683445913d12e70449a10b9e1bfc8adaf1e7917e86cf3be3cb29cca186f11 |
| 22544 | + uri: huggingface://mradermacher/gpt-oss-20b-Esper3.1-i1-GGUF/gpt-oss-20b-Esper3.1.i1-Q4_K_M.gguf |
0 commit comments