DeepSeek Cybersecurity Model v1.1
This is a fine-tuned version of Unsloth’s DeepSeek-R1-Distill-Llama-8B model, adapted for cybersecurity-focused text generation tasks.
Overview
- Base model:
unsloth/DeepSeek-R1-Distill-Llama-8B
- Developer: yanmyoaung
- Fine-tuned with: Unsloth + Hugging Face TRL
- Merged Weights: LoRA adapter merged into base model (16-bit)
- License: Apache 2.0
Model Purpose
This model is optimized for generating and understanding cybersecurity-related content, such as:
- Threat intelligence summaries
- Vulnerability analysis
- Incident response suggestions
- Cybersecurity Q&A and explanation generation
Inference Example
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("yanmyoaung04/deepseek-cybersecurity-model-v1.1")
tokenizer = AutoTokenizer.from_pretrained("yanmyoaung04/deepseek-cybersecurity-model-v1.1")
prompt = "Explain what a buffer overflow vulnerability is."
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=150)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
License
Apache 2.0 — free for academic and commercial use.
- Downloads last month
- 175
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for yanmyoaung04/deepseek-cybersecurity-model-v1.1
Base model
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Finetuned
unsloth/DeepSeek-R1-Distill-Llama-8B