metadata
base_model: unsloth/DeepSeek-R1-Distill-Llama-8B
tags:
- text-generation
- transformers
- unsloth
- deepseek
- llama
- trl
license: apache-2.0
language:
- en
DeepSeek Cybersecurity Model v1.1
This is a fine-tuned version of Unsloth’s DeepSeek-R1-Distill-Llama-8B model, adapted for cybersecurity-focused text generation tasks.
Overview
- Base model:
unsloth/DeepSeek-R1-Distill-Llama-8B
- Developer: yanmyoaung
- Fine-tuned with: Unsloth + Hugging Face TRL
- Merged Weights: LoRA adapter merged into base model (16-bit)
- License: Apache 2.0
Model Purpose
This model is optimized for generating and understanding cybersecurity-related content, such as:
- Threat intelligence summaries
- Vulnerability analysis
- Incident response suggestions
- Cybersecurity Q&A and explanation generation
Inference Example
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("yanmyoaung04/deepseek-cybersecurity-model-v1.1")
tokenizer = AutoTokenizer.from_pretrained("yanmyoaung04/deepseek-cybersecurity-model-v1.1")
prompt = "Explain what a buffer overflow vulnerability is."
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=150)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
License
Apache 2.0 — free for academic and commercial use.