Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook
Maxime Labonne PRO
mlabonne
AI & ML interests
Post-training, model editing, quantization
Recent Activity
liked
a dataset
1 day ago
data-agents/jupyter-agent-dataset
Organizations
👿 Daredevil-8B
Fine-tuned abliterated merge of the best Llama 3 8B model. Highest MMLU score in its category.
-
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation • 8B • Updated • 14.6k • • 223 -
mlabonne/Daredevil-8B-abliterated
Text Generation • 8B • Updated • 9.97k • • 51 -
mlabonne/Daredevil-8B
Text Generation • 8B • Updated • 74 • 42 -
mlabonne/NeuralLlama-3-8B-Instruct-abliterated
Text Generation • 8B • Updated • 1.17k • • 10
🔮 Mixture of Experts
MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y
🐶 Beagle
Merges done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y
🥼 DrMistral
Mistral and Llama models trained on a corpus of French and English data to act as a medical chatbot and ace exams.
🦙 Llama 2 Guanaco
Set of models fine-tuned using QLoRA on Google Colab with the Guanaco dataset.
✂️ Abliteration
Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration
-
mlabonne/gemma-3-27b-it-abliterated
Image-Text-to-Text • 27B • Updated • 5.26k • • 197 -
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text • 27B • Updated • 28.6k • 152 -
mlabonne/gemma-3-12b-it-abliterated-v2
Image-Text-to-Text • 12B • Updated • 1.41k • 10 -
mlabonne/gemma-3-4b-it-abliterated-v2
Image-Text-to-Text • 4B • Updated • 2.68k • 9
👑 Monarch
Family of 7B models that combine excellent reasoning and conversational abilities.
-
mlabonne/AlphaMonarch-7B
Text Generation • 7B • Updated • 10.9k • • 148 -
Running on Zero2727
AlphaMonarch-7B
👑Generate text based on user messages and a chat history
-
mlabonne/NeuralMonarch-7B
Text Generation • 7B • Updated • 27.3k • • 12 -
Sleeping66
NeuralMonarch 7B GGUF Chat
👑Chat with NeuralMonarch-7B
🔀 Phixtral
The first Mixture of Experts with phi-2 models.
🧠 NeuralHermes-2.5
Models and code related to the DPO fine-tuned OpenHermes-2.5-Mistral-7B
💻 CodeLlama
Llama and CodeLlama models trained to improve the performance in terms of code generation.
📙 LLM Engineer's Handbook
Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook
✂️ Abliteration
Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration
-
mlabonne/gemma-3-27b-it-abliterated
Image-Text-to-Text • 27B • Updated • 5.26k • • 197 -
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text • 27B • Updated • 28.6k • 152 -
mlabonne/gemma-3-12b-it-abliterated-v2
Image-Text-to-Text • 12B • Updated • 1.41k • 10 -
mlabonne/gemma-3-4b-it-abliterated-v2
Image-Text-to-Text • 4B • Updated • 2.68k • 9
👿 Daredevil-8B
Fine-tuned abliterated merge of the best Llama 3 8B model. Highest MMLU score in its category.
-
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation • 8B • Updated • 14.6k • • 223 -
mlabonne/Daredevil-8B-abliterated
Text Generation • 8B • Updated • 9.97k • • 51 -
mlabonne/Daredevil-8B
Text Generation • 8B • Updated • 74 • 42 -
mlabonne/NeuralLlama-3-8B-Instruct-abliterated
Text Generation • 8B • Updated • 1.17k • • 10
👑 Monarch
Family of 7B models that combine excellent reasoning and conversational abilities.
-
mlabonne/AlphaMonarch-7B
Text Generation • 7B • Updated • 10.9k • • 148 -
Running on Zero2727
AlphaMonarch-7B
👑Generate text based on user messages and a chat history
-
mlabonne/NeuralMonarch-7B
Text Generation • 7B • Updated • 27.3k • • 12 -
Sleeping66
NeuralMonarch 7B GGUF Chat
👑Chat with NeuralMonarch-7B
🔮 Mixture of Experts
MoE done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y
🔀 Phixtral
The first Mixture of Experts with phi-2 models.
🐶 Beagle
Merges done using mergekit and LazyMergekit: https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb#scrollTo=d5mYzDo1q96y
🧠 NeuralHermes-2.5
Models and code related to the DPO fine-tuned OpenHermes-2.5-Mistral-7B
🥼 DrMistral
Mistral and Llama models trained on a corpus of French and English data to act as a medical chatbot and ace exams.
💻 CodeLlama
Llama and CodeLlama models trained to improve the performance in terms of code generation.
🦙 Llama 2 Guanaco
Set of models fine-tuned using QLoRA on Google Colab with the Guanaco dataset.