Finetune of Qwen-2.5-7B model on a dump of DTF posts and comments.
Nikita Sushko
chameleon-lizard
AI & ML interests
NLP, Multilingual Models, Multiagent Systems
Recent Activity
upvoted
a
paper
12 days ago
nablaNABLA: Neighborhood Adaptive Block-Level Attention
upvoted
a
paper
19 days ago
RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA
Optimization