Asankhaya Sharma's picture

Asankhaya Sharma PRO

codelion

·

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and PTS. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

reacted to their post with ❤️ 1 day ago

Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora

reacted to their post with 👀 1 day ago

Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora

reacted to their post with 🚀 1 day ago

Over 40 percent of AI-generated code contains security vulnerabilities. We recently worked on a LoRA to write secure code by default using automated Semgrep analysis and GRPO, achieving 97 percent reduction in vulnerabilities without requiring security-specific prompts. Technical Approach: Automated security training pipeline combining Semgrep vulnerability detection with preference learning. Generate multiple solutions with varying security awareness, automatically analyze for vulnerabilities, create preference pairs based on security scores, train using GRPO with multi-factor scoring. Scoring System (100 points total): - Functionality: 40 points - Does the code work correctly - Security patterns: 40 points - Uses secure coding practices - Low vulnerabilities: 20 points - Semgrep score below threshold This balanced scoring prevents reward hacking where models generate empty functions to avoid vulnerabilities. Real Transformation Examples: Database query before: query = f"SELECT * FROM products WHERE name = '{name}'" Database query after: query = "SELECT * FROM products WHERE name = ?" db.execute(query, (name,)) Password hashing before: password_hash = hashlib.md5(password).hexdigest() Password hashing after: salt = bcrypt.gensalt(rounds=12) password_hash = bcrypt.hashpw(password.encode('utf-8'), salt) Model: https://huggingface.co/codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora Notebook: https://github.com/codelion/ellora/blob/main/Ellora_Recipe_5_Secure_Code_Generation_LoRA.ipynb Repository: https://github.com/codelion/ellora

View all activity

Organizations

liked a dataset 13 days ago

codelion/gemma-3-270m-icm-dpo

Viewer • Updated 2 days ago • 1.11k • 71 • 1

liked a model 15 days ago

google/gemma-3-270m-it

Text Generation • 0.3B • Updated 19 days ago • 162k • 370

liked 2 datasets about 1 month ago

OctoThinker/MegaMath-Web-Pro-Max

Viewer • Updated Jul 6 • 69.2M • 8.33k • 35

codelion/Llama-3.2-1B-Instruct-magpie-tool-calling

Viewer • Updated Jul 18 • 1.2k • 41 • 1

liked a Space about 2 months ago

ThinkSound

Generate audio for a video using captions and descriptions

liked a dataset about 2 months ago

codelion/Qwen3-0.6B-magpie

Viewer • Updated Jul 12 • 735 • 39 • 1

liked a model about 2 months ago

codelion/Qwen3-0.6B-accuracy-recovery-lora

Text Generation • Updated Jul 13 • 14 • 1

liked 2 datasets about 2 months ago

codelion/Qwen3-0.6B-pts-thought-anchors

Viewer • Updated Jul 10 • 148 • 37 • 2

sumuks/essential-web-v1.0-sample-10M

Viewer • Updated Jul 3 • 18.3k • 22 • 1

liked a dataset 2 months ago

allenai/ZebraLogicBench

Viewer • Updated Jul 11, 2024 • 4.26k • 1.86k • 20

liked a model 2 months ago

MemChainAI/adaptive-sentiment-classifier

Text Classification • Updated Jun 25 • 64 • 5

liked a Space 2 months ago

MLX My Repo

Convert and upload Hugging Face models to MLX format

liked 2 datasets 2 months ago

codelion/Qwen3-0.6B-icm

Viewer • Updated Jul 18 • 500 • 19 • 1

TheFinAI/FinCoT

Viewer • Updated Jul 27 • 9.19k • 1.05k • 7

liked 3 datasets 3 months ago

kensho/bizbench

Viewer • Updated Jun 3, 2024 • 19.1k • 232 • 6

AI-Secure/adv_glue

Viewer • Updated Jan 9, 2024 • 738 • 650 • 7

SakanaAI/ALE-Bench

Updated Jun 17 • 5.34k • 9

liked 3 models 3 months ago

google/gemma-3-1b-it

Text Generation • 1.0B • Updated Apr 4 • 3.23M • 595

unsloth/Magistral-Small-2506-unsloth-bnb-4bit

Text Generation • 13B • Updated Jun 10 • 1.75k • 4

mlx-community/Qwen3-0.6B-bf16

Text Generation • 0.6B • Updated Apr 28 • 1.8k • 5