-
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 13.3M • • 4.42k -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.99M • • 1.64k -
meta-llama/Llama-3.3-70B-Instruct
Text Generation • 71B • Updated • 395k • • 2.46k -
meta-llama/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • 109B • Updated • 759k • • 1.04k
Satya Saurabh Mishra
saurabhmishra9
·
AI & ML interests
Data Science, Machine Learning, AI etc
Organizations
Prompting and RAG
-
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks
Paper • 2412.15605 • Published • 2 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Paper • 2403.14403 • Published • 7 -
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Paper • 2412.13171 • Published • 36
LLM Models
-
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 13.3M • • 4.42k -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.99M • • 1.64k -
meta-llama/Llama-3.3-70B-Instruct
Text Generation • 71B • Updated • 395k • • 2.46k -
meta-llama/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • 109B • Updated • 759k • • 1.04k
Prompting and RAG
-
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks
Paper • 2412.15605 • Published • 2 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Paper • 2403.14403 • Published • 7 -
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Paper • 2412.13171 • Published • 36
models
0
None public yet
datasets
0
None public yet