PhysicsWallahAI
/

Aryabhata-1.0

@@ -1,5 +1,11 @@
 ---
 license: cc-by-nc-4.0
 tags:
 - small-language-model
 - jee
@@ -12,14 +18,8 @@ tags:
 - mathematics
 - ai4education
 - physicswallah
-language:
-- en
-model_name: PhysicsWallah/Aryabhata-1.0
 model_creator: Physics Wallah AI Research
 model_type: Causal decoder-based model
-base_model: Qwen/Qwen2.5-Math-7B
-pipeline_tag: text-generation
-library_name: transformers
 ---
 # Aryabhatta 1.0 : An exam-focused language model for JEE Math
@@ -146,7 +146,8 @@ model = AutoModelForCausalLM.from_pretrained(model_id)
 # Define stop strings
-stop_strings = ["<|im_end|>", "<|end|>", "<im_start|>", "⁠```python\n", "⁠<|im_start|>", "]}}]}}]"]
 def strip_bad_tokens(s, stop_strings):
     for suffix in stop_strings:
@@ -192,7 +193,8 @@ llm = LLM(model="PhysicsWallahAI/Aryabhata-1.0")
 query = 'Find all the values of \\sqrt[3]{1}'
 messages = [{'role': 'system', 'content': 'Think step-by-step; put only the final answer inside \\boxed{}.'},
             {'role': 'user', 'content': query}]
-sampling_params = SamplingParams(temperature=0.0, max_tokens=4*1024, stop=["<|im_end|>", "<|end|>", "<im_start|>", "⁠```python\n", "⁠<|im_start|>", "]}}]}}]"])
 # Run inference
 results = llm.chat(messages, sampling_params)
@@ -203,7 +205,7 @@ print(results[0].outputs[0].text.strip())
 ---
-Read more about Aryabhata 1.0 in our [Technical Report](https://arxiv.org/abs/2508.08665)
 ---
@@ -226,4 +228,5 @@ If you use this model, please cite:
   author = {Physics Wallah AI Research},
   year = {2025},
   note = {\url{https://huggingface.co/PhysicsWallahAI/Aryabhata-1.0}},
-}

 ---
+base_model: Qwen/Qwen2.5-Math-7B
+language:
+- en
+library_name: transformers
 license: cc-by-nc-4.0
+model_name: PhysicsWallah/Aryabhata-1.0
+pipeline_tag: text-generation
 tags:
 - small-language-model
 - jee
 - mathematics
 - ai4education
 - physicswallah
 model_creator: Physics Wallah AI Research
 model_type: Causal decoder-based model
 ---
 # Aryabhatta 1.0 : An exam-focused language model for JEE Math
 # Define stop strings
+stop_strings = ["<|im_end|>", "<|end|>", "<im_start|>", "⁠```python
+", "⁠<|im_start|>", "]}}]}}]"]
 def strip_bad_tokens(s, stop_strings):
     for suffix in stop_strings:
 query = 'Find all the values of \\sqrt[3]{1}'
 messages = [{'role': 'system', 'content': 'Think step-by-step; put only the final answer inside \\boxed{}.'},
             {'role': 'user', 'content': query}]
+sampling_params = SamplingParams(temperature=0.0, max_tokens=4*1024, stop=["<|im_end|>", "<|end|>", "<im_start|>", "⁠```python
+", "⁠<|im_start|>", "]}}]}}]"])
 # Run inference
 results = llm.chat(messages, sampling_params)
 ---
+Read more about Aryabhata 1.0 in our [Technical Report](https://arxiv.org/abs/2508.08665) and find the code on [GitHub](https://github.com/PhysicsWallahAI/Aryabhata-1.0).
 ---
   author = {Physics Wallah AI Research},
   year = {2025},
   note = {\url{https://huggingface.co/PhysicsWallahAI/Aryabhata-1.0}},
+}
+```