43 55 176

Di Zhang

di-zhang-fdu

https://scholar.google.com/citations?user=vxAO250AAAAJ&hl=en

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

updated a model about 8 hours ago

di-zhang-fdu/dev-8-31-1.7b-siglip2-patch14-sink-sliding

published a model about 8 hours ago

di-zhang-fdu/dev-8-31-1.7b-siglip2-patch14-sink-sliding

updated a model about 13 hours ago

di-zhang-fdu/dev-8-14-stage2-step900

View all activity

Organizations

upvoted a paper 4 days ago

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published 5 days ago • 45

upvoted an article 10 days ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

19 days ago

• 68

upvoted a paper 12 days ago

AdaLomo: Low-memory Optimization with Adaptive Learning Rate

Paper • 2310.10195 • Published Oct 16, 2023 • 3

upvoted 2 papers 17 days ago

IAG: Input-aware Backdoor Attack on VLMs for Visual Grounding

Paper • 2508.09456 • Published 18 days ago • 7

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Paper • 2508.08401 • Published 19 days ago • 42

upvoted a collection 21 days ago

VLM-R1

Collection

Multimodal Reasoning Dataset for Large Scale Training with DeepSeek-R1 thoughts style • 18 items • Updated Apr 14 • 2

upvoted 2 collections 2 months ago

RefGPT Datasets

Collection

A large-scale dialogue dataset with references. • 6 items • Updated May 17, 2024 • 4

RLVR

Collection

Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated Mar 31 • 13

upvoted an article 2 months ago

Article

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

and 11 others •

Jun 27

• 28

upvoted 5 papers 3 months ago

Control-R: Towards controllable test-time scaling

Paper • 2506.00189 • Published May 30 • 6

AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

Paper • 2506.05328 • Published Jun 5 • 20

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 135

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Paper • 2505.17873 • Published May 23 • 31

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 57

upvoted 5 papers 4 months ago

AlignRAG: An Adaptable Framework for Resolving Misalignments in Retrieval-Aware Reasoning of RAG

Paper • 2504.14858 • Published Apr 21 • 4

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21 • 66

upvoted a paper 5 months ago

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Paper • 2504.10368 • Published Apr 14 • 21

Di Zhang

AI & ML interests

Recent Activity

Organizations

di-zhang-fdu's activity

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub