About
I am HE ZHU (朱赫) (CV), an M.Phil. student in Smart City and Big Data at Peking University, advised by Prof. Wenjia Zhang and Prof. Guanhua Chen. I previously received my B.E. in Computer Science from the Southern University of Science and Technology, where I was advised by Prof. Zipei Fan and Prof. Xuan Song. My research centers on large language models for urban intelligence, focusing on instruction data synthesis, supervised fine-tuning (SFT) mechanisms, and domain-specific deployment. I lead the PlanGPT series and am currently a research intern at Microsoft Research Asia (GENAI Group).
My research interests broadly lie in the areas of Machine Learning (ML), Natural Language Processing (NLP), and Large Language Models (LLM). More specifically, I focus on data-centric LLM research and domain-specific deployment, including:
- (i) High-Quality Data Selection: AlignDiff, InstructDiff
- (ii) Scalable Data Synthesis: FANNO, Tag-Instruct
- (iii) Fine-Tuning Mechanisms: ASFT
- (iv) Domain-Specific Foundation Models (Urban Intelligence): PlanGPT, PlanGPT-VL
🔥 News
-
Jan 2026
ASFT was accepted to ICLR 2026.
-
Oct 2025
PlanGPT-VL was accepted to EMNLP Industry 2025.
-
Aug 2025
PlanGPT was selected as the oral paper of ACL Industry 2025. (Top 1%)
-
May 2025
Three First-Author papers (FANNO, Tag-Instruct and PlanGPT) were accepted by ACL 2025.
Education
-
Peking University — M.S. Smart City & Big Data, Advisor: Prof. Wenjia Zhang (Sep 2024 – Present). -
Southern University of Science and Technology — B.E. Computer Science, Advisor: Prof. Xuan Song (Sep 2020 – Jul 2024, GPA 90.2/100, top 10%).
Projects
I am the first author and technical lead of PlanGPT, the first systematic study of LLMs for urban planning. The suite includes:
Learn more: plangpt.github.io
- PlanGPT — Text LLM for planning scheme generation, powering Urban-Thinking, CoPlanner, and AuditPlanner.
- PlanGPT-VL — Vision-language model for planning maps with hallucination control and hybrid training. Outperforms larger general VLMs on PlanBench-V.
- PlanGPT-R1 — Reasoning-enhanced model for long-form planning documents.
- UP-Bench & PlanBench-V — Benchmarks covering text, vision, and multimodal planning tasks.
First Author Paper
-
Anchored Supervised Fine-Tuning ICLR 2026
-
PlanGPT: Enhancing Urban Planning with Tailored Language Model ACL Industry 2025 · Oral
-
Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only ACL 2025 (Findings)
-
Tag-Instruct: A Scalable Framework for Controlled Instruction-Tuning Data Synthesis ACL 2025 (Findings)
-
PlanGPT-VL: Vision-Language Model for Urban Planning Maps EMNLP Industry 2025
-
Towards Fair and Comprehensive Evaluation of Routers in Collaborative LLM Systems ACL 2026 Submission
-
InstructDiff: Domain-Adaptive Data Selection via Differential Entropy for Efficient LLM Fine-Tuning ACL 2026 Submission
-
AlignDiff: Exploiting Model-Intrinsic Information for Better Data Selection ICML 2026 Submission
Experience
-
Microsoft Research Asia · GENAI Group — Research Intern, Prof. Li Dong (Oct 2025 – Present). -
Shanghai AI Laboratory · OpenData Lab — Researcher, foundation language models (May 2025 – Oct 2025). -
SenseTime · Foundation Language Model Center — Researcher, foundation models (Jun 2024 – Sep 2024). -
LocationMind, Tokyo — Project Assistant, autonomous driving algorithms (Jun 2023 – Dec 2023).
Research Appointments
-
SUSTech-NLP Group — Research Assistant, Prof. Guanhua Chen (Nov 2023 – Present). -
Center for Spatial Information Science, UTokyo — Research Assistant (Feb 2022 – May 2022; Jul 2023 – Sep 2023). -
School of Computing, NUS — Visiting Student (May 2022 – Jul 2022).
Honors & Service
- Outstanding Graduate, Dept. of CS, SUSTech (Top 5%, 2024)
- Outstanding Graduate, SUSTech (Top 5%, 2024)
- Outstanding Thesis, SUSTech (Top 10%, 2024)
- Annual Outstanding Student, SUSTech (2021, 2022, 2023)
Service: Reviewer, ARR (ACL Rolling Review).
Leadership: Youth League Secretary, Peking University — led the branch to Five-Star recognition.