RESUME
Zhichao Zhao
AI Application Developer · Agent Systems Engineer · LLM/RAG Builder
Email: 220246355@seu.edu.cn
Phone: +86 188xxxx7217
---
About
AI-focused software engineer and graduate researcher with hands-on experience in multi-agent systems, retrieval-augmented generation (RAG), LLM alignment, and domain-specific intelligent applications. My work centers on building AI systems that are not only capable, but also explainable, controllable, and production-oriented.
Currently pursuing an M.S. in Software Engineering at Southeast University after recommendation-based admission. I am especially interested in AI application development, agent systems, LLM engineering, and intelligent product prototyping.
---
Education
Southeast University
M.S. in Software Engineering
Sep 2024 – Jun 2027
- Admitted through recommendation-based graduate admission
- Relevant coursework: Algorithms Design and Analysis, Pattern Recognition, Digital Image Processing
China University of Mining and Technology
B.Eng. in Data Science and Big Data Technology
Sep 2020 – Jun 2024
- Relevant coursework: Deep Learning, Machine Learning, Recommender Systems, Big Data Storage and Management
---
Honors
- Co-author of an SCI-indexed paper
- International Second Prize, Mathematical Contest in Modeling (MCM/ICM)
- Provincial Second Prize, “Black Tech” Special Competition, 18th Challenge Cup National Student Extracurricular Academic and Technology Competition
---
Projects
Intelligent Travel Planning Multi-Agent System
Tech Stack: LangGraph, LangChain, Streamlit, RAG, ChromaDB, MCP, SQLite
Built a production-style multi-agent travel planning system for complex user requests, tool orchestration, and personalized multi-turn interaction.
- Architected an explainable multi-agent workflow with five coordinated roles: Main Agent, Planner, Executor, Summarizer, and Feedback Agent
- Designed a dual execution framework with Simple ReAct for lightweight queries and Plan-then-Execute for multi-city, constraint-heavy scenarios
- Combined Qwen for autonomous tool usage with DeepSeek-R1 for structured planning and execution
- Built a hybrid RAG plus MCP architecture integrating local knowledge retrieval with real-time external tools
- Improved Hit@5 from 70% to 84% on a self-built travel evaluation dataset using DashScope Embeddings, ChromaDB, and Cross-Encoder reranking
- Implemented TXT/PDF ingestion, chunking, persistence, similarity retrieval, and MD5-based file deduplication
- Designed a four-layer memory framework covering short-term state memory, long-term user preference memory, persistent session history, and context compression
- Reduced average token consumption in multi-turn conversations by 30%
Intelligent Medical QA and Assisted Triage System
Tech Stack: DeepSeek-R1-Distill-Qwen-1.5B, LLaMA-Factory, LoRA, DPO, SearXNG, ChromaDB
Developed a domain-specific medical assistant system focused on safe generation, retrieval grounding, and preference alignment.
- Designed a full data cleaning pipeline for the Huatuo-26M corpus using regex filtering, heuristic rules, and perplexity-based quality screening
- Curated 100,000 high-quality multi-turn medical QA samples for supervised fine-tuning
- Distilled 2,000 pairwise preference samples and defined an annotation framework favoring clarity, structured output, and medical disclaimers
- Conducted two-stage fine-tuning with LoRA-based SFT and DPO alignment on a DeepSeek-1.5B backbone
- Achieved a 90% safe refusal rate in high-risk diagnosis and treatment-inducing scenarios
- Built a Web-RAG pipeline using AnythingLLM, bge-large-zh, ChromaDB, and SearXNG to integrate up-to-date medical references and reduce hallucination
---
Research
Heart Rate Prediction from Millimeter-Wave Radar Vital Sign Signals
First Author
Manuscript pending acceptance at Biomedical Signal Processing and Control (CAS Q2)
Tech Stack: PyTorch, Temporal Attention, Bi-LSTM, TCN, Millimeter-Wave Radar Signal Processing
- Designed ERNet, a deep spatiotemporal architecture for robust heart rate prediction under noisy sequential conditions
- Combined TimeDistributed CNN for spatial representation learning with Bi-LSTM for long-range temporal dependency modeling
- Introduced a temporal attention mechanism to suppress abrupt local artifacts and improve robustness
- Proposed a PCA-based adaptive fusion strategy to address multi-channel imbalance
- Built a complete end-to-end PyTorch pipeline for training, inference, and evaluation
- Achieved 97.6% prediction accuracy, outperforming baseline methods
---
Technical Skills
Programming: Python, C++
Frameworks and Tools: PyTorch, LangChain, LangGraph, ChromaDB, LLaMA-Factory, Streamlit
LLM Engineering: RAG, LoRA, DPO, MCP, multi-agent orchestration, retrieval systems, preference alignment, context compression
Systems: Linux, MySQL, SQLite
English: CET-6
---
Career Interests
- AI Application Development
- Agent System Development
- LLM Engineering
- Intelligent Product Prototyping
- RAG and Tool-Augmented AI Systems
---
Contact
Email: 220246355@seu.edu.cn
Phone: +86 188xxxx7217