Skip to content
View Xianyu39's full-sized avatar

Block or report Xianyu39

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Xianyu39/README.md

Hi, I'm Westwoods 👋

🎓 Master's student in Software Engineering
🔬 Researching LLM reasoning enhancement and reliability
⚙️ Interested in LLM systems, RAG architectures, and inference infrastructure

I enjoy building systems that make large language models more reliable, scalable, and efficient.


🔧 Technical Interests

  • LLM Systems & Infrastructure

    • vLLM inference optimization
    • high-throughput LLM serving
    • GPU inference platforms
  • Retrieval-Augmented Generation (RAG)

    • GraphRAG architectures
    • knowledge graph retrieval
    • multi-hop reasoning
  • AI Engineering

    • distributed inference services
    • experiment automation
    • ML system performance optimization

🚀 Selected Projects

GraphRAG Vertical QA System

A domain-specific QA system for food health recommendations.

Key features:

  • Knowledge graph built from collected food standard data
  • GraphRAG multi-hop retrieval
  • Embedding + Rerank entity matching pipeline
  • Significant improvement in entity matching accuracy

Tech stack:

Python · FastAPI · Neo4j · Milvus · RAG


LLM Inference Platform for Research Lab

Designed and maintained a shared LLM inference platform for 30+ researchers.

Highlights:

  • Centralized inference service architecture
  • vLLM + Ollama model serving
  • Prometheus monitoring and alert system
  • Optimized 32B model inference using 8-bit quantization + tensor parallel

Performance improvement:

  • Throughput increased from ~60 TPS → 600+ TPS

Tech stack:

vLLM · Docker · Linux · Prometheus · GPU servers


📚 Research Interests

Currently exploring methods to improve the reliability of LLM reasoning using structured knowledge.

Topics of interest:

  • reasoning-aware retrieval
  • proposition graphs
  • knowledge-grounded reasoning chains

🏆 Achievements

  • National Third Prize — Challenge Cup Competition
  • MCM/ICM Mathematical Contest in Modeling — Honorable Mention
  • Annual Author (technology writing platform)

📫 Contact

Email: fanghejin@qq.com
GitHub: https://github.com/Xianyu39

Popular repositories Loading

  1. obsidian-just-write obsidian-just-write Public

    一款基于Obsidian定制的文章写作软件。

    54 7

  2. Projects-Kanban Projects-Kanban Public

    一个Obsidian项目管理模板库。

    34 2

  3. obsidian-knowledge-tree obsidian-knowledge-tree Public

    Python 5

  4. Xianyu39 Xianyu39 Public

    Config files for my GitHub profile.

    1

  5. SCP-Management-System SCP-Management-System Public

    A normal university programming class task.

    C++

  6. Xianyu-LearningDS Xianyu-LearningDS Public

    This repository includes some Data Structure demos. Each one has it's own introduction file 'ReadMe.md'. Have a nice trip.

    Makefile