Skip to content
View hhh2210's full-sized avatar
:atom:
:atom:

Highlights

  • Pro

Block or report hhh2210

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hhh2210/README.md

Larry Hao (Zhuoyuan Hao)

I work on LLM reasoning, reinforcement learning, and practical AI systems. My current research focus is how reasoning models use prompt echoes, attention refocusing, probabilistic anchors, and inference-time reminders during multi-step reasoning.

Featured Research

Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning

ICLR 2026 paper and codebase on LLM reasoning. The project studies Echo of Prompt behavior: when a model repeats or rephrases the original user question inside its reasoning trajectory. The work asks whether prompt echoes are only supervised fine-tuning artifacts, or whether they can act as anchors for attention refocusing and better reasoning.

What I Build

  • LLM reasoning and evaluation: mechanisms, prompting, probes, and analysis tools for reasoning trajectories.
  • Agent and developer tooling: practical systems around AI coding, token-efficient command output, local model workflows, and Codex/Claude-style agent environments.
  • Applied AI products: research ideas that can survive product constraints, user growth, and real usage.

Selected Projects

  • echoes-as-anchors - code for the ICLR 2026 paper on Echo of Prompt, probabilistic costs, and attention refocusing in LLM reasoning.
  • rtk - a Rust CLI proxy that reduces LLM agent token consumption by filtering noisy command output before it enters model context.
  • auto-skill - a few-shot skill induction prototype for turning examples into reusable AI-agent skills.
  • CodexBar - a macOS menu bar utility for OpenAI Codex and Claude Code usage visibility.
  • Date Match - zero-budget cold start of a Gen Z relationship-matching product, reaching 100K users in the first 10 days and 170K+ completed tests through organic sharing.

Collaboration

I am interested in LLM reasoning, reinforcement learning, agent systems, evaluation, and applied AI products. Reach out for research collaboration, prototype building, or practical AI system work.

Pinned Loading

  1. echoes-as-anchors echoes-as-anchors Public

    ICLR 2026 code for Echoes as Anchors: Echo-of-Prompt, attention refocusing, and probabilistic analysis of LLM reasoning.

    Python 41 5

  2. Live-GalGame/LiveGalGame Live-GalGame/LiveGalGame Public

    修复了现实世界里和异性对话没有选择项的 Bug

    JavaScript 2.5k 74

  3. papercopilot/paperlists papercopilot/paperlists Public

    Processed / Cleaned Data for Paper Copilot

    Python 928 46

  4. House-prices-regression House-prices-regression Public

    Top 0.7% Kaggle House Prices regression solution using XGBoost, Optuna, and practical feature engineering.

    Python 1