Skip to content
View Peiyang-Song's full-sized avatar

Highlights

  • Pro

Organizations

@ucsb @lean-dojo @psychology-of-AI

Block or report Peiyang-Song

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Peiyang-Song/README.md

Welcome to my Github 👋

I am an incoming Ph.D. student at Carnegie Mellon University (CMU), affiliated with the Language Technologies Institute (LTI) in the School of Computer Science (SCS), and the newly founded Institute for Computer-Aided Reasoning in Mathematics (ICARM). I am working on verifiable reasoning agents at Apodex, mentored by Dr. Kaiyu Yang.

I received my B.S. in Computer Science from California Institute of Technology (Caltech), with a minor in Robotics, advised by Prof. Steven Low and Prof. Günter Niemeyer.

I study machine reasoning, in both formal and informal worlds: how AI systems can derive conclusions, act on them, evolve as tasks drift, and remain reliable when settings become complex.

(1) Formal reasoning. In mathematics, code, and other formalizable domains, verifiability gives us a rare foundation for building dependable reasoning systems. I develop models and infrastructure [LeanDojo] [Lean Copilot] that form a bidirectional loop between formal environments and ML systems, enabling efficient human-AI collaboration in theorem proving [Human-AI Formalization]. Building on this loop, I move beyond static models toward adaptive agents that evolve with growing libraries [LeanAgent] and expanding contexts [LeanProgress].

(2) Informal reasoning. In open-ended natural-language settings, I continue to study how agents adapt [Adaptation] [Steer2Adapt] and generalize [Generalization] [Idiom]. These goals, however, become harder to pursue with a lack of symbolic verifiers or reliable reward signals. One way forward is to develop agentic evaluations that provide rich behavioral signals while remaining meaningful across diverse scenarios [Interactive Evaluation] [Personality Illusion] [Rethinking Psychometrics]. Then at runtime, the complementary challenge is to equip reasoning systems to identify, mitigate, and resist failures as they inevitably arise [Reasoning Failures] [A-Not-B].

Earlier, I also worked on making reasoning systems more efficient through novel architecture design [Delay Space] [DelayNet].

My long-term research goal is to build AI systems whose reasoning is as creative as human intuition and as dependable as formal logic.

You can find more about me and my work from my Personal Website and Google Scholar Profile.

Should we share any common interest, feel free to email me at psong2@andrew.cmu.edu.

[Last Updated: June 2026]

Pinned Loading

  1. lean-dojo/LeanCopilot lean-dojo/LeanCopilot Public

    LLMs as Copilots for Theorem Proving in Lean

    C++ 1.3k 126

  2. lean-dojo/LeanDojo lean-dojo/LeanDojo Public

    Tool for data extraction and interacting with Lean programmatically.

    Python 812 116

  3. lean-dojo/ReProver lean-dojo/ReProver Public

    Retrieval-Augmented Theorem Provers for Lean

    Python 327 72

  4. Awesome-LLM-Reasoning-Failures Awesome-LLM-Reasoning-Failures Public

    Repo for "Large Language Model Reasoning Failures"

    205 23

  5. pat-jj/Awesome-Adaptation-of-Agentic-AI pat-jj/Awesome-Adaptation-of-Agentic-AI Public

    Repo for "Adaptation of Agentic AI"

    669 60

  6. psychology-of-AI/Personality-Illusion psychology-of-AI/Personality-Illusion Public

    [ICML26] The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs.

    Jupyter Notebook 73 6