Ruiyang's Homepage

Ruiyang (Ryan) Sun

Peking University; AI Researcher

Avatar

: +86 18609715159

For Work or Collaboration:

Personal Life and ๐Ÿถ Adorable Puppy:

I'm Ruiyang Sun (/ruฬฏeษชฬฏ jษ‘ล‹ swษ™n/, or Ryan Sun; ๅญ™็ฟ้˜ณ). I'm a ๐ŸŽ“ senior undergraduate student pursuing a double major in ๐Ÿงฒ Physics and ๐Ÿค– Artificial Intelligence (AI) at ๐Ÿ›๏ธ Peking University.

I have previously worked in the areas of ๐Ÿ”’ Safe Reinforcement Learning (Safe RL) and ๐Ÿค–๐Ÿงญ LLM Alignment, under the guidance of Prof. Yaodong Yang at the PKU Pair Lab.

Currently, my research is focused on ๐ŸŒฑ Emergent Socio-Dynamic Behavior and ๐Ÿงฉ Alignment Issues in ๐Ÿค–๐Ÿ‘ฅ AI-Human Societies, particularly focusing on advanced AI systems such as ๐Ÿ—ฃ๏ธ Large Language Models (LLMs), ๐Ÿ–ผ๏ธ Large Multimodal Models (LMMs), and ๐Ÿค–๐Ÿ’ผ LLM-powered Autonomous Agents. I aim for my research to contribute to the safer, more harmonious, and dignified integration of ๐Ÿค– AI systems into ๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘งโ€๐Ÿ‘ฆ human society. Here are some of the key research questions I'm exploring:

  1. When ๐Ÿค– AI systems are treated as human-like entities in AI-human mixed ecosystems, can we observe the ๐ŸŒŸ emergence of human-like socio-dynamic behaviors from these AI systems?

  2. In what ways do the behaviors of ๐Ÿค– AI systems diverge from those of ๐Ÿ‘ฅ humans, and how can these distinctions inform our understanding of human-AI ๐Ÿค interactions?

  3. How can emergent behaviors (e.g., ๐Ÿค social learning, ๐Ÿค— cooperation) enhance the intelligence of ๐Ÿค– AI systems and the collective ๐Ÿง  intelligence of AI-human societies?

  4. What forms of โš ๏ธ misalignment might arise between ๐Ÿ‘ฅ human and ๐Ÿค– AI systems in AI-human mixed ecosystems, and what strategies can be used to ๐Ÿ› ๏ธ mitigate these misalignments effectively?

I believe that to develop more ๐Ÿค– intelligent and ethical AI systems, we need to draw insights not only from ๐Ÿ’ป Computer Science but also from fields like ๐Ÿง  Psychology, ๐Ÿ‘ฅ Sociology, and ๐Ÿง  Cognitive Science. I welcome discussions from people with diverse perspectives! ๐ŸŒ

Besides research, I'm also a passionate ๐ŸŽ Apple developer, and I am currently working on creating a new ๐Ÿค– AI-powered teamwork research toolkit. I'd love to discuss or collaborate on this with anyone interested! ๐Ÿค

Selected Publications

2023
Safe RLHF: Safe Reinforcement Learning from Human Feedback
Safe RLHF: Safe Reinforcement Learning from Human Feedback

Spotlight Presentation; Github 1.1k+ stars

Josef Dai*, Xuehai Pan*, Ruiyang Sun*, Jiaming Ji*, Xinbo Xu, Mickel Liu, Yizhou Wang, Yaodong Yang

The Twelfth International Conference on Learning Representations (ICLR 2024)

2023
BEAVERTAILS: towards improved safety alignment of llm via a human-preference dataset
BEAVERTAILS: towards improved safety alignment of llm via a human-preference dataset

Jiaming Ji, Mickel Liu, Juntao Dai, Xuehai Pan, Chi Zhang, Ce Bian, Boyuan Chen, Ruiyang Sun, Yizhou Wang, Yaodong Yang

Proceedings of the 37th International Conference on Neural Information Processing Systems (NeurIPS 2023)

2023
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research

Github 900+ stars

Jiaming Ji, Jiayi Zhou, Borong Zhang, Juntao Dai, Xuehai Pan, Ruiyang Sun, Weidong Huang, Yiran Geng, Mickel Liu, Yaodong Yang

Journal of Machine Learning Research

BibTeX citation copied to clipboard!
ยฉ Copyright 2024-2025 Ruiyang Sun & GPT-4o & Cursor AI. Last updated: 4/3/2025.