GamingAgent
โญ 843LLM/VLM gaming agents and model evaluation through games. Evaluates long-horizon reasoning, memory & perception in Doom, Sokoban, Tetris, and Pokรฉmon Red.
View Project โ
Hi, I'm
Research Assistant at UCSD Hao AI Lab
M.S. Computer Science Student โข San Diego, CA
class Researcher:
def __init__(self):
self.focus = [
"LLM Systems",
"Agent Evaluation",
"GPU Infrastructure"
]
self.tools = [
"vLLM", "SGLang",
"NeMo RL", "Ray"
]
def build(self):
return "๐ Innovation"
"A journey of a thousand miles begins with a single step." โ Confucius
I work on LLM systems, evaluation, and GPU-accelerated ML infrastructure. Currently an M.S. Computer Science student at UC San Diego (GPA: 4.0), previously B.A. in Computer Science & Applied Mathematics from UC Berkeley (GPA: 3.86).
LLM/VLM gaming agents and model evaluation through games. Evaluates long-horizon reasoning, memory & perception in Doom, Sokoban, Tetris, and Pokรฉmon Red.
View Project โBenchmark for scientific correctness in text-to-video models. Evaluates physics & chemistry concepts using VLM-as-Judge scoring.
View Project โBuild RL environments for LLM training. Integrating Sokoban & Tetris for scalable RL training, reward profiling, and GRPO.
View Project โLLM environment framework for interactive evaluation. Standardized interfaces for game-based agent testing.
View Project โFeel free to reach out for collaborations, discussions, or just to say hi!
๐ง Recruiters: Feel free to reach out at yixinhuang48@gmail.com
๐ฌ Open an issue or discussion on any of my repositories!