Yaxin9Luo

Follow

Luo-Yaxin Yaxin9Luo

Follow

ML PhD at MBZUAI

12 followers · 44 following

MBZUAI
Abu Dhabi
https://yaxin9luo.github.io/
https://scholar.google.com/citations?user=tEaSCzYAAAAJ&hl=en

Achievements

Achievements

Yaxin9Luo/README.md

My long-term goal is to develop intelligent machines capable of perceiving, understanding, and creating multimodal content, such as videos.

Languages and Tools:

📫 How to reach me:

GitHub Stats

Contribution Statistics:

Activity Graph:

Pinned Loading

MetaAgentX/OpenCaptchaWorld MetaAgentX/OpenCaptchaWorld Public

The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agents through diverse and dynamic CAPTCHA puzzles.

JavaScript 36
Gamma-MOD Gamma-MOD Public

[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models

Python 38 3
APL APL Public

Python 4 1
De-Diffusion De-Diffusion Public

This is my version of code implementation for the model includes in the paper De-Diffusion Makes Text a Strong Cross-Modal Interface

Python 9