Change the repository type filter
All
Repositories list
20 repositories
Agentic-ADK
PublicOvis
PublicA novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.Pixelle-MCP
PublicAn Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.aiCHATS
PublicMarco-Voice
Public- Awesome Unified Multimodal Models
Marco-Bench-MIF
Public- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.
- TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance (ICCV 2025)
UNIC-Adapter
PublicParrot
Public🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.Marco-o1
PublicTG-LLaVA
PublicWings
PublicThe code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]M3Bench
PublicMeissonic
PublicAutoGPTQ
Public