Former Staff Research Scientist at ByteDance (Seed Multimodal & World Model team). Co-author of LLaVA-NeXT, LLaVA-OneVision, LLaVA-Video. Former Microsoft Research.