Anh Nguyen profile photo

Anh Nguyen / Aengus

Incoming ECE Ph.D. student at Johns Hopkins University in Fall 2026. Currently, I am a Predoctoral Research Resident at Qualcomm AI Research, advised by Principal Scientist Dr. Anh Tran.
Summer 2027: Open to research internships; interested in collaborations on generative models across pre-training, distillation, and controllable multimodal applications.
Contact: aengus.ng8@gmail.com
I work on efficient, scalable, and controllable generative modeling as a principled route to machine intelligence beyond human levels.
Research Statement
My long-term goal is to build systems capable of understanding, reasoning, planning, and acquiring physical intuition about the world.

My current research focuses on Efficient & Robust Multimodal Intelligence, aiming to resolve the trade-offs in foundation models through two core pillars: (1) Efficiency & Scalability to minimize training and inference costs, and (2) Robustness & Controllability to enforce alignment and reliability.

Most recently, my work on One-step Generative Modeling & Distillation (NeurIPS & ICCV 2025) collapses iterative inference into real-time, high-fidelity synthesis, while my research on Multimodal Representation (ICCV 2025) leverages internal semantics for zero-shot, fine-grained controllability.

Research Readiness: I can independently lead the entire research lifecycle for top-tier conferences, driving projects from problem formulation and experimentation to final publication.
Outside the Lab
I enjoy the combination of mathematics, coding, and intuition. Away from the keyboard, you can find me clearing my mind on long-distance runs 🏃‍♂️

news

Jun 18, 2026 Cross-Space Distillation: Teaching One-Step Students with Modern Diffusion Teachers got accepted at ECCV. This work removes a hidden constraint in fast diffusion distillation: Teacher and Student no longer need to live in the same latent space. We formalize Cross-Space Distillation and introduce Bridge, a lightweight latent-space interface that makes standard one-step distillation possible across mismatched resolutions, VAEs, architectures, and diffusion/flow paradigms.
Feb 22, 2026 Anti-I2V: Safeguarding your photos from malicious image-to-video generation got accepted at CVPR. This paper introduces a novel defense against unauthorized human image-to-video generation. Instead of relying on the standard RGB space, Anti-I2V optimizes noise in both the L*a*b* and frequency domains to improve robustness and target salient pixels. It introduces two tailored training objectives: Internal Representation Collapse (IRC) and Internal Representation Anchor (IRA). Together, these effectively degrade temporal coherence and generation fidelity to prevent model misuse.
Jan 26, 2026 Revisit Visual Prompt Tuning: The Expressiveness of Prompt Experts got accepted at ICLR. This work reframes VPT through a Mixture-of-Experts lens: prompts are not just tokens, but new experts injected into attention. We show that standard VPT is limited because these prompt experts are static, and introduce VAPT, which turns them into input-adaptive experts with stronger expressiveness, fewer parameters, and optimal sample efficiency.
Oct 6, 2025 🏆 I am honored to receive the Outstanding Resident in Research and Applied Demo Award 2025! The award is part of the 2025 Recognition Awards from the Qualcomm AI Residency Program, which honors “the exceptional achievements of our residents this year.”
Sep 18, 2025 Improved Training Technique for Shortcut Models got accepted at NeurIPS. This paper tackle the five core issues that held shortcut models back: the hidden flaw of compounding guidance, inflexible fixed guidance, frequency bias, divergent self-consistency, and curvy flow trajectories. Our method achieves state-of-the-art FID scores, making shortcut models a viable class of generative models capable of one-step, few-step, and multi-step sampling.

selected publications [full list]

(*) denotes equal contribution
  1. ECCV
    Cross-Space Distillation: Teaching One-Step Students with Modern Diffusion Teachers
    Anh Nguyen*, Ngan Nguyen*, Duc Vu*, Trung DaoViet Nguyen, Quan Dao, Kien Nguyen, Chi Tran, Phong Nguyen, Khoi NguyenCuong Pham, Dimitris Metaxas, Vishal M Patel, and Anh Tran
    In European Conference on Computer Vision, 2026
  2. NeurIPS
    Improved Training Technique for Shortcut Models
    Anh Nguyen*, Viet Nguyen*, Duc Vu, Trung Dao, Chi Tran, Toan Tran, and Anh Tran
    In The Thirty-nine Annual Conference on Neural Information Processing Systems, 2025
  3. CVPR
    Anti-I2V: Safeguarding your photos from malicious image-to-video generation
    Duc Vu, Anh Nguyen, Chi Tran, and Anh Tran
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026
  4. ICLR
    Revisit Visual Prompt Tuning: The Expressiveness of Prompt Experts
    Minh Le*, Anh Nguyen*, Huy Nguyen, Chau Nguyen, Anh Tran, and Nhat Ho
    In International Conference on Learning Representations, 2026
  5. ICCV
    Supercharged One-step Text-to-Image Diffusion Models with Negative Prompts
    Viet Nguyen*, Anh Nguyen*, Trung DaoKhoi NguyenCuong PhamToan Tran, and Anh Tran
    In International Conference on Computer Vision, 2025