Anh Nguyen profile photo

Anh Nguyen / Aengus

Incoming Ph.D. student at Johns Hopkins University in Fall 2026. Currently, I am a Predoctoral Research Resident at Qualcomm AI Research, advised by Principal Scientist Dr. Anh Tran.
Summer 2027: Open to research internships; interested in collaborations on generative models across pre-training, distillation, and controllable multimodal applications.
Contact: aengus.ng8@gmail.com
I work on efficient, scalable, and controllable generative modeling as a principled route to machine intelligence beyond human levels.
Research Statement
My long-term goal is to build systems capable of understanding, reasoning, planning, and acquiring physical intuition about the world.

My current research focuses on Efficient & Robust Multimodal Intelligence, aiming to resolve the trade-offs in foundation models through two core pillars: (1) Efficiency & Scalability to minimize training and inference costs, and (2) Robustness & Controllability to enforce alignment and reliability.

Most recently, my work on One-step Generative Modeling & Distillation (NeurIPS & ICCV 2025) collapses iterative inference into real-time, high-fidelity synthesis, while my research on Multimodal Representation (ICCV 2025) leverages internal semantics for zero-shot, fine-grained controllability.

Research Readiness: I can independently lead the entire research lifecycle for top-tier conferences, driving projects from problem formulation and experimentation to final publication.
Outside the Lab
I enjoy the combination of mathematics, coding, and intuition. Away from the keyboard, you can find me clearing my mind on long-distance runs 🏃‍♂️

news

Feb 22, 2026 Anti-I2V: Safeguarding your photos from malicious image-to-video generation got accepted at CVPR 2026. This paper introduces a novel defense against unauthorized human image-to-video generation. Instead of relying on the standard RGB space, Anti-I2V optimizes noise in both the L*a*b* and frequency domains to improve robustness and target salient pixels. It introduces two tailored training objectives: Internal Representation Collapse (IRC) and Internal Representation Anchor (IRA). Together, these effectively degrade temporal coherence and generation fidelity to prevent model misuse.
Jan 26, 2026 Revisit Visual Prompt Tuning: The Expressiveness of Prompt Experts got accepted at ICLR 2026. By formalizing the connection between Attention and Mixture of Experts (MoE), we identify a key limitation in standard VPT: the restricted expressiveness of static prompts. To resolve this, we propose Visual Adaptive Prompt Tuning (VAPT), which conditions prompt experts on the input instance. This formulation is theoretically proven to achieve optimal sample efficiency and yields substantial performance gains, surpassing full fine-tuning on VTAB-1K by 7.34% and outperforming VPT in low-data regimes (1% data) by over 50%, all while using fewer parameters.
Oct 6, 2025 🏆 I am honored to receive the Outstanding Resident in Research and Applied Demo Award 2025! The award is part of the 2025 Recognition Awards from the Qualcomm AI Residency Program, which honors “the exceptional achievements of our residents this year.”
Sep 18, 2025 Improved Training Technique for Shortcut Models got accepted at NeurIPS 2025. This paper tackle the five core issues that held shortcut models back: the hidden flaw of compounding guidance, inflexible fixed guidance, frequency bias, divergent self-consistency, and curvy flow trajectories. Our method achieves state-of-the-art FID scores, making shortcut models a viable class of generative models capable of one-step, few-step, and multi-step sampling.
Jun 26, 2025 Supercharged One-step Text-to-Image Diffusion Models with Negative Prompts got accepted at ICCV 2025. This paper, for the first time, enables negative guidance in one-step diffusion models, unlocking precise creative control without sacrificing speed. The proposed method boosts both controllability and quality, achieving a new state-of-the-art HPSv2 score.

selected publications [full list]

(*) denotes equal contribution
  1. NeurIPS
    Improved Training Technique for Shortcut Models
    Anh Nguyen*, Viet Nguyen*, Duc Vu, Trung Dao, Chi Tran, Toan Tran, and Anh Tran
    In The Thirty-nine Annual Conference on Neural Information Processing Systems, 2025
  2. CVPR
    Anti-I2V: Safeguarding your photos from malicious image-to-video generation
    Duc Vu, Anh Nguyen, Chi Tran, and Anh Tran
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026
  3. ICLR
    Revisit Visual Prompt Tuning: The Expressiveness of Prompt Experts
    Minh Le*, Anh Nguyen*, Huy Nguyen, Chau Nguyen, Anh Tran, and Nhat Ho
    In International Conference on Learning Representations, 2026
  4. ICCV
    Supercharged One-step Text-to-Image Diffusion Models with Negative Prompts
    Viet Nguyen*, Anh Nguyen*, Trung DaoKhoi NguyenCuong PhamToan Tran, and Anh Tran
    In International Conference on Computer Vision, 2025