VITA

About PI

Professor Zhangyang “Atlas” Wang [Google Scholar] is a tenured Associate Professor and holds the Temple Foundation Endowed Faculty Fellowship #7, in the Chandra Family Department of Electrical and Computer Engineering at The University of Texas at Austin. He is also a faculty member of UT Computer Science [CSRankings], and the Oden Institute CSEM program. Since May 2024, Dr. Wang has been on leave from UT Austin to serve as the full-time Research Director for XTX Markets in New York City, leading groundbreaking efforts at the intersection of algorithmic trading and deep learning.

Previously, he was the Jack Kilby/Texas Instruments Endowed Assistant Professor in the same department from 2020 to 2023; and an Assistant Professor of Computer Science and Engineering at Texas A&M University from 2017 to 2020. Alongside his academic career, he has also explored multiple exciting opportunities in the industry. He was a visiting scholar at Amazon Search from 2021 to 2022, leveraging geometric deep learning for recommendation systems. Later, he took on the (part-time) role of Director of AI Research & Technology for Picsart from 2022 to 2024, where he led the company’s ambitious initiative in video generative AI. He earned his Ph.D. in Electrical and Computer Engineering from UIUC in 2016, under the guidance of Professor Thomas S. Huang, and his B.E. in EEIS from USTC in 2012.

Prof. Wang has broad research interests in machine learning (ML) and optimization. Currently, his research passion centers on establishing the theoretical and algorithmic foundations of generative AI and neurosymbolic AI. His primary goal is developing structured, modular representations that enable efficient and robust learning within overparameterized model spaces, seamlessly connecting to symbolic knowledge and reasoning. This central vision guides his focused pursuit of enhancing efficiency, trustworthiness, and reasoning capabilities in large language models (LLMs), as well as driving innovation in 3D/4D computer vision. His research is gratefully supported by NSF, DARPA, ARL, ARO, IARPA, DOE, as well as dozens of industry and university grants. Prof. Wang co-founded the new Conference on Parsimony and Learning (CPAL) and served as its inaugural Program Chair. He regularly serves as conference (senior) area chairs, journal editors, invited speakers, tutorial/workshop organizers, various panelist positions and reviewers. He is an ACM Distinguished Speaker and an IEEE senior member.

Prof. Wang has received many research awards, including an NSF CAREER Award, an ARO Young Investigator Award, an IEEE AI's 10 To Watch Award, an AI 100 Top Thought Leader Award, an INNS Aharon Katzir Young Investigator Award, a Google Research Scholar award, an IBM Faculty Research Award, a J. P. Morgan Faculty Research Award, an Amazon Research Award, a Sony Faculty Research Award, an Adobe Data Science Research Award, a Meta Reality Labs Research Award, and two Google TensorFlow Model Garden Awards. His team has won three best paper awards (NeuS 2025, IEEE SPS 2024, LoG 2022) and two honorable mentions (MLSys 2025, VLDB 2024), as well as five competition prizes at CVPR/ICCV/ECCV. He feels most proud of being surrounded by some of the world's most brilliant students: his Ph.D. students include winners of nine prestigious fellowships (NSF GRFP, Apple, NVIDIA, Adobe, IBM ×2, Amazon, Qualcomm, and Snap), among many other honors.

About Our Research

At the VITA group, we pursue cutting-edge research spanning the theoretical foundations to practical applications of machine learning (ML). Our group's research continues to evolve, embracing new challenges at the forefront of AI and ML. We collaborate closely with industry partners and other academic institutions to ensure our work has real-world impact and addresses pressing technological needs.

Our current work is organized around three key themes, throughout which we maintain a commitment to developing ML algorithms that are efficient, scalable, and robust. We also explore the broader implications of our work, including applications in robotics, healthcare, and AI for social good.

Theme 1: Large Language Models (LLMs) - Fundamental Optimization, Mechanistic Understanding, & System Co-design

We focus on advancing the efficiency, scalability and trust of LLMs through innovative approaches to training and inference. Our research explores memory-efficient LLM training techniques (APOLLO, GaLoRe, & LiGO), efficient generative inference methods (H₂O & Flextron), understanding pre-trained model weights (lottery ticket) or training artifacts (oversmoothening & LLM-PBE): many accompanied with system or hardware co-design.

Selected Notable Works:

H. Zhu, Z. Zhang*, W. Cong*, X. Liu, S. Park, V. Chandra, B. Long, D. Pan, Z. Wang, and J. Lee, "APOLLO: SGD-like Memory, AdamW-level Performance", Conference on Machine Learning and Systems (MLSys), 2025. (Outstanding Paper Honorable Mention) [Paper] [Code] (Integrated into HuggingFace, LLaMA-Factory, Axolotl, FluxML, etc.)

P. Wang*, R. Cai*, Y. Wang*, J. Zhu*, P. Srivastava, Z. Wang, and P. Li, "Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing”, International Conference on Learning Representations (ICLR), 2025. [Paper] [Code]

J. Zhao, Z. Zhang*, B. Chen, Z. Wang, A. Anandkumar, and Y. Tian, "GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection”, International Conference on Machine Learning (ICML), 2024. (Oral) [Paper] [Code] (Integrated into HuggingFace, PyTorch, LLaMA-Factory, FedML, Axolotl, etc.)

R. Cai*, S. Muralidharan, G. Heinrich, H. Yin, Z. Wang, J. Kautz, and P. Molchanov, “Flextron: Many-in-One Flexible Large Language Model”, International Conference on Machine Learning (ICML), 2024. (Oral) [Paper] [Code]

Q. Li, J. Hong*, C. Xie, J. Tan, R. Xin, J. Hou, X. Yin, Z. Wang, D. Hendrycks, Z. Wang, B. Li, B. He, and D. Song, “LLM-PBE: Assessing Data Privacy in Large Language Models”, International Conference on Very Large Data Bases (VLDB), 2024. (Best Paper Finalist) [Paper] [Code]

Z. Zhang*, Y. Sheng, T. Zhou, T. Chen*, L. Zheng, R. Cai*, Z. Song, Y. Tian, C. Ré, C. Barrett, Z. Wang, and B. Chen, "H₂O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models”, Advances in Neural Information Processing Systems (NeurIPS), 2023. [Paper] [Code] (Integrated into DeepSpeed by Microsoft, and Llama-Recipes by Meta)

P. Wang*, R. Panda, L. Hennigen, P. Greengard, L. Karlinsky, R. Feris, D. Cox, Z. Wang, and Y. Kim, "Learning to Grow Pretrained Models for Efficient Transformer Training”, International Conference on Learning Representations (ICLR), 2023. (Spotlight) [Paper] [Code] (Implemented in IBM’s AI production system)

T. Chen*, J. Frankle, S. Chang, S. Liu, Y. Zhang, Z. Wang, and M. Carbin, “The Lottery Ticket Hypothesis for Pre-trained BERT Networks”, Advances in Neural Information Processing Systems (NeurIPS), 2020. [Paper] [Code]

Theme 2: Optimization in Modern ML - Learning to Optimize, Black-box optimization, Neurosymbolic learning

Our research in this theme focuses on developing novel optimization techniques for modern machine learning challenges. We have spearheaded the Learning to Optimize (L2O) framework (LISTA-CPSS & ALISTA) and benchmark (L2O Primer), and recently explore the new frontiers in black-box LLM optimization (DP-OPT) and neurosymbolic AI (formal fine-tuning, symbolic L2O, neurosymbolic visual RL, & formalized VLM uncertainty).

Selected Notable Works:

N. Bhatt*, Y. Yang, R. Siva, D. Milan, U. Topcu, and Z. Wang, "Know Where You're Uncertain When Planning with Multimodal Foundation Models: A Formal Framework”, Conference on Machine Learning and Systems (MLSys), 2025. [Paper] [Code]

W. Zheng*, S. Sharan*, Z. Fan*, K. Wang*, Y. Xi*, and Z. Wang, “Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search”, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024. [Paper] [Code]

Y. Yang, N. Bhatt*, T. Ingebrand, W. Ward, S. Carr, Z. Wang, and U. Topcu, "Fine-Tuning Language Models Using Formal Methods Feedback”, Conference on Machine Learning and Systems (MLSys), 2024. [Paper] [Code]

J. Hong*, J. Wang, C. Zhang, Z. LI*, B. Li, and Z. Wang, "DP-OPT: Make Large Language Model Your Differentially-Private Prompt Engineer”, International Conference on Learning Representations (ICLR), 2024. (Spotlight) [Paper] [Code]

(α-β) T. Chen*, X. Chen*, W. Chen*, H. Heaton, J. Liu, Z. Wang, and W. Yin, “Learning to Optimize: A Primer and A Benchmark”, Journal of Machine Learning Research (JMLR), 2022. [Paper] [Code]

W. Zheng*, T. Chen*, T. Hu*, and Z. Wang, “Symbolic Learning to Optimize: Towards Interpretability and Scalability”, International Conference on Learning Representations (ICLR), 2022. [Paper] [Code]

J. Liu, X. Chen*, Z. Wang, and W. Yin, “ALISTA: Analytic Weights Are As Good As Learned Weights in LISTA”, International Conference on Learning Representations (ICLR), 2019. [Paper] [Code]

X. Chen*, J. Liu, Z. Wang, and W. Yin, “Theoretical Linear Convergence of Unfolded ISTA and Its Practical Weights and Thresholds”, Advances in Neural Information Processing Systems (NeurIPS), 2018. (Spotlight) [Paper] [Code]

Theme 3: Generative Vision - 3D/4D/Video Synthesis, and Related Applications

Our group's earlier (pre-2021) work includes several influential algorithms for GAN-based image enhancement and editing “in the wild”. More recently (post-2021), we push the boundaries of generative AI for visual tasks, with a focus on 3D/4D reconstruction (LSM, InstantSplat, LightGaussian, & NeuralLift-360), novel view synthesis (GNT & SinNeRF), and video generation (StreamingT2V & Text2Video-Zero).

Selected Notable Works:

R. Henschel, L. Khachatryan, D. Hayrapetyan, H. Poghosyan, V. Tadevosyan, Z. Wang, S. Navasardyan, and H. Shi, "StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025. [Paper] [Code]

Z. Fan*, J. Zhang, W. Cong*, P. Wang*, R. Li, K. Wen, S. Zhou, A Kadambi, Z. Wang, D. Xu, B. Ivanovic, M. Pavone, and Y. Wang, “Large Spatial Model: End-to-end Unposed Images to Semantic 3D”, Advances in Neural Information Processing Systems (NeurIPS), 2024. [Paper] [Code]

Z. Fan*, K. Wang*, K. Wen, Z. Zhu*, D. Xu*, and Z. Wang, "LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS”, Advances in Neural Information Processing Systems (NeurIPS), 2024. (Spotlight) [Paper] [Code]

Z. Fan*, W. Cong*, K. Wen, K. Wang*, J. Zhang, X. Ding, D. Xu, B. Ivanovic, M. Pavone, G. Pavlakos, Z. Wang, and Y. Wang, "InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds”, arXiv preprint arXiv:2403.20309, 2024. [Paper] [Code]

L. Khachatryan, A. Movsisyan, V. Tadevosyan, R. Henschel, Z. Wang, S. Navasardyan, and H. Shi, "Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators”, IEEE International Conference on Computer Vision (ICCV), 2023. (Oral) [Paper] [Code] (Commercialized as Picsart AI GIF generator)

D. Xu*, Y. Jiang*, P. Wang*, Z. Fan*, Y. Wang*, and Z. Wang, "NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360◦ Views”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023. (Highlight) [Paper] [Code]

M. Varma*, P. Wang*, X. Chen*, T. Chen*, S. Venugopalan, and Z. Wang, "Is Attention All That NeRF Needs?”, International Conference on Learning Representations (ICLR), 2023. [Paper] [Code]

D. Xu*, Y. Jiang*, P. Wang*, Z. Fan*, H. Shi, and Z. Wang, “SinNeRF: Training Neural Radiance Field on Complex Scenes from a Single Image”, European Conference on Computer Vision (ECCV), 2022. [Paper] [Code]

About PI

About Our Research

Theme 1: Large Language Models (LLMs) - Fundamental Optimization, Mechanistic Understanding, & System Co-design

Theme 2: Optimization in Modern ML - Learning to Optimize, Black-box optimization, Neurosymbolic learning

Theme 3: Generative Vision - 3D/4D/Video Synthesis, and Related Applications

Prospective Students Shall Read More...

Sponsor