Jan Kautz
Vice President of Learning and Perception Research @ NVIDIA

I lead the Learning and Perception Research Team at NVIDIA, working predominantly on computer vision problems — from low-level vision (denoising, super-resolution, computational photography) and geometric vision (structure from motion, SLAM, optical flow) to visual perception (detection, recognition, classification), as well as machine learning problems (deep learning, reinforcement learning, generative models).

News
apr
2025
ICLR
We have many papers accepted at ICLR 2025. We hope you can join us!
mar
2025
GTC
We released our robotic foundation model GR00T N1. Read our announcement and various articles: VentureBeat, TechCrunch, TechRadar, Tom's Guide and more.
dec
2024
NeurIPS
My team has over 10 papers at NeurIPS 2024. Please stop by and say hi!
oct
2024
ECCV
We have several papers accepted to ECCV 2024. Please stop by our posters!
jun
2024
CVPR
My team has over TODO papers accepted to CVPR 2024. Please stop by our posters and talks.
may
2024
ICLR
We have several papers accepted at ICLR 2024. We hope you can join us!
dec
2023
NeurIPS
My team has over XXX papers at NeurIPS 2023. Please stop by and say hi!
oct
2023
ICCV
My team has TODO papers accepted to ICCV 2023. Please stop by our posters!
aug
2023
SIGGRAPH
Honored to see our work on Precomputed Radiance Transfer make the SIGGRAPH Seminal Graphics papers list.
jun
2023
CVPR
My team has over 10 papers accepted to CVPR 2023. Come and join us in Vancouver.
may
2023
ICLR
We have 3 papers accepted at ICLR 2023. We hope to see you there!
may
2023
ICRA
My team has 4 papers at ICRA 2023. See you in London!
Research Areas
Efficient AI
We are interested in efficient AI, for both training and inferencing, which includes methods such as pruning, neural architecture search, and so forth.
Visual Perception
Solving perception tasks is an important aspect of our work. In particular, we work on a variety of 2D and 3D perception tasks using deep learning.
Foundation Models
My team is investigating foundation models in the area of computer vision, multi-modal LLMs, as well as embodied AI.
Generative AI
We investigate learning-based methods that can synthesize multi-dimensional data for creative as well as scientific applications.
Publications
2025
NeurIPS
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
G. Chen, Z. Li, S. Wang, J. Jiang, Y. Liu, L. Lu, D.-A. Huang, W. Byeon, M. Le, T. Rintamaki, T. Poon, M. Ehrlich, T. Lu, L. Wang, B. Catanzaro, J. Kautz, A. Tao, Z. Yu, G. Liu
Advances in Neural Information Processing Systems (NeurIPS)
December 2025
NeurIPS
CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
S. Diao, Y. Yang, Y. Fu, X. Dong, D. Su, M. Kliegl, Z. Chen, P. Belcak, Y. Suhara, H. Yin, M. Patwary, Y. C. Lin, J. Kautz, P. Molchanov
Advances in Neural Information Processing Systems (NeurIPS)
December 2025
NeurIPS
GSPN-2: Efficient Parallel Sequence Modeling
H. Wang, Y. Liang, D. Wehr, H. Ye, X. Li, K. C. Cheung, K. Han, H. Yin, P. Molchanov, S. Liu, W. Byeon, C. McCarthy, J. Gu, J. Kautz, K. Chen
Advances in Neural Information Processing Systems (NeurIPS)
December 2025
PDF
NeurIPS
Scaling RL to Long Videos
Y. Chen, W. Huang, B. Shi, Q. Hu, H. Ye, L. Zhu, Z. Liu, P. Molchanov, J. Kautz, X. Qi, S. Liu, H. Yin, Y. Lu, S. Han
Advances in Neural Information Processing Systems (NeurIPS)
December 2025
NeurIPS
Fast-SLM: Towards Latency-Optimal Hybrid Small Language Models
Y. Fu, X. Dong, S. Diao, M. V. Keirsbilck, H. Ye, W. Byeon, Y. Karnati, L. Liebenwein, M. Khadkevich, A. Keller, J. Kautz, Y. C. Lin, P. Molchanov
Advances in Neural Information Processing Systems (NeurIPS)
December 2025
NeurIPS
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
A. Taghibakhshi, S. T. Sreenivas, S. Muralidharan, M. Chochowski, Y. Karnati, R. B. Joshi, A. S. Mahabaleshwarkar, Z. Chen, Y. Suhara, O. Olabiyi, D. Korzekwa, M. Patwary, M. Shoeybi, J. Kautz, B. Catanzaro
Advances in Neural Information Processing Systems (NeurIPS)
December 2025
NeurIPS
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
M. Liu, S. Diao, X. Lu, J. Hu, X. Dong, Y. Choi, J. Kautz, Y. Dong
Advances in Neural Information Processing Systems (NeurIPS)
December 2025
ICCV
AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion
Y. Huang, Y. Yuan, X. Li, J. Kautz, U. Iqbal
IEEE International Conference on Computer Vision (ICCV)
October 2025
ICCV
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion
G. Kim, X. Li, Y. Yuan, K. Nagano, T. Li, J. Kautz, S. Y. Chun, U. Iqbal
IEEE International Conference on Computer Vision (ICCV)
October 2025
ICCV
HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis
T. Teufel, X. Zhou, U. Iqbal, P. Rao, P. Gera, J. Kautz, V. Golyanik, C. Theobalt
IEEE International Conference on Computer Vision (ICCV)
October 2025
ICCV
GEM: A GENeralist Model for Human MOtion
J. Li, J. Cao, H. Zhang, D. Rempe, J. Kautz, U. Iqbal, Y. Yuan
IEEE International Conference on Computer Vision (ICCV)
October 2025
CoRL
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
J. Jang, S. Ye, Z. Lin, J. Xiang, J. Bjorck, Y. Fang, F. Hu, S. Huang, K. Kundalia, L. Magne, A. Mandlekar, A. Narayan, Y. L. Tan, G. Wang, J. Wang, Q. Wang, Y. Xu, K. Zheng, R. Zheng, L. Zettlemoyer, D. Fox, J. Kautz, S. Reed, Y. Zhu, L. Fan
Conference on Robot Learning (CoRL)
September 2025
CoRL
FLARE: Robot Learning with Implicit World Modeling
R. Zheng, J. Wang, S. Reed, Y. Fang, F. Hu, J. Jang, K. Kundalia, Z. Lin, L. Magne, A. Narayan, Y. L. Tan, G. Wang, Q. Wang, J. Xiang, Y. Xu, S. Ye, J. Kautz, F. Huang, Y. Zhu, L. Fan
Conference on Robot Learning (CoRL)
September 2025
TMLR
Wolf: Dense Video Captioning with a World Summarization Framework
B. Li, L. Zhu, R. Tian, S. Tan, Y. Chen, Y. Lu, Y. Cui, S. Veer, M. Ehrlich, J. Philion, X. Weng, F. Xue, L. Fan, Y. Zhu, J. Kautz, A. Tao, M.-Y. Liu, S. Fidler, B. Ivanovic, T. Darrell, J. Malik, S. Han, M. Pavone
Transactions on Machine Learning Research
September 2025
ArXiV
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
NVIDIA
ArXiV
August 2025
ICML
FeatSharp: Your Vision Model Features, Sharper
M. Ranzinger, G. Heinrich, P. Molchanov, J. Kautz, B. Catanzaro, A. Tao
International Conference on Machine Learning (ICML)
July 2025
ICML
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
D. Shi, Y. Fu, X. Yuan, Z. Yu, H. You, S. Li, X. Dong, J. Kautz, P. Molchanov, Y. C. Lin
International Conference on Machine Learning (ICML)
July 2025
CVPR
One-Minute Video Generation with Test-Time Training
J. Xu, S. Han, K. Dalal, D. Koceja, X. Li, Y. Zhao, K. C. Cheung, Y. Choi, J. Kautz, S. Liu, Y. Sun, X. Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
CVPR
Parallel Sequence Modeling via Generalization Spatial Propagation Network (GSPN)
H. Wang, W. Byeon, J. Xu, J. Gu, K. C. Cheung, X. Wang, K. Han, J. Kautz, S. Liu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
CVPR
Scaling Vision Pre-Training to 4K Resolution
B. Shi, B. Li, H. Cai, Y. Lu, S. Liu, M. Pavone, J. Kautz, S. Han, T. Darrell, P. Molchanov, H. Yin
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025 (highlight)
CVPR
RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
G. Heinrich, M. Ranzinger, H. Yin, Y. Lu, J. Kautz, B. Catanzaro, A. Tao, P. Molchanov
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
CVPR
FoundationStereo: Zero-Shot Stereo Matching
B. Wen, M. Trepte, O. J. Aribido, J. Kautz, O. Gallo, S. Birchfield
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025 (best paper award candidate, oral)
CVPR
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
Y. Man, D.-A. Huang, G. Liu, S. Sheng, S. Liu, L. Gui, J. Kautz, Y.-X. Wang, Z. Yu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
CVPR
NVILA: Efficient Frontier Visual Language Models
Z. Liu, L. Zhu, B. Shi, Z. Zhang, Y. Lou, S. Yang, H. Xi, S. Cao, Y. Gu, D. Li, X. Li, Y. Fang, Y. Chen, C.-Y. Hsieh, D.-A. Huang, A.-C. Cheng, V. Nath, A. Myronenko, J. Hu, S. Liu, R. Krishna, D. Xu, X. Wang, P. Molchanov, J. Kautz, H. Yin, S. Han, a. Y. Lu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
CVPR
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
J. Lee, C. Park, J. Choe, Y.-C. F. Wang, J. Kautz, M. Cho, C. Choy
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
CVPR
SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing
X. Li, Y. Yuan, S. D. Mello, G. Daviet, J. Leaf, M. Macklin, J. Kautz, U. Iqbal
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
CVPR
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counter Factual Reasoning
S. Wang, Z. Yu, X. Jiang, S. Lan, M. Shi, N. Chang, J. Kautz, Y. Li, J. M. Alvarez
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
CVPR
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
A. Hatamizadeh, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2025
RSS
NaVILA: Legged Robot Vision-Language-Action Model for Navigation
A.-C. Cheng, Y. Ji, Z. Yang, Z. Gongye, X. Zou, J. Kautz, E. Biyik, H. Yin, S. Liu, X. Wang
Robotics: Science and Systems (RSS)
June 2025
ICRA
HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots
T. He, W. Xiao, T. Lin, Z. Luo, Z. Xu, Z. Jiang, J. Kautz, C. Liu, G. Shi, X. Wang, L. “. Fan, Y. Zhu
International Conference on Robotics and Automation (ICRA)
May 2025
ArXiV
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
NVIDIA
ArXiV
April 2025
ICLR
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
M. Shi, F. Liu, S. Wang, S. Liao, S. Radhakrishnan, D.-A. Huang, H. Yin, K. Sapra, Y. Yacoob, H. Shi, B. Catanzaro, A. Tao, J. Kautz, G. Liu, Z. Yu
International Conference on Learning Representations (ICLR)
April 2025
ICLR
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Y. Chen, F. Xue, D. Li, Q. Hu, L. Zhu, X. Li, Y. Fang, H. Tang, S. Yang, Z. Liu, Y. He, H. Yin, P. Molchanov, J. Kautz, L. Fan, Y. Zhu, Y. Lu, S. Han
International Conference on Learning Representations (ICLR)
April 2025
ICLR
Gated Delta Networks: Improving Mamba2 with Delta Rule
S. Yang, J. Kautz, A. Hatamizadeh
International Conference on Learning Representations (ICLR)
April 2025
ICLR
Hymba: A Hybrid-head Architecture for Small Language Models
X. Dong, Y. Fu, S. Diao, W. Byeon, Z. Chen, A. S. Mahabaleshwarkar, S.-Y. Liu, M. V. Keirsbilck, M.-H. Chen, Y. Suhara, Y. C. Lin, J. Kautz, P. Molchanov
International Conference on Learning Representations (ICLR)
April 2025
ICLR
LlamaFlex: Many-in-One LLMs via Generalized Pruning and Weight Sharing
R. Cai, S. Muralidharan, H. Yin, Z. Wang, J. Kautz, P. Molchanov
International Conference on Learning Representations (ICLR)
April 2025
PDF
ICLR
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement
Z. Ye, K. Xia, Y. Fu, X. Dong, J. Hong, X. Yuan, S. Diao, J. Kautz, P. Molchanov, Y. Lin
International Conference on Learning Representations (ICLR)
April 2025
PDF
ArXiV
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
NVIDIA
ArXiV
March 2025
Nature
Residual corrective diffusion modeling for km-scale atmospheric downscaling
M. Mardani, N. Brenowitz, Y. Cohen, J. Pathak, C.-Y. Chen, C.-C. Liu, A. Vahdat, M. A. Nabian, T. Ge, A. Subramaniam, K. Kashinath, J. Kautz, M. Pritchard
Nature Communications Earth & Environment
February 2025
2024
NeurIPS
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
G. Fang, H. Yin, S. Muralidharan, G. Heinrich, J. Pool, J. Kautz, P. Molchanov, X. Wang
Advances in Neural Information Processing Systems (NeurIPS)
December 2024
NeurIPS
Compact Language Models via Pruning and Knowledge Distillation
S. Muralidharan, S. T. Sreenivas, R. Joshi, M. Chochowski, M. Patwary, M. Shoeybi, B. Catanzaro, J. Kautz, P. Molchanov
Advances in Neural Information Processing Systems (NeurIPS)
December 2024
NeurIPS
SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models
A.-C. Cheng, H. Yin, Y. Fu, Q. Guo, R. Yang, J. Kautz, X. Wang, S. Liu
Advances in Neural Information Processing Systems (NeurIPS)
December 2024
NeurIPS
CosAE: Learnable Fourier Series for Image Restoration
S. Liu, S. D. Mello, J. Kautz
Advances in Neural Information Processing Systems (NeurIPS)
December 2024
ECCV
COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation
J. Li, Y. Yuan, D. Rempe, H. Zhang, P. Molchanov, C. Lu, J. Kautz, U. Iqbal
European Conference on Computer Vision (ECCV)
September 2024
ECCV
LITA: Language Instructed Temporal-localization Assistant
D.-A. Huang, S. Liao, S. Radhakrishnan, H. Yin, P. Molchanov, Z. Yu, J. Kautz
European Conference on Computer Vision (ECCV)
September 2024
ECCV
DiffiT: Diffusion Vision Transformers for Image Generation
A. Hatamizadeh, J. Song, G. Liu, J. Kautz, A. Vahdat
European Conference on Computer Vision (ECCV)
September 2024
ICML
Flextron: Many-in-One Flexible Large Language Model
R. Cai, S. Muralidharan, G. Heinrich, H. Yin, Z. Wang, J. Kautz, P. Molchanov
International Conference on Machine Learning (ICML)
July 2024 (oral)
ArXiV
An Empirical Study of Mamba-based Language Models
R. Waleffe, W. Byeon, D. Riach, B. Norick, V. Korthikanti, T. Dao, A. Gu, A. Hatamizadeh, S. Singh, D. Narayanan, G. Kulshreshtha, V. Singh, J. Casper, J. Kautz, M. Shoeybi, B. Catanzaro
ArXiV
June 2024
CVPR
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
B. Wen, W. Yang, J. Kautz, S. Birchfield
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2024 (highlight)
CVPR
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning
Y. Yuan, X. Li, Y. Huang, S. D. Mello, K. Nagano, J. Kautz, U. Iqbal
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2024 (highlight)
CVPR
COLMAP-Free 3D Gaussian Splatting
Y. Fu, S. Liu, A. Kulkarni, J. Kautz, A. A. Efros, X. Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2024 (highlight)
CVPR
VILA: On pretraining for vision language models
J. Lin, H. Yin, W. Ping, Y. Lu, P. Molchanov, A. Tao, H. Mao, J. Kautz, M. Shoeybi, S. Han
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2024
CVPR
AM-RADIO: Agglomerative Model - Reduce All Domains Into One
M. Ranzinger, G. Heinrich, P. Molchanov, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2024
CVPR
Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Z. Li, Z. Yu, S. Lan, J. Li, J. Kautz, T. Lu, J. M. Alvarez
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2024
ICLR
FasterViT: Fast Vision Transformers with Hierarchical Attention
A. Hatamizadeh, G. Heinrich, H. Yin, A. Tao, J. M. Alvarez, J. Kautz, P. Molchanov
International Conference on Learning Representations (ICLR)
May 2024
ICLR
3D Reconstruction with Generalizable Neural Fields using Scene Priors
Y. Fu, S. D. Mello, X. Li, A. Kulkarni, J. Kautz, X. Wang, S. Liu
International Conference on Learning Representations (ICLR)
May 2024
ICLR
Learning to Jointly Understand Visual and Tactile Signals
Y. Li, Y. Du, C. Liu, F. Williams, M. Foshey, B. Eckart, J. Kautz, J. B. Tenenbaum, A. Torralba, W. Matusik
International Conference on Learning Representations (ICLR)
May 2024
PDF
ICLR
A Variational Perspective on Solving Inverse Problems with Diffusion Models
M. Mardani, J. Song, J. Kautz, A. Vahdat
International Conference on Learning Representations (ICLR)
May 2024
3DV
Field-of-View Agnostic Depth Estimation for Cross-Dataset Generalization
D. Lichy, H. Su, A. Badki, J. Kautz, O. Gallo
International Conference on 3D Vision
March 2024 (oral)
3DV
PACE: Human and Camera Motion Estimation from in-the-wild Videos
M. Kocabas, Y. Yuan, P. Molchanov, Y. Guo, M. Black, O. Hilliges, J. Kautz, U. Iqbal
International Conference on 3D Vision
March 2024
2023
NeurIPS
Generalizable One-shot Neural Head Avatar
X. Li, S. D. Mello, S. Liu, K. Nagano, U. Iqbal, J. Kautz
Advances in Neural Information Processing Systems (NeurIPS)
December 2023
NeurIPS
Convolutional State Space Models for Long-Range Spatiotemporal Modeling
J. T. Smith, S. D. Mello, J. Kautz, S. Linderman, W. Byeon
Advances in Neural Information Processing Systems (NeurIPS)
December 2023
MICCAI
SMRD: SURE-based Robust MRI Reconstruction with Diffusion Models
B. Ozturkler, C. Liu, B. Eckart, M. Mardani, J. Song, J. Kautz
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI)
October 2023
ICCV
RANA: Relightable and Articulated Neural Avatars
U. Iqbal, A. Caliskan, K. Nagano, S. Khamis, P. Molchanov, J. Kautz
IEEE International Conference on Computer Vision (ICCV)
October 2023
ICCV
PhysDiff: Physics-Guided Human Motion Diffusion Model
Y. Yuan, J. Song, U. Iqbal, A. Vahdat, J. Kautz
IEEE International Conference on Computer Vision (ICCV)
October 2023 (oral)
SIGGRAPH
Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization
C. Lin, K. Nagano, J. Kautz, E. Chan, U. Iqbal, L. Guibas, G. Wetzstein, S. Khamis
ACM SIGGRAPH
August 2023
ICML
Global Context Vision Transformers
A. Hatamizadeh, H. Yin, J. Kautz, P. Molchanov
International Conference on Machine Learning (ICML)
July 2023
ICML
Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation
J. Song, Q. Zhang, H. Yin, M. Mardani, M.-Y. Liu, J. Kautz, Y. Chen, A. Vahdat
International Conference on Machine Learning (ICML)
July 2023
PDF
CVPR
Heterogeneous Continual Learning
D. Madaan, H. Yin, W. Byeon, J. Kautz, P. Molchanov
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
June 2023 (highlight)
PDF
CVPR
Zero-shot Pose Transfer for Unrigged Stylized 3D Characters
J. Wang, X. Li, S. Liu, S. D. Mello, O. Gallo, X. Wang, J. Kautz
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
June 2023
PDF
CVPR
The Best Defense is a Good Offense: Adversarial Augmentation Against Adversarial Attacks
I. Frosio, J. Kautz
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
June 2023
CVPR
Global Vision Transformer Pruning with Hessian-Aware Saliency
H. Yang, H. Yin, M. Shen, P. Molchanov, H. Li, J. Kautz
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
June 2023
PDF
CVPR
Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models
P. Micaelli, P. Molchanov, A. Vahdat, H. Yin, J. Kautz
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
June 2023
CVPR
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
B. Wen, J. Tremblay, V. Blukis, S. Tyree, T. Müller, A. Evans, D. Fox, J. Kautz, S. Birchfield
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
June 2023
ICLR
Pseudoinverse-Guided Diffusion Models for Inverse Problems
J. Song, A. Vahdat, M. Mardani, J. Kautz
International Conference on Learning Representations (ICLR), 2023
May 2023
PDF
ICRA
Online Consistent Video Depth using Continuous Geometric Representations
C. Liu, B. Eckart, J. Kautz
IEEE International Conference on Robotics and Automation (ICRA)
May 2023
PDF
IEEE TMI
Do Gradient Inversion Attacks Make Federated Learning Unsafe?
H. Roth, A. Hatamizadeh, H. Yin, P. Molchanov, A. Myronenko, W. Li, P. Dogra, A. Feng, M. Flores, J. Kautz, D. Xu
IEEE Transactions on Medical Imaging
42(7), January 2023
2022
SIGGRAPH ASIA
Learning to Relight Portrait Images via a Virtual Light Stage and Synthetic-to-Real Adaptation
Y.-Y. Yeh, K. Nagano, S. Khamis, J. Kautz, M.-Y. Liu, T.-C. Wang
ACM Transactions on Graphics (Proceedings SIGGRAPH Asia 2022)
41(6), December 2022
MIA
Towards Annotation-efficient Segmentation via Image-to-image Translation
E. Vorontsov, P. Molchanov, M. Gazda, C. Beckham, J. Kautz, S. Kadoury
Medical Image Analysis
82, November 2022
ECCV
LANA: Latency Aware Network Acceleration
P. Molchanov, J. Hall, H. Yin, N. Fusi, J. Kautz, A. Vahdat
European Conference on Computer Vision (ECCV)
October 2022
ECCV
Neural Light Field Estimation for Outdoor Scenes with Differentiable Virtual Object Insertion
Z. Wang, W. Chen, D. Acuna, J. Kautz, S. Fidler
European Conference on Computer Vision (ECCV)
October 2022
CVPR
GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras
Y. Yuan, U. Iqbal, P. Molchanov, K. Kitani, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2022 (oral)
CVPR
A-ViT: Adaptive Tokens for Efficient Vision Transformer
H. Yin, A. Vahdat, J. M. Alvarez, A. Mallya, J. Kautz, P. Molchanov
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2022 (oral)
CVPR
GradViT: Gradient Inversion of Vision Transformers
A. Hatamizadeh, H. Yin, H. Roth, W. Li, J. Kautz, D. Xu, P. Molchanov
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2022
CVPR
GroupViT: Zero-Shot Transfer to Semantic Segmentation with Text Supervision
J. Xu, S. D. Mello, S. Liu, W. Byeon, T. Breuel, J. Kautz, X. Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2022
CVPR
CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs
J. Mu, S. Liu, S. D. Mello, Z. Yu, N. Vasconcelos, X. Wang, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2022
CVPR
FreeSOLO: Learning to Segment Objects without Annotations
X. Wang, Z. Yu, S. D. Mello, J. Kautz, A. Anandkumar, C. Shen, J. M. Alvarez
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2022
PDF
ICLR
Learning Continuous Environment Fields via Implicit Functions
X. Li, S. D. Mello, X. Wang, M.-H. Yang, J. Kautz, S. Liu
International Conference on Learning Representations (ICLR)
April 2022
PDF
IJCV
Learning Contrastive Representation for Semantic Correspondence
T. Xiao, S. Liu, S. D. Mello, Z. Yu, J. Kautz, M.-H. Yang
International Journal on Computer Vision (IJCV)
March 2022
PDF
IJCV
Displacement-Invariant Cost Computation for Stereo Matching
Y. Zhong, C. Loop, W. Byeon, S. Birchfield, Y. Dai, K. Zhang, A. Kamenev, T. Breuel, H. Li, J. Kautz
International Journal on Computer Vision (IJCV)
March 2022
PDF
AAAI
Neural Interferometry: Image Reconstruction from Astronomical Interferometers using Implicit Neural Representations
B. Wu, B. Eckart, C. Liu, J. Kautz
AAAI Conference on Artificial Intelligence (AAAI)
February 2022
PDF
2021
NeurIPS
Coupled Segmentation and Edge Learning Using Dynamic Graph Propagation
Z. Yu, R. Huang, W. Byeon, S. Liu, G. Liu, T. Breuel, A. Anandkumar, J. Kautz
Neural Information Processing Systems (NeurIPS)
December 2021
PDF
NeurIPS
A Contrastive Learning Approach for Training Variational Autoencoder Priors
J. Aneja, A. Schwing, J. Kautz, A. Vahdat
Neural Information Processing Systems (NeurIPS)
December 2021
PDF
NeurIPS
Score-based Generative Modeling in Latent Space
A. Vahdat, K. Kreis, J. Kautz
Neural Information Processing Systems (NeurIPS)
December 2021
3DV
KAMA: 3D Keypoint Aware Body Mesh Articulation
U. Iqbal, K. Xie, Y. Guo, J. Kautz, P. Molchanov
International Conference on 3D Vision (3DV)
December 2021
BMVC
Hierarchical Contrastive Motion Learning for Video Action Recognition
X. Yang, X. Yang, S. Liu, D. Sun, L. Davis, J. Kautz
British Machine Vision Conference (BMVC)
November 2021
ICCV
Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting
Z. Wang, J. Philion, S. Fidler, J. Kautz
International Conference on Computer Vision (ICCV)
October 2021 (oral)
ICCV
Self-Supervised Object Detection via Generative Image Synthesis
S. K. Mustikovela, S. De Mello, A. Prakash, U. Iqbal, S. Liu, T. Nguyen-Phuoc, C. Rother, J. Kautz
International Conference on Computer Vision (ICCV)
October 2021
TPAMI
Domain Stylization: A Fast Covariance Matching Framework towards Domain Adaptation
A. Dundar, M.-Y. Liu, Z. Yu, T.-C. Wang, J. Zedlewski, J. Kautz
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
43(7), July 2021
CVPR
Binary TTC: A Temporal Geofence for Autonomous Navigation
A. Badki, O. Gallo, J. Kautz, P. Sen
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2021 (Best Student Paper Honorable Mention & oral)
CVPR
Weakly-Supervised Physically Unconstrained Gaze Estimation
R. Kothari, S. De Mello, U. Iqbal, W. Byeon, S. Park, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2021 (oral)
CVPR
Learning to Track Instances without Video Annotations
Y. Fu, S. Liu, U. Iqbal, S. De Mello, H. Shi, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2021 (oral)
CVPR
See Through Gradients: Image Batch Recovery via GradInversion
H. Yin, A. Mallya, A. Vahdat, J. M. Alvarez, J. Kautz, P. Molchanov
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2021
CVPR
Self-Supervised Learning on 3D Point Clouds by Learning Latent Generative Models
B. Eckart, W. Yuan, C. Liu, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2021
CVPR
DexYCB: A Benchmark for Capturing Hand Grasping of Objects
Y.-W. Chao, W. Yang, A. Handa, Y. Xiang, Y. Narang, K. V. Wyk, U. Iqbal, P. Molchanov, J. Tremblay, S. Birchfield, J. Kautz, D. Fox
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2021
ICLR
VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models
Z. Xiao, K. Kreis, J. Kautz, A. Vahdat
International Conference on Learning Representations (ICLR)
May 2021 (spotlight)
PDF
ICLR
Parameter Efficient Multimodal Transformers for Video Representation Learning
S. Lee, Y. Yu, G. Kim, T. Breuel, J. Kautz, Y. Song
International Conference on Learning Representations (ICLR)
May 2021
PDF
2020
NeurIPS
NVAE: A Deep Hierarchical Variational Autoencoder
A. Vahdat, J. Kautz
Neural Information Processing Systems (NeurIPS)
December 2020 (spotlight)
PDF
NeurIPS
Online Adaptation for Consistent Mesh Reconstruction in the Wild
X. Li, S. Liu, S. De Mello, K. Kim, X. Wang, M.-H. Yang, J. Kautz
Neural Information Processing Systems (NeurIPS)
December 2020
PDF
NeurIPS
Convolutional Tensor-Train LSTM for Spatio-Temporal Learning
J. Su, W. Byeon, J. Kossaifi, F. Huang, J. Kautz, A. Anandkumar
Neural Information Processing Systems (NeurIPS)
December 2020
PDF
ISMAR
Optical Gaze Tracking with Spatially-Sparse Single-Pixel Detectors
R. Li, E. Whitmire, M. Stengel, B. Boudaoud, J. Kautz, D. Luebke, S. Patel, K. Aksit
IEEE International Symposium on Mixed and Augmented Reality (ISMAR)
November 2020
PDF
ECCV
Contrastive Learning for Weakly Supervised Phrase Grounding
T. Gupta, A. Vahdat, G. Chechik, X. Yang, J. Kautz, D. Hoiem
European Conference on Computer Vision (ECCV)
August 2020 (spotlight)
PDF
ECCV
DeepGMR: Learning Latent Gaussian Mixture Models for Registration
W. Yuan, B. Eckart, K. Kim, V. Jampani, D. Fox, J. Kautz
European Conference on Computer Vision (ECCV)
August 2020 (spotlight)
PDF
ECCV
Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification
Y. Zou, X. Yang, Z. Yu, B. V. K. V. Kumar, J. Kautz
European Conference on Computer Vision (ECCV)
August 2020 (oral)
PDF
ECCV
Self-supervised Single-view 3D Reconstruction via Semantic Consistency
X. Li, S. Liu, K. Kim, S. De Mello, V. Jampani, M.-H. Yang, J. Kautz
European Conference on Computer Vision (ECCV)
August 2020
PDF
ECCV
Weakly Supervised 3D Hand Pose Estimation via Biomechanical Constraints
A. Spurr, P. Molchanov, U. Iqbal, O. Hilliges, J. Kautz
European Conference on Computer Vision (ECCV)
August 2020
PDF
ECCV
UFO2: A Unified Framework towards Omni-supervised Object Detection
Z. Ren, Z. Yu, X. Yang, M.-Y. Liu, A. Schwing, J. Kautz
European Conference on Computer Vision (ECCV)
August 2020
PDF
ICML
Angular Visual Hardness
B. Chen, W. Liu, A. Garg, Z. Yu, A. Shrivastava, J. Kautz, A. Anandkumar
International Conference on Machine Learning (ICML)
July 2020
PDF
CVPR
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
H. Yin, P. Molchanov, J. M. Alvarez, Z. Li, A. Mallya, D. Hoiem, N. Jha, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020 (oral)
CVPR
UNAS: Differentiable Architecture Search Meets Reinforcement Learning
A. Vahdat, A. Mallya, M.-Y. Liu, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020 (oral)
CVPR
Self-Supervised Viewpoint Learning from Image Collections
S. K. Mustikovela, V. Jampani, S. De Mello, U. Iqbal, S. Liu, C. Rother, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020
CVPR
Bi3D: Stereo Depth Estimation via Binary Classifications
A. Badki, O. Gallo, A. Troccoli, K. Kim, P. Sen, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020
CVPR
Meshlet Priors for 3D Mesh Reconstruction
A. Badki, O. Gallo, P. Sen, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020
CVPR
Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild
U. Iqbal, P. Molchanov, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020
CVPR
Two-shot Spatially-varying BRDF and Shape Estimation
M. Boss, V. Jampani, K. Kim, H. Lensch, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020
CVPR
Novel View Synthesis of Dynamic Scenes with Globally Coherent Depths from a Monocular Camera
J. S. Yoon, K. Kim, O. Gallo, H. S. Park, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020
CVPR
Instance-aware, Context-focused, and Memory-efficient Weakly-Supervised Object Detection
Z. Ren, Z. Yu, X. Yang, M.-Y. Liu, Y. J. Lee, A. Schwing, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2020
IJCV
Exploiting Semantics for Face Image Deblurring
Z. Shen, W.-S. Lai, T. Xu, J. Kautz, M.-H. Yang
International Journal on Computer Vision (IJCV)
March 2020
WACV
NRMVS: Non-Rigid Multi-View Stereo
M. Innmann, K. Kim, J. Gu, M. Niessner, C. Loop, M. Stamminger, J. Kautz
IEEE Winter Conference on Applications of Computer Vision (WACV)
March 2020, pages 2754-2763
2019
NeurIPS
Joint-task Self-supervised Learning for Temporal Correspondence
X. Li, S. Liu, S. De Mello, X. Wang, M.-H. Yang, J. Kautz
Neural Information Processing Systems (NeurIPS)
December 2019
NeurIPS
Dancing to Music
H.-Y. Lee, X. Yang, M.-Y. Liu, T.-C. Wang, Y.-D. Lu, M.-H. Yang, J. Kautz
Neural Information Processing Systems (NeurIPS)
December 2019
NeurIPS
Few-shot Video-to-Video Synthesis
T.-C. Wang, M.-Y. Liu, A. Tao, G. Liu, J. Kautz, B. Catanzaro
Neural Information Processing Systems (NeurIPS)
December 2019
ICCV
Extreme View Synthesis
I. Choi, O. Gallo, A. Troccoli, M. H. Kim, J. Kautz
IEEE International Conference on Computer Vision (ICCV)
October 2019 (oral)
ICCV
SENSE: A Shared Encoder Network for Scene-flow Estimation
H. Jiang, D. Sun, V. Jampani, Z. Lv, E. Learned-Miller, J. Kautz
IEEE International Conference on Computer Vision (ICCV)
October 2019 (oral)
ICCV
Few-shot Adaptive Gaze Estimation
S. Park, S. De Mello, P. Molchanov, U. Iqbal, O. Hilliges, J. Kautz
IEEE International Conference on Computer Vision (ICCV)
October 2019 (oral)
ICCV
Learning Propagation for Arbitrarily-structured Data
S. Liu, X. Li, V. Jampani, S. De Mello, J. Kautz
IEEE International Conference on Computer Vision (ICCV)
October 2019
ICCV
Few-shot Unsupervised Image-to-Image Translation
M.-Y. Liu, X. Huang, A. Mallya, T. Karras, T. Aila, J. Lehtinen, J. Kautz
IEEE International Conference on Computer Vision (ICCV)
October 2019
ICCV
Neural Inverse Rendering of an Indoor Scene from a Single Image
S. Sengupta, J. Gu, K. Kim, G. Liu, D. Jacobs, J. Kautz
IEEE International Conference on Computer Vision (ICCV)
October 2019
PDF
ICCV
Unsupervised Video Interpolation Using Cycle Consistency
F. Reda, D. Sun, A. Dundar, M. Shoeybi, G. Liu, K. Shih, A. Tao, J. Kautz, B. Catanzaro
IEEE International Conference on Computer Vision (ICCV)
October 2019
BMVC
Few-Shot Viewpoint Estimation
H.-Y. Tseng, S. De Mello, J. Tremblay, S. Liu, S. Birchfield, M.-H. Yang, J. Kautz
British Machine Vision Conference (BMVC)
September 2019
PDF
BMVC
Video Stitching for Linear Camera Arrays
W.-S. Lai, O. Gallo, J. Gu, D. Sun, M.-H. Yang, J. Kautz
British Machine Vision Conference (BMVC)
September 2019
CVPR
Joint Discriminative and Generative Learning for Person Re-identification
Z. Zheng, X. Yang, Z. Yu, L. Zheng, Y. Yang, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019 (oral)
CVPR
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
X. Yang, X. Yang, M.-Y. Liu, F. Xiao, L. Davis, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019 (oral)
CVPR
PlaneRCNN: 3D Plane Detection and Reconstruction from a Single View
C. Liu, K. Kim, J. Gu, Y. Furukawa, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019 (oral)
CVPR
Neural RGB → D Sensing: Depth and Uncertainty from a Video Camera
C. Liu, J. Gu, K. Kim, S. Narasimhan, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019 (Best Paper Finalist & oral)
CVPR
SCOPS: Self-Supervised Co-Part Segmentation
W.-C. Hung, V. Jampani, S. Liu, P. Molchanov, M.-H. Yang, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019
CVPR
Pixel Adaptive Convolutional Neural Networks
H. Su, V. Jampani, D. Sun, O. Gallo, E. Learned-Miller, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019
CVPR
Learning Linear Transformations for Fast Image and Video Style Transfer
X. Li, S. Liu, J. Kautz, M.-H. Yang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019
CVPR
Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments
X. Li, S. Liu, K. Kim, M.-H. Yang, X. Wang, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019
CVPR
Importance Estimation for Neural Network Pruning
P. Molchanov, A. Mallya, S. Tyree, I. Frosio, J. Kautz
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
June 2019
TPAMI
Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation
D. Sun, X. Yang, M.-Y. Liu, J. Kautz
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
?(?), ? 2019
TIP
Statistical Nearest Neighbors for Image Denoising
I. Frosio, J. Kautz
IEEE Transactions on Image Processing
28(2), February 2019, pages 723-728
WACV
A Fusion Approach for Multi-Frame Optical Flow Estimation
Z. Ren, O. Gallo, D. Sun, M.-H. Yang, E. Sudderth, J. Kautz
IEEE Winter Conference on Applications of Computer Vision (WACV)
January 2019
PDF
CV
oct 2018

Vice President of Learning and Perception Research, NVIDIA, USA

Head of the learning and perception research group at NVIDIA.

apr 2017

Senior Director of Visual Computing and Machine Learning Research, NVIDIA, USA

oct 2015

Director of Visual Computing Research, NVIDIA, USA

oct 2014

Senior Research Manager, NVIDIA, USA

Head of the visual computing research group at NVIDIA.

sep 2013

Senior Research Scientist, NVIDIA, USA

Research in comp. photography and computer vision.

oct 2012

Professor of Visual Computing, University College London, UK

oct 2011

Associate Professor (Reader), University College London, UK

oct 2009

Associate Professor (Senior Lecturer), University College London, UK

mar 2006

Assistant Professor (Lecturer), University College London, UK

Research in visual computing, teaching and supervision of students (BSc, MSc, PhD).

jul 2003

Post-Doctoral Researcher, Massachusetts Institute of Technology, USA

Working on appearance editing and realistic, real-time rendering.

sep 1999

PhD Student, Max-Planck-Institut fur Informatik, Germany

Received PhD (summa cum laude).

may 1998

Graduate Student, University of Waterloo, Canada

Received MMath.

oct 1993

Student, University Erlangen-Nurnberg, Germany

Received Diplom-Informatiker (MSc in Computer Science).

Contact
Jan Kautz
NVIDIA | Learning and Perception Research Group
2 Technology Park Drive | Westford, MA 01886