Bryan Catanzaro

Ameya Sunil Mahabaleshwarkar

Saurav Muralidharan

Ruisi Cai

Marcin Chochowski

CoRR, November, 2025

Music Flamingo: Scaling Music Understanding in Audio Language Models.

[BibT_eX]

[DOI]

CoRR, November, 2025

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM.

[BibT_eX]

[DOI]

CoRR, October, 2025

UALM: Unified Audio Language Model for Understanding, Generation and Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data.

[BibT_eX]

[DOI]

CoRR, October, 2025

RLP: Reinforcement as a Pretraining Objective.

[BibT_eX]

[DOI]

CoRR, October, 2025

Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset.

[BibT_eX]

[DOI]

Rabeeh Karimi Mahabadi

CoRR, August, 2025

Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding.

[BibT_eX]

[DOI]

CoRR, August, 2025

Fusing LLM Capabilities with Routing Data.

[BibT_eX]

[DOI]

CoRR, July, 2025

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models.

[BibT_eX]

[DOI]

CoRR, July, 2025

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy.

[BibT_eX]

[DOI]

CoRR, June, 2025

Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge.

[BibT_eX]

[DOI]

CoRR, May, 2025

Nemotron-Research-Tool-N1: Exploring Tool-Using Language Models with Reinforced Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, April, 2025

From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models.

[BibT_eX]

[DOI]

CoRR, April, 2025

Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning.

[BibT_eX]

[DOI]

CoRR, April, 2025

FeatSharp: Your Vision Model Features, Sharper.

[BibT_eX]

[DOI]

CoRR, February, 2025

Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, January, 2025

A2SB: Audio-to-Audio Schrodinger Bridges.

[BibT_eX]

[DOI]

CoRR, January, 2025

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning.

[BibT_eX]

[DOI]

Ali Taghibakhshi

Ameya Mahabaleshwarkar

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Nemotron-CORTEXA: Enhancing LLM Agents for Software Engineering Tasks via Improved Localization and Solution Diversity.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

FeatSharp: Your Vision Model Features, Sharper.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

ETTA: Elucidating the Design Space of Text-to-Audio Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Fugatto 1: Foundational Generative Audio Transformer Opus 1.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders.

[BibT_eX]

[DOI]

Subhashree Radhakrishnan

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Mm-Embed: Universal Multimodal Retrieval with Multimodal LLMS.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MIND: Math Informed syNthetic Dialogues for Pretraining LLMs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Progressive Learning of 3D Reconstruction Network From 2D GAN Data.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Maximize Your Data's Potential: Enhancing LLM Accuracy with Two-Phase Pretraining.

[BibT_eX]

[DOI]

CoRR, 2024

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2024

OMCAT: Omni Context Aware Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

Upcycling Large Language Models into Mixture of Experts.

[BibT_eX]

[DOI]

CoRR, 2024

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

NVLM: Open Frontier-Class Multimodal LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders.

[BibT_eX]

[DOI]

Subhashree Radhakrishnan

CoRR, 2024

LLM Pruning and Distillation in Practice: The Minitron Approach.

[BibT_eX]

[DOI]

CoRR, 2024

Effective Large Language Model Debugging with Best-first Tree Search.

[BibT_eX]

[DOI]

Jialin Song

Jonathan Raiman

CoRR, 2024

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities.

[BibT_eX]

[DOI]

CoRR, 2024

Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Data, Data Everywhere: A Guide for Pretraining Dataset Construction.

[BibT_eX]

[DOI]

CoRR, 2024

Improving Text-To-Audio Models with Synthetic Captions.

[BibT_eX]

[DOI]

CoRR, 2024

An Empirical Study of Mamba-based Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Audio Dialogues: Dialogues dataset for audio and music understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Nemotron-4 15B Technical Report.

[BibT_eX]

[DOI]

CoRR, 2024

ChatQA: Building GPT-4 Level Conversational QA Models.

[BibT_eX]

[DOI]

CoRR, 2024

Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Compact Language Models via Pruning and Knowledge Distillation.

[BibT_eX]

[DOI]

Saurav Muralidharan

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ChatQA: Surpassing GPT-4 on Conversational QA and RAG.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

ODIN: Disentangled Reward Mitigates Hacking in RLHF.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Retrieval meets Long Context Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Scaling Nvidia's Multi-Speaker Multi-Lingual TTS Systems With Zero-Shot TTS to Indic Languages.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

LLM-Evolve: Evaluation for LLM's Evolving Capability on Benchmarks.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Data, Data Everywhere: A Guide for Pretraining Dataset Construction.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CircuitVAE: Efficient and Scalable Latent Circuit Optimization.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

2023

Fine Detailed Texture Learning for 3D Meshes With Generative Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Partial Convolution for Padding, Inpainting, and Image Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

ChipNeMo: Domain-Adapted LLMs for Chip Design.

[BibT_eX]

[DOI]

CoRR, 2023

RAVEN: In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models.

[BibT_eX]

[DOI]

Kevin Chen-Chuan Chang

CoRR, 2023

Multilingual Multiaccented Multispeaker TTS with RADTTS.

[BibT_eX]

[DOI]

CoRR, 2023

P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reducing Activation Recomputation in Large Transformer Models.

[BibT_eX]

[DOI]

Vijay Anand Korthikanti

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

BigVGAN: A Universal Neural Vocoder with Large-Scale Training.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GraPhSyM: Graph Physical Synthesis Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

High-Acoustic Fidelity Text To Speech Synthesis With Fine-Grained Control Of Speech Attributes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Any-to-Any Voice Conversion with F0 and Timbre Disentanglement and Novel Timbre Conditioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Vani: Very-Lightweight Accent-Controllable TTS for Native And Non-Native Speakers With Identity Preservation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Context Generation Improves Open Domain Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Adding Instructions during Pretraining: Effective way of Controlling Toxicity in Language Models.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Language Models: The Most Important Compute Challenge of Our Time (Keynote).

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022

Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers.

[BibT_eX]

[DOI]

CoRR, 2022

Factuality Enhanced Language Models for Open-Ended Text Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows.

[BibT_eX]

[DOI]

CoRR, 2022

Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction.

[BibT_eX]

[DOI]

CoRR, 2022

Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model.

[BibT_eX]

[DOI]

CoRR, 2022

Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Factuality Enhanced Language Models for Open-Ended Text Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Speech Denoising in the Waveform Domain With Self-Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

One TTS Alignment to Rule Them All.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Evaluating Parameter Efficient Learning for Generation.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Multi-Stage Prompting for Knowledgeable Dialogue Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases.

[BibT_eX]

[DOI]

CoRR, 2021

Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Guiding Global Placement With Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Efficient Large-Scale Language Model Training on GPU Clusters.

[BibT_eX]

[DOI]

CoRR, 2021

Efficient large-scale language model training on GPU clusters using megatron-LM.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

Long-Short Transformer: Efficient Transformers for Language and Vision.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

DiffWave: A Versatile Diffusion Model for Audio Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Dual Contrastive Loss and Attention for GANs.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

View Generalization for Single Image Textured 3D Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

End-to-End Training of Neural Retrievers for Open-Domain Question Answering.

[BibT_eX]

[DOI]

Devendra Singh Sachan

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Accelerating Chip Design With Machine Learning.

[BibT_eX]

[DOI]

Rangharajan Venkatesan

Yanqing Zhang

William J. Dally

IEEE Micro, 2020

Local Knowledge Powered Conversational Agents.

[BibT_eX]

[DOI]

CoRR, 2020

Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter.

[BibT_eX]

[DOI]

CoRR, 2020

Hierarchical Multi-Scale Attention for Semantic Segmentation.

[BibT_eX]

[DOI]

Andrew Tao

Karan Sapra

CoRR, 2020

Neural FFTs for Universal Texture Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Mellotron: Multispeaker Expressive Voice Synthesis by Conditioning on Rhythm, Pitch and Global Style Tokens.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Training Question Answering Models From Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Panoptic-Based Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Large Scale Multi-Actor Generative Dialog Modeling.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Neural ODEs for Image Segmentation with Level Sets.

[BibT_eX]

[DOI]

CoRR, 2019

Zero-shot Text Classification With Generative Language Models.

[BibT_eX]

[DOI]

Raul Puri

Dimitris S. Papailiopoulos

CoRR, 2019

Improving SAT Solver Heuristics with Graph Networks and Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism.

[BibT_eX]

[DOI]

CoRR, 2019

Video Interpolation and Prediction with Unsupervised Landmarks.

[BibT_eX]

[DOI]

CoRR, 2019

SysML: The New Frontier of Machine Learning Systems.

[BibT_eX]

[DOI]

Alexandros G. Dimakis

Anastasios Kyrillidis

Shivaram Venkataraman

CoRR, 2019

Graphical Contrastive Losses for Scene Graph Generation.

[BibT_eX]

[DOI]

CoRR, 2019

CongestionNet: Routing Congestion Prediction Using Deep Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 27th IFIP/IEEE International Conference on Very Large Scale Integration, 2019

Few-shot Video-to-Video Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Unsupervised Video Interpolation Using Cycle Consistency.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Waveglow: A Flow-based Generative Network for Speech Synthesis.

[BibT_eX]

[DOI]

Ryan Prenger

Rafael Valle

Proceedings of the IEEE International Conference on Acoustics, 2019

Graphical Contrastive Losses for Scene Graph Parsing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Improving Semantic Segmentation via Video Propagation and Label Relaxation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Practical Text Classification With Large Pre-Trained Language Models.

[BibT_eX]

[DOI]

CoRR, 2018

Partial Convolution based Padding.

[BibT_eX]

[DOI]

CoRR, 2018

An Interpretable Model for Scene Graph Generation.

[BibT_eX]

[DOI]

CoRR, 2018

SDCNet: Video Prediction Using Spatially-Displaced Convolution.

[BibT_eX]

[DOI]

CoRR, 2018

Introduction to the 1st Place Winning Model of OpenImages Relationship Detection Challenge.

[BibT_eX]

[DOI]

CoRR, 2018

Video-to-Video Synthesis.

[BibT_eX]

[DOI]

CoRR, 2018

Large Scale Language Modeling: Converging on 40GB of Text in Four Hours.

[BibT_eX]

[DOI]

Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

Video-to-Video Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

SDC-Net: Video Prediction Using Spatially-Displaced Convolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Image Inpainting for Irregular Holes Using Partial Convolutions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Malware Detection by Eating a Whole EXE.

[BibT_eX]

[DOI]

Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

DSD: Dense-Sparse-Dense Training for Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

2016

DSD: Regularizing Deep Neural Networks with Dense-Sparse-Dense Training Flow.

[BibT_eX]

[DOI]

CoRR, 2016

Persistent RNNs: Stashing Recurrent Weights On-Chip.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin.

[BibT_eX]

[DOI]

CoRR, 2015

A collection-oriented programming model for performance portability.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

2014

Deep Speech: Scaling up end-to-end speech recognition.

[BibT_eX]

[DOI]

CoRR, 2014

cuDNN: Efficient Primitives for Deep Learning.

[BibT_eX]

[DOI]

Sharan Chetlur

Cliff Woolley

Philippe Vandermersch

CoRR, 2014

A decomposition for in-place matrix transposition.

[BibT_eX]

[DOI]

Bryan Christopher Catanzaro

Alexander Keller

Michael Garland

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Nitro: A Framework for Adaptive Code Variant Tuning.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

2013

GPU Scripting and Code Generation with PyCUDA

[BibT_eX]

[DOI]

CoRR, 2013

Deep learning with COTS HPC systems.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Machine Learning, 2013

2012

PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation.

[BibT_eX]

[DOI]

Parallel Comput., 2012

2011

Compilation Techniques for Embedded Data Parallel Languages.

[BibT_eX]

[DOI]

PhD thesis, 2011

Copperhead: compiling an embedded data parallel language.

[BibT_eX]

[DOI]

Michael Garland

Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011

Considerations When Evaluating Microprocessor Platforms.

[BibT_eX]

[DOI]

Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism, 2011

PALLAS: Mapping Applications onto Manycore.

[BibT_eX]

[DOI]

Proceedings of the Multiprocessor System-on-Chip - Hardware Design and Tool Integration., 2011

2010

Ubiquitous Parallel Computing from Berkeley, Illinois, and Stanford.

[BibT_eX]

[DOI]

IEEE Micro, 2010

Parallel computing with patterns and frameworks.

[BibT_eX]

[DOI]

XRDS, 2010

2009

PyCUDA: GPU Run-Time Code Generation for High-Performance Computing

[BibT_eX]

[DOI]

CoRR, 2009

Efficient, high-quality image contour detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

2008

Fast support vector machine training and classification on graphics processors.

[BibT_eX]

[DOI]

Narayanan Sundaram

Proceedings of the Machine Learning, 2008

Parallelizing CAD: a timely research agenda for EDA.

[BibT_eX]

[DOI]

Bor-Yiing Su

Proceedings of the 45th Design Automation Conference, 2008

2007

Efficient Parallelization of H.264 Decoding with Macro Block Level Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2005

Choice of base revisited: higher radices for FPGA-based floating-point computation (abstract only).

[BibT_eX]

[DOI]

Brent E. Nelson

Proceedings of the ACM/SIGDA 13th International Symposium on Field Programmable Gate Arrays, 2005

Higher Radix Floating-Point Representations for FPGA-Based Arithmetic.

[BibT_eX]

[DOI]