Udit Gupta

Orcid: 0000-0002-9118-0961

Affiliations:
  • Cornell University, USA


According to our database1, Udit Gupta authored at least 46 papers between 2018 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
GreenScale: Carbon Optimization for Edge Computing.
IEEE Internet Things J., August, 2025

Empowering Users to Make Sustainability-Forward Decisions for Computing Services.
Commun. ACM, July, 2025

Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion.
CoRR, May, 2025

EcoServe: Designing Carbon-Aware AI Inference Systems.
CoRR, February, 2025

A view of the sustainable computing landscape.
Patterns, 2025

Slower is Greener: Acceptance of Eco-feedback Interventions on Carbon Heavy Internet Services.
ACM J. Comput. Sustain. Soc., 2025

Hermes: Algorithm-System Co-design for Efficient Retrieval-Augmented Generation At-Scale.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

Fair-CO2: Fair Attribution for Cloud Carbon Emissions.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

CORDOBA: Carbon-Efficient Optimization Framework for Computing Systems.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

Silicon Foraging: Harvesting Excess Compute for Sustainable Edge Computing.
Proceedings of the 2025 ACM Designing Interactive Systems Conference, 2025

2024
Towards Understanding Systems Trade-offs in Retrieval-Augmented Generation Model Inference.
CoRR, 2024

Carbon Connect: An Ecosystem for Sustainable Computing.
CoRR, 2024

Information Flow Control in Machine Learning through Modular Model Architecture.
Proceedings of the 33rd USENIX Security Symposium, 2024

3D IC Architecture Evaluation and Optimization with Digital Compute-in-Memory Designs.
Proceedings of the 29th ACM/IEEE International Symposium on Low Power Electronics and Design, 2024

GPU-based Private Information Retrieval for On-Device Machine Learning Inference.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Architectural CO<sub>2</sub> Footprint Tool: Designing Sustainable Computer Systems With an Architectural Carbon Modeling Tool.
IEEE Micro, 2023

Design Space Exploration and Optimization for Carbon-Efficient Extended Reality Systems.
CoRR, 2023

GreenScale: Carbon-Aware Systems for Edge Computing.
CoRR, 2023

Peeling Back the Carbon Curtain: Carbon Optimization Challenges in Cloud Computing.
Proceedings of the 2nd Workshop on Sustainable Computer Systems, 2023

Carbon-Efficient Design Optimization for Computing Systems.
Proceedings of the 2nd Workshop on Sustainable Computer Systems, 2023

MP-Rec: Hardware-Software Co-design to Enable Multi-path Recommendation.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Carbon Explorer: A Holistic Framework for Designing Carbon Aware Datacenters.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
A Holistic Approach for Designing Carbon Aware Datacenters.
CoRR, 2022


ACT: designing sustainable computer systems with an architectural carbon modeling tool.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

Hercules: Heterogeneity-Aware Inference Serving for At-Scale Personalized Recommendation.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021
Sustainable AI: Environmental Implications, Challenges and Opportunities.
CoRR, 2021

RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance.
Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

Chasing Carbon: The Elusive Environmental Footprint of Computing.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

RecSSD: near data processing for solid state drive based recommendation inference.
Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

2020

RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing.
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

DeepRecSys: A System for Optimizing End-To-End At-Scale Neural Recommendation Inference.
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

Cross-Stack Workload Characterization of Deep Recommendation Systems.
Proceedings of the IEEE International Symposium on Workload Characterization, 2020

The Architectural Implications of Facebook's DNN-Based Personalized Recommendation.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020

Emerging Neural Workloads and Their Impact on Hardware.
Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

2019
MLPerf Training Benchmark.
CoRR, 2019

The Architectural Implications of Facebook's DNN-based Personalized Recommendation.
CoRR, 2019

Deep Learning Recommendation Model for Personalization and Recommendation Systems.
CoRR, 2019

A 16nm 25mm<sup>2</sup> SoC with a 54.5x Flexibility-Efficiency Range from Dual-Core Arm Cortex-A53 to eFPGA and Cache-Coherent Accelerators.
Proceedings of the 2019 Symposium on VLSI Circuits, Kyoto, Japan, June 9-14, 2019, 2019

CHAMPVis: Comparative Hierarchical Analysis of Microarchitectural Performance.
Proceedings of the IEEE/ACM International Workshop on Programming and Performance Visualization Tools, 2019

MaxNVM: Maximizing DNN Storage Density and Inference Efficiency with Sparse Encoding and Error Mitigation.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

MASR: A Modular Accelerator for Sparse RNNs.
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018
Weightless: Lossy weight encoding for deep neural network compression.
Proceedings of the 6th International Conference on Learning Representations, 2018

Ares: a framework for quantifying the resilience of deep neural networks.
Proceedings of the 55th Annual Design Automation Conference, 2018

On-chip deep neural network storage with multi-level eNVM.
Proceedings of the 55th Annual Design Automation Conference, 2018


  Loading...