Lingjia Tang

Orcid: 0000-0002-5609-7775

According to our database1, Lingjia Tang authored at least 60 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
One Agent Too Many: User Perspectives on Approaches to Multi-agent Conversational AI.
CoRR, 2024

2023
Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's GPT-4 with Self-Hosted Open Source SLMs in Production.
CoRR, 2023

The Jaseci Programming Paradigm and Runtime Stack: Building Scale-Out Production Applications Easy and Fast.
IEEE Comput. Archit. Lett., 2023

Label Agnostic Pre-training for Zero-shot Text Classification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Towards Personalized Intelligence at Scale.
CoRR, 2022

One Agent To Rule Them All: Towards Multi-agent Conversational AI.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2020
A Benchmarking Framework for Interactive 3D Applications in the Cloud.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

2019
Caliper: Interference Estimator for Multi-tenant Environments Sharing Architectural Resources.
ACM Trans. Archit. Code Optim., 2019

Outlier Detection for Improved Data Quality and Diversity in Dialog Systems.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Understanding the Impact of Socket Density in Density Optimized Servers.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

GrandSLAm: Guaranteeing SLAs for Jobs in Microservices Execution Frameworks.
Proceedings of the Fourteenth EuroSys Conference 2019, Dresden, Germany, March 25-28, 2019, 2019

An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Rethinking Numerical Representations for Deep Neural Networks.
CoRR, 2018

Adasa: A Conversational In-Vehicle Digital Assistant for Advanced Driver Assistance Features.
Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology, 2018

Data Collection for Dialogue System: A Startup Perspective.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Proctor: Detecting and Investigating Interference in Shared Datacenters.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2018

Virtual Melting Temperature: Managing Server Load to Minimize Cooling Overhead with Phase Change Materials.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

Gist: Efficient Data Encoding for Deep Neural Network Training.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

The Architectural Implications of Autonomous Driving: Constraints and Acceleration.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

SmoothOperator: Reducing Power Fragmentation and Improving Power Utilization in Large-scale Datacenters.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

Architectural support for convolutional neural networks on modern CPUs.
Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018

2017
Reining in Long Tails in Warehouse-Scale Computers with Quick Voltage Boosting Using Adrenaline.
ACM Trans. Comput. Syst., 2017

Thermal Time Shifting: Decreasing Data Center Cooling Costs with Phase-Change Materials.
IEEE Internet Comput., 2017

DeftNN: addressing bottlenecks for DNN execution on GPUs via synapse vector elimination and near-compute data fission.
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

PowerChief: Intelligent Power Allocation for Multi-Stage Applications to Improve Responsiveness on Power Constrained CMP.
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

Prophet: Precise QoS Prediction on Non-Preemptive Accelerators to Improve Utilization in Warehouse-Scale Computers.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

2016
Designing Future Warehouse-Scale Computers for Sirius, an End-to-End Voice and Vision Personal Assistant.
ACM Trans. Comput. Syst., 2016

Sirius Implications for Future Warehouse-Scale Computers.
IEEE Micro, 2016

Input responsiveness: using canary inputs to dynamically steer approximation.
Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2016

CrystalBall: Statically analyzing runtime behavior via deep sequence learning.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Continuous shape shifting: Enabling loop co-optimization via near-free dynamic code rewriting.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Concise loads and stores: The case for an asymmetric compute-memory architecture for approximation.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Treadmill: Attributing the Source of Tail Latency through Precise Load Testing and Statistical Inference.
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

PowerChop: Identifying and Managing Non-critical Units in Hybrid Processor Architectures.
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Baymax: QoS Awareness and Increased Utilization for Non-Preemptive Accelerators in Warehouse Scale Computers.
Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, 2016

2015
Thermal time shifting: leveraging phase change materials to reduce cooling costs in warehouse-scale computers.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

DjiNN and Tonic: DNN as a service and its implications for future warehouse scale computers.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

Octopus-Man: QoS-driven task management for heterogeneous multicores in warehouse-scale computers.
Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Adrenaline: Pinpointing and reining in tail queries with quick voltage boosting.
Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Sirius: An Open End-to-End Voice and Vision Personal Assistant and Its Implications for Future Warehouse Scale Computers.
Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, 2015

2014
Enabling fair pricing on high performance computer systems with node sharing.
Sci. Program., 2014

HaPPy: Hyperthread-aware Power Profiling Dynamically.
Proceedings of the 2014 USENIX Annual Technical Conference, 2014

SMiTe: Precise QoS Prediction on Real-System SMT Processors to Improve Utilization in Warehouse Scale Computers.
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

Protean Code: Achieving Near-Free Online Code Transformations for Warehouse Scale Computers.
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

2013
Understanding Application Contentiousness and Sensitivity on Modern Multicores.
Adv. Comput., 2013

Enabling fair pricing on HPC systems with node sharing.
Proceedings of the International Conference for High Performance Computing, 2013

Bubble-flux: precise online QoS management for increased utilization in warehouse scale computers.
Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

Whare-map: heterogeneity in "homogeneous" warehouse-scale computers.
Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

Optimizing Google's warehouse scale computers: The NUMA experience.
Proceedings of the 19th IEEE International Symposium on High Performance Computer Architecture, 2013

ReQoS: reactive static/dynamic compilation for QoS in warehouse scale computers.
Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2013

2012
Increasing Utilization in Modern Warehouse-Scale Computers Using Bubble-Up.
IEEE Micro, 2012

Performance analysis of thread mappings with a holistic view of the hardware resources.
Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2012

Compiling for niceness: mitigating contention for QoS in warehouse scale computers.
Proceedings of the 10th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2012

2011
Heterogeneity in "Homogeneous" Warehouse-Scale Computers: A Performance Opportunity.
IEEE Comput. Archit. Lett., 2011

Bubble-Up: increasing utilization in modern warehouse scale computers via sensible co-locations.
Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011

The impact of memory subsystem resource sharing on datacenter applications.
Proceedings of the 38th International Symposium on Computer Architecture (ISCA 2011), 2011

Directly characterizing cross core interference through contention synthesis.
Proceedings of the High Performance Embedded Architectures and Compilers, 2011

2007
Agile optimization for coercion.
Proceedings of the Winter Simulation Conference, 2007

Explanation Exploration: Exploring Emergent Behavior.
Proceedings of the 21st International Workshop on Principles of Advanced and Distributed Simulation, 2007


  Loading...