Jason Mars

Orcid: 0000-0002-7029-5292

According to our database1, Jason Mars authored at least 69 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
One Agent Too Many: User Perspectives on Approaches to Multi-agent Conversational AI.
CoRR, 2024

2023
Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's GPT-4 with Self-Hosted Open Source SLMs in Production.
CoRR, 2023

The Jaseci Programming Paradigm and Runtime Stack: Building Scale-Out Production Applications Easy and Fast.
IEEE Comput. Archit. Lett., 2023

Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Label Agnostic Pre-training for Zero-shot Text Classification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
The Case for a Wholistic Serverless Programming Paradigm and Full Stack Automation for AI and Beyond - The Philosophy of Jaseci and Jac.
CoRR, 2022

Towards Personalized Intelligence at Scale.
CoRR, 2022

One Agent To Rule Them All: Towards Multi-agent Conversational AI.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2020
A Benchmarking Framework for Interactive 3D Applications in the Cloud.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

2019
Caliper: Interference Estimator for Multi-tenant Environments Sharing Architectural Resources.
ACM Trans. Archit. Code Optim., 2019

Outlier Detection for Improved Data Quality and Diversity in Dialog Systems.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Understanding the Impact of Socket Density in Density Optimized Servers.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

GrandSLAm: Guaranteeing SLAs for Jobs in Microservices Execution Frameworks.
Proceedings of the Fourteenth EuroSys Conference 2019, Dresden, Germany, March 25-28, 2019, 2019

An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Rethinking Numerical Representations for Deep Neural Networks.
CoRR, 2018

Adasa: A Conversational In-Vehicle Digital Assistant for Advanced Driver Assistance Features.
Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology, 2018

Data Collection for Dialogue System: A Startup Perspective.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Proctor: Detecting and Investigating Interference in Shared Datacenters.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2018

Virtual Melting Temperature: Managing Server Load to Minimize Cooling Overhead with Phase Change Materials.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

Gist: Efficient Data Encoding for Deep Neural Network Training.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

The Architectural Implications of Autonomous Driving: Constraints and Acceleration.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

SmoothOperator: Reducing Power Fragmentation and Improving Power Utilization in Large-scale Datacenters.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

Architectural support for convolutional neural networks on modern CPUs.
Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018

2017
Reining in Long Tails in Warehouse-Scale Computers with Quick Voltage Boosting Using Adrenaline.
ACM Trans. Comput. Syst., 2017

Thermal Time Shifting: Decreasing Data Center Cooling Costs with Phase-Change Materials.
IEEE Internet Comput., 2017

DeftNN: addressing bottlenecks for DNN execution on GPUs via synapse vector elimination and near-compute data fission.
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

PowerChief: Intelligent Power Allocation for Multi-Stage Applications to Improve Responsiveness on Power Constrained CMP.
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

Enhancing human capabilities.
Proceedings of the Workshop on Trends in Machine-Learning (and impact on computer architecture), 2017

Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

Prophet: Precise QoS Prediction on Non-Preemptive Accelerators to Improve Utilization in Warehouse-Scale Computers.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

2016
Designing Future Warehouse-Scale Computers for Sirius, an End-to-End Voice and Vision Personal Assistant.
ACM Trans. Comput. Syst., 2016

Sirius Implications for Future Warehouse-Scale Computers.
IEEE Micro, 2016

Input responsiveness: using canary inputs to dynamically steer approximation.
Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2016

CrystalBall: Statically analyzing runtime behavior via deep sequence learning.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Continuous shape shifting: Enabling loop co-optimization via near-free dynamic code rewriting.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Concise loads and stores: The case for an asymmetric compute-memory architecture for approximation.
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Treadmill: Attributing the Source of Tail Latency through Precise Load Testing and Statistical Inference.
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

PowerChop: Identifying and Managing Non-critical Units in Hybrid Processor Architectures.
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Baymax: QoS Awareness and Increased Utilization for Non-Preemptive Accelerators in Warehouse Scale Computers.
Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, 2016

2015
Thermal time shifting: leveraging phase change materials to reduce cooling costs in warehouse-scale computers.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

DjiNN and Tonic: DNN as a service and its implications for future warehouse scale computers.
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

Octopus-Man: QoS-driven task management for heterogeneous multicores in warehouse-scale computers.
Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Adrenaline: Pinpointing and reining in tail queries with quick voltage boosting.
Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Sirius: An Open End-to-End Voice and Vision Personal Assistant and Its Implications for Future Warehouse Scale Computers.
Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, 2015

AREP: Adaptive Resource Efficient Prefetching for Maximizing Multicore Performance.
Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014
Enabling fair pricing on high performance computer systems with node sharing.
Sci. Program., 2014

HaPPy: Hyperthread-aware Power Profiling Dynamically.
Proceedings of the 2014 USENIX Annual Technical Conference, 2014

SMiTe: Precise QoS Prediction on Real-System SMT Processors to Improve Utilization in Warehouse Scale Computers.
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

Protean Code: Achieving Near-Free Online Code Transformations for Warehouse Scale Computers.
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

2013
Understanding Application Contentiousness and Sensitivity on Modern Multicores.
Adv. Comput., 2013

Enabling fair pricing on HPC systems with node sharing.
Proceedings of the International Conference for High Performance Computing, 2013

Bubble-flux: precise online QoS management for increased utilization in warehouse scale computers.
Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

Whare-map: heterogeneity in "homogeneous" warehouse-scale computers.
Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

Optimizing Google's warehouse scale computers: The NUMA experience.
Proceedings of the 19th IEEE International Symposium on High Performance Computer Architecture, 2013

ReQoS: reactive static/dynamic compilation for QoS in warehouse scale computers.
Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2013

2012
Increasing Utilization in Modern Warehouse-Scale Computers Using Bubble-Up.
IEEE Micro, 2012

THeME: a system for testing by hardware monitoring events.
Proceedings of the International Symposium on Software Testing and Analysis, 2012

Performance analysis of thread mappings with a holistic view of the hardware resources.
Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2012

BlockChop: Dynamic squash elimination for hybrid processor architecture.
Proceedings of the 39th International Symposium on Computer Architecture (ISCA 2012), 2012

Compiling for niceness: mitigating contention for QoS in warehouse scale computers.
Proceedings of the 10th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2012

2011
Evaluating indirect branch handling mechanisms in software dynamic translation systems.
ACM Trans. Archit. Code Optim., 2011

Heterogeneity in "Homogeneous" Warehouse-Scale Computers: A Performance Opportunity.
IEEE Comput. Archit. Lett., 2011

Bubble-Up: increasing utilization in modern warehouse scale computers via sensible co-locations.
Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011

The impact of memory subsystem resource sharing on datacenter applications.
Proceedings of the 38th International Symposium on Computer Architecture (ISCA 2011), 2011

Exploiting hardware advances for software testing and debugging.
Proceedings of the 33rd International Conference on Software Engineering, 2011

Directly characterizing cross core interference through contention synthesis.
Proceedings of the High Performance Embedded Architectures and Compilers, 2011

2010
Contention aware execution: online contention detection and response.
Proceedings of the CGO 2010, 2010

2009
A cross-layer approach to heterogeneity and reliability.
Proceedings of the 7th ACM/IEEE International Conference on Formal Methods and Models for Codesign (MEMOCODE 2009), 2009

Scenario Based Optimization: A Framework for Statically Enabling Online Optimizations.
Proceedings of the CGO 2009, 2009


  Loading...