Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation.

[BibT_eX]

[DOI]

Mengkang Hu

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Intelligent User Interfaces, 2025

Self-Evolved Reward Learning for LLMS.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RuAG: Learned-rule-augmented Generation for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AllHands :Ask Me Anything on Large-scale Verbatim Feedback via Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

Performance Aware LLM Load Balancer for Mixed Workloads.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Machine Learning and Systems, 2025

Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Coach: Exploiting Temporal Patterns for All-Resource Oversubscription in Cloud Platforms.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

AXIS: Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Synergistic Weak-Strong Collaboration by Aligning Preferences.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CARMO: Dynamic Criteria Generation for Context Aware Reward Modelling.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

REFA: Reference Free Alignment for multi-preference optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

Large Action Models: From Inception to Implementation.

[BibT_eX]

[DOI]

CoRR, 2024

TurboAttention: Efficient Attention Approximation For High Throughputs LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

SWEPO: Simultaneous Weighted Preference Optimization for Group Contrastive Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

Ensuring Fair LLM Serving Amid Diverse Applications.

[BibT_eX]

[DOI]

Redwan Ibne Seraj Khan

CoRR, 2024

Sharingan: Extract User Action Sequence from Desktop Recordings.

[BibT_eX]

[DOI]

CoRR, 2024

Token-level Proximal Policy Optimization for Query Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Unveiling Context-Aware Criteria in Self-Assessing LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

AI Delegates with a Dual Focus: Ensuring Privacy and Strategic Self-Disclosure.

[BibT_eX]

[DOI]

CoRR, 2024

Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents.

[BibT_eX]

[DOI]

CoRR, 2024

Intelligent Router for LLM Workloads: Improving Performance Through Workload-Aware Scheduling.

[BibT_eX]

[DOI]

CoRR, 2024

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation.

[BibT_eX]

[DOI]

CoRR, 2024

The Vision of Autonomic Computing: Can LLMs Make It a Reality?

[BibT_eX]

[DOI]

CoRR, 2024

Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation.

[BibT_eX]

[DOI]

CoRR, 2024

An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing.

[BibT_eX]

[DOI]

CoRR, 2024

Large Language Models can Deliver Accurate and Interpretable Time Series Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Lean Attention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

Workload Intelligence: Punching Holes Through the Cloud Abstraction.

[BibT_eX]

[DOI]

CoRR, 2024

Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides.

[BibT_eX]

[DOI]

CoRR, 2024

Why does Prediction Accuracy Decrease over Time? Uncertain Positive Learning for Cloud Failure Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

Contrastive Learning with Negative Sampling Correction.

[BibT_eX]

[DOI]

CoRR, 2024

COIN: Chance-Constrained Imitation Learning for Uncertainty-aware Adaptive Resource Oversubscription Policy.

[BibT_eX]

[DOI]

CoRR, 2024

Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency Perspective.

[BibT_eX]

[DOI]

Proceedings of the ACM on Web Conference 2024, 2024

Dependency Aware Incident Linking in Large Cloud Systems.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-view Total Correlation.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2024

LM-PACE: Confidence Estimation by Large Language Models for Effective Root Causing of Cloud Incidents.

[BibT_eX]

[DOI]

Dylan Zhang

Xuchao Zhang

Chetan Bansal

Pedro Henrique B. Las-Casas

Rodrigo Fonseca

Saravan Rajmohan

Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024

Automated Root Causing of Cloud Incidents using In-Context Learning with GPT-4.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024

MonitorAssistant: Simplifying Cloud Service Monitoring via Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024

Exploring LLM-Based Agents for Root Cause Analysis.

[BibT_eX]

[DOI]

Pedro Henrique B. Las-Casas

Rodrigo Fonseca

Saravan Rajmohan

Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024

X-Lifecycle Learning for Cloud Incident Management using LLMs.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024

Pre-trained KPI Anomaly Detection Model Through Disentangled Transformer.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Large Language Models Can Provide Accurate and Interpretable Incident Triage.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024

Early Bird: Ensuring Reliability of Cloud Systems Through Early Failure Prediction.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024

Can We Trust Auto-Mitigation? Improving Cloud Failure Prediction with Uncertain Positive Learning.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Symposium on Software Reliability Engineering, 2024

UniLog: Automatic Logging via LLM and In-Context Learning.

[BibT_eX]

[DOI]

Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

Intelligent Monitoring Framework for Cloud Services: A Data-Driven Approach.

[BibT_eX]

[DOI]

Proceedings of the 46th International Conference on Software Engineering: Software Engineering in Practice, 2024

Xpert: Empowering Incident Management with Query Recommendations via Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

Automatic Root Cause Analysis via Large Language Models for Cloud Incidents.

[BibT_eX]

[DOI]

Proceedings of the Nineteenth European Conference on Computer Systems, 2024

EfficientRAG: Efficient Retriever for Multi-Hop Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hybrid-RACA: Hybrid Retrieval-Augmented Composition Assistance for Real-time Text Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Building AI Agents for Autonomous Clouds: Challenges and Design Principles.

[BibT_eX]

[DOI]

Pedro Henrique B. Las-Casas

Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

COIN: Chance-Constrained Imitation Learning for Safe and Adaptive Resource Oversubscription under Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection.

[BibT_eX]

[DOI]

Proc. VLDB Endow., November, 2023

TaskWeaver: A Code-First Agent Framework.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking Privacy in Machine Learning Pipelines from an Information Flow Control Perspective.

[BibT_eX]

[DOI]

Santiago Zanella-Béguelin

Menglin Xia

Victor Rühle

CoRR, 2023

PACE-LM: Prompting and Augmentation for Calibrated Confidence Estimation with GPT-4 in Cloud Incident Root Cause Analysis.

[BibT_eX]

[DOI]

Dylan Zhang

Xuchao Zhang

Chetan Bansal

Pedro Henrique B. Las-Casas

Rodrigo Fonseca

Saravan Rajmohan

CoRR, 2023

Diffusion-based Time Series Data Imputation for Microsoft 365.

[BibT_eX]

[DOI]

CoRR, 2023

Hybrid Retrieval-Augmented Generation for Real-time Composition Assistance.

[BibT_eX]

[DOI]

CoRR, 2023

Empowering Practical Root Cause Analysis by Large Language Models for Cloud Incidents.

[BibT_eX]

[DOI]

CoRR, 2023

Introspective Tips: Large Language Model for In-Context Decision Making.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2023, 2023

EDITS: An Easy-to-difficult Training Strategy for Cloud Failure Prediction.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Multi-Agent Reinforcement Learning with Shared Policy for Cloud Quota Management Problem.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Diffusion-Based Time Series Data Imputation for Cloud Failure Prediction at Microsoft 365.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Assess and Summarize: Improve Outage Understanding with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

STEAM: Observability-Preserving Trace Sampling.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice Systems.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

CODEC: Cost-Effective Duration Prediction System for Deadline Scheduling in the Cloud.

[BibT_eX]

[DOI]

Harshwardhan Chaturvedi

Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023

TraceArk: Towards Actionable Performance Anomaly Alerting for Online Service Systems.

[BibT_eX]

[DOI]

Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

Incident-aware Duplicate Ticket Aggregation for Cloud Systems.

[BibT_eX]

[DOI]

Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

CONAN: Diagnosing Batch Failures for Cloud Systems.

[BibT_eX]

[DOI]

Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

Recommending Root-Cause and Mitigation Steps for Cloud Incidents using Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

How Different are the Cloud Workloads? Characterizing Large-Scale Private and Public Cloud Workloads.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Network, 2023

Snape: Reliable and Low-Cost Computing with Mixture of Spot and On-Demand VMs.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022

An Intelligent Framework for Timely, Accurate, and Comprehensive Cloud Incident Detection.

[BibT_eX]

[DOI]

ACM SIGOPS Oper. Syst. Rev., 2022

Spot Virtual Machine Eviction Prediction in Microsoft Cloud.

[BibT_eX]

[DOI]

Senthil Baladhandayutham

Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

UniParser: A Unified Log Parser for Heterogeneous Log Data.

[BibT_eX]

[DOI]

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Solving the Batch Stochastic Bin Packing Problem in Cloud: A Chance-constrained Optimization Approach.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Multi-task Hierarchical Classification for Disk Failure Prediction in Online Service Systems.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

T-SMOTE: Temporal-oriented Synthetic Minority Oversampling Technique for Imbalanced Time Series Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Saravan Rajmohan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...