Dongmei Zhang

Orcid: 0000-0002-9230-2799

Affiliations:
  • Microsoft Research Asia, Beijing, China
  • Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA (PhD 1999)


According to our database1, Dongmei Zhang authored at least 280 papers between 1997 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
AllHands: Ask Me Anything on Large-scale Verbatim Feedback via Large Language Models.
CoRR, 2024

CONLINE: Complex Code Generation and Refinement with Online Searching and Correctness Testing.
CoRR, 2024

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression.
CoRR, 2024

Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments.
CoRR, 2024

Ploutos: Towards interpretable stock movement prediction with financial large language model.
CoRR, 2024

UFO: A UI-Focused Agent for Windows OS Interaction.
CoRR, 2024

Revisiting VAE for Unsupervised Time Series Anomaly Detection: A Frequency Perspective.
CoRR, 2024

Why does Prediction Accuracy Decrease over Time? Uncertain Positive Learning for Cloud Failure Prediction.
CoRR, 2024

Contrastive Learning with Negative Sampling Correction.
CoRR, 2024

COIN: Chance-Constrained Imitation Learning for Uncertainty-aware Adaptive Resource Oversubscription Policy.
CoRR, 2024

Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning.
CoRR, 2024

Table Meets LLM: Can Large Language Models Understand Structured Table Data? A Benchmark and Empirical Study.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Source Free Graph Unsupervised Domain Adaptation.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

UniLog: Automatic Logging via LLM and In-Context Learning.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

Automatic Root Cause Analysis via Large Language Models for Cloud Incidents.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

Text-to-Image Generation for Abstract Concepts.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Text2Analysis: A Benchmark of Table Question Answering with Advanced Data Analysis and Unclear Queries.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection.
Proc. VLDB Endow., November, 2023

CoCoAST: Representing Source Code via Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Empir. Softw. Eng., November, 2023

Aesthetics++: Refining Graphic Designs by Exploring Design Principles and Human Preference.
IEEE Trans. Vis. Comput. Graph., June, 2023

Towards Natural Language-Based Visualization Authoring.
IEEE Trans. Vis. Comput. Graph., 2023

XInsight: eXplainable Data Analysis Through The Lens of Causality.
Proc. ACM Manag. Data, 2023

Xpert: Empowering Incident Management with Query Recommendations via Large Language Models.
CoRR, 2023

TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning.
CoRR, 2023

Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers.
CoRR, 2023

TaskWeaver: A Code-First Agent Framework.
CoRR, 2023

Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation.
CoRR, 2023

Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations.
CoRR, 2023

Table-GPT: Table-tuned GPT for Diverse Table Tasks.
CoRR, 2023

Text-to-Image Generation for Abstract Concepts.
CoRR, 2023

Diffusion-based Time Series Data Imputation for Microsoft 365.
CoRR, 2023

A Survey for Graphic Design Intelligence.
CoRR, 2023

SoTaNa: The Open-Source Software Development Assistant.
CoRR, 2023

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct.
CoRR, 2023

WonderFlow: Narration-Centric Design of Animated Data Videos.
CoRR, 2023

Leveraging LLMs for KPIs Retrieval from Hybrid Long-Document: A Comprehensive Framework and Dataset.
CoRR, 2023

Empowering Practical Root Cause Analysis by Large Language Models for Cloud Incidents.
CoRR, 2023

Evaluating and Enhancing Structural Understanding Capabilities of Large Language Models on Tables via Input Designs.
CoRR, 2023

Introspective Tips: Large Language Model for In-Context Decision Making.
CoRR, 2023

Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering.
CoRR, 2023

Demonstration of InsightPilot: An LLM-Empowered Automated Data Exploration System.
CoRR, 2023

Conservative State Value Estimation for Offline Reinforcement Learning.
CoRR, 2023

Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning.
Proceedings of the ACM Web Conference 2023, 2023

HAPENS: Hardness-Personalized Negative Sampling for Implicit Collaborative Filtering.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

EDITS: An Easy-to-difficult Training Strategy for Cloud Failure Prediction.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Robust Mid-Pass Filtering Graph Convolutional Networks.
Proceedings of the ACM Web Conference 2023, 2023

Homophily-oriented Heterogeneous Graph Rewiring.
Proceedings of the ACM Web Conference 2023, 2023

Multi-Agent Reinforcement Learning with Shared Policy for Cloud Quota Management Problem.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Revisiting Code Search in a Two-Stage Paradigm.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

MM-GNN: Mix-Moment Graph Neural Network towards Modeling Neighborhood Feature Distribution.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Diffusion-Based Time Series Data Imputation for Cloud Failure Prediction at Microsoft 365.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Assess and Summarize: Improve Outage Understanding with Large Language Models.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

STEAM: Observability-Preserving Trace Sampling.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice Systems.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

ML4C: Seeing Causality Through Latent Vicinity.
Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

LayoutPrompter: Awaken the Design Ability of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Conservative State Value Estimation for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Focusing on Pinocchio's Nose: A Gradients Scrutinizer to Thwart Split-Learning Hijacking Attacks Using Intrinsic Attributes.
Proceedings of the 30th Annual Network and Distributed System Security Symposium, 2023

Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Robust Multimodal Failure Detection for Microservice Systems.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Contextual Self-attentive Temporal Point Process for Physical Decommissioning Prediction of Cloud Assets.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Auto-Validate by-History: Auto-Program Data Quality Constraints to Validate Recurring Data Pipelines.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GetPt: Graph-enhanced General Table Pre-training with Alternate Attention Network.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

On Manipulating Signals of User-Item Graph: A Jacobi Polynomial-based Graph Collaborative Filtering.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Towards Efficient Fine-Tuning of Pre-trained Code Models: An Experimental Study and Beyond.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

CODEC: Cost-Effective Duration Prediction System for Deadline Scheduling in the Cloud.
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023

TraceArk: Towards Actionable Performance Anomaly Alerting for Online Service Systems.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

Aegis: Attribution of Control Plane Change Impact across Layers and Components for Cloud Systems.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

CoCoSoDa: Effective Contrastive Learning for Code Search.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Incident-aware Duplicate Ticket Aggregation for Cloud Systems.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

CONAN: Diagnosing Batch Failures for Cloud Systems.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

Did We Miss Something Important? Studying and Exploring Variable-Aware Log Abstraction.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield Energy.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CASR: Generating Complex Sequences with Autoregressive Self-Boost Refinement.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Parse-Then-Place Approach for Generating Graphic Layouts from Textual Descriptions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

InsightPilot: An LLM-Empowered Automated Data Exploration System.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

How Different are the Cloud Workloads? Characterizing Large-Scale Private and Public Cloud Workloads.
Proceedings of the 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Network, 2023

LayoutFormer++: Conditional Graphic Layout Generation via Constraint Serialization and Decoding Space Restriction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Hadamard Adapter: An Extreme Parameter-Efficient Adapter Tuning Method for Pre-trained Language Models.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Snape: Reliable and Low-Cost Computing with Mixture of Spot and On-Demand VMs.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

AnaMeta: A Table Understanding Dataset of Field Metadata Knowledge Shared by Multi-dimensional Data Analysis Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

HermEs: Interactive Spreadsheet Formula Prediction via Hierarchical Formulet Expansion.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

How Do In-Context Examples Affect Compositional Generalization?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Unveiling the Black Box of PLMs with Semantic Anchors: Towards Interpretable Neural Semantic Parsing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

SheetPT: Spreadsheet Pre-training Based on Hierarchical Attention Network.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
MultiVision: Designing Analytical Dashboards with Deep Learning Based Recommendation.
IEEE Trans. Vis. Comput. Graph., 2022

AI4VIS: Survey on Artificial Intelligence Approaches for Data Visualization.
IEEE Trans. Vis. Comput. Graph., 2022

A Mixed-Initiative Approach to Reusing Infographic Charts.
IEEE Trans. Vis. Comput. Graph., 2022

An Intelligent Framework for Timely, Accurate, and Comprehensive Cloud Incident Detection.
ACM SIGOPS Oper. Syst. Rev., 2022

A large-scale empirical study of commit message generation: models, datasets and evaluation.
Empir. Softw. Eng., 2022

LUNA: Language Understanding with Number Augmentations on Transformers via Number Plugins and Pre-training.
CoRR, 2022

Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems.
CoRR, 2022

Guiding the PLMs with Semantic Anchors as Intermediate Supervision: Towards Interpretable Semantic Parsing.
CoRR, 2022

Enhanced Fairness Testing via Generating Effective Initial Individual Discriminatory Instances.
CoRR, 2022

Make Heterophily Graphs Better Fit GNN: A Graph Rewiring Approach.
CoRR, 2022

Inferring Tabular Analysis Metadata by Infusing Distribution and Knowledge Information.
CoRR, 2022

Long Code for Code Search.
CoRR, 2022

UniLayout: Taming Unified Sequence-to-Sequence Transformers for Graphic Layout Generation.
CoRR, 2022

ASTA: Learning Analytical Semantics over Tables for Intelligent Data Analysis and Visualization.
CoRR, 2022

TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data.
CoRR, 2022

Enhancing Semantic Code Search with Multimodal Contrastive Learning and Soft Data Augmentation.
CoRR, 2022

ECMG: Exemplar-based Commit Message Generation.
CoRR, 2022

Table Pre-training: A Survey on Model Architectures, Pretraining Objectives, and Downstream Tasks.
CoRR, 2022

Investigating the Role and Interplay of Narrations and Animations in Data Videos.
Comput. Graph. Forum, 2022

Spot Virtual Machine Eviction Prediction in Microsoft Cloud.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

UniParser: A Unified Log Parser for Heterogeneous Log Data.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Understanding and Improvement of Adversarial Training for Network Embedding from an Optimization Perspective.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

SPINE: a scalable log parser with feedback guidance.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

Neuron with Steady Response Leads to Better Generalization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LibDB: An Effective and Efficient Framework for Detecting Third-Party Libraries in Binaries.
Proceedings of the 19th IEEE/ACM International Conference on Mining Software Repositories, 2022

ChartStamp: Robust Chart Embedding for Real-World Applications.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Solving the Batch Stochastic Bin Packing Problem in Cloud: A Chance-constrained Optimization Approach.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

pureGAM: Learning an Inherently Pure Additive Model.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Multi-task Hierarchical Classification for Disk Failure Prediction in Online Service Systems.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

ML4S: Learning Causal Skeleton from Vicinal Graphs.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

GridBook: Natural Language Formulas for the Spreadsheet Grid.
Proceedings of the IUI 2022: 27th International Conference on Intelligent User Interfaces, Helsinki, Finland, March 22, 2022

T-SMOTE: Temporal-oriented Synthetic Minority Oversampling Technique for Imbalanced Time Series Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

DeepTraLog: Trace-Log Combined Microservice Anomaly Detection through Graph-based Deep Learning.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

On the Evaluation of Neural Code Summarization.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Testing Machine Learning Systems in Industry: An Empirical Study.
Proceedings of the 44th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2022

TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Robust Numerical Question Answering: Diagnosing Numerical Capabilities of NLP Systems.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

RACE: Retrieval-augmented Commit Message Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

FormLM: Recommending Creation Ideas for Online Forms by Modelling Semantic and Structural Information.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

PLOG: Table-to-Logic Pretraining for Logical Table-to-Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learning Rate Perturbation: A Generic Plugin of Learning Rate Schedule towards Flatter Local Minima.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

OneLabeler: A Flexible System for Building Data Labeling Tools.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Accelerating Code Search with Deep Hashing and Code Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

FORTAP: Using Formulas for Numerical-Reasoning-Aware Table Pretraining.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Coarse-to-Fine Generative Modeling for Graphic Layouts.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Retrieve-Then-Adapt: Example-based Automatic Generation for Proportion-related Infographics.
IEEE Trans. Vis. Comput. Graph., 2021

Chartem: Reviving Chart Images with Data Embedding.
IEEE Trans. Vis. Comput. Graph., 2021

Source Free Unsupervised Graph Domain Adaptation.
CoRR, 2021

A Surrogate Objective Framework for Prediction+Optimization with Soft Constraints.
CoRR, 2021

A Unified and Fast Interpretable Model for Predictive Analytics.
CoRR, 2021

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily.
CoRR, 2021

FORTAP: Using Formulae for Numerical-Reasoning-Aware Table Pretraining.
CoRR, 2021

Neural Code Summarization: How Far Are We?
CoRR, 2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning for Semantic Code Search.
CoRR, 2021

CoCoSum: Contextual Code Summarization with Multi-Relational Graph Neural Network.
CoRR, 2021

AniVis: Generating Animated Transitions Between Statistical Charts with a Tree Model.
CoRR, 2021

Understanding and Improvement of Adversarial Training for Network Embedding from an Optimization Perspective.
CoRR, 2021

Survey on Artificial Intelligence Approaches for Visualization Data.
CoRR, 2021

Animated Presentation of Static Infographics with InfoMotion.
Comput. Graph. Forum, 2021

NTAM: Neighborhood-Temporal Attention Model for Disk Failure Prediction in Cloud Platforms.
Proceedings of the WWW '21: The Web Conference 2021, 2021

AnaSearch: Extract, Retrieve and Visualize Structured Results from Unstructured Text for Analytical Queries.
Proceedings of the WSDM '21, 2021

Fighting the Fog of War: Automated Incident Detection for Cloud Systems.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

Onion: identifying incident-indicating logs for cloud systems.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Intelligent container reallocation at Microsoft 365.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

LS-sampling: an effective local search based sampling approach for achieving high t-wise coverage.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Effective low capacity status prediction for cloud systems.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

MetaInsight: Automatic Discovery of Structured Knowledge for Exploratory Data Analysis.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

A Surrogate Objective Framework for Prediction+Programming with Soft Constraints.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Table2Charts: Recommending Charts by Learning Shared Table Representations.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

HALO: Hierarchy-aware Fault Localization for Cloud Systems.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

TUTA: Tree-based Transformers for Generally Structured Table Pre-training.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

TabularNet: A Neural Network Architecture for Understanding Semantic Structures of Tabular Data.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Can Neural Clone Detection Generalize to Unseen Functionalitiesƒ.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

Semantic table structure identification in spreadsheets.
Proceedings of the ISSTA '21: 30th ACM SIGSOFT International Symposium on Software Testing and Analysis, 2021

How Long Will it Take to Mitigate this Incident for Online Service Systems?
Proceedings of the 32nd IEEE International Symposium on Software Reliability Engineering, 2021

Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Predictive Job Scheduling under Uncertain Constraints in Cloud Computing.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

On the Evaluation of Commit Message Generation Models: An Experimental Study.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2021

AutoCCAG: An Automated Approach to Constrained Covering Array Generation.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

CAST: Enhancing Code Summarization with Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Weakly Supervised Semantic Parsing by Learning from Mistakes.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Neuron Campaign for Initialization Guided by Information Bottleneck Theory.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Is a Single Model Enough? MuCoS: A Multi-Model Ensemble Learning Approach for Semantic Code Search.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Returning to the Office During the COVID-19 Pandemic Recovery: Early Indicators from China.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Learning Algebraic Recombination for Compositional Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Revisiting Iterative Back-Translation from the Perspective of Compositional Generalization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Iterative Utterance Segmentation for Neural Semantic Parsing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
DataShot: Automatic Generation of Fact Sheets from Tabular Data.
IEEE Trans. Vis. Comput. Graph., 2020

Text-to-Viz: Automatic Generation of Infographics from Proportion-Related Natural Language Statements.
IEEE Trans. Vis. Comput. Graph., 2020

Structure-aware Pre-training for Table Understanding with Tree-based Transformers.
CoRR, 2020

Table2Charts: Learning Shared Representations for Recommending Charts on Multi-dimensional Data.
CoRR, 2020

How to mitigate the incident? an effective troubleshooting guide recommendation technique for online service systems.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Efficient customer incident triage via linking with system incidents.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Efficient incident identification from multi-dimensional issue reports via meta-heuristic search.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Identifying linked incidents in large-scale online service systems.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Towards intelligent incident management: why we need it and how we make it.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Compositional Generalization by Learning Analytical Expressions.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Hierarchical Poset Decoding for Compositional Generalization in Language.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

How Incidental are the Incidents? Characterizing and Prioritizing Incidents for Large-Scale Online Service Systems.
Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020

RECPARSER: A Recursive Semantic Parsing Framework for Text-to-SQL Task.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

How Far are We from Effective Context Modeling? An Exploratory Study on Semantic Parsing in Context.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Incomplete Utterance Rewriting as Semantic Segmentation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

"What Do You Mean by That?" A Parser-Independent Interactive Approach for Enhancing Text-to-SQL.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Learning Formatting Style Transfer and Structure Extraction for Spreadsheet Tables with a Hybrid Neural Network Architecture.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Neural Formatting for Spreadsheet Tables.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

You Impress Me: Dialogue Generation via Mutual Persona Perception.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Table2Analysis: Modeling and Recommendation of Common Analysis Patterns for Multi-Dimensional Data.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Reliable and Efficient Anytime Skeleton Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Hybrid Semantic Parsing Approach for Tabular Data Analysis.
CoRR, 2019

Oui! Outlier Interpretation on Multi-dimensional Data via Visual Analytics.
Comput. Graph. Forum, 2019

Outage Prediction and Diagnosis for Cloud Service Systems.
Proceedings of the World Wide Web Conference, 2019

Cross-dataset Time Series Anomaly Detection for Cloud Systems.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

Robust log-based anomaly detection on unstable log data.
Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2019

QuickInsights: Quick and Automatic Discovery of Insights from Multi-Dimensional Data.
Proceedings of the 2019 International Conference on Management of Data, 2019

Continuous Incident Triage for Large-Scale Online Service Systems.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Local Search with Efficient Automatic Configuration for Minimum Vertex Cover.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

An empirical investigation of incident triage for online service systems.
Proceedings of the 41st International Conference on Software Engineering: Software Engineering in Practice, 2019

Neural Feature Search: A Neural Architecture for Automated Feature Engineering.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

A Split-and-Recombine Approach for Follow-up Query Analysis.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Data-Anonymous Encoding for Text-to-SQL Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

FANDA: A Novel Approach to Perform Follow-Up Query Analysis.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

TableSense: Spreadsheet Table Detection with Convolutional Neural Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Systematically Ensuring the Confidence of Real-Time Home Automation IoT Systems.
ACM Trans. Cyber Phys. Syst., 2018

QuanFuzz: Fuzz Testing of Quantum Program.
CoRR, 2018

Improving Service Availability of Cloud Systems by Predicting Disk Error.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

Automated refactoring of nested-IF formulae in spreadsheets.
Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2018

Predicting Node failure in cloud service systems.
Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2018

Identifying impactful service system problems via log analysis.
Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2018

BigIN4: Instant, Interactive Insight Identification for Multi-Dimensional Big Data.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Expandable group identification in spreadsheets.
Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, 2018

Decoding Technology Transfer through Experiences at Microsoft.
Proceedings of the 5th IEEE/ACM International Workshop on Software Engineering Research and Industrial Practice, 2018

SemRegex: A Semantics-Based Approach for Generating Regular Expressions from Natural Language Specifications.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

InfoNice: Easy Creation of Information Graphics.
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018

Generating Regular Expressions from Natural Language Specifications: Are We There Yet?
Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Experience report on applying software analytics in incident management of online service.
Autom. Softw. Eng., 2017

Extracting Top-K Insights from Multi-dimensional Data.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Transferring Code-Clone Detection and Analysis to Practice.
Proceedings of the 39th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice Track, 2017

2016
The Future of Software Engineering.
IEEE Softw., 2016

Deep API learning.
Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2016

Systematically Debugging IoT Control System Correctness for Building Automation.
Proceedings of the 3rd ACM International Conference on Systems for Energy-Efficient Built Environments, 2016

iDice: problem identification for emerging issues.
Proceedings of the 38th International Conference on Software Engineering, 2016

Software analytics and its application in practice.
Proceedings of the Perspectives on Data Science for Software Engineering, 2016

How to tame your online services.
Proceedings of the Perspectives on Data Science for Software Engineering, 2016

Visual analytics for software engineering data.
Proceedings of the Perspectives on Data Science for Software Engineering, 2016

2015
Roundtable: The Future of Software Engineering for Internet Computing.
IEEE Softw., 2015

YADING: Fast Clustering of Large-Scale Time Series Data.
Proc. VLDB Endow., 2015

Roundtable: Research Opportunities and Challenges for Emerging Software Systems.
J. Comput. Sci. Technol., 2015

Log2: A Cost-Aware Logging Mechanism for Performance Diagnosis.
Proceedings of the 2015 USENIX Annual Technical Conference, 2015

CodeHow: Effective Code Search Based on API Understanding and Extended Boolean Model (E).
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015

Learning to Log: Helping Developers Make Informed Logging Decisions.
Proceedings of the 37th IEEE/ACM International Conference on Software Engineering, 2015

Uncovering JavaScript Performance Code Smells Relevant to Type Mutations.
Proceedings of the Programming Languages and Systems - 13th Asian Symposium, 2015

2014
Predicting Consistency-Maintenance Requirement of Code Clonesat Copy-and-Paste Time.
IEEE Trans. Software Eng., 2014

Querying sequential software engineering data.
Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, (FSE-22), Hong Kong, China, November 16, 2014

Correlating events with time series for incident diagnosis.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Where do developers log? an empirical study on logging practices in industry.
Proceedings of the 36th International Conference on Software Engineering, 2014

Identifying Recurrent and Unknown Performance Issues.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Mining Historical Issue Repositories to Heal Large-Scale Online Service Systems.
Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2014

Comprehending performance from real-world execution traces: a device-driver case.
Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2014

2013
Software Analytics in Practice.
IEEE Softw., 2013

Software Analytics Principles and Practices (NII Shonan Meeting 2013-12).
NII Shonan Meet. Rep., 2013

Mining succinct and high-coverage API usage patterns from source code.
Proceedings of the 10th Working Conference on Mining Software Repositories, 2013

Contextual analysis of program logs for understanding system behaviors.
Proceedings of the 10th Working Conference on Mining Software Repositories, 2013

Software analytics for incident management of online services: An experience report.
Proceedings of the 2013 28th IEEE/ACM International Conference on Automated Software Engineering, 2013

Context-sensitive delta inference for identifying workload-dependent performance bottlenecks.
Proceedings of the International Symposium on Software Testing and Analysis, 2013

Pathways to technology transfer and adoption: achievements and challenges (mini-tutorial).
Proceedings of the 35th International Conference on Software Engineering, 2013

Software analytics: achievements and challenges.
Proceedings of the 35th International Conference on Software Engineering, 2013

2012
Performance Issue Diagnosis for Online Service Systems.
Proceedings of the IEEE 31st Symposium on Reliable Distributed Systems, 2012

How do software engineers understand code changes?: an exploratory study in industry.
Proceedings of the 20th ACM SIGSOFT Symposium on the Foundations of Software Engineering (FSE-20), 2012

MSR 2012 keynote: Software analytics in practice - Approaches and experiences.
Proceedings of the 9th IEEE Working Conference of Mining Software Repositories, 2012

Can I clone this piece of code here?
Proceedings of the IEEE/ACM International Conference on Automated Software Engineering, 2012

Healing online service systems via mining historical issue repositories.
Proceedings of the IEEE/ACM International Conference on Automated Software Engineering, 2012

Software analytics in practice: Mini tutorial.
Proceedings of the 34th International Conference on Software Engineering, 2012

Performance debugging in the large via mining millions of stack traces.
Proceedings of the 34th International Conference on Software Engineering, 2012

ReBucket: A method for clustering duplicate crash reports based on call stack similarity.
Proceedings of the 34th International Conference on Software Engineering, 2012

Teaching and Training for Software Analytics.
Proceedings of the 25th IEEE Conference on Software Engineering Education and Training, 2012

XIAO: tuning code clones at hands of engineers in practice.
Proceedings of the 28th Annual Computer Security Applications Conference, 2012

2011
Code clone detection experience at microsoft.
Proceedings of the Proceeding of the 5th ICSE International Workshop on Software Clones, 2011

2009
A Unified Framework for Recognizing Handwritten Chemical Expressions.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

2007
Systematic Multi-Path HMM Topology Design for Online Handwriting Recognition of East Asian Characters.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

2006
Face-tracking as an augmented input in video games: enhancing presence, role-playing and control.
Proceedings of the 2006 Conference on Human Factors in Computing Systems, 2006

1999
Harmonic Shape Images: A Representation for 3D Free-Form Surfaces Based on Energy Minimization.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 1999

Harmonic Maps and Their Applications in Surface Matching.
Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

Experimental Analysis of Harmonic Shape Images.
Proceedings of the 2nd International Conference on 3D Digital Imaging and Modeling (3DIM '99), 1999

1997
Multi-Scale Classification of 3-D Objects.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997


  Loading...