Yue Yu

Orcid: 0000-0002-9865-2212

Affiliations:
  • National University of Defense Technology, National Laboratory for Parallel and Distributed Processing, China


According to our database1, Yue Yu authored at least 199 papers between 2012 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Towards to real world vehicle privacy protection: A new dataset and benchmark.
Pattern Recognit., 2026

2025
Preserve and Sculpt: Manifold-Aligned Fine-tuning of Vision-Language Models for Few-Shot Learning.
CoRR, August, 2025

NeMo: A Neuron-Level Modularizing-While-Training Approach for Decomposing DNN Models.
CoRR, August, 2025

RAG in the Wild: On the (In)effectiveness of LLMs with Mixture-of-Knowledge Retrieval Augmentation.
CoRR, July, 2025

SoftSignSGD(S3): An Enhanced Optimizer for Practical DNN Training and Loss Spikes Minimization Beyond Adam.
CoRR, July, 2025

Simple Convergence Proof of Adam From a Sign-like Descent Perspective.
CoRR, July, 2025

SecFwT: Efficient Privacy-Preserving Fine-Tuning of Large Language Models Using Forward-Only Passes.
CoRR, June, 2025

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale.
CoRR, June, 2025

Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration.
CoRR, April, 2025

RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation.
CoRR, April, 2025

Accurate Expert Predictions in MoE Inference via Cross-Layer Gate.
CoRR, February, 2025

Continuous Knowledge-Preserving Decomposition for Few-Shot Continual Learning.
CoRR, January, 2025

StoCFL: A stochastically clustered federated learning framework for Non-IID data with dynamic client participation.
Neural Networks, 2025

A review on knowledge graphs for healthcare: Resources, applications, and promises.
J. Biomed. Informatics, 2025

A Structure-Preserving Denoising Diffusion Model for AV45 PET Quantification Without MRI in Alzheimer's Disease Diagnosis.
Int. J. Imaging Syst. Technol., 2025

Obscura: Concealing Recomputation Overhead in Training of Large Language Models with Bubble-filling Pipeline Transformation.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

Preference-Strength-Aware Self-Improving Alignment with Generative Preference Models.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Instruct or Interact? Exploring and Eliciting LLMs' Capability in Code Snippet Adaptation Through Prompt Engineering.
Proceedings of the 47th IEEE/ACM International Conference on Software Engineering, 2025

Efficient Evolutionary Search Over Chemical Space with Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Causally Motivated Sycophancy Mitigation for Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Centrality-guided Pre-training for Graph.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

COPR: Continual Human Preference Learning via Optimal Policy Regularization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

CENTAUR: Bridging the Impossible Trinity of Privacy, Efficiency, and Performance in Privacy-Preserving Transformer Inference.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Correcting Large Language Model Behavior via Influence Function.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Towards More Precise Coincidental Correctness Detection With Deep Semantic Learning.
IEEE Trans. Software Eng., December, 2024

Continual Learning of Image Classes With Language Guidance From a Vision-Language Model.
IEEE Trans. Circuits Syst. Video Technol., December, 2024

How Do Developers Adapt Code Snippets to Their Contexts? An Empirical Study of Context-Based Code Snippet Adaptations.
IEEE Trans. Software Eng., November, 2024

BrainMass: Advancing Brain Network Analysis for Diagnosis With Large-Scale Self-Supervised Learning.
IEEE Trans. Medical Imaging, November, 2024

Intelligence-Endogenous Management Platform for Computing and Network Convergence.
IEEE Netw., July, 2024

Re-Thinking the Effectiveness of Batch Normalization and Beyond.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

MUBen: Benchmarking the Uncertainty of Molecular Representation Models.
Trans. Mach. Learn. Res., 2024

Cut to the Chase: An Error-Oriented Approach to Detect Error-Handling Bugs.
Proc. ACM Softw. Eng., 2024

Training and Serving System of Foundation Models: A Comprehensive Survey.
IEEE Open J. Comput. Soc., 2024

Efficient privacy-preserving Gaussian process via secure multi-party computation.
J. Syst. Archit., 2024

Accelerating LASG/IAP climate system ocean model version 3 for performance portability using Kokkos.
Future Gener. Comput. Syst., 2024

Efficient Evolutionary Search Over Chemical Space with Large Language Models.
CoRR, 2024

Online Self-Preferring Language Models.
CoRR, 2024

MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning.
CoRR, 2024

ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling.
CoRR, 2024

EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records.
CoRR, 2024

SecFormer: Towards Fast and Accurate Privacy-Preserving Inference for Large Language Models.
CoRR, 2024

Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles.
CoRR, 2024

A Survey on Scheduling Techniques in Computing and Network Convergence.
IEEE Commun. Surv. Tutorials, 2024

GGT: Graph-guided testing for adversarial sample detection of deep neural network.
Comput. Secur., 2024

Accelerating denoising diffusion probabilistic model via truncated inverse processes for medical image segmentation.
Comput. Biol. Medicine, 2024

ModelFoundry: A Tool for DNN Modularization and On-Demand Model Reuse Inspired by the Wisdom of Software Engineering.
Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024

A Performance-Portable Kilometer-Scale Global Ocean Model on ORISE and New Sunway Heterogeneous Supercomputers.
Proceedings of the International Conference for High Performance Computing, 2024

HYDRA: Model Factorization Framework for Black-Box LLM Personalization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

SiFT: A Serial Framework with Textual Guidance for Federated Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Easing Concept Bleeding in Diffusion via Entity Localization and Anchoring.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Causally Motivated Personalized Federated Invariant Learning with Shortcut-Averse Information-Theoretic Regularization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

At Which Training Stage Does Code Data Help LLMs Reasoning?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Optimus: Warming Serverless ML Inference via Inter-Function Model Transformation.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Revisiting Data Reconstruction Attacks on Real-world Dataset for Federated Natural Language Understanding.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

PCLmed: Champion Solution for ImageCLEFmedical 2024 Caption Prediction Challenge via Medical Vision-Language Foundation Models.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Predicting Text Preference Via Structured Comparative Reasoning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Multi-modal Stance Detection: New Datasets and Model.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Excitement surfeited turns to errors: Deep learning testing framework based on excitable neurons.
Inf. Sci., August, 2023

Influential Global and Local Contexts Guided Trace Representation for Fault Localization.
ACM Trans. Softw. Eng. Methodol., May, 2023

To Follow or Not to Follow: Understanding Issue/Pull-Request Templates on GitHub.
IEEE Trans. Software Eng., April, 2023

Towards Usable Neural Comment Generation via Code-Comment Linkage Interpretation: Method and Empirical Study.
IEEE Trans. Software Eng., April, 2023

deGraphCS: Embedding Variable-based Flow Graph for Neural Code Search.
ACM Trans. Softw. Eng. Methodol., April, 2023

Multitask joint strategies of self-supervised representation learning on biomedical networks for drug discovery.
Nat. Mac. Intell., April, 2023

Pull Request Decisions Explained: An Empirical Overview.
IEEE Trans. Software Eng., February, 2023

Plumber: Boosting the Propagation of Vulnerability Fixes in the npm Ecosystem.
IEEE Trans. Software Eng., 2023

When Database Meets New Storage Devices: Understanding and Exposing Performance Mismatches via Configurations.
Proc. VLDB Endow., 2023

How Many Validation Labels Do You Need? Exploring the Design Space of Label-Efficient Model Ranking.
CoRR, 2023

On What Basis? Predicting Text Preference Via Structured Comparative Reasoning.
CoRR, 2023

Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models.
CoRR, 2023

Personalized Federated Learning via Amortized Bayesian Meta-Learning.
CoRR, 2023

MUBen: Benchmarking the Uncertainty of Pre-Trained Models for Molecular Property Prediction.
CoRR, 2023

A Survey on Knowledge Graphs for Healthcare: Resources, Applications, and Promises.
CoRR, 2023

Stochastic Clustered Federated Learning.
CoRR, 2023

Ginver: Generative Model Inversion Attacks Against Collaborative Inference.
Proceedings of the ACM Web Conference 2023, 2023

MulCS: Towards a Unified Deep Representation for Multilingual Code Search.
Proceedings of the IEEE International Conference on Software Analysis, 2023

Practical privacy-preserving Gaussian process regression via secret sharing.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Can Machine Learning Pipelines Be Better Configured?
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Weakly-Supervised Scientific Document Classification via Retrieval-Augmented Multi-Stage Training.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

ToolQA: A Dataset for LLM Question Answering with External Tools.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Mask Again: Masked Knowledge Distillation for Masked Video Modeling.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Local Boosting for Weakly-Supervised Learning.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

R-Mixup: Riemannian Mixup for Biological Networks.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

A Two-Stage Framework for Ambiguous Classification in Software Engineering.
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023

Deep Dag Learning of Effective Brain Connectivity for FMRI Analysis.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Understanding and Detecting On-The-Fly Configuration Bugs.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Two Sides of the Same Coin: Exploiting the Impact of Identifiers in Neural Code Comprehension.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

On the Reliability of Coverage Data for Fault Localization.
Proceedings of the 30th Asia-Pacific Software Engineering Conference, 2023

FedPETuning: When Federated Learning Meets the Parameter-Efficient Tuning Methods of Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Cold-Start Data Selection for Better Few-shot Language Model Fine-tuning: A Prompt-based Uncertainty Propagation Approach.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Neighborhood-Regularized Self-Training for Learning with Few Labels.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Redundancy, Context, and Preference: An Empirical Study of Duplicate Pull Requests in OSS Projects.
IEEE Trans. Software Eng., 2022

Are You Still Working on This? An Empirical Study on Pull Request Abandonment.
IEEE Trans. Software Eng., 2022

Transductive Relation-Propagation With Decoupling Training for Few-Shot Learning.
IEEE Trans. Neural Networks Learn. Syst., 2022

Opportunities and Challenges in Repeated Revisions to Pull-Requests: An Empirical Study.
Proc. ACM Hum. Comput. Interact., 2022

Pull request latency explained: an empirical overview.
Empir. Softw. Eng., 2022

FENSE: A feature-based ensemble modeling approach to cross-project just-in-time defect prediction.
Empir. Softw. Eng., 2022

Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks.
CoRR, 2022

COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning.
CoRR, 2022

ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select.
CoRR, 2022

Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach.
CoRR, 2022

PRBoost: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning.
CoRR, 2022

The Development and Prospect of Code Clone.
CoRR, 2022

DeepSensor: Deep Learning Testing Framework Based on Neuron Sensitivity.
CoRR, 2022

A Survey on Programmatic Weak Supervision.
CoRR, 2022

HAF: a hybrid annotation framework based on expert knowledge and learning technique.
Sci. China Inf. Sci., 2022

A Distributed Graph Inference Computation Framework Based on Graph Neural Network Model.
Proceedings of the 34th International Conference on Software Engineering and Knowledge Engineering, 2022

OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark Under Heterogeneous AI Computing Platforms.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

Self-Training with Differentiable Teacher.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

AcTune: Uncertainty-Based Active Self-Training for Active Fine-Tuning of Pretrained Language Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions on EHR.
Proceedings of the Machine Learning for Health, 2022

Reentrancy Vulnerability Detection and Localization: A Deep Learning Based Two-phase Approach.
Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering, 2022

NeuronFair: Interpretable White-Box Fairness Testing through Biased Neuron Identification.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

A Universal Data Augmentation Approach for Fault Localization.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Bridging Pre-trained Models and Downstream Tasks for Source Code Understanding.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Multi-Intention-Aware Configuration Selection for Performance Tuning.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

COCO-DR: Combating the Distribution Shift in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Who, What, Why and How? Towards the Monetary Incentive in Crowd Collaboration: A Case Study of Github's Sponsor Mechanism.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks (Extended Abstract).
Proceedings of the IEEE International Conference on Big Data, 2022

Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Near-Online Multi-Pedestrian Tracking via Combining Multiple Consistent Appearance Cues.
IEEE Trans. Circuits Syst. Video Technol., 2021

Detecting Duplicate Contributions in Pull-Based Model Combining Textual and Change Similarities.
J. Comput. Sci. Technol., 2021

Dual Channel Among Task and Contribution on OSS Communities: An Empirical Study.
Int. J. Softw. Eng. Knowl. Eng., 2021

How to cherry pick the bug report for better summarization?
Empir. Softw. Eng., 2021

ATM: An Uncertainty-aware Active Self-training Framework for Label-efficient Text Classification.
CoRR, 2021

Pull Request Decision Explained: An Empirical Overview.
CoRR, 2021

PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation.
CoRR, 2021

deGraphCS: Embedding Variable-based Flow Graph for Neural Code Search.
CoRR, 2021

Why API documentation is insufficient for developers: an empirical study.
Sci. China Inf. Sci., 2021

SumGNN: multi-typed drug interaction prediction via efficient knowledge graph summarization.
Bioinform., 2021

MulCode: A Multi-task Learning Approach for Source Code Understanding.
Proceedings of the 28th IEEE International Conference on Software Analysis, 2021

WRENCH: A Comprehensive Benchmark for Weak Supervision.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Detecting Adversarial Samples with Graph-Guided Testing.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

2020
Automatic Voter Recommendation Method for Closing Questions in Stack Overflow.
Int. J. Softw. Eng. Knowl. Eng., 2020

On the Shoulders of Giants: A New Dataset for Pull-based Development Research.
Proceedings of the MSR '20: 17th International Conference on Mining Software Repositories, 2020

STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

CP-Detector: Using Configuration-related Performance Properties to Expose Performance Bugs.
Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020

BugSum: Deep Context Understanding for Bug Report Summarization.
Proceedings of the ICPC '20: 28th International Conference on Program Comprehension, 2020

An Empirical Study of Multi-discussing Pattern in Open-Source Software Development.
Proceedings of the Internetware'20: 12th Asia-Pacific Symposium on Internetware, 2020

Software visualization and deep transfer learning for effective software defect prediction.
Proceedings of the ICSE '20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June, 2020

SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Sia-RAE: A Siamese Network based on Recursive AutoEncoder for Effective Clone Detection.
Proceedings of the 27th Asia-Pacific Software Engineering Conference, 2020

2019
RepoLike: amulti-feature-based personalized recommendation approach for open-source repositories.
Frontiers Inf. Technol. Electron. Eng., 2019

Multi-reviewing pull-requests: An exploratory study on GitHub OSS projects.
Inf. Softw. Technol., 2019

History-Driven Fix for Code Quality Issues.
IEEE Access, 2019

2018
Correlation-based software search by leveraging software term database.
Frontiers Comput. Sci., 2018

An Initial Step Towards Organ Transplantation Based on GitHub Repository.
IEEE Access, 2018

Adaptive software search toward users' customized requirements in GitHub.
Proceedings of the 30th International Conference on Software Engineering and Knowledge Engineering, 2018

A dataset of duplicate pull-requests in github.
Proceedings of the 15th International Conference on Mining Software Repositories, 2018

Within-ecosystem issue linking: a large-scale study of rails.
Proceedings of the 7th International Workshop on Software Mining, 2018

Transferring Well-Trained Models for Cross-Project Issue Classification: A Large-Scale Empirical Study.
Proceedings of the Tenth Asia-Pacific Symposium on Internetware, 2018

Who Will Become a Long-Term Contributor?: A Prediction Model based on the Early Phase Behaviors.
Proceedings of the Tenth Asia-Pacific Symposium on Internetware, 2018

A Hybrid Approach for Tag Hierarchy Construction.
Proceedings of the New Opportunities for Software Reuse - 17th International Conference, 2018

Cross-Project Issue Classification Based on Ensemble Modeling in a Social Coding World.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Building a Cloud-Ready Program: A highly scalable Implementation based on Kubernetes.
Proceedings of the 2nd International Conference on Advances in Image Processing, 2018

An Insight Into the Impact of Dockerfile Evolutionary Trajectories on Quality and Latency.
Proceedings of the 2018 IEEE 42nd Annual Computer Software and Applications Conference, 2018

Recommending Similar Bug Reports: A Novel Approach Using Document Embedding Model.
Proceedings of the 25th Asia-Pacific Software Engineering Conference, 2018

2017
What Are They Talking About? Analyzing Code Reviews in Pull-Based Development Model.
J. Comput. Sci. Technol., 2017

Social media in GitHub: the role of @-mention in assisting software development.
Sci. China Inf. Sci., 2017

Automatic Classification of Review Comments in Pull-based Development Model.
Proceedings of the 29th International Conference on Software Engineering and Knowledge Engineering, 2017

Detecting Duplicate Pull-requests in GitHub.
Proceedings of the 9th Asia-Pacific Symposium on Internetware, 2017

DevRec: A Developer Recommendation System for Open Source Repositories.
Proceedings of the Mastering Scale and Complexity in Software Reuse, 2017

Where Is the Road for Issue Reports Classification Based on Text Mining?
Proceedings of the 2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, 2017

2016
Reviewer recommendation for pull-requests in GitHub: What can we learn from code review and bug assignment?
Inf. Softw. Technol., 2016

Initial and Eventual Software Quality Relating to Continuous Integration in GitHub.
CoRR, 2016

Determinants of pull-based development in the context of continuous integration.
Sci. China Inf. Sci., 2016

2015
OSSEAN: Mining Crowd Wisdom in Open Source Communities.
Proceedings of the 2015 IEEE Symposium on Service-Oriented System Engineering, 2015

Quality and productivity outcomes relating to continuous integration in GitHub.
Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, 2015

Wait for It: Determinants of Pull Request Evaluation Latency on GitHub.
Proceedings of the 12th IEEE/ACM Working Conference on Mining Software Repositories, 2015

Evaluating Bug Severity Using Crowd-based Knowledge: An Exploratory Study.
Proceedings of the 7th Asia-Pacific Symposium on Internetware, 2015

Exploring the Use of @-mention to Assist Software Development in GitHub.
Proceedings of the 7th Asia-Pacific Symposium on Internetware, 2015

2014
Investigating social media in GitHub's pull-requests: a case study on Ruby on Rails.
Proceedings of the 1st International Workshop on Crowd-based Software Development Methods and Technologies, 2014

Exploring the patterns of social behavior in GitHub.
Proceedings of the 1st International Workshop on Crowd-based Software Development Methods and Technologies, 2014

Reviewer Recommender of Pull-Requests in GitHub.
Proceedings of the 30th IEEE International Conference on Software Maintenance and Evolution, Victoria, BC, Canada, September 29, 2014

A Exploratory Study of @-Mention in GitHub's Pull-Requests.
Proceedings of the 21st Asia-Pacific Software Engineering Conference, 2014

Who Should Review this Pull-Request: Reviewer Recommendation to Expedite Crowd Collaboration.
Proceedings of the 21st Asia-Pacific Software Engineering Conference, 2014

2013
A Trusted Remote Attestation Model Based on Trusted Computing.
Proceedings of the 12th IEEE International Conference on Trust, 2013

HESA: The Construction and Evaluation of Hierarchical Software Feature Repository.
Proceedings of the 25th International Conference on Software Engineering and Knowledge Engineering, 2013

Mining and recommending software features across multiple web repositories.
Proceedings of the 5th Asia-Pacific Symposium on Internetware, 2013

2012
Inducing Taxonomy from Tags: An Agglomerative Hierarchical Clustering Framework.
Proceedings of the Advanced Data Mining and Applications, 8th International Conference, 2012


  Loading...