Audris Mockus

Orcid: 0000-0002-7987-7598

Affiliations:
  • University of Tennessee, Knoxville, Department of Elecrical Engineering and Computer Science
  • Avaya Labs Research, Basking Ridge, USA


According to our database1, Audris Mockus authored at least 149 papers between 1994 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Beyond Dependencies: The Role of Copy-Based Reuse in Open Source Software Development.
CoRR, 2024

OSS License Identification at Scale: A Comprehensive Dataset Using World of Code.
CoRR, 2024

Towards Automation of Human Stage of Decay Identification: An Artificial Intelligence Approach.
CoRR, 2024

The Role of Data Filtering in Open Source Software Ranking and Selection.
Proceedings of the 1st IEEE/ACM International Workshop on Methodological Issues with Empirical Studies in Software Engineering, 2024

Dataset: Copy-based Reuse in Open Source Software.
Proceedings of the 21st IEEE/ACM International Conference on Mining Software Repositories, 2024

2023
On the Variability of Software Engineering Needs for Deep Learning: Stages, Trends, and Application Types.
IEEE Trans. Software Eng., February, 2023

SciCat: A Curated Dataset of Scientific Software Repositories.
CoRR, 2023

Modeling the Centrality of Developer Output with Software Supply Chains.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Applying the Universal Version History Concept to Help De-Risk Copy-Based Code Reuse.
Proceedings of the 23rd IEEE International Working Conference on Source Code Analysis and Manipulation, 2023

Stage of Decay Estimation Exploiting Exogenous and Endogenous Image Attributes to Minimize Manual Labeling Efforts and Maximize Classification Performance.
Proceedings of the IEEE International Conference on Image Processing, 2023

How R Developers explain their Package Choice: A Survey.
Proceedings of the ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, 2023

2022
A Methodology for Analyzing Uptake of Software Technologies Among Developers.
IEEE Trans. Software Eng., 2022

One-off events? An empirical study of hackathon code creation and reuse.
Empir. Softw. Eng., 2022

How are Software Repositories Mined? A Systematic Literature Review of Workflows, Methodologies, Reproducibility, and Tools.
CoRR, 2022

SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images.
CoRR, 2022

The Extent of Orphan Vulnerabilities from Code Reuse in Open Source Software.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

2021
Companies' Participation in OSS Development-An Empirical Study of OpenStack.
IEEE Trans. Software Eng., 2021

World of code: enabling a research workflow for mining and analyzing the universe of open source VCS data.
Empir. Softw. Eng., 2021

Pseudo Pixel-level Labeling for Images with Evolving Content.
CoRR, 2021

The Secret Life of Hackathon Code.
CoRR, 2021

SChISM: Semantic Clustering via Image Sequence Merging for Images of Human-Decomposition.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

The Secret Life of Hackathon Code Where does it come from and where does it go?
Proceedings of the 18th IEEE/ACM International Conference on Mining Software Repositories, 2021

Replication Package for Representation of Developer Expertise in Open Source Software.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering: Companion Proceedings, 2021

Representation of Developer Expertise in Open Source Software.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

2020
A Dataset for GitHub Repository Deduplication: Replication Package.
Dataset, February, 2020

A Dataset for GitHub Repository Deduplication: Replication Package.
Dataset, February, 2020

A Dataset for GitHub Repository Deduplication.
Dataset, February, 2020

Do code review measures explain the incidence of post-release defects?
Empir. Softw. Eng., 2020

Deriving a usage-independent software quality metric.
Empir. Softw. Eng., 2020

ALFAA: Active Learning Fingerprint based Anti-Aliasing for correcting developer identity errors in version control systems.
Empir. Softw. Eng., 2020

More Effective Software Repository Mining.
CoRR, 2020

Which Pull Requests Get Accepted and Why? A study of popular NPM Packages.
CoRR, 2020

An Analytical Workflow for Clustering Forensic Images.
CoRR, 2020

A Dataset for GitHub Repository Deduplication.
Proceedings of the MSR '20: 17th International Conference on Mining Software Repositories, 2020

A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared Commits.
Proceedings of the MSR '20: 17th International Conference on Mining Software Repositories, 2020

A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits.
Proceedings of the MSR '20: 17th International Conference on Mining Software Repositories, 2020

Detecting and Characterizing Bots that Commit Code.
Proceedings of the MSR '20: 17th International Conference on Mining Software Repositories, 2020

An Exploratory Study of Bot Commits.
Proceedings of the ICSE '20: 42nd International Conference on Software Engineering, Workshops, Seoul, Republic of Korea, 27 June, 2020

Collaborative Learning Of Semi-Supervised Clustering And Classification For Labeling Uncurated Data.
Proceedings of the IEEE International Conference on Image Processing, 2020

Effect of Technical and Social Factors on Pull Request Quality for the NPM Ecosystem.
Proceedings of the ESEM '20: ACM / IEEE International Symposium on Empirical Software Engineering and Measurement, 2020

An Analytical Workflow for Clustering Forensic Images (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Machine-assisted annotation of forensic imagery.
CoRR, 2019

ALFAA: Active Learning Fingerprint Based Anti-Aliasing for Correcting Developer Identity Errors in Version Control Data.
CoRR, 2019

Insights from open source software supply chains (keynote).
Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2019

Patterns of Effort Contribution and Demand and User Classification based on Participation Patterns in NPM Ecosystem.
Proceedings of the Fifteenth International Conference on Predictive Models and Data Analytics in Software Engineering, 2019

World of code: an infrastructure for mining the universe of open source VCS data.
Proceedings of the 16th International Conference on Mining Software Repositories, 2019

Developer Reputation Estimator (DRE).
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Machine-Assisted Annotation of Forensic Imagery.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

2018
Are Software Dependency Supply Chain Metrics Useful in Predicting Change of Popularity of NPM Packages?
Proceedings of the 14th International Conference on Predictive Models and Data Analytics in Software Engineering, 2018

Modeling Relationship between Post-Release Faults and Usage in Mobile Software.
Proceedings of the 14th International Conference on Predictive Models and Data Analytics in Software Engineering, 2018

2017
Towards Engineering Free/Libre OpenSource Software (FLOSS) Ecosystems forImpact and Sustainability (NII Shonan Meeting 2017-8).
NII Shonan Meet. Rep., 2017

On the scalability of Linux kernel maintainers' work.
Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering, 2017

WIP: Live Restructuring of Data Architecture.
Proceedings of the 12th IEEE/ACM International Workshop on Software Engineering for Science, 2017

2016
Inflow and Retention in OSS Communities with Commercial Involvement: A Case Study of Three Hybrid Projects.
ACM Trans. Softw. Eng. Methodol., 2016

Improving Software Quality as Customers Perceive It.
IEEE Softw., 2016

Software Engineering for Big Data Systems.
IEEE Softw., 2016

Crowdsourcing the discovery of software repositories in an educational environment.
PeerJ Prepr., 2016

Towards building a universal defect prediction model with rank transformed predictors.
Empir. Softw. Eng., 2016

Effectiveness of code contribution: from patch-based to pull-request-based tools.
Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2016

Thresholds for Size and Complexity Metrics: A Case Study from the Perspective of Defect Density.
Proceedings of the 2016 IEEE International Conference on Software Quality, 2016

Exploring a framework for identity and attribute linking across heterogeneous data systems.
Proceedings of the 2nd International Workshop on BIG Data Software Engineering, 2016

Decisions as a service for application centric real time analytics.
Proceedings of the 2nd International Workshop on BIG Data Software Engineering, 2016

Quantifying and mitigating turnover-induced knowledge loss: case studies of chrome and a project at avaya.
Proceedings of the 38th International Conference on Software Engineering, 2016

Modularizing global variable in climate simulation software: position paper.
Proceedings of the International Workshop on Software Engineering for Science, 2016

Analysis of Popularity of Game Mods: A Case Study.
Proceedings of the 2016 Annual Symposium on Computer-Human Interaction in Play, 2016

Operational data are missing, incorrect, and decontextualized.
Proceedings of the Perspectives on Data Science for Software Engineering, 2016

2015
Who Will Stay in the FLOSS Community? Modeling Participant's Initial Behavior.
IEEE Trans. Software Eng., 2015

An empirical study of goto in C code.
PeerJ Prepr., 2015

A Large-Scale Empirical Study of the Relationship between Build Technology and Build Maintenance.
Empir. Softw. Eng., 2015

A method to identify and correct problematic software activity data: exploiting capacity constraints and data redundancies.
Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, 2015

An empirical study of goto in C code from GitHub repositories.
Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering, 2015

Evidence Engineering.
Proceedings of the 8th India Software Engineering Conference, 2015

Commit Quality in Five High Performance Computing Projects.
Proceedings of the 1st IEEE/ACM International Workshop on Software Engineering for High Performance Computing in Science, 2015

Assessing the State of Software in a Large Enterprise.
Proceedings of the Art and Science of Analyzing Software Data, 2015

2014
Mining micro-practices from operational data.
Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, (FSE-22), Hong Kong, China, November 16, 2014

Defect prediction and software risk.
Proceedings of the 10th International Conference on Predictive Models in Software Engineering, 2014

Is mining software repositories data science? (keynote).
Proceedings of the 11th Working Conference on Mining Software Repositories, 2014

Towards building a universal defect prediction model.
Proceedings of the 11th Working Conference on Mining Software Repositories, 2014

Product assignment recommender.
Proceedings of the 36th International Conference on Software Engineering, 2014

Engineering big data solutions.
Proceedings of the on Future of Software Engineering, 2014

Collecting and leveraging a benchmark of build system clones to aid in quality assessments.
Proceedings of the 36th International Conference on Software Engineering, 2014

Patterns of folder use and project popularity: a case study of github repositories.
Proceedings of the 2014 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, 2014

Forking and coordination in multi-platform development: a case study.
Proceedings of the 2014 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, 2014

2013
Quantifying the Effect of Code Smells on Maintenance Effort.
IEEE Trans. Software Eng., 2013

A Large-Scale Empirical Study of Just-in-Time Quality Assurance.
IEEE Trans. Software Eng., 2013

Risky files: an approach to focus quality improvement effort.
Proceedings of the Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering, 2013

How Does Context Affect the Distribution of Software Maintainability Metrics?
Proceedings of the 2013 IEEE International Conference on Software Maintenance, 2013

The chunking pattern.
Proceedings of the 1st International Workshop on Data Analysis Patterns in Software Engineering, 2013

Impact of Triage: A Study of Mozilla and Gnome.
Proceedings of the 2013 ACM / IEEE International Symposium on Empirical Software Engineering and Measurement, 2013

2012
What make long term contributors: Willingness and opportunity in OSS community.
Proceedings of the 34th International Conference on Software Engineering, 2012

Questioning software maintenance metrics: a comparative case study.
Proceedings of the 2012 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, 2012

2011
Guest Editors' Introduction: Special Section on the Socio-Technical Environment of Software Development Projects.
IEEE Trans. Software Eng., 2011

High-impact defects: a study of breakage and surprise defects.
Proceedings of the SIGSOFT/FSE'11 19th ACM SIGSOFT Symposium on the Foundations of Software Engineering (FSE-19) and ESEC'11: 13th European Software Engineering Conference (ESEC-13), 2011

Does the initial environment impact the future of developers.
Proceedings of the 33rd International Conference on Software Engineering, 2011

2010
Assessing the state of software in a large enterprise.
Empir. Softw. Eng., 2010

Developer fluency: achieving true mastery in software projects.
Proceedings of the 18th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2010

Growth of newcomer competence: challenges of globalization.
Proceedings of the Workshop on Future of Software Engineering Research, 2010

Organizational volatility and its effects on software defects.
Proceedings of the 18th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2010

2009
Software Dependencies, Work Dependencies, and Their Impact on Failures.
IEEE Trans. Software Eng., 2009

Variability and Reproducibility in Software Engineering: A Study of Four Companies that Developed the Same System.
IEEE Trans. Software Eng., 2009

Future of Mining Software Archives: A Roundtable.
IEEE Softw., 2009

Amassing and indexing a large sample of version control systems: Towards the census of public source code history.
Proceedings of the 6th International Working Conference on Mining Software Repositories, 2009

Succession: Measuring transfer of code and developer productivity.
Proceedings of the 31st International Conference on Software Engineering, 2009

2<sup>nd</sup> international workshop on socio-technical congruence (STC 2009).
Proceedings of the 31st International Conference on Software Engineering, 2009

Test coverage and post-verification defects: A multiple case study.
Proceedings of the Third International Symposium on Empirical Software Engineering and Measurement, 2009

2008
Evaluation of source code copy detection methods on freebsd.
Proceedings of the 2008 International Working Conference on Mining Software Repositories, 2008

Interval quality: relating customer-perceived quality to process quality.
Proceedings of the 30th International Conference on Software Engineering (ICSE 2008), 2008

Socio-technical congruence (STC 2008).
Proceedings of the 30th International Conference on Software Engineering (ICSE 2008), 2008

Missing Data in Software Engineering.
Proceedings of the Guide to Advanced Empirical Software Engineering, 2008

2006
TA-RE: an exchange language for mining software repositories.
Proceedings of the 2006 International Workshop on Mining Software Repositories, 2006

Constructing universal version history.
Proceedings of the 2006 International Workshop on Mining Software Repositories, 2006

Empirical estimates of software availability of deployed systems.
Proceedings of the 2006 International Symposium on Empirical Software Engineering (ISESE 2006), 2006

Software Support Tools and Experimental Work.
Proceedings of the Empirical Software Engineering Issues. Critical Assessment and Future Directions, 2006

Quantifying the Value of New Technologies for Software Development.
Proceedings of the Value-Based Software Engineering, 2006

2005
Guest Editor's Introduction: Special Issue on Mining Software Repositories.
IEEE Trans. Software Eng., 2005

Report on MSR 2004: International workshop on mining software repositories.
ACM SIGSOFT Softw. Eng. Notes, 2005

Refactoring for Changeability: A Way to Go?
Proceedings of the 11th IEEE International Symposium on Software Metrics (METRICS 2005), 2005

Predictors of customer perceived software quality.
Proceedings of the 27th International Conference on Software Engineering (ICSE 2005), 2005

2004
MSR 2004: International Workshop on Mining Software Repositories.
Proceedings of the 26th International Conference on Software Engineering (ICSE 2004), 2004

2003
An Empirical Study of Speed and Communication in Globally Distributed Software Development.
IEEE Trans. Software Eng., 2003

Formulation and preliminary test of an empirical theory of coordination in software engineering.
Proceedings of the 11th ACM SIGSOFT Symposium on Foundations of Software Engineering 2003 held jointly with 9th European Software Engineering Conference, 2003

Analogy Based Prediction of Work Item Flow in Software Projects: a Case Study.
Proceedings of the 2003 International Symposium on Empirical Software Engineering (ISESE 2003), 30 September, 2003

Understanding and Predicting Effort in Software Projects.
Proceedings of the 25th International Conference on Software Engineering, 2003

2002
Visualizing Software Changes.
IEEE Trans. Software Eng., 2002

Using Version Control Data to Evaluate the Impact of Software Tools: A Case Study of the Version Editor.
IEEE Trans. Software Eng., 2002

Two case studies of open source software development: Apache and Mozilla.
ACM Trans. Softw. Eng. Methodol., 2002

handiMessenger: Awareness-Enhanced Universal Communication for Mobile Users.
Proceedings of the Mobile Human-Computer Interaction, 4th International Symposium, 2002

Expertise browser: a quantitative approach to identifying expertise.
Proceedings of the 24th International Conference on Software Engineering, 2002

Shared Mental Models, Familiarity, and Coordination: A Multi-Method Study of Distributed Software Teams.
Proceedings of the International Conference on Information Systems, 2002

2001
Does Code Decay? Assessing the Evidence from Change Management Data.
IEEE Trans. Software Eng., 2001

Identifying Productivity Drivers by Modeling Work Units Using Partial Data.
Technometrics, 2001

Globalization by Chunking: A Quantitative Approach.
IEEE Softw., 2001

Making the Software Factory Work: Lessons from a Decade of Experience.
Proceedings of the 7th IEEE International Software Metrics Symposium (METRICS 2001), 2001

Challenges of Global Software Development.
Proceedings of the 7th IEEE International Software Metrics Symposium (METRICS 2001), 2001

An Empirical Study of Global Software Development: Distance and Speed.
Proceedings of the 23rd International Conference on Software Engineering, 2001

Shared Mental Models and Coordination in Large-Scale, Distributed Software Development.
Proceedings of the International Conference on Information Systems, 2001

2000
Predicting risk of software changes.
Bell Labs Tech. J., 2000

Measuring technology effects on software change cost.
Bell Labs Tech. J., 2000

Identifying Reasons for Software Changes using Historic Databases.
Proceedings of the 2000 International Conference on Software Maintenance, 2000

A case study of open source software development: the Apache server.
Proceedings of the 22nd International Conference on on Software Engineering, 2000

Distance, dependencies, and delay in a global collaboration.
Proceedings of the CSCW 2000, 2000

A Web-Based Approach to Interactive Visualization in Context.
Proceedings of the working conference on Advanced visual interfaces, 2000

1999
Measuring Domain Engineering Effects on Software Change Cost.
Proceedings of the 6th IEEE International Software Metrics Symposium (METRICS 1999), 1999

Using Version Control Data to Evaluate the Impact of Software Tools.
Proceedings of the 1999 International Conference on Software Engineering, 1999

1998
A Web Laboratory for Software Data Analysis.
World Wide Web, 1998

Understanding the Sources of Variation in Software Inspections.
ACM Trans. Softw. Eng. Methodol., 1998

Inferring Change Effort from Configuration Management Databases.
Proceedings of the 5th IEEE International Software Metrics Symposium (METRICS 1998), 1998

1997
Bayesian approach for randomization of heuristic algorithms of discrete programming.
Proceedings of the Randomization Methods in Algorithm Design, 1997

1994
An Example of the Estimation and Display of a Smoothly Varying Function of Time and Space - The Incidence of the Disease Mumps.
J. Am. Soc. Inf. Sci., 1994


  Loading...