Weijia Xu

Orcid: 0000-0002-5732-8926

According to our database1, Weijia Xu authored at least 108 papers between 2003 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Stronger Inductive Biases for Sample-Efficient and Controllable Neural Machine Translation.
PhD thesis, 2023

GRIM: GRaph-based Interactive narrative visualization for gaMes.
CoRR, 2023

Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling.
CoRR, 2023

Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection.
CoRR, 2023

2022
Deep Neural Network Training With Distributed K-FAC.
IEEE Trans. Parallel Distributed Syst., 2022

Synchronic Curation for Assessing Reuse and Integration Fitness of Multiple Data Collections.
Int. J. Digit. Curation, 2022

Research on multi factor stock selection model based on LightGBM and Bayesian Optimization.
Proceedings of the 9th International Conference on Information Technology and Quantitative Management, 2022

BatchLens: A Visualization Approach for Analyzing Batch Jobs in Cloud Systems.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

Constrained Regeneration for Cross-Lingual Query-Focused Extractive Summarization.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Accelerating Deep Learning Training Through Transparent Storage Tiering.
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

2021
EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints.
Trans. Assoc. Comput. Linguistics, 2021

Gramene 2021: harnessing the power of comparative genomics and pathways for plant research.
Nucleic Acids Res., 2021

Improved incremental local outlier detection for data streams based on the landmark window model.
Knowl. Inf. Syst., 2021

Recognition and Co-Analysis of Pedestrian Activities in Different Parts of Road using Traffic Camera Video.
CoRR, 2021

Soft Layer Selection with Meta-Learning for Zero-Shot Cross-Lingual Transfer.
CoRR, 2021

Research on Tourism Prosperity Index Based on the Power Big Data.
Proceedings of the WI-IAT '21: IEEE/WIC/ACM International Conference on Web Intelligence, Hybrid Event / Melbourne, VIC, Australia, December 14 - 17, 2021, 2021

The Impact of COVID-19 on China's Capital Market and Major Industry Sectors.
Proceedings of the 8th International Conference on Information Technology and Quantitative Management, 2021

Improving Multilingual Neural Machine Translation with Auxiliary Source Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Rule-based Morphological Inflection Improves Neural Terminology Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Lightning Talks of EduHPC 2021.
Proceedings of the 9th IEEE/ACM Workshop on Education for High Performance Computing, 2021

The Case for Storage Optimization Decoupling in Deep Learning Frameworks.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

MONARCH: Hierarchical Storage Management for Deep Learning Frameworks.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

Tracking Property Ownership Variance and Forecasting Housing Price with Machine Learning and Deep Learning.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

How Does Distilled Data Complexity Impact the Quality and Confidence of Non-Autoregressive Machine Translation?
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

A Non-Autoregressive Edit-Based Approach to Controllable Text Simplification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Building a PubMed knowledge graph.
CoRR, 2020

Hierarchical Dirichlet Multinomial Allocation Model for Multi-Source Document Clustering.
IEEE Access, 2020

A New Data Fusion Framework of Business Intelligence and Analytics in Economy, Finance and Management.
Proceedings of the IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, 2020

Convolutional neural network training with distributed K-FAC.
Proceedings of the International Conference for High Performance Computing, 2020

Modeling Data Curation to Scientific Inquiry: A Case Study for Multimodal Data Integration.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

Dual Reconstruction: a Unifying Objective for Semi-Supervised Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

End-to-End Slot Alignment and Recognition for Cross-Lingual NLU.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

AI4AV (Artificial Intelligence for Audiovisual): Design and Evaluation of a Shared System for LAMs.
Proceedings of the 15th Annual International Conference of the Alliance of Digital Humanities Organizations, 2020

A Study of Spoken Audio Processing using Machine Learning for Libraries, Archives and Museums (LAM).
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019
Improving Publication Pipeline with Automated Biological Entity Detection and Validation Service.
Data Inf. Manag., 2019

Special Issue on Cyberinfrastructure, Machine Learning, and Digital Library.
Data Inf. Manag., 2019

Identifier Services: Modeling and Implementing Distributed Data Management in Cyberinfrastructure.
Data Inf. Manag., 2019

HPC AI500: A Benchmark Suite for HPC AI Systems.
CoRR, 2019

Extracting Domain Information using Deep Learning.
Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning), 2019

Adjusting the Inheritance of Topic for Dynamic Document Clustering.
Proceedings of the Theoretical Computer Science - 37th National Conference, 2019

Differentiable Sampling with Flexible Reference Word Order for Neural Machine Translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Surprise Languages: Rapid-Response Cross-Language IR.
Proceedings of the 9th International Workshop on Evaluating Information Access co-located with the 14th NTCIR Conference on the Evaluation of Information Access Technologies (NTCIR 2019), 2019

Quantifying the Impact of Memory Errors in Deep Learning.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

Detecting Pedestrian Crossing Events in Large Video Data from Traffic Monitoring Cameras.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Performance Comparison of Julia Distributed Implementations of Dirichlet Process Mixture Models.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
Authentication with User Driven Web Application for Accessing Remote Resources.
Proceedings of the Practice and Experience on Advanced Research Computing, 2018

A Case Study of R Performance Analysis and Optimization.
Proceedings of the Practice and Experience on Advanced Research Computing, 2018

Building Big Data Processing and Visualization Pipeline through Apache Zeppelin.
Proceedings of the Practice and Experience on Advanced Research Computing, 2018

The University of Maryland's Chinese-English Neural Machine Translation Systems at WMT18.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Enabling User Driven Web Applications on Remote Computing Resource.
Proceedings of the 2018 IEEE World Congress on Services, 2018

Cyberinfrastructure for Digital Libraries and Archives: Integrating Data Management, Analysis, and Publication.
Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries, 2018

Domain Informational Vocabulary Extraction Experiences with Publication Pipeline Integration and Ontology Curation.
Proceedings of the 9th International Conference on Biological Ontology (ICBO 2018), 2018

Enabling User Driven Big Data Application on Remote Computing Resources.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Integrated HPC Scheduler Data Processing Workflow using Apache Zeppelin.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

HPC AI500: A Benchmark Suite for HPC AI Systems.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2018

2017
Insights into Research Computing Operations using Big Data-Powered Log Analysis.
Proceedings of the Practice and Experience in Advanced Research Computing 2017: Sustainability, 2017

A Portable Strategy for Preserving Web Applications Functionality.
Proceedings of the 2017 ACM/IEEE Joint Conference on Digital Libraries, 2017

Big data system for information aggregation and model comparison for precison medicine.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Enabling versatile analysis of large scale traffic video data with deep learning and HiveQL.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

2016
A web interface for XALT log data analysis.
Proceedings of the XSEDE16 Conference on Diversity, 2016

Data Curation with a Focus on Reuse.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016

A Web Application for Extracting Key Domain Information for Scientific Publications Using Ontology.
Proceedings of the Joint International Conference on Biological Ontology and BioCreative, 2016

Enhancing Information Accessibility of Publications with Text Mining and Ontology.
Proceedings of the Joint International Conference on Biological Ontology and BioCreative, 2016

Computation-Aided Analysis on Film Credits.
Proceedings of the 11th Annual International Conference of the Alliance of Digital Humanities Organizations, 2016

Supporting large scale connected vehicle data analysis using HIVE.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Content-based comparison for collections identification.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

A workload aware model of computational resource selection for big data applications.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

2015
FlowGate: towards extensible and scalable web-based flow cytometry data analysis.
Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure, St. Louis, MO, USA, July 26, 2015

Multi-Column Query Method Research and Optimization on HBase.
Proceedings of the Knowledge Management in Organizations - 10th International Conference, 2015

Wrangler's user environment: A software framework for management of data-intensive computing system.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

Performance evaluation of enabling logistic regression for big data with R.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

Supporting Data Driven Access through Automatic Keyword Extraction and Summarization.
Proceedings of the 2015 IEEE International Congress on Big Data, New York City, NY, USA, June 27, 2015

2014
Interactive visualization for curatorial analysis of large digital collection.
Inf. Vis., 2014

The Adaptive Projection Forest: Using adjustable exclusion and parallelism in metric space indexes.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

On scaling time dependent shortest path computations for Dynamic Traffic Assignment.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

2013
Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems.
BMC Syst. Biol., 2013

Lost in the Data, Aerial Views of an Archaeological Collection.
Proceedings of the 8th Annual International Conference of the Alliance of Digital Humanities Organizations, 2013

A case study on entity Resolution for Distant Processing of big Humanities data.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Fast scalable selection algorithms for large scale data.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Performance evaluation of R with Intel Xeon Phi coprocessor.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Data mining for "big archives" analysis: A case study.
Proceedings of the Beyond the Cloud: Rethinking Information Boundaries, 2013

2012
On automatically tagging web documents from examples.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Analysis and Optimization of Data Import with Hadoop.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

An accurate scalable template-based alignment algorithm.
Proceedings of the 2012 IEEE International Conference on Bioinformatics and Biomedicine, 2012

Designing an Interface for Exploring Online Autism Support Communities.
Proceedings of the AMIA 2012, 2012

2011
Finding stories in the archive through paragraph alignment.
Lit. Linguistic Comput., 2011

Assessing the Preservation Condition of Large and Heterogeneous Electronic Records Collections with Visualization.
Int. J. Digit. Curation, 2011

Integrating multi-touch in high-resolution display environments.
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

Analysis of large digital collections with interactive visualization.
Proceedings of the 6th IEEE Conference on Visual Analytics Science and Technology, 2011

Facilitating Understanding of Large Document Collections.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

rCAD: A Novel Database Schema for the Comparative Analysis of RNA.
Proceedings of the IEEE 7th International Conference on E-Science, 2011

RNA2DMap: A Visual Exploration Tool of the Information in RNA's Higher-Order Structure.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2011

R-PASS: A Fast Structure-Based RNA Sequence Alignment Algorithm.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2011

2010
Visualizing personal digital collections.
Proceedings of the 2010 Joint International Conference on Digital Libraries, 2010

Distributed, Scalable Clustering for Detecting Halos in Terascale Astronomy Datasets.
Proceedings of the ICDMW 2010, 2010

2009
Visual Representation of Multiple Associations in Data using Constrained Graph Layout.
Proceedings of the EG UK Theory and Practice of Computer Graphics, 2009

Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA.
Proceedings of the Scientific and Statistical Database Management, 2009

Composing and executing parallel data-flow graphs with shell pipes.
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, 2009

2008
Anytime k-nearest neighbor search for database applications.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

2006
A fast coarse filtering method for peptide identification by mass spectrometry.
Bioinform., 2006

On Integrating Peptide Sequence Analysis and Relational Distance-Based Indexing.
Proceedings of the Sixth IEEE International Symposium on BioInformatics and BioEngineering (BIBE 2006), 2006

2005
An Assessment of a Metric Space Database Index to Support Sequence Homology.
Int. J. Artif. Intell. Tools, 2005

On Optimizing Distance-Based Similarity Search for Biological Databases.
Proceedings of the Fourth International IEEE Computer Society Computational Systems Bioinformatics Conference, 2005

2004
Biosequence Use Cases in MoBIoS SQL.
IEEE Data Eng. Bull., 2004

A metric model of amino acid substitution.
Bioinform., 2004

Using MoBIoS' scalable genome join to find conserved primer pair candidates between two genomes.
Proceedings of the Proceedings Twelfth International Conference on Intelligent Systems for Molecular Biology/Third European Conference on Computational Biology 2004, 2004

2003
MoBIoS: A Metric-Space DBMS to Support Biological Discovery.
Proceedings of the 15th International Conference on Scientific and Statistical Database Management (SSDBM 2003), 2003


  Loading...