Yihua Huang

Orcid: 0000-0003-1806-0936

Affiliations:
  • Nanjing University, State Key Laboratory for Novel Software Technology, China


According to our database1, Yihua Huang authored at least 85 papers between 2009 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms.
IEEE Trans. Parallel Distributed Syst., November, 2023

Coral: federated query join order optimization based on deep reinforcement learning.
World Wide Web (WWW), September, 2023

Nested relation extraction via self-contrastive learning guided by structure and semantic similarity.
Neural Networks, May, 2023

ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-oriented Sample Size Allocation and Data Generation.
Proc. VLDB Endow., 2023

Simple and Efficient Partial Graph Adversarial Attack: A New Perspective.
CoRR, 2023

HAGNN: Hybrid Aggregation for Heterogeneous Graph Neural Networks.
CoRR, 2023

Adaptive Online Cache Capacity Optimization via Lightweight Working Set Size Estimation at Scale.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

AdaMCL: Adaptive Fusion Multi-View Contrastive Learning for Collaborative Filtering.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Operation-Level Early Stopping for Robustifying Differentiable NAS.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Time and Cost-Efficient Cloud Data Transmission based on Serverless Computing Compression.
Proceedings of the IEEE INFOCOM 2023, 2023

STSD: Modeling Spatial Temporal Staticity and Dynamicity in Traffic Forecasting.
Proceedings of the IEEE International Conference on Data Mining, 2023

AutoAC: Towards Automated Attribute Completion for Heterogeneous Graph Neural Network.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

2022
Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters.
IEEE Trans. Parallel Distributed Syst., 2022

Octopus-DF: Unified DataFrame-based cross-platform data analytic system.
Parallel Comput., 2022

One-Stage Tree: end-to-end tree builder and pruner.
Mach. Learn., 2022

Pronounce differently, mean differently: A multi-tagging-scheme learning method for Chinese NER integrated with lexicon and phonetic features.
Inf. Process. Manag., 2022

Transition Relation Aware Self-Attention for Session-based Recommendation.
CoRR, 2022

Pretraining Multi-modal Representations for Chinese NER Task with Cross-Modality Attention.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Meces: Latency-efficient Rescaling via Prioritized State Migration for Stateful Distributed Stream Processing Systems.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

NAS-CTR: Efficient Neural Architecture Search for Click-Through Rate Prediction.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

AutoGSR: Neural Architecture Search for Graph-based Session Recommendation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Evolutionary Automated Feature Engineering.
Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

A2: Efficient Automated Attacker for Boosting Adversarial Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Knowledge-enhanced Black-box Attacks for Recommendations.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

PSP: Progressive Space Pruning for Efficient Graph Neural Architecture Search.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Fluid: Dataset Abstraction and Elastic Acceleration for Cloud-native Deep Learning Training Jobs.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Efficient. Scalable and Robust Data Shuffle Service for Distributed MapReduce Computing on Cloud.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Towards Efficient Reverse-time Migration Imaging Computation by Pipeline and Fine-grained Execution Parallelization.
Proceedings of the 25th IEEE International Conference on Computational Science and Engineering, 2022

Towards Self-supervised Learning on Graphs with Heterophily.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

DIFER: Differentiable Automated Feature Engineering.
Proceedings of the International Conference on Automated Machine Learning, 2022

2021
Towards Efficient Distributed Subgraph Enumeration Via Backtracking-Based Framework.
IEEE Trans. Parallel Distributed Syst., 2021

Towards Efficient Large-Scale Interprocedural Program Static Analysis on Distributed Data-Parallel Computation.
IEEE Trans. Parallel Distributed Syst., 2021

Alchemy: Distributed financial quantitative analysis system with high-level programming model.
Softw. Pract. Exp., 2021

Improving in-memory file system reading performance by fine-grained user-space cache mechanisms.
J. Syst. Archit., 2021

VSIM: Distributed local structural vertex similarity calculation on big graphs.
J. Parallel Distributed Comput., 2021

SparkDQ: Efficient generic big data quality management on distributed data-parallel computation.
J. Parallel Distributed Comput., 2021

Empirical analysis of performance bottlenecks in graph neural network training and inference with GPUs.
Neurocomputing, 2021

Progressive AutoSpeech: An Efficient and General Framework for Automatic Speech Classification.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

UniGPS: A Unified Programming Framework for Distributed Graph Processing.
Proceedings of the 27th IEEE International Conference on Parallel and Distributed Systems, 2021

2020
DIFER: Differentiable Automated Feature Engineering.
CoRR, 2020

Distributed Subgraph Enumeration via Backtracking-based Framework.
CoRR, 2020

Semi-supervised Embedding Learning for High-dimensional Bayesian Optimization.
CoRR, 2020

2019
Correlation-aware partitioning for skewed range query optimization.
World Wide Web, 2019

Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms.
IEEE Trans. Parallel Distributed Syst., 2019

DGST: Efficient and scalable suffix tree construction on distributed data-parallel platforms.
Parallel Comput., 2019

ForestLayer: Efficient training of deep forests on distributed task-parallel platforms.
J. Parallel Distributed Comput., 2019

Adaptive cache policy scheduling for big data applications on distributed tiered storage system.
Concurr. Comput. Pract. Exp., 2019

BigSpa: An Efficient Interprocedural Static Analysis Engine in the Cloud.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Push-Based Network-efficient Hadoop YARN Scheduling Mechanism for In-Memory Computing.
Proceedings of the 25th IEEE International Conference on Parallel and Distributed Systems, 2019

HyMJ: A Hybrid Structure-Aware Approach to Distributed Multi-way Join Query.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

BENU: Distributed Subgraph Enumeration with Backtracking-Based Framework.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

2018
Penguin: Efficient Query-Based Framework for Replaying Large Scale Historical Data.
IEEE Trans. Parallel Distributed Syst., 2018

Parallelizing Machine Learning Optimization Algorithms on Distributed Data-Parallel Platforms with Parameter Server.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

Seal: Efficient Training Large Scale Statistical Machine Translation Models on Spark.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

2017
Improving Execution Concurrency of Large-Scale Matrix Multiplication on Distributed Data-Parallel Platforms.
IEEE Trans. Parallel Distributed Syst., 2017

AutoMJ: Towards Efficient Multi-way Join Query on Distributed Data-Parallel Platform.
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017

2016
Accelerating Big Data Applications on Tiered Storage System with Various Eviction Policies.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

A Time-Cost Based Automatic Scheduling Framework for Matrix Computation on Various Distributed Computing Platforms.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Stock market prediction exploiting microblog sentiment analysis.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

PTR: Phrase-Based Topical Ranking for Automatic Keyphrase Extraction in Scientific Publications.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

An Adaptive Partition-Based Caching Approach for Efficient Range Queries on Key-Value Data.
Proceedings of the Web Technologies and Applications - 18th Asia-Pacific Web Conference, 2016

2015
AutoRM: An effective approach for automatic Web data record mining.
Knowl. Based Syst., 2015

Cichlid: Efficient Large Scale RDFS/OWL Reasoning with Spark.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

iPLAR: Towards Interactive Programming with Parallel Linear Algebra in R.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

Parallel Training GBRT Based on KMeans Histogram Approximation for Big Data.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015

Unified Programming Model and Software Framework for Big Data Machine Learning and Data Analytics.
Proceedings of the 39th Annual Computer Software and Applications Conference, 2015

Efficient large scale distributed matrix computation with spark.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014
SHadoop: Improving MapReduce performance by optimizing job execution mechanism in Hadoop clusters.
J. Parallel Distributed Comput., 2014

Multi-feature and DAG-Based Multi-tree Matching Algorithm for Automatic Web Data Mining.
Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Warsaw, Poland, August 11-14, 2014, 2014

YAFIM: A Parallel Frequent Itemset Mining Algorithm with Spark.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Training Large Scale Deep Neural Networks on the Intel Xeon Phi Many-Core Coprocessor.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Rainbow: A distributed and hierarchical RDF triple store with dynamic scalability.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

CinHBa: A Secondary Index with Hotscore Caching Policy on Key-Value Data Store.
Proceedings of the Advanced Data Mining and Applications - 10th International Conference, 2014

2013
NEXIR: A Novel Web Extraction Rule Language toward a Three-Stage Web Data Extraction Model.
Proceedings of the Web Information Systems Engineering - WISE 2013, 2013

Large Scale Nearest Neighbors Search Based on Neighborhood Graph.
Proceedings of the International Conference on Advanced Cloud and Big Data, 2013

Parallel Approach and Platform for Large-Scale WEB Data Extraction.
Proceedings of the International Conference on Advanced Cloud and Big Data, 2013

A parallel computing platform for training large scale neural networks.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Extraction Rule Language for Web Information Extraction and Integration.
Proceedings of the 2013 10th Web Information System and Application Conference, 2013

Layered and Weighted Tree Matching Algorithm for Automatic Web Data Records Recognition.
Proceedings of the 2013 10th Web Information System and Application Conference, 2013

2012
Parallelized Similarity Flooding Algorithm for Processing Large Scale Graph Datasets with MapReduce.
Proceedings of the 13th International Conference on Parallel and Distributed Computing, 2012

Parallelized Near-Duplicate Document Detection Algorithm for Large Scale Chinese Web Pages.
Proceedings of the 13th International Conference on Parallel and Distributed Computing, 2012

Performance Optimization for Short MapReduce Job Execution in Hadoop.
Proceedings of the 2012 Second International Conference on Cloud and Green Computing, 2012

2011
Parallelization of BLAST with MapReduce for Long Sequence Alignment.
Proceedings of the Fourth International Symposium on Parallel Architectures, 2011

PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets.
Proceedings of the Fourth International Symposium on Parallel Architectures, 2011

2009
Improvements on Teaching Methods and Contents for the "Computer Organization and Architecture" Curriculum.
Proceedings of the International Conference on Scalable Computing and Communications / Eighth International Conference on Embedded Computing, 2009


  Loading...