Di Jiang

Orcid: 0000-0003-2309-1809

Affiliations:
  • WeBank


According to our database1, Di Jiang authored at least 53 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Contextualized Token Discrimination for Speech Search Query Correction.
CoRR, September, 2025

Semantic-Augmented Latent Topic Modeling with LLM-in-the-Loop.
CoRR, July, 2025

On Efficient Single-Source Personalized PageRank Computation in Online Social Networks.
IEEE Trans. Knowl. Data Eng., June, 2025

QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation.
CoRR, May, 2025

Dual Learning Between Molecules and Natural Language.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2025

Dialogue Language Model with Large-Scale Persona Data Engineering.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Interactive Search with Reinforcement Learning.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

2024
A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models.
J. Comput. Sci. Technol., July, 2024

Neural Moderation of ASMR Erotica Content in Social Networks.
IEEE Trans. Knowl. Data Eng., January, 2024

Dial-In LLM: Human-Aligned Dialogue Intent Clustering with LLM-in-the-loop.
CoRR, 2024

ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction.
CoRR, 2024

Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation.
CoRR, 2024

Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models.
CoRR, 2024

Neural-Bayesian Program Learning for Few-shot Dialogue Intent Parsing.
CoRR, 2024

InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries.
Proceedings of the Asian Conference on Machine Learning, 2024

2023
Burstiness-Aware Web Search Analysis on Different Levels of Evidences.
IEEE Trans. Knowl. Data Eng., March, 2023

Scalable Identity-Oriented Speech Retrieval.
IEEE Trans. Knowl. Data Eng., March, 2023

Heterogeneous Latent Topic Discovery for Semantic Text Mining.
IEEE Trans. Knowl. Data Eng., 2023

Enhance Mono-modal Sentiment Classification with Federated Cross-modal Transfer.
IEEE Data Eng. Bull., 2023

A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models.
CoRR, 2023

Hierarchical Crowdsourcing for Data Labeling with Heterogeneous Crowd.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Opponent-aware Order Pricing towards Hub-oriented Mobility Services.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Probabilistic Topic Models - Foundation and Application
Springer, ISBN: 978-981-99-2430-1, 2023

2022
Cleaning Uncertain Data With Crowdsourcing - A General Model With Diverse Accuracy Rates.
IEEE Trans. Knowl. Data Eng., 2022

Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question.
CoRR, 2022

VoiceQuerySystem: A Voice-driven Database Querying System Using Natural Language Questions.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

A Platform for Deploying the TFE Ecosystem of Automatic Speech Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

RGVisNet: A Hybrid Retrieval-Generation Neural Framework Towards Automatic Data Visualization Generation.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Knowledge-Enhanced Learning for KG Embedding.
Proceedings of the 28th IEEE International Conference on Parallel and Distributed Systems, 2022

2021
Industrial Federated Topic Modeling.
ACM Trans. Intell. Syst. Technol., 2021

A GDPR-compliant Ecosystem for Speech Recognition with Transfer, Federated, and Evolutionary Learning.
ACM Trans. Intell. Syst. Technol., 2021

Federated Topic Discovery: A Semantic Consistent Approach.
IEEE Intell. Syst., 2021

Memetic Federated Learning for Biomedical Natural Language Processing.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

SmartSales: An AI-Powered Telemarketing Coaching System in FinTech.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

SmartMeeting: Automatic Meeting Transcription and Summarization for In-Person Conversations.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

L2RS: A Learning-to-Rescore Mechanism for Hybrid Speech Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multimodal N-best List Rescoring with Weakly Supervised Pre-training in Hybrid Speech Recognition.
Proceedings of the IEEE International Conference on Data Mining, 2021

FedSP: Federated Speaker Verification with Personal Privacy Preservation.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2021

Familia: A Configurable Topic Modeling Framework for Industrial Text Engineering.
Proceedings of the Database Systems for Advanced Applications, 2021

A Health-friendly Speaker Verification System Supporting Mask Wearing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
GoldenRetriever: A Speech Recognition System Powered by Modern Information Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A De Novo Divide-and-Merge Paradigm for Acoustic Model Optimization in Automatic Speech Recognition.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

TopicOcean: An Ever-Increasing Topic Model With Meta-learning.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Federated Acoustic Model Optimization for Automatic Speech Recognition.
Proceedings of the Database Systems for Advanced Applications, 2020

2019
L2RS: A Learning-to-Rescore Mechanism for Automatic Speech Recognition.
CoRR, 2019

DAL: Dual Adversarial Learning for Dialogue Generation.
CoRR, 2019

Integrating Bayesian and Neural Networks for Discourse Coherence.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

Topic-Aware Dialogue Speech Recognition with Transfer Learning.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Chameleon: A Language Model Adaptation Toolkit for Automatic Speech Recognition of Conversational Speech.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Federated Topic Modeling.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Familia: A Configurable Topic Modeling Framework for Industrial Text Engineering.
CoRR, 2018

2016
Cross-Lingual Topic Discovery From Multilingual Search Engine Query Log.
ACM Trans. Inf. Syst., 2016

Latent Topic Embedding.
Proceedings of the COLING 2016, 2016


  Loading...