Wenjie Tian

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance.
CoRR, March, 2026

Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning.
CoRR, March, 2026

EmoOmni: Bridging Emotional Understanding and Expression in Omni-Modal LLMs.
CoRR, February, 2026

Integrating Fine-Grained Audio-Visual Evidence for Robust Multimodal Emotion Reasoning.
CoRR, January, 2026

dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition.
CoRR, January, 2026

Self-Supervised Absolute-Scale Multi-Sensor Fusion Odometry via Multi-Layer Feature Fusion and Pose Refinement for Autonomous Driving.
IEEE Trans Autom. Sci. Eng., 2026

Geometry knowledge-embedded self-supervised deep monocular visual odometry for autonomous driving.
Knowl. Based Syst., 2026

Diffusion-Enhanced Identifiable Confounder Modeling for Debiased Recommendation.
IEEE Access, 2026

KALL-E: Autoregressive Speech Synthesis with Next-Distribution Prediction.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Pose Error Prediction, Compensation Method, and Applicable Condition Determination of Parallel Motion Platform Based on Transfer Learning.
IEEE Trans. Neural Networks Learn. Syst., October, 2025

SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity.
CoRR, October, 2025

PodEval: A Multimodal Evaluation Framework for Podcast Audio Generation.
CoRR, October, 2025

OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue.
CoRR, August, 2025

MAL: multilevel active learning with BERT for Chinese textual affective structure analysis.
Frontiers Inf. Technol. Electron. Eng., June, 2025

Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought.
CoRR, February, 2025

CosyAudio: Improving Audio Generation with Confidence Scores and Synthetic Captions.
CoRR, January, 2025

OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia.
CoRR, January, 2025

Identifying the High-Risk Trigger Events in Aviation Activities: A Directed Weighted CN-SIR Methodology.
Qual. Reliab. Eng. Int., 2025

Bibliometric analysis of natural language processing using CiteSpace and VOSviewer.
Nat. Lang. Process. J., 2025

Enhancing food safety review classification with large language models and label embedding.
Expert Syst. Appl., 2025

DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Llasa+: Free Lunch for Accelerated and Streaming Llama-Based Speech Synthesis.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

Dialospeech: Dual-Speaker Dialogue Generation with LLM and Flow Matching.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024
A Survey of the Applications of Text Mining for the Food Domain.
Algorithms, May, 2024

Using Multisource Data and Time Series Features to Construct a Global Terrestrial CO₂ Coverage by Deep Learning.
IEEE Trans. Geosci. Remote. Sens., 2024

Autoregressive Speech Synthesis with Next-Distribution Prediction.
CoRR, 2024

YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls.
CoRR, 2024

CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion.
CoRR, 2024

UniStyle: Unified Style Modeling for Speaking Style Captioning and Stylistic Speech Synthesis.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Text-aware and Context-aware Expressive Audiobook Speech Synthesis.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Efficient Neural Architecture Design via Capturing Architecture-Performance Joint Distribution.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
An efficient calibration method for serial industrial robots based on kinematics decomposition and equivalent systems.
Robotics Comput. Integr. Manuf., December, 2023

Open-architecture of CNC system and mirror milling technology for a 5-axis hybrid robot.
Robotics Comput. Integr. Manuf., June, 2023

Appaction: Automatic GUI Interaction for Mobile Apps via Holistic Widget Perception.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

2022
Kinematics of a 5-axis hybrid robot near singular configurations.
Robotics Comput. Integr. Manuf., 2022

Safety Assessment of Transport Aircraft Heavy Equipment Airdrop: An Improved STPA-BN Mechanism.
IEEE Access, 2022

2021
Evaluation of the Kinematic Performance of a 3-RRS Parallel Mechanism.
Robotica, 2021

Equipment Mission Safety Evaluation Method Based on Function-Structure.
IEEE Access, 2021

Meta-data Augmentation Based Search Strategy Through Generative Adversarial Network for AutoML Model Selection.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

AutoCluster: Meta-learning Based Ensemble Method for Automated Unsupervised Clustering.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

2020
A Systematic Approach for Accuracy Design of Lower-Mobility Parallel Mechanism.
Robotica, 2020

2019
Kinematic calibration of a 5-DOF hybrid kinematic machine tool by considering the ill-posed identification problem using regularisation method.
Robotics Comput. Integr. Manuf., 2019

CF-PROV: A Content-Rich and Fine-Grained Scientific Workflow Provenance Model.
IEEE Access, 2019

2018
UMDISW: A Universal Multi-Domain Intelligent Scientific Workflow Framework for the Whole Life Cycle of Scientific Data.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2018


  Loading...