Wenjie Tian

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026

YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance.

[BibT_eX]

[DOI]

CoRR, March, 2026

Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning.

[BibT_eX]

[DOI]

CoRR, March, 2026

EmoOmni: Bridging Emotional Understanding and Expression in Omni-Modal LLMs.

[BibT_eX]

[DOI]

CoRR, February, 2026

Integrating Fine-Grained Audio-Visual Evidence for Robust Multimodal Emotion Reasoning.

[BibT_eX]

[DOI]

CoRR, January, 2026

dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition.

[BibT_eX]

[DOI]

CoRR, January, 2026

Self-Supervised Absolute-Scale Multi-Sensor Fusion Odometry via Multi-Layer Feature Fusion and Pose Refinement for Autonomous Driving.

[BibT_eX]

[DOI]

IEEE Trans Autom. Sci. Eng., 2026

Geometry knowledge-embedded self-supervised deep monocular visual odometry for autonomous driving.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2026

Diffusion-Enhanced Identifiable Confounder Modeling for Debiased Recommendation.

[BibT_eX]

[DOI]

Xiuming Chen

Wenjie Tian

Haoyun Fang

IEEE Access, 2026

KALL-E: Autoregressive Speech Synthesis with Next-Distribution Prediction.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Pose Error Prediction, Compensation Method, and Applicable Condition Determination of Parallel Motion Platform Based on Transfer Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., October, 2025

SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity.

[BibT_eX]

[DOI]

CoRR, October, 2025

PodEval: A Multimodal Evaluation Framework for Podcast Audio Generation.

[BibT_eX]

[DOI]

CoRR, October, 2025

OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue.

[BibT_eX]

[DOI]

CoRR, August, 2025

MAL: multilevel active learning with BERT for Chinese textual affective structure analysis.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., June, 2025

Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought.

[BibT_eX]

[DOI]

CoRR, February, 2025

CosyAudio: Improving Audio Generation with Confidence Scores and Synthetic Captions.

[BibT_eX]

[DOI]

CoRR, January, 2025

OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia.

[BibT_eX]

[DOI]

CoRR, January, 2025

Identifying the High-Risk Trigger Events in Aviation Activities: A Directed Weighted CN-SIR Methodology.

[BibT_eX]

[DOI]

Wenjie Tian

Jihui Xu

Lu Chen

Qual. Reliab. Eng. Int., 2025

Bibliometric analysis of natural language processing using CiteSpace and VOSviewer.

[BibT_eX]

[DOI]

Xiuming Chen

Wenjie Tian

Haoyun Fang

Nat. Lang. Process. J., 2025

Enhancing food safety review classification with large language models and label embedding.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2025

DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Llasa+: Free Lunch for Accelerated and Streaming Llama-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

Dialospeech: Dual-Speaker Dialogue Generation with LLM and Flow Matching.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024

A Survey of the Applications of Text Mining for the Food Domain.

[BibT_eX]

[DOI]

Algorithms, May, 2024

Using Multisource Data and Time Series Features to Construct a Global Terrestrial CO₂ Coverage by Deep Learning.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

Autoregressive Speech Synthesis with Next-Distribution Prediction.

[BibT_eX]

[DOI]

Xinfa Zhu

Wenjie Tian

Lei Xie

CoRR, 2024

YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls.

[BibT_eX]

[DOI]

CoRR, 2024

CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion.

[BibT_eX]

[DOI]

CoRR, 2024

UniStyle: Unified Style Modeling for Speaking Style Captioning and Stylistic Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Text-aware and Context-aware Expressive Audiobook Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Efficient Neural Architecture Design via Capturing Architecture-Performance Joint Distribution.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

An efficient calibration method for serial industrial robots based on kinematics decomposition and equivalent systems.

[BibT_eX]

[DOI]

Robotics Comput. Integr. Manuf., December, 2023

Open-architecture of CNC system and mirror milling technology for a 5-axis hybrid robot.

[BibT_eX]

[DOI]

Robotics Comput. Integr. Manuf., June, 2023

Appaction: Automatic GUI Interaction for Mobile Apps via Holistic Widget Perception.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

2022

Kinematics of a 5-axis hybrid robot near singular configurations.

[BibT_eX]

[DOI]

Robotics Comput. Integr. Manuf., 2022

Safety Assessment of Transport Aircraft Heavy Equipment Airdrop: An Improved STPA-BN Mechanism.

[BibT_eX]

[DOI]

IEEE Access, 2022

2021

Evaluation of the Kinematic Performance of a 3-RRS Parallel Mechanism.

[BibT_eX]

[DOI]

Robotica, 2021

Equipment Mission Safety Evaluation Method Based on Function-Structure.

[BibT_eX]

[DOI]

IEEE Access, 2021

Meta-data Augmentation Based Search Strategy Through Generative Adversarial Network for AutoML Model Selection.

[BibT_eX]

[DOI]

Yue Liu

Wenjie Tian

Shuang Li

Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

AutoCluster: Meta-learning Based Ensemble Method for Automated Unsupervised Clustering.

[BibT_eX]

[DOI]

Yue Liu

Shuang Li

Wenjie Tian

Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

2020

A Systematic Approach for Accuracy Design of Lower-Mobility Parallel Mechanism.

[BibT_eX]

[DOI]

Robotica, 2020

2019

Kinematic calibration of a 5-DOF hybrid kinematic machine tool by considering the ill-posed identification problem using regularisation method.

[BibT_eX]

[DOI]

Robotics Comput. Integr. Manuf., 2019

CF-PROV: A Content-Rich and Fine-Grained Scientific Workflow Provenance Model.

[BibT_eX]

[DOI]

IEEE Access, 2019

2018

UMDISW: A Universal Multi-Domain Intelligent Scientific Workflow Framework for the Whole Life Cycle of Scientific Data.

[BibT_eX]

[DOI]

Proceedings of the Benchmarking, Measuring, and Optimizing, 2018

Wenjie Tian

Bibliography

Loading...