Ziyang Ma

Orcid: 0000-0002-8195-3262

According to our database1, Ziyang Ma authored at least 35 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
HAM-TTS: Hierarchical Acoustic Modeling for Token-Based Zero-Shot Text-to-Speech with Model and Data Scaling.
CoRR, 2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM.
CoRR, 2024

An Embarrassingly Simple Approach for LLM with Strong ASR Capacity.
CoRR, 2024

BAT: Learning to Reason about Spatial Sounds with Large Language Models.
CoRR, 2024

ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering.
CoRR, 2024

EAT: Self-Supervised Pre-Training with Efficient Audio Transformer.
CoRR, 2024

Towards Weakly Supervised Text-to-Audio Grounding.
CoRR, 2024

2023
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation.
CoRR, 2023

Hourglass-AVSR: Down-Up Sampling-based Computational Efficiency Model for Audio-Visual Speech Recognition.
CoRR, 2023

LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT.
CoRR, 2023

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition.
CoRR, 2023

Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS.
CoRR, 2023

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching.
CoRR, 2023

Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition.
CoRR, 2023

Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation.
CoRR, 2023

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation.
CoRR, 2023

LTCR: Long-Text Chinese Rumor Detection Dataset.
CoRR, 2023

Front-End Adapter: Adapting Front-End Input of Speech Based Self-Supervised Learning for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Few-Shot Learning for Talking Face System with TTS Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Fast-Hubert: an Efficient Training Framework for Self-Supervised Speech Representation Learning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
TESSP: Text-Enhanced Self-Supervised Speech Pre-training.
CoRR, 2022

MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets.
CoRR, 2022

Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition.
CoRR, 2022

2021
Feature-weighted ordinal classification for predicting drug response in multiple myeloma.
Bioinform., 2021

Joint Optimization of Computation Offloading, Data Compression, Energy Harvesting, and Application Scenarios in Fog Computing.
IEEE Access, 2021

Hierarchical Deep Residual Reasoning for Temporal Moment Localization.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

2020
A Blockchain-Based Trust Management With Conditional Privacy-Preserving Announcement Scheme for VANETs.
IEEE Internet Things J., 2020

The Application of TED Talk Strategies in Freshmen Library Orientation Lecture.
Proceedings of the ICIEI 2020: The 5th International Conference on Information and Education Innovations, 2020

2015
Bounded-Distortion Metric Learning.
CoRR, 2015

Video Super-Resolution via Deep Draft-Ensemble Learning.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Handling motion blur in multi-frame super-resolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Real-time and robust hand tracking with a single depth camera.
Vis. Comput., 2014

2013
Coherence-enhancing line drawing for color images.
Sci. China Inf. Sci., 2013

Constant Time Weighted Median Filtering for Stereo Matching and Beyond.
Proceedings of the IEEE International Conference on Computer Vision, 2013


  Loading...