Pingchuan Ma

Orcid: 0000-0003-3752-0803

Affiliations:
  • Beijing Institute of Technology, School of Computer Science and Technology, China
  • Imperial College London, UK (PhD 2022)


According to our database1, Pingchuan Ma authored at least 28 papers between 2018 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
KAN-AV dataset for audio-visual face and speech analysis in the wild.
Image Vis. Comput., December, 2023

Self-Supervised Video-Centralised Transformer for Video Face Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

End-to-End Video-to-Speech Synthesis Using Generative Adversarial Networks.
IEEE Trans. Cybern., June, 2023

Omnidirectional Image Quality Assessment With Knowledge Distillation.
IEEE Signal Process. Lett., 2023

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition.
CoRR, 2023

Is dataset condensation a silver bullet for healthcare data sharing?
CoRR, 2023

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision.
CoRR, 2023

Jointly Learning Visual and Auditory Speech Representations from Raw Data.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Cross-Lingual Visual Speech Representations.
Proceedings of the IEEE International Conference on Acoustics, 2023

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels.
Proceedings of the IEEE International Conference on Acoustics, 2023

SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Visual speech recognition for multiple languages in the wild.
Nat. Mac. Intell., November, 2022

Streaming Audio-Visual Speech Recognition with Alignment Regularization.
CoRR, 2022

Training Strategies for Improved Lip-Reading.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Lip-reading with Densely Connected Temporal Convolutional Networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

End-To-End Audio-Visual Speech Recognition with Conformers.
Proceedings of the IEEE International Conference on Acoustics, 2021

Detecting Adversarial Attacks on Audiovisual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Towards Practical Lipreading with Distilled and Efficient Models.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
End-to-end visual speech recognition for small-scale datasets.
Pattern Recognit. Lett., 2020

Visually Guided Self Supervised Learning of Speech Representations.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Lipreading Using Temporal Convolutional Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards Pose-Invariant Lip-Reading.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Detecting Adversarial Attacks On Audio-Visual Speech Recognition.
CoRR, 2019

Video-Driven Speech Reconstruction Using Generative Adversarial Networks.
Proceedings of the Interspeech 2019, 2019

Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition.
Proceedings of the Interspeech 2019, 2019

2018
Audio-Visual Speech Recognition with a Hybrid CTC/Attention Architecture.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

End-to-End Audiovisual Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018


  Loading...