Lu Lu

Orcid: 0000-0003-1311-9097

Affiliations:
  • Bytedance Inc., ByteDance AI Lab, Speech and Audio Team


According to our database1, Lu Lu authored at least 26 papers between 2023 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Slice Sandwich: Jagged Slicing Multi-Tier Dynamic Resources for Diversified V2X Services.
IEEE Trans. Mob. Comput., May, 2024

A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR.
CoRR, 2024

SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words.
CoRR, 2024

Can Large Language Models Understand Spatial Audio?
CoRR, 2024

MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning.
CoRR, 2024

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Challenges in Training PINNs: A Loss Landscape Perspective.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SALMONN: Towards Generic Hearing Abilities for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PolyVoice: Language Models for Speech to Speech Translation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Connecting Speech Encoder and Large Language Model for ASR.
Proceedings of the IEEE International Conference on Acoustics, 2024

Extending Large Language Models for Speech and Audio Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2024

SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Multi-SP Network Slicing Parallel Relieving Edge Network Conflict.
IEEE Trans. Parallel Distributed Syst., November, 2023

Learning-Based Real-Time Transmission Control for Multi-Path TCP Networks.
IEEE Trans. Cogn. Commun. Netw., October, 2023

Network Meets ChatGPT: Intent Autonomous Management, Control and Operation.
J. Commun. Inf. Networks, September, 2023

Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models.
CoRR, 2023

Language-specific Acoustic Boundary Learning for Mandarin-English Code-switching Speech Recognition.
CoRR, 2023

PolyVoice: Language Models for Speech to Speech Translation.
CoRR, 2023

Unleashing Infinite-Length Input Capacity for Large-scale Language Models with Self-Controlled Memory System.
CoRR, 2023

Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions and Prospects.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Language-specific Boundary Learning for Improving Mandarin-English Code-switching Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

AudioQR: Deep Neural Audio Watermarks For QR Code.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Improving Large-Scale Deep Biasing With Phoneme Features and Text-Only Data in Streaming Transducer.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023


  Loading...