Xiaomeng Yang

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
IPAD: Iterative, Parallel, and Diffusion-Based Network for Scene Text Recognition.
Int. J. Comput. Vis., August, 2025

Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision.
CoRR, August, 2025

VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting.
CoRR, July, 2025

SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training.
CoRR, May, 2025

ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation.
CoRR, May, 2025

Visual Text Processing: A Comprehensive Review and Unified Evaluation.
CoRR, April, 2025

Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption.
CoRR, March, 2025

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2.
CoRR, February, 2025

IPO: Iterative Preference Optimization for Text-to-Video Generation.
CoRR, February, 2025

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Enabling Natural Human-Computer Interaction Through AI-Powered Nanocomposite IoT Throat Vibration Sensor.
IEEE Internet Things J., July, 2024

Fine Calibration Method for Laser Altimeter Pointing and Ranging Based on Dense Control Points.
Remote. Sens., February, 2024

Masked and Permuted Implicit Context Learning for Scene Text Recognition.
IEEE Signal Process. Lett., 2024

LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.
CoRR, 2024

VidGen-1M: A Large-Scale Dataset for Text-to-video Generation.
CoRR, 2024

EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models.
CoRR, 2024

Impact of Dispersion Uniformity on the Conductivity of Carbon Nanotubes Based Tactile Sensor: A Tunneling Theory Approach.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2024

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

TorchRL: A data-driven decision-making library for PyTorch.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Advancing Human-Machine Interaction Using Intelligent Wearable Acoustic Sensors in Noisy Environments.
Proceedings of the Intelligent Robotics and Applications - 17th International Conference, 2024

Accurate and Robust Scene Text Recognition via Adversarial Training.
Proceedings of the IEEE International Conference on Acoustics, 2024

Learning Personalized Alignment for Evaluating Open-ended Text Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
On-orbit geometric calibration of satellite laser altimeters using infrared detectors and corner-cube retroreflectors.
Int. J. Digit. Earth, December, 2023

Facial Expression Recognition Based on Fine-Tuned Channel-Spatial Attention Transformer.
Sensors, August, 2023

Beyond OCR + VQA: Towards end-to-end reading and reasoning for robust and accurate textvqa.
Pattern Recognit., June, 2023

Research and implementation of modulation recognition based on cascaded feature fusion.
IET Commun., June, 2023

Characteristics and driving factors of the technology cooperation network evolution: a case study of solid waste treatment field in China.
Technol. Anal. Strateg. Manag., May, 2023

Centroid Extraction of Laser Spots Captured by Infrared Detectors Combining Laser Footprint Images and Detector Observation Data.
Remote. Sens., April, 2023

Research on Glacier Elevation Variability in the Qilian Mountains of the Qinghai-Tibet Plateau Based on Topographic Correction by Pyramid Registration.
Remote. Sens., January, 2023

A Geometric Calibration Method Without a Field Site of the GF-7 Satellite Laser Relying on a Surface Mathematical Model.
IEEE Trans. Geosci. Remote. Sens., 2023

Denoising and Accuracy Evaluation of ICESat-2/ATLAS Photon Data for Nearshore Waters Based on Improved Local Distance Statistics.
Remote. Sens., 2023

Towards precision medicine based on a continuous deep learning optimization and ensemble approach.
npj Digit. Medicine, 2023

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition.
CoRR, 2023

End-to-end Story Plot Generator.
CoRR, 2023

Learning Personalized Story Evaluation.
CoRR, 2023

Masked and Permuted Implicit Context Learning for Scene Text Recognition.
CoRR, 2023

Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation.
CoRR, 2023

Learning Compiler Pass Orders using Coreset and Normalized Value Prediction.
Proceedings of the International Conference on Machine Learning, 2023

MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

AutoCAT: Reinforcement Learning for Automated Exploration of Cache-Timing Attacks.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

2022
A Separate Calibration Method of Laser Pointing and Ranging for the GF-7 Satellite Laser That Does Not Require Field Detectors.
Remote. Sens., December, 2022

Feature-Selection High-Resolution Network With Hypersphere Embedding for Semantic Segmentation of VHR Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2022

Research on the formation mechanism of big data technology cooperation networks: empirical evidence from China.
Scientometrics, 2022

Spatiotemporal Variations of Aerosols in China during the COVID-19 Pandemic Lockdown.
Remote. Sens., 2022

AutoCAT: Reinforcement Learning for Automated Exploration of Cache Timing-Channel Attacks.
CoRR, 2022

Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Individual Recognition Method of Radiation Source Based on Deep Subdomain Adaptation Network.
Proceedings of the 22nd IEEE International Conference on Communication Technology, 2022

2021
An Improved Fmask Method for Cloud Detection in GF-6 WFV Based on Spectral-Contextual Information.
Remote. Sens., 2021

Innovation Cooperation Network Evolution About Green Building Technology With Government Intervention: Based on Evolutionary Game Theory.
IEEE Access, 2021

A Cost-Efficient Framework for Scene Text Detection in the Wild.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Beyond OCR + VQA: Involving OCR into the Flow for Robust and Accurate TextVQA.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Educational Data Mining: Discovering Principal Factors for Better Academic Performance.
Proceedings of the BDET 2021: The 3rd International Conference on Big Data Engineering and Technology, 2021

2019
Hybrid Composition with IdleBlock: More Efficient Networks for Image Recognition.
CoRR, 2019

2018
Efficient Group Signature Scheme Over NTRU Lattice.
Proceedings of the Cloud Computing and Security - 4th International Conference, 2018


  Loading...