Bing Yin

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Prototype-Based Modality-Specific Feature Distillation for Incomplete Multimodal Sentiment Analysis.
IEEE Trans. Emerg. Top. Comput. Intell., April, 2026

Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts.
CoRR, April, 2026

Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning.
CoRR, April, 2026

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs.
CoRR, April, 2026

Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards.
CoRR, March, 2026

TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment.
CoRR, March, 2026

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning.
CoRR, March, 2026

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning.
CoRR, January, 2026

Teach Diffusion Language Models to Learn from Their Own Mistakes.
CoRR, January, 2026

MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation.
Proceedings of the ACM Web Conference 2026, 2026

Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Breaking Model Lock-in: Cost-Efficient Zero-Shot LLM Routing via a Universal Latent Space.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation.
CoRR, December, 2025

Rethinking Popularity Bias in Collaborative Filtering via Analytical Vector Decomposition.
CoRR, December, 2025

Self-Rewarding PPO: Aligning Large Language Models with Demonstrations Only.
CoRR, October, 2025

POPI: Personalizing LLMs via Optimized Natural Language Preference Inference.
CoRR, October, 2025

DeepPlanner: Scaling Planning Capability for Deep Research Agents via Advantage Shaping.
CoRR, October, 2025

Improving Sampling Efficiency in RLVR through Adaptive Rollout and Response Reuse.
CoRR, September, 2025

TransLLM: A Unified Multi-Task Foundation Framework for Urban Transportation via Learnable Prompting.
CoRR, August, 2025

SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding.
CoRR, July, 2025

A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis.
npj Digit. Medicine, 2025

Diff-TST: Diffusion model for one-shot text-image style transfer.
Expert Syst. Appl., 2025

VGTS: Visually Guided Text Spotting for novel categories in historical manuscripts.
Expert Syst. Appl., 2025

Length-aware center loss for sequence to sequence Thai scene text recognition.
Eng. Appl. Artif. Intell., 2025

Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

IHEval: Evaluating Language Models on Following the Instruction Hierarchy.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

LongLeader: A Comprehensive Leaderboard for Large Language Models in Long-context Scenarios.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Exploring Part-Informed Visual-Language Learning for Person Re-Identification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Efficient Fine-tuning Strategies for Enhancing Face Recognition Performance in Challenging Scenarios.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

DORM: Preference Data Weights Optimization for Reward Modeling in LLM Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

DrAgent: Empowering Large Language Models as Medical Agents for Multi-hop Medical Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Can Language Models Follow Multiple Turns of Entangled Instructions?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Improving Large Language Models Function Calling and Interpretability via Guided-Structured Templates.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

M-LLM Based Video Frame Selection for Efficient Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Aligning Large Language Models with Implicit Preferences from User-Generated Content.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Cross-modulated Attention Transformer for RGBT Tracking.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond.
ACM Trans. Knowl. Discov. Data, July, 2024

Evaluation of GPM IMERG-FR Product for Computing Rainfall Erosivity for Mainland China.
Remote. Sens., April, 2024

Weakly supervised scene text generation for low-resource languages.
Expert Syst. Appl., March, 2024

Dynamic facial expression recognition with pseudo-label guided multi-modal pre-training.
IET Comput. Vis., February, 2024

SEMv2: Table separation line detection based on instance segmentation.
Pattern Recognit., 2024

NDOrder: Exploring a novel decoding order for scene text recognition.
Expert Syst. Appl., 2024

RNR: Teaching Large Language Models to Follow Roles and Rules.
CoRR, 2024

Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs.
CoRR, 2024

COSMO: A Large-Scale E-commerce Common Sense Knowledge Generation and Serving System at Amazon.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

1DFormer: A Transformer Architecture Learning 1D Landmark Representations for Facial Landmark Tracking.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

MEMORYLLM: Towards Self-Updatable Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

LightLT: A Lightweight Representation Quantization Framework for Long-Tail Data.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

ICDAR 2024 Competition on Recognition of Chemical Structures.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Evolutionary Contrastive Distillation for Language Model Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Data Diversity Matters for Robust Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

NAMER: Non-autoregressive Modeling for Handwritten Mathematical Expression Recognition.
Proceedings of the Computer Vision - ECCV 2024, 2024

Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision-Language Reasoning Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Rainfall Erosivity Mapping for Tibetan Plateau Using High-Resolution Temporal and Spatial Precipitation Datasets for the Third Pole.
Remote. Sens., November, 2023

Visible-infrared person re-identification via specific and shared representations learning.
Vis. Intell., 2023

1DFormer: Learning 1D Landmark Representations via Transformer for Facial Landmark Tracking.
CoRR, 2023

Situated Natural Language Explanations.
CoRR, 2023

CCGen: Explainable Complementary Concept Generation in E-Commerce.
CoRR, 2023

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond.
CoRR, 2023

OTS: A One-shot Learning Approach for Text Spotting in Historical Manuscripts.
CoRR, 2023

SEMv2: Table Separation Line Detection Based on Conditional Convolution.
CoRR, 2023

Mutually-paced Knowledge Distillation for Cross-lingual Temporal Knowledge Graph Reasoning.
Proceedings of the ACM Web Conference 2023, 2023

Implicit Query Parsing at Amazon Product Search.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Enhancing User Intent Capture in Session-Based Recommendation with Attribute Patterns.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Amazon-M2: A Multilingual Multi-locale Shopping Session Dataset for Recommendation and Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Handwritten Chemical Structure Image to Structure-Specific Markup Using Random Conditional Guided Decoder.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LightToken: A Task and Model-agnostic Lightweight Token Embedding Framework for Pre-trained Language Models.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

A Unified Framework of Graph Information Bottleneck for Robustness and Membership Privacy.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Knowledge Graph Reasoning over Entities and Numerical Values.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Exploiting Intent Evolution in E-commercial Query Recommendation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

End-to-End Multilingual Text Recognition Based on Byte Modeling.
Proceedings of the Image and Graphics - 12th International Conference, 2023

A Multimodal Text Block Segmentation Framework for Photo Translation.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Speech4Mesh: Speech-Assisted Monocular 3D Facial Reconstruction for Speech-Driven 3D Facial Animation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Knowledge-Selective Pretraining for Attribute Value Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Improving Consistency for Text Summarization with Energy Functions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Context-Aware Query Rewriting for Improving Users' Search Experience on E-commerce Websites.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), 2023

FolkScope: Intention Knowledge Graph Construction for E-commerce Commonsense Discovery.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

SCOTT: Self-Consistent Chain-of-Thought Distillation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Graph Reasoning for Question Answering with Triplet Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
A multimodal attention fusion network with a dynamic vocabulary for TextVQA.
Pattern Recognit., 2022

FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce Commonsense.
CoRR, 2022

Short Text Pre-training with Extended Token Classification for E-commerce Query Understanding.
CoRR, 2022

DiP-GNN: Discriminative Pre-Training of Graph Neural Networks.
CoRR, 2022

Vision-Language Adaptive Mutual Decoder for OOV-STR.
CoRR, 2022

RETE: Retrieval-Enhanced Temporal Event Forecasting on Unified Query Product Evolutionary Graph.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

ROSE: Robust Caches for Amazon Product Search.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Can Clicks Be Both Labels and Features?: Unbiased Behavior Feature Collection and Uncertainty-aware Learning to Rank.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Query Attribute Recommendation at Amazon Search.
Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022

Learning to Sample and Aggregate: Few-shot Reasoning over Temporal Knowledge Graphs.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SEQZERO: Few-shot Compositional Semantic Parsing with Sequential Prompts and Zero-shot Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

AutoGDA: Automated Graph Data Augmentation for Node Classification.
Proceedings of the Learning on Graphs Conference, 2022

Condensing Graphs via One-Step Gradient Matching.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Multilingual Knowledge Graph Completion with Self-Supervised Adaptive Graph Alignment.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Letter from the Special Issue Editor.
IEEE Data Eng. Bull., 2021

Graph-based Multilingual Product Retrieval in E-Commerce Search.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021

Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

MetaTS: Meta Teacher-Student Network for Multilingual Sequence Labeling with Minimal Supervision.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

QUEACO: Borrowing Treasures from Weakly-labeled Behavior Data for Query Attribute Value Extraction.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
On Data Augmentation for Extreme Multi-label Classification.
CoRR, 2020

Shareable Representations for Search Query Understanding.
CoRR, 2020

Learn to Cross-lingual Transfer with Meta Graph Learning Across Heterogeneous Languages.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Semantic Product Search.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

2018
Speaker Diarization with Enhancing Speech for the First DIHARD Challenge.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

An Analysis of Speaker Diarization Fusion Methods For The First DIHARD Challenge.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2016
A purchasing management strategy model about grading and classifying railway materials based on material grouping.
Proceedings of the International Conference on Logistics, Informatics and Service Sciences, 2016

2014
Evaluating the Ability of NPP-VIIRS Nighttime Light Data to Estimate the Gross Domestic Product and the Electric Power Consumption of China at Multiple Scales: A Comparison with DMSP-OLS Data.
Remote. Sens., 2014

2012
Endpoint learning for multilinear commutator of singular integral on space of homogeneous type.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

2009
Hierarchical Stability-Based Model Selection for Clustering Algorithms.
Proceedings of the International Conference on Machine Learning and Applications, 2009


  Loading...