Weinan Zhang

Orcid: 0000-0002-0127-2425

Affiliations:
  • Shanghai Jiao Tong University, John Hopcroft Center for Computer Science, China
  • University College London, Department of Computer Science, UK


According to our database1, Weinan Zhang authored at least 332 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Survey on Bid Optimization in Real-Time Bidding Display Advertising.
ACM Trans. Knowl. Discov. Data, April, 2024

Play to Your Strengths: Collaborative Intelligence of Conventional Recommender Models and Large Language Models.
CoRR, 2024

An Aligning and Training Framework for Multimodal Recommendations.
CoRR, 2024

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision.
CoRR, 2024

Looking Ahead to Avoid Being Late: Solving Hard-Constrained Traveling Salesman Problem.
CoRR, 2024

Towards Efficient and Effective Unlearning of Large Language Models for Recommendation.
CoRR, 2024

Offline Fictitious Self-Play for Competitive Games.
CoRR, 2024

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning.
CoRR, 2024

Entropy-Regularized Token-Level Policy Optimization for Large Language Models.
CoRR, 2024

CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models.
CoRR, 2024

Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning.
CoRR, 2024

DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching.
CoRR, 2024

ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update.
CoRR, 2024

InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization.
CoRR, 2024

D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems.
CoRR, 2024

Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges.
CoRR, 2024

GeoGalactica: A Scientific Large Language Model in Geoscience.
CoRR, 2024

K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

2023
Large sequence models for sequential decision-making: a survey.
Frontiers Comput. Sci., December, 2023

Learning to Retrieve User Behaviors for Click-through Rate Estimation.
ACM Trans. Inf. Syst., October, 2023

AIM: Automatic Interaction Machine for Click-Through Rate Prediction.
IEEE Trans. Knowl. Data Eng., April, 2023

Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning.
IEEE Trans. Knowl. Data Eng., April, 2023

Offline Pre-trained Multi-agent Decision Transformer.
Mach. Intell. Res., April, 2023

Time-Series Representation Learning in Topology Prediction for Passive Optical Network of Telecom Operators.
Sensors, March, 2023

Information Retrieval meets Large Language Models: A strategic report from Chinese IR community.
AI Open, January, 2023

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
J. Mach. Learn. Res., 2023

Adaptation Augmented Model-based Policy Optimization.
J. Mach. Learn. Res., 2023

GFS: Graph-based Feature Synthesis for Prediction over Relational Databases.
CoRR, 2023

Vision-Language Foundation Models as Effective Robot Imitators.
CoRR, 2023

Diffusion Models for Reinforcement Learning: A Survey.
CoRR, 2023

ALT: Towards Fine-grained Alignment between Language and CTR Models for Click-Through Rate Prediction.
CoRR, 2023

ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction.
CoRR, 2023

Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners.
CoRR, 2023

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training.
CoRR, 2023

Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future.
CoRR, 2023

CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language Models.
CoRR, 2023

ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation.
CoRR, 2023

Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community.
CoRR, 2023

Is Risk-Sensitive Reinforcement Learning Properly Resolved?
CoRR, 2023

Towards Open-World Recommendation with Knowledge Augmentation from Large Language Models.
CoRR, 2023

How Can Recommender Systems Benefit from Large Language Models: A Survey.
CoRR, 2023

Learning A Foundation Language Model for Geoscience Knowledge Understanding and Utilization.
CoRR, 2023

Privileged Knowledge Distillation for Sim-to-Real Policy Generalization.
CoRR, 2023

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.
CoRR, 2023

MADiff: Offline Multi-agent Learning with Diffusion Models.
CoRR, 2023

An Empirical Study on Google Research Football Multi-agent Scenarios.
CoRR, 2023

Covidia: COVID-19 Interdisciplinary Academic Knowledge Graph.
CoRR, 2023

FMGNN: Fused Manifold Graph Neural Network.
CoRR, 2023

Integrated Ranking for News Feed with Reinforcement Learning.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

AutoGen: An Automated Dynamic Model Generation Framework for Recommender System.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

A Bird's-eye View of Reranking: From List Level to Page Level.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Learning to Distinguish Multi-User Coupling Behaviors for TV Recommendation.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

An F-shape Click Model for Information Retrieval on Multi-block Mobile Pages.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

International Workshop on Deep Learning Practice for High-Dimensional Sparse Data with RecSys 2023.
Proceedings of the 17th ACM Conference on Recommender Systems, 2023

Lending Interaction Wings to Recommender Systems with Conversational Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Feature-Based Coalition Game Framework with Privileged Knowledge Transfer for User-tag Profile Modeling.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

ReLoop2: Building Self-Adaptive Recommendation Models via Responsive Error Compensation Loop.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Dense Representation Learning and Retrieval for Tabular Data Prediction.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Specify Robust Causal Representation from Mixed Observations.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

On-device Integrated Re-ranking with Heterogeneous Behavior Modeling.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GMOCAT: A Graph-Enhanced Multi-Objective Method for Computerized Adaptive Testing.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Deep Landscape Forecasting in Multi-Slot Real-Time Bidding.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Learning Multi-Agent Intention-Aware Communication for Optimal Multi-Order Execution in Finance.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Large Decision Models.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Multi-embodiment Legged Robot Control as a Sequence Modeling Problem.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models.
Proceedings of the International Conference on Machine Learning, 2023

Order Matters: Agent-by-agent Policy Optimization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Visual Imitation Learning with Patch Rewards.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Personalized Diversification for Neural Re-ranking in Recommendation.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Text Classification In The Wild: A Large-Scale Long-Tailed Name Normalization Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2023

Optimal Real-Time Bidding Strategy for Position Auctions in Online Advertising.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Graph Enhanced Hierarchical Reinforcement Learning for Goal-oriented Learning Path Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

DRL4IR: 4th Workshop on Deep Reinforcement Learning for Information Retrieval.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Learning Decomposed Spatial Relations for Multi-Variate Time-Series Modeling.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Set-to-Sequence Ranking-Based Concept-Aware Learning Path Recommendation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Efficient policy evaluation by matrix sketching.
Frontiers Comput. Sci., October, 2022

GraphHINGE: Learning Interaction Models of Structured Neighborhood on Heterogeneous Information Network.
ACM Trans. Inf. Syst., 2022

Beyond Relevance Ranking: A General Graph Matching Framework for Utility-Oriented Learning to Rank.
ACM Trans. Inf. Syst., 2022

Hierarchical Multiagent Reinforcement Learning for Allocating Guaranteed Display Ads.
IEEE Trans. Neural Networks Learn. Syst., 2022

Spatio-Temporal Meta Learning for Urban Traffic Prediction.
IEEE Trans. Knowl. Data Eng., 2022

Learning to select cuts for efficient mixed-integer programming.
Pattern Recognit., 2022

A gradient boosting tree model for multi-department venous thromboembolism risk assessment with imbalanced data.
J. Biomed. Informatics, 2022

On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective.
CoRR, 2022

Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents.
CoRR, 2022

NeurIPS 2022 Competition: Driving SMARTS.
CoRR, 2022

RITA: Boost Autonomous Driving Simulators with Realistic Interactive Traffic Flow.
CoRR, 2022

Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems.
CoRR, 2022

Forgetting Fast in Recommender Systems.
CoRR, 2022

A Survey on Model-based Reinforcement Learning.
CoRR, 2022

Learning Enhanced Representations for Tabular Data via Neighborhood Propagation.
CoRR, 2022

PerfectDou: Dominating DouDizhu with Perfect Information Distillation.
CoRR, 2022

Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects.
CoRR, 2022

Efficient Policy Space Response Oracles.
CoRR, 2022

Towards Collaborative Question Answering: A Preliminary Study.
CoRR, 2022

Phrase-level Adversarial Example Generation for Neural Machine Translation.
CoRR, 2022

Who to Watch Next: Two-side Interactive Networks for Live Broadcast Recommendation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Learn over Past, Evolve for Future: Search-based Time-aware Recommendation with Sequential Behavior Data.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Improving Knowledge Tracing with Collaborative Information.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

DRL4IR: 3rd Workshop on Deep Reinforcement Learning for Information Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Multi-Level Interaction Reranking with User Behavior History.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

RankFlow: Joint Optimization of Multi-Stage Cascade Ranking Systems as Flows.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Neural Statistics for Click-Through Rate Prediction.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-Based Policy Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

PerfectDou: Dominating DouDizhu with Perfect Information Distillation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Bootstrapped Transformer for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Reinforcement Learning with Automated Auxiliary Loss Search.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Enhanced Representation for Tabular Data via Neighborhood Propagation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

User-tag Profile Modeling in Recommendation System via Contrast Weighted Tag Masking.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Combo-Fashion: Fashion Clothes Matching CTR Prediction with Item History.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Retrieval-Based Gradient Boosting Decision Trees for Disease Risk Assessment.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

4th Workshop on Deep Learning Practice and Theory for High-Dimensional Sparse and Imbalanced Data with KDD 2022.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Spatio-Temporal Graph Few-Shot Learning with Cross-City Knowledge Transfer.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Towards Applicable Reinforcement Learning: Improving the Generalization and Sample Efficiency with Policy Ensemble.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Goal-Conditioned Reinforcement Learning: Problems and Solutions.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Neural Re-ranking in Multi-stage Recommender Systems: A Review.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Promoting Quality and Diversity in Population-based Reinforcement Learning via Hierarchical Trajectory Space Exploration.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Aggregating Intra-class and Inter-class Information for Multi-label Text Classification.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Heterogeneous Graph Representation for Knowledge Tracing.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2022

Why Propagate Alone? Parallel Use of Labels and Features on Graphs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Inductive Relation Prediction Using Analogy Subgraph Embeddings.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Automatical Graph-based Knowledge Tracing.
Proceedings of the 15th International Conference on Educational Data Mining, 2022

PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Multi-Scale User Behavior Network for Entire Space Multi-Task Learning.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Towards Return Parity in Markov Decision Processes.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Nested Named Entity Recognition with Span-level Graphs.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Learning Graph Representation With Generative Adversarial Nets.
IEEE Trans. Knowl. Data Eng., 2021

Which Heroes to Pick? Learning to Draft in MOBA Games With Neural Networks and Tree Search.
IEEE Trans. Games, 2021

Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks.
CoRR, 2021

Curriculum Offline Imitation Learning.
CoRR, 2021

Context-aware Reranking with Utility Maximization for Recommendation.
CoRR, 2021

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
CoRR, 2021

NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning.
CoRR, 2021

QA4PRF: A Question Answering Based Framework for Pseudo Relevance Feedback.
IEEE Access, 2021

An Adversarial Imitation Click Model for Information Retrieval.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Tracing Knowledge State with Individual Cognition and Acquisition Estimation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

A Graph-Enhanced Click Model for Web Search.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

DRL4IR: 2nd Workshop on Deep Reinforcement Learning for Information Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Learning to Build High-Fidelity and Robust Environment Models.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

Model-Based Offline Policy Optimization with Distribution Correcting Regularization.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

Curriculum Offline Imitating Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On Effective Scheduling of Model-based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ModularNAS: Towards Modularized and Reusable Neural Architecture Search.
Proceedings of Machine Learning and Systems 2021, 2021

3rd International Workshop on Deep Learning Practice for High-Dimensional Sparse Data with KDD 2021.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Retrieval & Interaction Machine for Tabular Data Prediction.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

An Embedding Learning Framework for Numerical Features in CTR Prediction.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Deep Learning for Click-Through Rate Estimation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

MARS: Markov Molecular Sampling for Multi-objective Drug Discovery.
Proceedings of the 9th International Conference on Learning Representations, 2021

Learning Logic Rules for Document-Level Relation Extraction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

LiteratureQA: A Qestion Answering Corpus with Graph Knowledge on Academic Literature.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

GAKG: A Multimodal Geoscience Academic Knowledge Graph.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Energy-Based Imitation Learning.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Fork or Fail: Cycle-Consistent Training with Many-to-One Mappings.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Glancing Transformer for Non-Autoregressive Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Universal Trading for Order Execution with Oracle Policy Distillation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Data-Driven Multimodal Patrol Planning for Anti-poaching.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
State representation modeling for deep reinforcement learning based recommendation.
Knowl. Based Syst., 2020

Probabilistic robust regression with adaptive weights - a case study on face recognition.
Frontiers Comput. Sci., 2020

Sobolev Wasserstein GAN.
CoRR, 2020

Reciprocal Supervised Learning Improves Neural Machine Translation.
CoRR, 2020

Learning Interaction Models of Structured Neighborhood on Heterogeneous Information Network.
CoRR, 2020

SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving.
CoRR, 2020

CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training.
CoRR, 2020

Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space.
CoRR, 2020

Large-Scale Optimal Transport via Adversarial Training with Cycle-Consistency.
CoRR, 2020

Truth Inference With a Deep Clustering-Based Aggregation Model.
IEEE Access, 2020

QA4IE: A Question Answering Based System for Document-Level General Information Extraction.
IEEE Access, 2020

Sequential Recommendation with Dual Side Neighbor-based Collaborative Relation Modeling.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

TADS: Learning Time-Aware Scheduling Policy with Dyna-Style Planning for Spaced Repetition.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

QuAChIE: Question Answering based Chinese Information Extraction System.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

User Behavior Retrieval for Click-Through Rate Prediction.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

A Deep Recurrent Survival Model for Unbiased Ranking.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Deep Reinforcement Learning for Information Retrieval: Fundamentals and Advances.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

GIKT: A Graph-Based Interaction Model for Knowledge Tracing.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

Efficient Projection-free Algorithms for Saddle Point Problems.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Model-based Policy Optimization with Unsupervised Model Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

AutoFIS: Automatic Feature Interaction Selection in Factorization Models for Click-Through Rate Prediction.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Aggregating Crowd Wisdom with Side Information via a Clustering-based Label-aware Autoencoder.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

DropNAS: Grouped Operation Dropout for Differentiable Architecture Search.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Efficient and Robust High-Dimensional Linear Contextual Bandits.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Multi-Agent Determinantal Q-Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Bidirectional Model-based Policy Optimization.
Proceedings of the 37th International Conference on Machine Learning, 2020

GraphAF: a Flow-based Autoregressive Model for Molecular Graph Generation.
Proceedings of the 8th International Conference on Learning Representations, 2020

Multi-Agent Interactions Modeling with Correlated Policies.
Proceedings of the 8th International Conference on Learning Representations, 2020

Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Improving Unsupervised Domain Adaptation with Variational Information Bottleneck.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020


GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Learning to Infer User Hidden States for Online Sequential Advertising.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

U-rank: Utility-oriented Learning to Rank with Implicit Feedback.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Sequential Advertising Agent with Interpretable User Hidden Intents.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Efficient Spectrum-Revealing CUR Matrix Decomposition.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Bi-Level Actor-Critic for Multi-Agent Coordination.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Towards Making the Most of BERT in Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Author Name Disambiguation on Heterogeneous Information Network with Adversarial Representation Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Product-Based Neural Networks for User Response Prediction over Multi-Field Categorical Data.
ACM Trans. Inf. Syst., 2019

Signal Instructed Coordination in Team Competition.
CoRR, 2019

An End-to-End Neighborhood-based Interaction Model for Knowledge-enhanced Recommendation.
CoRR, 2019

Towards Efficient and Unbiased Implementation of Lipschitz Continuity in GANs.
CoRR, 2019

Layout Design for Intelligent Warehouse by Evolution With Fitness Approximation.
IEEE Access, 2019

CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario.
Proceedings of the World Wide Web Conference, 2019

CommunityGAN: Community Detection with Generative Adversarial Nets.
Proceedings of the World Wide Web Conference, 2019

Sampled in Pairs and Driven by Text: A New Graph Embedding Framework.
Proceedings of the World Wide Web Conference, 2019

Triple-to-Text: Converting RDF Triples into High-Quality Natural Languages via Optimizing an Inverse KL Divergence.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Lifelong Sequential Modeling with Personalized Memorization for User Response Prediction.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Deep Landscape Forecasting for Real-time Bidding Advertising.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

TrajGuard: A Comprehensive Trajectory Copyright Protection Scheme.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Lipschitz Generative Adversarial Nets.
Proceedings of the 36th International Conference on Machine Learning, 2019

CoT: Cooperative Training for Generative Modeling of Discrete Data.
Proceedings of the 36th International Conference on Machine Learning, 2019

AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods.
Proceedings of the 7th International Conference on Learning Representations, 2019

Exploring Diverse Expressions for Paraphrase Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Factorized Q-learning for large-scale multi-agent systems.
Proceedings of the First International Conference on Distributed Artificial Intelligence, 2019

Generative adversarial exploration for reinforcement learning.
Proceedings of the First International Conference on Distributed Artificial Intelligence, 2019

Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

CoLight: Learning Network-level Cooperation for Traffic Signal Control.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Learning Adaptive Display Exposure for Real-Time Advertising.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Learning to Advertise for Organic Traffic Maximization in E-Commerce Product Feeds.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

CIKM 2019 Workshop on Artificial Intelligence in Transportation (AI in transportation).
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Dynamically Fused Graph Network for Multi-hop Reasoning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Deep Recurrent Survival Analysis.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Academic Reader: An Interactive Question Answering System on Academic Literatures.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Large-Scale Interactive Recommendation with Tree-Structured Policy Gradient.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Bidding Machine: Learning to Bid for Directly Optimizing Profits in Display Advertising.
IEEE Trans. Knowl. Data Eng., 2018

Delving Deep into Multiscale Pedestrian Detection via Single Scale Feature Maps.
Sensors, 2018

TGE-PS: Text-driven Graph Embedding with Pairs Sampling.
CoRR, 2018

Factorized Q-Learning for Large-Scale Multi-Agent Systems.
CoRR, 2018

Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning.
CoRR, 2018

CoT: Cooperative Training for Generative Modeling.
CoRR, 2018

Optimizing Sponsored Search Ranking Strategy by Deep Reinforcement Learning.
CoRR, 2018

Neural Text Generation: Past, Present and Beyond.
CoRR, 2018

Collaborative Filtering with Graph-based Implicit Feedback.
CoRR, 2018

Pedestrian Detection by Feature Selected Self-Similarity Features.
IEEE Access, 2018

A Bootstrapping Framework With Interactive Information Modeling for Network Alignment.
IEEE Access, 2018

Improving Negative Sampling for Word Representation using Self-embedded Features.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

f<sub>BGD</sub>: Learning Embeddings From Positive Unlabeled Data with BGD.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

A Machine Learning Approach to Prevent Malicious Calls over Telephony Networks.
Proceedings of the 2018 IEEE Symposium on Security and Privacy, 2018

Texygen: A Benchmarking Platform for Text Generation Models.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Generative Adversarial Nets for Information Retrieval: Fundamentals and Advances.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

QA4IE: A Question Answering Based Framework for Information Extraction.
Proceedings of the Semantic Web - ISWC 2018, 2018

Label-Aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Learning to Design Games: Strategic Environments in Reinforcement Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

On the Equilibrium of Query Reformulation and Document Retrieval.
Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018

Mean Field Multi-Agent Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Path-Level Network Transformation for Efficient Architecture Search.
Proceedings of the 35th International Conference on Machine Learning, 2018

Activation Maximization Generative Adversarial Nets.
Proceedings of the 6th International Conference on Learning Representations, 2018

AceKG: A Large-scale Knowledge Graph for Academic Data Mining.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Learning Multi-touch Conversion Attribution with Dual-attention Mechanisms for Online Advertising.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

A Study of AI Population Dynamics with Million-agent Reinforcement Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

GraphGAN: Graph Representation Learning With Generative Adversarial Nets.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Wasserstein Distance Guided Representation Learning for Domain Adaptation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

A Neural Stochastic Volatility Model.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Long Text Generation via Adversarial Training with Leaked Information.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Neural Link Prediction over Aligned Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Efficient Architecture Search by Network Transformation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting.
Found. Trends Inf. Retr., 2017

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence.
CoRR, 2017

Face Transfer with Generative Adversarial Network.
CoRR, 2017

An Empirical Study of AI Population Dynamics with Million-agent Reinforcement Learning.
CoRR, 2017

Generative Adversarial Nets with Labeled Data by Activation Maximization.
CoRR, 2017

Learning to Design Games: Strategic Environments in Deep Reinforcement Learning.
CoRR, 2017

Adversarial Representation Learning for Domain Adaptation.
CoRR, 2017

Reinforcement Learning for Architecture Search by Network Transformation.
CoRR, 2017

We Make Choices We Think are Going to Save Us: Debate and Stance Identification for Online Breast Cancer CAM Discussions.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Cataloguing Treatments Discussed and Used in Online Autism Communities.
Proceedings of the 26th International Conference on World Wide Web, 2017

Managing Risk of Bidding in Display Advertising.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Real-Time Bidding by Reinforcement Learning in Display Advertising.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Unsupervised Diverse Colorization via Generative Adversarial Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

Dynamic Attention Deep Model for Article Recommendation by Learning Human Editors' Demonstration.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

BoostFM: Boosted Factorization Machines for Top-N Feature-based Recommendation.
Proceedings of the 22nd International Conference on Intelligent User Interfaces, 2017

Aggregating Crowd Wisdoms with Label-aware Autoencoders.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

An adaptive coding and modulation based multicast scheme to reduce bandwidth consumption in the next generation satellite TV systems.
Proceedings of the 2017 International Conference on Computing, 2017

Volume Ranking and Sequential Selection in Programmatic Display Advertising.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Content Recommendation by Noise Contrastive Transfer Learning of Feature Representation.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Optimal real-time bidding for display advertising.
PhD thesis, 2016

Optimal Real-Time Bidding Frameworks Discussion.
CoRR, 2016

Learning text representation using recurrent convolutional neural network with highway layers.
CoRR, 2016

Feedback Control of Real-Time Display Advertising.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Optimizing Factorization Machines for Top-N Context-Aware Recommendations.
Proceedings of the Web Information Systems Engineering - WISE 2016, 2016

Functional Bid Landscape Forecasting for Display Advertising.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016

Bid-aware Gradient Descent for Unbiased Learning with Censored Data in Display Advertising.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Product-Based Neural Networks for User Response Prediction.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Deep Learning over Multi-field Categorical Data - - A Case Study on User Response Prediction.
Proceedings of the Advances in Information Retrieval, 2016

Implicit Look-Alike Modelling in Display Ads - Transfer Collaborative Filtering to CTR Estimation.
Proceedings of the Advances in Information Retrieval, 2016

Real-Time Bidding Based Display Advertising: Mechanisms and Algorithms.
Proceedings of the Advances in Information Retrieval, 2016

LambdaFM: Learning Optimal Ranking with Factorization Machines Using Lambda Surrogates.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

User Response Learning for Directly Optimizing Campaign Performance in Display Advertising.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Collective Noise Contrastive Estimation for Policy Transfer Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
An Empirical Study on Display Ad Impression Viewability Measurements.
CoRR, 2015

Risk-Hedged Venture Capital Investment Recommendation.
Proceedings of the 9th ACM Conference on Recommender Systems, 2015

Statistical Arbitrage Mining for Display Advertising.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Annotating Needles in the Haystack without Looking: Product Information Extraction from Emails.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

2014
Bid keyword suggestion in sponsored search based on competitiveness and relevance.
Inf. Process. Manag., 2014

Real-Time Bidding Benchmarking with iPinYou Dataset.
CoRR, 2014

Optimal real-time bidding for display advertising.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

2013
MeDetect: A LOD-Based System for Collective Entity Annotation in Biomedicine.
Proceedings of the 2013 IEEE/WIC/ACM International Conferences on Web Intelligence, 2013

Optimizing top-n collaborative filtering via dynamic negative item sampling.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

To personalize or not: a risk management perspective.
Proceedings of the Seventh ACM Conference on Recommender Systems, 2013

Interactive collaborative filtering.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

An empirical study of top-n recommendation for venture finance.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Advertising Keywords Recommendation for Short-Text Web Pages Using Wikipedia.
ACM Trans. Intell. Syst. Technol., 2012

SVDFeature: a toolkit for feature-based collaborative filtering.
J. Mach. Learn. Res., 2012

Serendipitous Personalized Ranking for Top-N Recommendation.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012

Collaborative filtering with short term preferences mining.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

MeDetect: Domain Entity Annotation in Biomedical References Using Linked Open Data.
Proceedings of the ISWC 2012 Posters & Demonstrations Track, 2012

A semantic approach to recommending text advertisements for images.
Proceedings of the Sixth ACM Conference on Recommender Systems, 2012

Local implicit feedback mining for music recommendation.
Proceedings of the Sixth ACM Conference on Recommender Systems, 2012

Joint optimization of bid and budget allocation in sponsored search.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

A Semantic-Driven Music Recommendation Model for Digital Photo Albums.
Proceedings of the Semantic Web and Web Science, 2012

Feature Based Informative Model for Discriminating Favorite Items from Unrated Ones.
Proceedings of the Web Technologies and Applications - 14th Asia-Pacific Web Conference, 2012

2011
Feature-Based Matrix Factorization
CoRR, 2011

LODDO: Using Linked Open Data Description Overlap to Measure Semantic Relatedness between Named Entities.
Proceedings of the Semantic Web - Joint International Semantic Technology Conference, 2011


  Loading...