Zhe Zhao

Orcid: 0000-0002-6847-0186

Affiliations:
  • Google AI, USA7
  • University of Michigan, Department of Electrical Engineering and Computer Science, Ann Arbor, MI, USA (former)
  • Peking University, Beijing, China (former)


According to our database1, Zhe Zhao authored at least 40 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Wisdom of Committee: Distilling from Foundation Model to Specialized Application Model.
CoRR, 2024

LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views.
CoRR, 2024

2023
Talking Models: Distill Pre-trained Knowledge to Downstream Models via Interactive Communication.
CoRR, 2023

COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Fast as CHITA: Neural Network Pruning with Combinatorial Optimization.
Proceedings of the International Conference on Machine Learning, 2023

Multitask Ranking System for Immersive Feed and No More Clicks: A Case Study of Short-Form Video Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
Can Small Heads Help? Understanding and Improving Multi-Task Generalization.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Transformer Memory as a Differentiable Search Index.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Multi-Task Generalization via Regularizing Spurious Correlation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HyperPrompt: Prompt-based Task-Conditioning of Transformers.
Proceedings of the International Conference on Machine Learning, 2022

2021
The Benchmark Lottery.
CoRR, 2021

DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Synthesizer: Rethinking Self-Attention for Transformer Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

HyperGrid Transformers: Towards A Single Model for Multiple Tasks.
Proceedings of the 9th International Conference on Learning Representations, 2021

Learning-to-Rank with Partitioned Preference: Fast Estimation for the Plackett-Luce Model.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Small Towers Make Big Differences.
CoRR, 2020

HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections.
CoRR, 2020

Understanding and Improving Knowledge Distillation.
CoRR, 2020

Off-policy Learning in Two-stage Recommender Systems.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Multitask Mixture of Sequential Experts for User Activity Streams.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

2019
Recommending what video to watch next: a multitask ranking system.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019

Sampling-bias-corrected neural modeling for large corpus item recommendations.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019

Fairness in Recommendation Ranking through Pairwise Comparisons.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

SNR: Sub-Network Routing for Flexible Parameter Sharing in Multi-Task Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Identify Shifts of Word Semantics through Bayesian Surprise.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

2017
Data Decisions and Theoretical Implications when Adversarially Learning Fair Representations.
CoRR, 2017

2016
Detecting Social Media Icebergs by Their Tips: Rumors, Persuasion Campaigns, and Information Needs.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

2015
Towards the prediction problems of bursting hashtags on Twitter.
J. Assoc. Inf. Sci. Technol., 2015

Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts.
Proceedings of the 24th International Conference on World Wide Web, 2015

Improving User Topic Interest Profiles by Behavior Factorization.
Proceedings of the 24th International Conference on World Wide Web, 2015

2014
On the Real-time Prediction Problems of Bursting Hashtags in Twitter.
CoRR, 2014

Real-Time Predicting Bursting Hashtags on Twitter.
Proceedings of the Web-Age Information Management - 15th International Conference, 2014

Predicting bursts and popularity of hashtags in real-time.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

2013
Questions about questions: an empirical analysis of information needs on Twitter.
Proceedings of the 22nd International World Wide Web Conference, 2013

2012
A Framework for Similarity Search of Time Series Cliques with Natural Relations.
IEEE Trans. Knowl. Data Eng., 2012

Extracting representative motion flows for effective video retrieval.
Multim. Tools Appl., 2012

Recommending Flickr groups with social topic model.
Inf. Retr., 2012

2010
Multiple feature fusion for social media applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Efficient similarity matching of Time Series Cliques with natural relations.
Proceedings of the 26th International Conference on Data Engineering, 2010


  Loading...