Fei Sha

Affiliations:
  • University of Southern California, Los Angeles, USA


According to our database1, Fei Sha authored at least 169 papers between 1992 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems.
CoRR, 2024

2023
Drinking From a Firehose: Continual Learning With Web-Scale Natural Language.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models.
CoRR, 2023

The Impact of Depth and Width on Transformer Language Model Generalization.
CoRR, 2023

WeatherBench 2: A benchmark for the next generation of data-driven global weather models.
CoRR, 2023

SEEDS: Emulation of Weather Forecast Ensembles with Diffusion Models.
CoRR, 2023

Neural Ideal Large Eddy Simulation: Modeling Turbulence with Neural Stochastic Differential Equations.
CoRR, 2023

V2Meow: Meowing to the Visual Beat via Music Generation.
CoRR, 2023

Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL.
CoRR, 2023

Debias Coarsely, Sample Conditionally: Statistical Downscaling through Optimal Transport and Probabilistic Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Neural Ideal Large Eddy Simulation: Modeling Turbulence with Neural Stochastic Differential Equations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute.
Proceedings of the International Conference on Machine Learning, 2023

User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems.
Proceedings of the International Conference on Machine Learning, 2023

Evolve Smoothly, Fit Consistently: Learning Smooth Latent Dynamics For Advection-Dominated Systems.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Adversarially robust subspace learning in the spiked covariance model.
Stat. Anal. Data Min., 2022

ALMA: Hierarchical Learning for Composite Multi-Agent Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Compositional Generalization with Latent Structure and Data Augmentation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Quickest Detection of the Change of Community via Stochastic Block Models.
Proceedings of the IEEE International Symposium on Information Theory, 2022

Mention Memory: incorporating textual knowledge into Transformers through entity mention attention.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Possibility Before Utility: Learning And Using Hierarchical Affordances.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Generate-and-Retrieve: Use Your Predictions to Improve Retrieval for Semantic Parsing.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Policy Learning and Evaluation with Randomized Quasi-Monte Carlo.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Co-training Transformer with Videos and Images Improves Action Recognition.
CoRR, 2021

Learning to Generalize Compositionally by Transferring Across Semantic Parsing Tasks.
CoRR, 2021

HyperPINN: Learning parameterized differential equations with physics-informed hypernetworks.
CoRR, 2021

Embedding Adaptation is Still Needed for Few-Shot Learning.
CoRR, 2021

ReadTwice: Reading Very Large Documents with Memories.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

CoMSum and SIBERT: A Dataset and Neural Model for Query-Based Multi-document Summarization.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Systematic Generalization on gSCAN: What is Nearly Solved and What is Next?
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Visually Grounded Concept Composition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

DOCENT: Learning Self-Supervised Entity Representations from Large Document Collections.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

When MAML Can Adapt Fast and How to Assist When It Cannot.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Classifier and Exemplar Synthesis for Zero-Shot Learning.
Int. J. Comput. Vis., 2020

A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus.
CoRR, 2020

AQuaMuSe: Automatically Generating Datasets for Query-Based Multi-Document Summarization.
CoRR, 2020

Uncertainty Estimation with Infinitesimal Jackknife, Its Distribution and Mean-Field Approximation.
CoRR, 2020

AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning.
CoRR, 2020

Visual Storytelling via Predicting Anchor Word Embeddings in the Stories.
CoRR, 2020

Learning to Represent Image and Text with Denotation Graph.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Few-Shot Learning via Embedding Adaptation With Set-to-Set Functions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Amortized Inference of Variational Bounds for Learning Noisy-OR.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Kernel Approximation Methods for Speech Recognition.
J. Mach. Learn. Res., 2019

Decoupling Adaptation from Modeling with Meta-Optimizers for Meta Learning.
CoRR, 2019

Topic Augmented Generator for Abstractive Summarization.
CoRR, 2019

Neural Theorem Provers Do Not Learn Rules Without Exploration.
CoRR, 2019

Learning Classifier Synthesis for Generalized Few-Shot Learning.
CoRR, 2019

Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning.
CoRR, 2019

Synthesized Policies for Transfer and Adaptation across Tasks and Environments.
CoRR, 2019

Hyper-parameter Tuning under a Budget Constraint.
CoRR, 2019

Hyper-parameter Tuning under a Budget Constraint.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Actor-Attention-Critic for Multi-Agent Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

A Bayesian Theory of Mind Approach to Nonverbal Communication.
Proceedings of the 14th ACM/IEEE International Conference on Human-Robot Interaction, 2019

2018
Learning Embedding Adaptation for Few-Shot Learning.
CoRR, 2018

Toward an Internet of Battlefield Things: A Resilience Perspective.
Computer, 2018

Synthesize Policies for Transfer and Adaptation across Tasks and Environments.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018


A Probabilistic Model for Joint Learning of Word Embeddings from Texts and Images.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Cross-Modal and Hierarchical Modeling of Video and Text.
Proceedings of the Computer Vision - ECCV 2018, 2018

Retrospective Encoders for Video Summarization.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Answer Embeddings for Visual Question Answering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Cross-Dataset Adaptation for Visual Question Answering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Aiming to Know You Better Perhaps Makes Me a More Engaging Dialogue Partner.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Multi-Task Learning for Sequence Tagging: An Empirical Study.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Robust Active Label Correction.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Geodesic Flow Kernel and Landmarks: Kernel Methods for Unsupervised Domain Adaptation.
Proceedings of the Domain Adaptation in Computer Vision Applications., 2017

Aligning Where to See and What to Tell: Image Captioning with Region-Based Attention and Scene-Specific Contexts.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

LabelBank: Revisiting Global Perspectives for Semantic Segmentation.
CoRR, 2017

An Empirical Study on The Properties of Random Bases for Kernel Methods.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

FastMask: Segment Multi-scale Object Candidates in One Shot.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Attention Correctness in Neural Image Captioning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Active multi-view object recognition: A unifying view on online feature selection and view planning.
Robotics Auton. Syst., 2016

FastMask: Segment Object Multi-scale Candidates in One Shot.
CoRR, 2016

Recalling Holistic Information for Semantic Segmentation.
CoRR, 2016

Understanding Image and Text Simultaneously: a Dual Vision-Language Machine Comprehension Task.
CoRR, 2016

Supervised Word Mover's Distance.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

A comparison between deep neural nets and kernel acoustic models for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Video Summarization with Long Short-Term Memory.
Proceedings of the Computer Vision - ECCV 2016, 2016

An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild.
Proceedings of the Computer Vision - ECCV 2016, 2016

Summary Transfer: Exemplar-Based Subset Selection for Video Summarization.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Synthesized Classifiers for Zero-Shot Learning.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Metric Learning for Ordinal Data.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

The improvement of page sorting algorithm for music users in Nutch.
Proceedings of the 15th IEEE/ACIS International Conference on Computer and Information Science, 2016

2015
Marginalizing stacked linear denoising autoencoders.
J. Mach. Learn. Res., 2015

Aligning where to see and what to tell: image caption with region-based attention and scene factorization.
CoRR, 2015

Large-Margin Determinantal Point Processes.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

A Distributed Frank-Wolfe Algorithm for Communication-Efficient Sparse Learning.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Active Multi-view Object Recognition and Online Feature Selection.
Proceedings of the Robotics Research, 2015

Exponential Integration for Hamiltonian Monte Carlo.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Designing a socially assistive robot for personalized number concepts learning in preschool children.
Proceedings of the 2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, 2015

Similarity Learning for High-Dimensional Sparse Data.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition.
Int. J. Comput. Vis., 2014

How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets.
CoRR, 2014

Distributed Frank-Wolfe Algorithm: A Unified Framework for Communication-Efficient Sparse Learning.
CoRR, 2014

Diverse Sequential Subset Selection for Supervised Video Summarization.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Two-Stage Metric Learning.
Proceedings of the 31th International Conference on Machine Learning, 2014

Demystifying Information-Theoretic Clustering.
Proceedings of the 31th International Conference on Machine Learning, 2014

Marginalized Denoising Auto-encoders for Nonlinear Representations.
Proceedings of the 31th International Conference on Machine Learning, 2014

Discriminative non-negative matrix factorization for single-channel speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Decorrelating Semantic Visual Attributes by Resisting the Urge to Share.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Sparse Compositional Metric Learning.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
An alternative text representation to TF-IDF and Bag-of-Words
CoRR, 2013

Reshaping Visual Datasets for Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Similarity Component Analysis.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Analogy-preserving Semantic Embedding for Visual Object Categorization.
Proceedings of the 30th International Conference on Machine Learning, 2013

Connecting the Dots with Landmarks: Discriminatively Learning Domain-Invariant Features for Unsupervised Domain Adaptation.
Proceedings of the 30th International Conference on Machine Learning, 2013

Deformable Spatial Pyramid Matching for Fast Dense Correspondences.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Cloud-enabled privacy-preserving collaborative learning for mobile sensing.
Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems, 2012

Non-linear Metric Learning.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Semantic Kernel Forests from Multiple Taxonomies.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Predicting Likability of Speakers with Gaussian Processes.
Proceedings of the INTERSPEECH 2012, 2012

Information-Theoretical Learning of Discriminative Clusters for Unsupervised Domain Adaptation.
Proceedings of the 29th International Conference on Machine Learning, 2012

Marginalized Denoising Autoencoders for Domain Adaptation.
Proceedings of the 29th International Conference on Machine Learning, 2012

Geodesic flow kernel for unsupervised domain adaptation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

From sBoW to dCoT marginalized encoders for text representation.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Learning the Kernel Matrix with Low-Rank Multiplicative Shaping.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Predicting Pedestrian Counts in Crowded Scenes With Rich and High-Dimensional Features.
IEEE Trans. Intell. Transp. Syst., 2011

Information Theoretical Clustering via Semidefinite Programming.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Learning Discriminative Metrics via Generative Models and Kernel Learning
CoRR, 2011

Rapid Feature Learning with Stacked Linear Denoisers
CoRR, 2011

Learning a Tree of Metrics with Disjoint Visual Features.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Online learning of large margin hidden Markov models for automatic speech recognition.
Proceedings of the 2011 Symposium on Machine Learning in Speech and Language Processing, 2011

A Study of Web Services Performance Prediction: A Client's Perspective.
Proceedings of the MASCOTS 2011, 2011

Learning with Whom to Share in Multi-task Feature Learning.
Proceedings of the 28th International Conference on Machine Learning, 2011

Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop.
Proceedings of the IEEE International Conference on Acoustics, 2011

Application of moment invariants in checking the overall dimension of the head cover.
Proceedings of the International Conference on Electronic and Mechanical Engineering and Information Technology, 2011

Sharing features between objects and their attributes.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Metric learning for reinforcement learning agents.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

2010
Convex Optimizations for Distance Metric Learning and Pattern Classification [Applications Corner].
IEEE Signal Process. Mag., 2010

Online Learning and Acoustic Feature Adaptation in Large-Margin Hidden Markov Models.
IEEE J. Sel. Top. Signal Process., 2010

Locally Linear Denoising on Image Manifolds.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Active site prediction using evolutionary and structural information.
Bioinform., 2010

Unsupervised Kernel Dimension Reduction.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Invited Talk Abstracts.
Proceedings of the Manifold Learning and Its Applications, 2010

2009
Robust web extraction: an approach based on a probabilistic tree-edit model.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

A fast online algorithm for large margin training of continuous density hidden Markov models.
Proceedings of the INTERSPEECH 2009, 2009

Matrix updates for perceptron training of continuous density hidden Markov models.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Large-margin feature adaptation for automatic speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
DiscLDA: Discriminative Learning for Dimensionality Reduction and Classification.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

2007
Multiplicative Updates for Nonnegative Quadratic Programming.
Neural Comput., 2007

Multiplicative Updates for <i>L</i><sub>1</sub>-Regularized Linear and Logistic Regression.
Proceedings of the Advances in Intelligent Data Analysis VII, 2007

Regression on manifolds using kernel dimension reduction.
Proceedings of the Machine Learning, 2007

Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Comparison of Large Margin Training to Other Discriminative Methods for Phonetic Recognition by Hidden Markov Models.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Graph Laplacian Regularization for Large-Scale Semidefinite Programming.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Large Margin Hidden Markov Models for Automatic Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Large Margin Gaussian Mixture Modeling for Phonetic Classification and Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Spectral Methods for Dimensionality Reduction.
Proceedings of the Semi-Supervised Learning, 2006

2005
Analysis and extension of spectral methods for nonlinear dimensionality reduction.
Proceedings of the Machine Learning, 2005

2004
Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Learning a kernel matrix for nonlinear dimensionality reduction.
Proceedings of the Machine Learning, 2004

Multiband statistical learning for f<sub>0</sub> estimation in speech.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Shallow Parsing with Conditional Random Fields.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Statistical signal processing with nonnegativity constraints.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Multiplicative Updates for Large Margin Classifiers.
Proceedings of the Computational Learning Theory and Kernel Machines, 2003

2002
Multiplicative Updates for Nonnegative Quadratic Programming in Support Vector Machines.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

2000
A Semi-Structured Data Cartridge for Relational Databases.
Proceedings of the 16th International Conference on Data Engineering, San Diego, California, USA, February 28, 2000

1999
Miro Web: Integrating Multiple Data Sources through Semistructured Data Types.
Proceedings of the VLDB'99, 1999

XML-based Components for Federating Multiple Heterogeneous Data Sources.
Proceedings of the Conceptual Modeling, 1999

Managing Semistructured Data in Object-Relational DBMS.
Proceedings of the 15èmes Journées Bases de Données Avancées, 1999

1997
Using Conceptual Modeling and Intelligent Agents to Integrate Semi-structured Documents in Federated Databases.
Proceedings of the Conceptual Modeling, 1997

1996
Calibrating the Query Optimizer Cost Model of IRO-DB, an Object-Oriented Federated Database System.
Proceedings of the 12èmes Journées Bases de Données Avancées, 1996

1992
Performance Analysis of an Access Control Strategy in Integrated Networks.
Comput. Networks ISDN Syst., 1992


  Loading...