Tianlong Chen

Orcid: 0000-0001-7774-8197

Affiliations:
  • University of North Carolina at Chapel Hill, Department of Computer Science, Chapel Hill, NC, USA
  • Massachusetts Institute of Technology (MIT), Computer Science and Artificial Intelligence Laboratory (CSAIL), Cambridge, MA, USA (former)
  • Harvard University, Department of Biomedical Informatics, Boston, MA, USA (former)
  • University of Texas at Austin, TX, USA (PhD 2023)


According to our database1, Tianlong Chen authored at least 368 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Out-of-Distribution-Resistant Evaluations for Explanations of Graph Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2026

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening.
CoRR, May, 2026

Chreode: A Cell World Model for One-Step Temporal Dynamics and Perturbation Prediction.
CoRR, May, 2026

Detecting Unfaithful Chain-of-Thought via Circuit-Guided Internal-External Discrepancy.
CoRR, May, 2026

ConceptM<sup>3</sup>oE: Concept-Guided Multimodal Mixture of Experts for Interpretable Computational Pathology.
CoRR, May, 2026

GEMQ: Global Expert-Level Mixed-Precision Quantization for MoE LLMs.
CoRR, May, 2026

MuteBench: Modality Unavailability Tolerance Evaluation for Incomplete Multimodal Fusion.
CoRR, May, 2026

TRUST: A Framework for Decentralized AI Service v.0.1.
CoRR, April, 2026

Vision-Language-Action in Robotics: A Survey of Datasets, Benchmarks, and Data Engines.
CoRR, April, 2026

Clinically-Informed Modeling for Pediatric Brain Tumor Classification from Whole-Slide Histopathology Images.
CoRR, April, 2026

Graph-of-Agents: A Graph-based Framework for Multi-Agent LLM Collaboration.
CoRR, April, 2026

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities.
CoRR, April, 2026

Benchmarking Multi-turn Medical Diagnosis: Hold, Lure, and Self-Correction.
CoRR, April, 2026

Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models.
CoRR, April, 2026

LangMARL: Natural Language Multi-Agent Reinforcement Learning.
CoRR, April, 2026

Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs.
CoRR, March, 2026

Probing to Refine: Reinforcement Distillation of LLMs via Explanatory Inversion.
CoRR, March, 2026

Tabular LLMs for Interpretable Few-Shot Alzheimer's Disease Prediction with Multimodal Biomedical Data.
CoRR, March, 2026

Progressive Residual Warmup for Language Model Pretraining.
CoRR, March, 2026

Farther the Shift, Sparser the Representation: Analyzing OOD Mechanisms in LLMs.
CoRR, March, 2026

PathMoE: Interpretable Multimodal Interaction Experts for Pediatric Brain Tumor Classification.
CoRR, March, 2026

Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?
CoRR, February, 2026

A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs.
CoRR, February, 2026

Can Multimodal LLMs See Science Instruction? Benchmarking Pedagogical Reasoning in K-12 Classroom Videos.
CoRR, February, 2026

PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency.
CoRR, February, 2026

FlashSchNet: Fast and Accurate Coarse-Grained Neural Network Molecular Dynamics.
CoRR, February, 2026

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models.
CoRR, February, 2026

NeuroCanvas: VLLM-Powered Robust Seizure Detection by Reformulating Multichannel EEG as Image.
CoRR, February, 2026

TMS: Trajectory-Mixed Supervision for Reward-Free, On-Policy SFT.
CoRR, February, 2026

Dynamic Mix Precision Routing for Efficient Multi-step LLM Interaction.
CoRR, February, 2026

Geometry- and Relation-Aware Diffusion for EEG Super-Resolution.
CoRR, February, 2026

Towards Building Non-Fine-Tunable Foundation Models.
CoRR, February, 2026

From Models to Systems: A Comprehensive Survey of Efficient Multimodal Learning.
Trans. Mach. Learn. Res., 2026

$\texttt{LucidAtlas}$: Learning Uncertainty-Aware, Covariate-Disentangled, Individualized Atlas Representations.
Trans. Mach. Learn. Res., 2026

SDoH-GPT: using large language models to extract social determinants of health.
J. Am. Medical Informatics Assoc., 2026

GatorSC: multi-scale cell and gene graphs with mixture-of-experts fusion for single-cell transcriptomics.
Briefings Bioinform., 2026

Explaining the 'Unexplainable' Large Language Models.
Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

EdgeTune: Efficient On-Device LLM Personalization at the Edge.
Proceedings of the 2026 ACM/IEEE International Conference on Embedded Artificial Intelligence and Sensing Systems, 2026

Dialogue is Better Than Monologue: Instructing Meidcal LLMs via Strategic Conversations.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

Can Multimodal LLMs 'See' Science Instruction? Benchmarking Pedagogical Reasoning in K-12 Classroom Videos.
Proceedings of the Artificial Intelligence in Education - 27th International Conference, 2026

Vulnerability-Aware Robust Multimodal Adversarial Training.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

COIN: Uncertainty-Guarding Selective Question Answering for Foundation Models with Provable Risk Guarantees.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Model Editing as a Double-Edged Sword: Steering Agent Behavior Toward Beneficence or Harm.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

OR-R1: Automating Modeling and Solving of Operations Research Optimization Problem via Test-Time Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Multimodal Fusion of Regional Brain Experts for Interpretable Alzheimer's Disease Diagnosis.
CoRR, December, 2025

LEC: Linear Expectation Constraints for False-Discovery Control in Selective Prediction and Routing Systems.
CoRR, December, 2025

PPBoost: Progressive Prompt Boosting for Text-Driven Medical Image Segmentation.
CoRR, November, 2025

Fairness in Multi-modal Medical Diagnosis with Demonstration Selection.
CoRR, November, 2025

A Space-Time Transformer for Precipitation Forecasting.
CoRR, November, 2025

Unlocking Dynamic Inter-Client Spatial Dependencies: A Federated Spatio-Temporal Graph Learning Method for Traffic Flow Forecasting.
CoRR, November, 2025

Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder.
CoRR, November, 2025

ResearchGPT: Benchmarking and Training LLMs for End-to-End Computer Science Research Workflows.
CoRR, October, 2025

TRUST: A Decentralized Framework for Auditing Large Language Model Reasoning.
CoRR, October, 2025

Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert Signatures.
CoRR, October, 2025

SARHAchat: An LLM-Based Chatbot for Sexual and Reproductive Health Counseling.
CoRR, October, 2025

Can GRPO Help LLMs Transcend Their Pretraining Origin?
CoRR, October, 2025

Metacognitive Self-Correction for Multi-Agent System via Prototype-Guided Next-Execution Reconstruction.
CoRR, October, 2025

EditCast3D: Single-Frame-Guided 3D Editing with Video Propagation and View Selection.
CoRR, October, 2025

Multi-Agent Debate for LLM Judges with Adaptive Stability Detection.
CoRR, October, 2025

AsyncSpade: Efficient Test-Time Scaling with Asynchronous Sparse Decoding.
CoRR, October, 2025

FaithCoT-Bench: Benchmarking Instance-Level Faithfulness of Chain-of-Thought Reasoning.
CoRR, October, 2025

GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction.
CoRR, September, 2025

Quantum Variational Activation Functions Empower Kolmogorov-Arnold Networks.
CoRR, September, 2025

Transferring Expert Cognitive Models to Social Robots via Agentic Concept Bottleneck Models.
CoRR, August, 2025

Enabling Few-Shot Alzheimer's Disease Diagnosis on Tabular Biomarker Data with LLMs.
CoRR, July, 2025

SparseC-AFM: a deep learning method for fast and accurate characterization of MoS<sub>2</sub> with C-AFM.
CoRR, July, 2025

SPATIA: Multimodal Model for Prediction and Generation of Spatial Cell Phenotypes.
CoRR, July, 2025

GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling.
CoRR, June, 2025

Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning.
CoRR, June, 2025

Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm.
CoRR, June, 2025

UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making.
CoRR, June, 2025

An Empirical Study of Federated Prompt Learning for Vision Language Model.
CoRR, May, 2025

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction.
CoRR, May, 2025

DOGe: Defensive Output Generation for LLM Protection Against Knowledge Distillation.
CoRR, May, 2025

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation.
CoRR, May, 2025

Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference.
CoRR, May, 2025

DD-Ranking: Rethinking the Evaluation of Dataset Distillation.
CoRR, May, 2025

GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization.
CoRR, May, 2025

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment.
CoRR, April, 2025

Efficient MAP Estimation of LLM Judgment Performance with Prior Transfer.
CoRR, April, 2025

Are We Merely Justifying Results ex Post Facto? Quantifying Explanatory Inversion in Post-Hoc Model Explanations.
CoRR, April, 2025

Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations.
CoRR, April, 2025

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment.
CoRR, April, 2025

LightDefense: A Lightweight Uncertainty-Driven Defense against Jailbreaks via Shifted Token Distribution.
CoRR, April, 2025

Agents Under Siege: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks.
CoRR, April, 2025

ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion.
CoRR, March, 2025

Make Optimization Once and for All with Fine-grained Guidance.
CoRR, March, 2025

H3PIMAP: A Heterogeneity-Aware Multi-Objective DNN Mapping Framework on Electronic-Photonic Processing-in-Memory Architectures.
CoRR, March, 2025

Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning.
CoRR, March, 2025

NeuroSymAD: A Neuro-Symbolic Framework for Interpretable Alzheimer's Disease Diagnosis.
CoRR, March, 2025

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam.
CoRR, February, 2025

LucidAtlas$: Learning Uncertainty-Aware, Covariate-Disentangled, Individualized Atlas Representations.
CoRR, February, 2025

Symbiotic Cooperation for Web Agents: Harnessing Complementary Strengths of Large and Small LLMs.
CoRR, February, 2025

Continually Evolved Multimodal Foundation Models for Cancer Prognosis.
CoRR, January, 2025

Dialogue is Better Than Monologue: Instructing Medical LLMs via Strategical Conversations.
CoRR, January, 2025

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense.
CoRR, January, 2025

The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit.
CoRR, January, 2025

GroverGPT: A Large Language Model with 8 Billion Parameters for Quantum Searching.
CoRR, January, 2025

Data for "CryoNeRF: neural radiance field for homogeneous and heterogeneous cryo-EM reconstruction".
Dataset, January, 2025

Data for "CryoNeRF: neural radiance field for homogeneous and heterogeneous cryo-EM reconstruction".
Dataset, January, 2025

A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning.
Trans. Mach. Learn. Res., 2025

Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning.
Trans. Mach. Learn. Res., 2025

A natural language processing-based approach for early detection of heart failure onset using electronic health records.
Knowl. Based Syst., 2025

GatorCLR: Personalized predictions of patient outcomes on electronic health records using self-supervised contrastive graph representation.
J. Biomed. Informatics, 2025

Word-Sequence Entropy: Towards uncertainty estimation in free-form medical question answering applications and beyond.
Eng. Appl. Artif. Intell., 2025

Deep Learning for Accurate Diagnosis of Viral Infections through scRNA-seq Analysis: A Comprehensive Benchmark Study.
J. Data-centric Mach. Learn. Res., 2025

A Survey on Reinforcement Learning for Optimal Decision-Making and Control of Intelligent Vehicles.
CAAI Trans. Intell. Technol., 2025

A Decentralized Framework for Auditing Large Language Model Reasoning.
Proceedings of the 7th IEEE International Conference on Trust, 2025

Protecting Privacy against Membership Inference Attack with LLM Fine-tuning through Flatness.
Proceedings of the 2025 SIAM International Conference on Data Mining, 2025

One Token Embedding Is Enough to Deadlock Your Large Reasoning Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

<tt>BetaConform</tt>: Efficient MAP Estimation of LLM Ensemble Judgment Performance with Prior Transfer.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Mozart: Modularized and Efficient MoE Training on 3.5D Wafer-Scale Chiplet Architectures.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

RTGS: Real-Time 3D Gaussian Splatting SLAM via Multi-Level Redundancy Reduction.
Proceedings of the 58th IEEE/ACM International Symposium on Microarchitecture, 2025

Oldie but Goodie: Re-illuminating Label Propagation on Graphs with Partially Observed Features.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

A Survey on Trustworthy LLM Agents: Threats and Countermeasures.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

<i>MerRec: </i> A Large-scale Multipurpose Mercari Dataset for Consumer-to-Consumer Recommendation Systems.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Occult: Optimizing Collaborative Communications across Experts for Accelerated Parallel MoE Training and Inference.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Modalities Contribute Unequally: Enhancing Medical Multi-modal Learning through Adaptive Modality Token Re-balancing.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Graph Sparsification via Mixture of Graphs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adapt-∞: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Composable Interventions for Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Thought Graph: Balancing Specificity and Uncertainty in LLM-Based Gene Set Annotation.
Proceedings of the 13th IEEE International Conference on Healthcare Informatics, 2025

Towards Stabilized and Efficient Diffusion Transformers Through Long-Skip-Connections With Spectral Constraints.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

HoloZip: High Hologram Compression via Latent-of-Latent Coding.
Proceedings of the IEEE International Conference on Computational Photography, 2025

AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Bag of Tricks for Sparse Mixture-of-Experts: A Benchmark Across Reasoning, Efficiency, and Safety.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Task-Aware Resolution Optimization for Visual Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Glider: Global and Local Instruction-Driven Expert Router.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

EQA-RM: A Generative Embodied Reward Model with Test-time Scaling.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Window Token Concatenation for Efficient Visual Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Sparse MoE as a New Treatment: Addressing Forgetting, Fitting, Learning Issues in Multi-Modal Multi-Task Learning.
Proceedings of the Conference on Parsimony and Learning, 2025

You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time.
Proceedings of the Conference on Parsimony and Learning, 2025

GraphRCG: Self-Conditioned Graph Generation.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

Enabling Few-Shot Alzheimer's Disease Diagnosis on Biomarker Data with Tabular LLMs.
Proceedings of the 16th ACM International Conference on Bioinformatics, 2025

The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SConU: Selective Conformal Uncertainty in Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Agents Under Siege: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

UQ-Merge: Uncertainty Guided Multimodal Large Language Model Merging.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Unveiling Privacy Risks in Multi-modal Large Language Models: Task-specific Vulnerabilities and Mitigation Challenges.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Vision Language Model Helps Private Information De-Identification in Vision Data.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Spatial Coordinates as a Cell Language: A Multi-Sentence Framework for Imaging Mass Cytometry Analysis.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

BrainMAP: Learning Multiple Activation Pathways in Brain Networks.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Evaluating topological fitness of human brain-inspired sub-circuits in Echo State Networks.
Proceedings of the AAAI Bridge Program on AI for Medicine and Healthcare, 2025

Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Breaking the Resource Monopoly from Industries: Sustainable and Reliable LLM Serving by Recycling Outdated and Resource-Constrained GPUs.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Tuning-Free Accountable Intervention for LLM Deployment - a Metacognitive Approach.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Inductive Lottery Ticket Learning for Graph Neural Networks.
J. Comput. Sci. Technol., November, 2024

One is Not Enough: Parameter-Efficient Fine-Tuning With Multiplicative Sparse Factorization.
IEEE J. Sel. Top. Signal Process., September, 2024

Single-cell RNA sequencing data imputation using bi-level feature propagation.
Briefings Bioinform., May, 2024

Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation.
Trans. Mach. Learn. Res., 2024

Political-LLM: Large Language Models in Political Science.
CoRR, 2024

Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare.
CoRR, 2024

Accelerating Vision Diffusion Transformers with Skip Branches.
CoRR, 2024

FM-TS: Flow Matching for Time Series Generation.
CoRR, 2024

HEXA-MoE: Efficient and Heterogeneous-aware MoE Acceleration with ZERO Computation Redundancy.
CoRR, 2024

FairSkin: Fair Diffusion for Skin Disease Image Generation.
CoRR, 2024

Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching.
CoRR, 2024

GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning.
CoRR, 2024

PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches.
CoRR, 2024

Adapt-∞: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection.
CoRR, 2024

Leveraging Social Determinants of Health in Alzheimer's Research Using LLM-Augmented Literature Mining and Knowledge Graphs.
CoRR, 2024

Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems.
CoRR, 2024

Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models.
CoRR, 2024

(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork.
CoRR, 2024

SDoH-GPT: Using Large Language Models to Extract Social Determinants of Health (SDoH).
CoRR, 2024

DLO: Dynamic Layer Operation for Efficient Vertical Scaling of LLMs.
CoRR, 2024

Cross-Lingual Multi-Hop Knowledge Editing - Benchmarks, Analysis and a Simple Contrastive Learning based Approach.
CoRR, 2024

Benchmark on Drug Target Interaction Modeling from a Structure Perspective.
CoRR, 2024

Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark.
CoRR, 2024

Graph Sparsification via Mixture of Graphs.
CoRR, 2024

Hybrid Quantum-Classical Scheduling for Accelerating Neural Network Training with Newton's Gradient Descent.
CoRR, 2024

RESSA: Repair Sparse Vision-Language Models via Sparse Cross-Modality Adaptation.
CoRR, 2024

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order.
CoRR, 2024

Tuning-Free Accountable Intervention for LLM Deployment - A Metacognitive Approach.
CoRR, 2024

Privacy-preserving Fine-tuning of Large Language Models through Flatness.
CoRR, 2024

GraphRCG: Self-conditioned Graph Generation via Bootstrapped Representations.
CoRR, 2024

The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative.
CoRR, 2024

Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization.
CoRR, 2024

Word-Sequence Entropy: Towards Uncertainty Estimation in Free-Form Medical Question Answering Applications and Beyond.
CoRR, 2024

MerRec: A Large-scale Multipurpose Mercari Dataset for Consumer-to-Consumer Recommendation Systems.
CoRR, 2024

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations.
CoRR, 2024

TrustLLM: Trustworthiness in Large Language Models.
CoRR, 2024

Thought Graph: Generating Thought Process for Biological Reasoning.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

Enhancing Quantum Security over Federated Learning via Post-Quantum Cryptography.
Proceedings of the 5th IEEE International Conference on Trust, 2024

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ReTA: Recursively Thinking Ahead to Improve the Strategic Reasoning of Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering.
Proceedings of the Machine Learning for Health, 2024

Distributed UAV Beamforming Using Graph Recurrent Neural Networks.
Proceedings of the 13th IEEE Sensor Array and Multichannel Signal Processing Workshop, 2024

Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


Evolution-Inspired Loss Functions for Protein Representation Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sparse MoE with Language Guided Routing for Multilingual Machine Translation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Glue pizza and eat rocks - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Cross-Lingual Multi-Hop Knowledge Editing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Mew: Multiplexed Immunofluorescence Image Analysis Through an Efficient Multiplex Network.
Proceedings of the Computer Vision - ECCV 2024, 2024

Facial Affective Behavior Analysis with Instruction Tuning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Contextualization Distillation from Large Language Model for Knowledge Graph Completion.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Molecular Data Programming: Towards Molecule Pseudo-labeling with Systematic Weak Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HRBP: Hardware-friendly Regrouping towards Block-based Pruning for Sparse CNN Training.
Proceedings of the Conference on Parsimony and Learning, 2024

Cross-Quality Few-Shot Transfer for Alloy Yield Strength Prediction: A New Materials Science Benchmark and A Sparsity-Oriented Optimization Framework.
Proceedings of the Conference on Parsimony and Learning, 2024

Towards Instructing Disease-Drug Link Prediction with Social Determinants of Health.
Proceedings of the 15th ACM International Conference on Bioinformatics, 2024

Sparsity-Guided Holistic Explanation for LLMs with Interpretable Inference-Time Intervention.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Don't Be So Dense: Sparse-to-Sparse GAN Training Without Sacrificing Performance.
Int. J. Comput. Vis., October, 2023

Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive Benchmark Study.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Troubleshooting image segmentation models with human-in-the-loop.
Mach. Learn., March, 2023

Can Pruning Improve Certified Robustness of Neural Networks?
Trans. Mach. Learn. Res., 2023

Graph Contrastive Learning: An Odyssey towards Generalizable, Scalable and Principled Representation Learning on Graphs.
IEEE Data Eng. Bull., 2023

The Counterattack of CNNs in Self-Supervised Learning: Larger Kernel Size might be All You Need.
CoRR, 2023

Rethinking PGD Attack: Is Sign Function Necessary?
CoRR, 2023

SiRA: Sparse Mixture of Low Rank Adaptation.
CoRR, 2023

H<sub>2</sub>O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
CoRR, 2023

Learning imaging mechanism directly from optical microscopy observations.
CoRR, 2023

Attend Who is Weak: Pruning-assisted Medical Image Localization under Sophisticated and Implicit Imbalances.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

QuantumSEA: In-Time Sparse Exploration for Noise Adaptive Quantum Circuits.
Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

Enhancing Adversarial Training via Reweighting Optimization Trajectory.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models.
Proceedings of the International Conference on Machine Learning, 2023

Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication.
Proceedings of the International Conference on Machine Learning, 2023

Learning to Optimize Differentiable Games.
Proceedings of the International Conference on Machine Learning, 2023

Graph Domain Adaptation via Theory-Grounded Spectral Regularization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast Self-Adaptation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Is Attention All That NeRF Needs?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Proceedings of the Eleventh International Conference on Learning Representations, 2023

More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

HotProtein: A Novel Framework for Protein Thermostability Prediction and Editing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Robust Mixture-of-Expert Training for Convolutional Neural Networks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Accelerable Lottery Tickets with the Mixed-Precision Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Generalize Provably in Learning to Optimize.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Scalable Perception-Action-Communication Loops With Convolutional and Graph Neural Networks.
IEEE Trans. Signal Inf. Process. over Networks, 2022

DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference.
ACM Trans. Design Autom. Electr. Syst., 2022

Can You Win Everything with A Lottery Ticket?
Trans. Mach. Learn. Res., 2022

Queried Unlabeled Data Improves and Robustifies Class-Incremental Learning.
Trans. Mach. Learn. Res., 2022

Adversarial Feature Augmentation and Normalization for Visual Recognition.
Trans. Mach. Learn. Res., 2022

Learning to Optimize: A Primer and A Benchmark.
J. Mach. Learn. Res., 2022

QuanGCN: Noise-Adaptive Training for Robust Quantum Graph Convolutional Networks.
CoRR, 2022

M<sup>3</sup>ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design.
CoRR, 2022

Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
CoRR, 2022

Is Attention All NeRF Needs?
CoRR, 2022

Neural Implicit Dictionary via Mixture-of-Expert Training.
CoRR, 2022

More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity.
CoRR, 2022

APP: Anytime Progressive Pruning.
CoRR, 2022

VAQF: Fully Automatic Software-hardware Co-design Framework for Low-bit Vision Transformer.
CoRR, 2022

Bringing Your Own View: Graph Contrastive Learning without Prefabricated Data Augmentations.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Advancing Model Pruning via Bi-level Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Augmentations in Hypergraph Contrastive Learning: Fabricated and Generative.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sparse Winning Tickets are Data-Efficient Image Recognizers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Old can be Gold: Better Gradient Flow can Make Vanilla-GCNs Great Again.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Comprehensive Study on Large-Scale Graph Training: Benchmarking and Rethinking.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

You Can Have Better Graph Neural Networks by Not Training Weights at All: Finding Untrained GNNs Tickets.
Proceedings of the Learning on Graphs Conference, 2022

Towards Robust Detection and Segmentation Using Vertical and Horizontal Adversarial Training.
Proceedings of the International Joint Conference on Neural Networks, 2022

Neural Implicit Dictionary Learning via Mixture-of-Expert Training.
Proceedings of the International Conference on Machine Learning, 2022

Universality of Winning Tickets: A Renormalization Group Perspective.
Proceedings of the International Conference on Machine Learning, 2022

Training Your Sparse Neural Network Better with Any Mask.
Proceedings of the International Conference on Machine Learning, 2022

Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness.
Proceedings of the International Conference on Machine Learning, 2022

Data-Efficient Double-Win Lottery Tickets from Robust Pre-training.
Proceedings of the International Conference on Machine Learning, 2022

Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets.
Proceedings of the International Conference on Machine Learning, 2022

Symbolic Learning to Optimize: Towards Interpretability and Scalability.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Unified Visual Transformer Compression.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, And No Retraining.
Proceedings of the Tenth International Conference on Learning Representations, 2022

The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Optimizer Amalgamation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Sparsity Winning Twice: Better Robust Generalization from More Efficient Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards Lifelong Learning of Multilingual Text-to-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction.
Proceedings of the Computer Vision - ECCV 2022, 2022

DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment.
Proceedings of the Computer Vision - ECCV 2022, 2022

Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models.
Proceedings of the Computer Vision - ECCV 2022, 2022

CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Aug-NeRF: Training Stronger Neural Radiance Fields with Triple-Level Physically-Grounded Augmentations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

AutoCoG: A Unified Data-Model Co-Search Framework for Graph Neural Networks.
Proceedings of the International Conference on Automated Machine Learning, 2022

Playing Lottery Tickets with Vision and Language.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Improving Contrastive Learning on Imbalanced Seed Data via Open-World Sampling.
CoRR, 2021

CAP: Co-Adversarial Perturbation on Weights and Features for Improving Generalization of Graph Neural Networks.
CoRR, 2021

Universality of Deep Neural Network Lottery Tickets: A Renormalization Group Perspective.
CoRR, 2021

FreeTickets: Accurate, Robust and Efficient Deep Ensemble by Training with Dynamic Sparsity.
CoRR, 2021

Playing Lottery Tickets with Vision and Language.
CoRR, 2021

Learning Transferable 3D Adversarial Cloaks for Deep Trained Detectors.
CoRR, 2021

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly.
CoRR, 2021

Sandwich Batch Normalization.
CoRR, 2021

Good Students Play Big Lottery Better.
CoRR, 2021

Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Sparse Training via Boosting Pruning Plasticity with Neuroregeneration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Chasing Sparsity in Vision Transformers: An End-to-End Exploration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Data-Efficient GAN Training Beyond (Just) Augmentations: A Lottery Ticket Perspective.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Sparse and Imperceptible Adversarial Attack via a Homotopy Algorithm.
Proceedings of the 38th International Conference on Machine Learning, 2021

Efficient Lottery Ticket Finding: Less Data is More.
Proceedings of the 38th International Conference on Machine Learning, 2021

Graph Contrastive Learning Automated.
Proceedings of the 38th International Conference on Machine Learning, 2021

Self-Damaging Contrastive Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

A Unified Lottery Ticket Hypothesis for Graph Neural Networks.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning A Minimax Optimizer: A Pilot Study.
Proceedings of the 9th International Conference on Learning Representations, 2021

Undistillable: Making A Nasty Teacher That CANNOT teach students.
Proceedings of the 9th International Conference on Learning Representations, 2021

GANs Can Play Lottery Tickets Too.
Proceedings of the 9th International Conference on Learning Representations, 2021

Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Robust Overfitting may be mitigated by properly learned smoothening.
Proceedings of the 9th International Conference on Learning Representations, 2021

VGAI: End-to-End Learning of Vision-Based Decentralized Controllers for Robot Swarms.
Proceedings of the IEEE International Conference on Acoustics, 2021

Troubleshooting Blind Image Quality Models in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

"BNN - BN = ?": Training Binary Neural Networks Without Batch Normalization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

The Lottery Tickets Hypothesis for Supervised and Self-Supervised Pre-Training in Computer Vision Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DynEHR: Dynamic adaptation of models with data heterogeneity in electronic health records.
Proceedings of the IEEE EMBS International Conference on Biomedical and Health Informatics, 2021

2020
PCAL: A Privacy-preserving Intelligent Credit Risk Modeling Framework Based on Adversarial Learning.
CoRR, 2020

Can 3D Adversarial Logos Cloak Humans?
CoRR, 2020

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks.
CoRR, 2020

Calibrated Domain-Invariant Learning for Highly Generalizable Large Scale Re-Identification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Graph Contrastive Learning with Augmentations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Robust Pre-Training by Adversarial Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Training Stronger Baselines for Learning to Optimize.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

The Lottery Ticket Hypothesis for Pre-trained BERT Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dataset and Enhanced Model for Eligibility Criteria-to-SQL Semantic Parsing.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

AutoSpeech: Neural Architecture Search for Speaker Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

When Does Self-Supervision Help Graph Convolutional Networks?
Proceedings of the 37th International Conference on Machine Learning, 2020

Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training.
Proceedings of the 37th International Conference on Machine Learning, 2020

I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifiers Adaptively.
Proceedings of the 8th International Conference on Learning Representations, 2020

Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference.
Proceedings of the 8th International Conference on Learning Representations, 2020

HALO: Hardware-Aware Learning to Optimize.
Proceedings of the Computer Vision - ECCV 2020, 2020

L2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Focus Longer to See Better: Recursively Refined Attention for Fine-Grained Image Classification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning to Optimize in Swarms.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Cross-Modal Person Search: A Coarse-to-Fine Framework using Bi-Directional Text-Image Matching.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

ABD-Net: Attentive but Diverse Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...