Tianlong Chen

Yunmei Liu

CoRR, February, 2026

TMS: Trajectory-Mixed Supervision for Reward-Free, On-Policy SFT.

[BibT_eX]

[DOI]

CoRR, February, 2026

Dynamic Mix Precision Routing for Efficient Multi-step LLM Interaction.

[BibT_eX]

[DOI]

CoRR, February, 2026

Geometry- and Relation-Aware Diffusion for EEG Super-Resolution.

[BibT_eX]

[DOI]

CoRR, February, 2026

Towards Building Non-Fine-Tunable Foundation Models.

[BibT_eX]

[DOI]

CoRR, February, 2026

From Models to Systems: A Comprehensive Survey of Efficient Multimodal Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

$\texttt{LucidAtlas}$: Learning Uncertainty-Aware, Covariate-Disentangled, Individualized Atlas Representations.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

SDoH-GPT: using large language models to extract social determinants of health.

[BibT_eX]

[DOI]

J. Am. Medical Informatics Assoc., 2026

GatorSC: multi-scale cell and gene graphs with mixture-of-experts fusion for single-cell transcriptomics.

[BibT_eX]

[DOI]

Briefings Bioinform., 2026

Explaining the 'Unexplainable' Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

EdgeTune: Efficient On-Device LLM Personalization at the Edge.

[BibT_eX]

[DOI]

Zhenyu Wang

Shahriar Nirjon

Proceedings of the 2026 ACM/IEEE International Conference on Embedded Artificial Intelligence and Sensing Systems, 2026

Dialogue is Better Than Monologue: Instructing Meidcal LLMs via Strategic Conversations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

Can Multimodal LLMs 'See' Science Instruction? Benchmarking Pedagogical Reasoning in K-12 Classroom Videos.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence in Education - 27th International Conference, 2026

Vulnerability-Aware Robust Multimodal Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

COIN: Uncertainty-Guarding Selective Question Answering for Foundation Models with Provable Risk Guarantees.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Model Editing as a Double-Edged Sword: Steering Agent Behavior Toward Beneficence or Harm.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

OR-R1: Automating Modeling and Solving of Operations Research Optimization Problem via Test-Time Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Multimodal Fusion of Regional Brain Experts for Interpretable Alzheimer's Disease Diagnosis.

[BibT_eX]

[DOI]

CoRR, December, 2025

LEC: Linear Expectation Constraints for False-Discovery Control in Selective Prediction and Routing Systems.

[BibT_eX]

[DOI]

CoRR, December, 2025

PPBoost: Progressive Prompt Boosting for Text-Driven Medical Image Segmentation.

[BibT_eX]

[DOI]

CoRR, November, 2025

Fairness in Multi-modal Medical Diagnosis with Demonstration Selection.

[BibT_eX]

[DOI]

CoRR, November, 2025

A Space-Time Transformer for Precipitation Forecasting.

[BibT_eX]

[DOI]

Levi Harris

CoRR, November, 2025

Unlocking Dynamic Inter-Client Spatial Dependencies: A Federated Spatio-Temporal Graph Learning Method for Traffic Flow Forecasting.

[BibT_eX]

[DOI]

CoRR, November, 2025

Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder.

[BibT_eX]

[DOI]

CoRR, November, 2025

ResearchGPT: Benchmarking and Training LLMs for End-to-End Computer Science Research Workflows.

[BibT_eX]

[DOI]

CoRR, October, 2025

TRUST: A Decentralized Framework for Auditing Large Language Model Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert Signatures.

[BibT_eX]

[DOI]

CoRR, October, 2025

SARHAchat: An LLM-Based Chatbot for Sexual and Reproductive Health Counseling.

[BibT_eX]

[DOI]

CoRR, October, 2025

Can GRPO Help LLMs Transcend Their Pretraining Origin?

[BibT_eX]

[DOI]

CoRR, October, 2025

Metacognitive Self-Correction for Multi-Agent System via Prototype-Guided Next-Execution Reconstruction.

[BibT_eX]

[DOI]

Hossein Nourkhiz Mahjoub

Ehsan Moradi-Pari

Kwonjoon Lee

CoRR, October, 2025

EditCast3D: Single-Frame-Guided 3D Editing with Video Propagation and View Selection.

[BibT_eX]

[DOI]

CoRR, October, 2025

Multi-Agent Debate for LLM Judges with Adaptive Stability Detection.

[BibT_eX]

[DOI]

CoRR, October, 2025

AsyncSpade: Efficient Test-Time Scaling with Asynchronous Sparse Decoding.

[BibT_eX]

[DOI]

CoRR, October, 2025

FaithCoT-Bench: Benchmarking Instance-Level Faithfulness of Chain-of-Thought Reasoning.

[BibT_eX]

[DOI]

CoRR, October, 2025

GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction.

[BibT_eX]

[DOI]

CoRR, September, 2025

Quantum Variational Activation Functions Empower Kolmogorov-Arnold Networks.

[BibT_eX]

[DOI]

CoRR, September, 2025

Transferring Expert Cognitive Models to Social Robots via Agentic Concept Bottleneck Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

Enabling Few-Shot Alzheimer's Disease Diagnosis on Tabular Biomarker Data with LLMs.

[BibT_eX]

[DOI]

CoRR, July, 2025

SparseC-AFM: a deep learning method for fast and accurate characterization of MoS2 with C-AFM.

[BibT_eX]

[DOI]

CoRR, July, 2025

SPATIA: Multimodal Model for Prediction and Generation of Spatial Cell Phenotypes.

[BibT_eX]

[DOI]

CoRR, July, 2025

GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling.

[BibT_eX]

[DOI]

CoRR, June, 2025

Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, June, 2025

Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm.

[BibT_eX]

[DOI]

CoRR, June, 2025

UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making.

[BibT_eX]

[DOI]

CoRR, June, 2025

An Empirical Study of Federated Prompt Learning for Vision Language Model.

[BibT_eX]

[DOI]

CoRR, May, 2025

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction.

[BibT_eX]

[DOI]

CoRR, May, 2025

DOGe: Defensive Output Generation for LLM Protection Against Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, May, 2025

The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation.

[BibT_eX]

[DOI]

Ruichen Zhang

Konstantinos N. Plataniotis

CoRR, May, 2025

Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference.

[BibT_eX]

[DOI]

CoRR, May, 2025

DD-Ranking: Rethinking the Evaluation of Dataset Distillation.

[BibT_eX]

[DOI]

Baharan Mirzasoleiman

Manolis Kellis

CoRR, May, 2025

GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization.

[BibT_eX]

[DOI]

CoRR, May, 2025

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment.

[BibT_eX]

[DOI]

CoRR, April, 2025

Efficient MAP Estimation of LLM Judgment Performance with Prior Transfer.

[BibT_eX]

[DOI]

CoRR, April, 2025

Are We Merely Justifying Results ex Post Facto? Quantifying Explanatory Inversion in Post-Hoc Model Explanations.

[BibT_eX]

[DOI]

CoRR, April, 2025

Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations.

[BibT_eX]

[DOI]

CoRR, April, 2025

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment.

[BibT_eX]

[DOI]

CoRR, April, 2025

LightDefense: A Lightweight Uncertainty-Driven Defense against Jailbreaks via Shifted Token Distribution.

[BibT_eX]

[DOI]

CoRR, April, 2025

Agents Under Siege: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks.

[BibT_eX]

[DOI]

CoRR, April, 2025

ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion.

[BibT_eX]

[DOI]

CoRR, March, 2025

Make Optimization Once and for All with Fine-grained Guidance.

[BibT_eX]

[DOI]

CoRR, March, 2025

H3PIMAP: A Heterogeneity-Aware Multi-Objective DNN Mapping Framework on Electronic-Photonic Processing-in-Memory Architectures.

[BibT_eX]

[DOI]

Krishnendu Chakrabarty

Farshad Firouzi

Jeff Zhang

Jiaqi Gu

CoRR, March, 2025

Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning.

[BibT_eX]

[DOI]

CoRR, March, 2025

NeuroSymAD: A Neuro-Symbolic Framework for Interpretable Alzheimer's Disease Diagnosis.

[BibT_eX]

[DOI]

CoRR, March, 2025

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam.

[BibT_eX]

[DOI]

CoRR, February, 2025

LucidAtlas$: Learning Uncertainty-Aware, Covariate-Disentangled, Individualized Atlas Representations.

[BibT_eX]

[DOI]

CoRR, February, 2025

Symbiotic Cooperation for Web Agents: Harnessing Complementary Strengths of Large and Small LLMs.

[BibT_eX]

[DOI]

CoRR, February, 2025

Continually Evolved Multimodal Foundation Models for Cancer Prognosis.

[BibT_eX]

[DOI]

CoRR, January, 2025

Dialogue is Better Than Monologue: Instructing Medical LLMs via Strategical Conversations.

[BibT_eX]

[DOI]

CoRR, January, 2025

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense.

[BibT_eX]

[DOI]

CoRR, January, 2025

The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit.

[BibT_eX]

[DOI]

CoRR, January, 2025

GroverGPT: A Large Language Model with 8 Billion Parameters for Quantum Searching.

[BibT_eX]

[DOI]

CoRR, January, 2025

Data for "CryoNeRF: neural radiance field for homogeneous and heterogeneous cryo-EM reconstruction".

[BibT_eX]

[DOI]

Dataset, January, 2025

Data for "CryoNeRF: neural radiance field for homogeneous and heterogeneous cryo-EM reconstruction".

[BibT_eX]

[DOI]

Dataset, January, 2025

A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Pushing the Limits of Sparsity: A Bag of Tricks for Extreme Pruning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

A natural language processing-based approach for early detection of heart failure onset using electronic health records.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2025

GatorCLR: Personalized predictions of patient outcomes on electronic health records using self-supervised contrastive graph representation.

[BibT_eX]

[DOI]

J. Biomed. Informatics, 2025

Word-Sequence Entropy: Towards uncertainty estimation in free-form medical question answering applications and beyond.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2025

Deep Learning for Accurate Diagnosis of Viral Infections through scRNA-seq Analysis: A Comprehensive Benchmark Study.

[BibT_eX]

[DOI]

J. Data-centric Mach. Learn. Res., 2025

A Survey on Reinforcement Learning for Optimal Decision-Making and Control of Intelligent Vehicles.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2025

A Decentralized Framework for Auditing Large Language Model Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 7th IEEE International Conference on Trust, 2025

Protecting Privacy against Membership Inference Attack with LLM Fine-tuning through Flatness.

[BibT_eX]

[DOI]

Proceedings of the 2025 SIAM International Conference on Data Mining, 2025

One Token Embedding Is Enough to Deadlock Your Large Reasoning Model.

[BibT_eX]

[DOI]

Mohan Zhang

Yihua Zhang

Jinghan Jia

Zhangyang (Atlas) Wang

Sijia Liu

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

<tt>BetaConform</tt>: Efficient MAP Estimation of LLM Ensemble Judgment Performance with Prior Transfer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Mozart: Modularized and Efficient MoE Training on 3.5D Wafer-Scale Chiplet Architectures.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

RTGS: Real-Time 3D Gaussian Splatting SLAM via Multi-Level Redundancy Reduction.

[BibT_eX]

[DOI]

Proceedings of the 58th IEEE/ACM International Symposium on Microarchitecture, 2025

Oldie but Goodie: Re-illuminating Label Propagation on Graphs with Partially Observed Features.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

A Survey on Trustworthy LLM Agents: Threats and Countermeasures.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

MerRec: A Large-scale Multipurpose Mercari Dataset for Consumer-to-Consumer Recommendation Systems.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Occult: Optimizing Collaborative Communications across Experts for Accelerated Parallel MoE Training and Inference.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Modalities Contribute Unequally: Enhancing Medical Multi-modal Learning through Adaptive Modality Token Re-balancing.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Graph Sparsification via Mixture of Graphs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches.

[BibT_eX]

[DOI]

Rana Muhammad Shahroz

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adapt-∞: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Proactive Privacy Amnesia for Large Language Models: Safeguarding PII with Negligible Impact on Model Utility.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Composable Interventions for Language Models.

[BibT_eX]

[DOI]

Arinbjörn Kolbeinsson

Jonathan Richard Schwarz

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Thought Graph: Balancing Specificity and Uncertainty in LLM-Based Gene Set Annotation.

[BibT_eX]

[DOI]

Proceedings of the 13th IEEE International Conference on Healthcare Informatics, 2025

Towards Stabilized and Efficient Diffusion Transformers Through Long-Skip-Connections With Spectral Constraints.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

HoloZip: High Hologram Compression via Latent-of-Latent Coding.

[BibT_eX]

[DOI]

Praneeth Chakravarthula

Proceedings of the IEEE International Conference on Computational Photography, 2025

AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion.

[BibT_eX]

[DOI]

Rana Muhammad Shahroz

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Bag of Tricks for Sparse Mixture-of-Experts: A Benchmark Across Reasoning, Efficiency, and Safety.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Task-Aware Resolution Optimization for Visual Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Glider: Global and Local Instruction-Driven Expert Router.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

EQA-RM: A Generative Embodied Reward Model with Test-time Scaling.

[BibT_eX]

[DOI]

Yuhang Chen

Zhen Tan

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Window Token Concatenation for Efficient Visual Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Sparse MoE as a New Treatment: Addressing Forgetting, Fitting, Learning Issues in Multi-Modal Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2025

You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time.

[BibT_eX]

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2025

GraphRCG: Self-Conditioned Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

Enabling Few-Shot Alzheimer's Disease Diagnosis on Biomarker Data with Tabular LLMs.

[BibT_eX]

[DOI]

Proceedings of the 16th ACM International Conference on Bioinformatics, 2025

The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit.

[BibT_eX]

[DOI]

Srinivas Prasad Govindan

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SConU: Selective Conformal Uncertainty in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Agents Under Siege: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks.

[BibT_eX]

[DOI]

Rana Muhammad Shahroz

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

UQ-Merge: Uncertainty Guided Multimodal Large Language Model Merging.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Unveiling Privacy Risks in Multi-modal Large Language Models: Task-specific Vulnerabilities and Mitigation Challenges.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Vision Language Model Helps Private Information De-Identification in Vision Data.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Spatial Coordinates as a Cell Language: A Multi-Sentence Framework for Imaging Mass Cytometry Analysis.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

BrainMAP: Learning Multiple Activation Pathways in Brain Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Evaluating topological fitness of human brain-inspired sub-circuits in Echo State Networks.

[BibT_eX]

[DOI]

Proceedings of the AAAI Bridge Program on AI for Medicine and Healthcare, 2025

Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Mapping from Meaning: Addressing the Miscalibration of Prompt-Sensitive Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Breaking the Resource Monopoly from Industries: Sustainable and Reliable LLM Serving by Recycling Outdated and Resource-Constrained GPUs.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Tuning-Free Accountable Intervention for LLM Deployment - a Metacognitive Approach.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Inductive Lottery Ticket Learning for Graph Neural Networks.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., November, 2024

One is Not Enough: Parameter-Efficient Fine-Tuning With Multiplicative Sparse Factorization.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

IEEE J. Sel. Top. Signal Process., September, 2024

Single-cell RNA sequencing data imputation using bi-level feature propagation.

[BibT_eX]

[DOI]

Briefings Bioinform., May, 2024

Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Political-LLM: Large Language Models in Political Science.

[BibT_eX]

[DOI]

CoRR, 2024

Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare.

[BibT_eX]

[DOI]

CoRR, 2024

Accelerating Vision Diffusion Transformers with Skip Branches.

[BibT_eX]

[DOI]

CoRR, 2024

FM-TS: Flow Matching for Time Series Generation.

[BibT_eX]

[DOI]

CoRR, 2024

HEXA-MoE: Efficient and Heterogeneous-aware MoE Acceleration with ZERO Computation Redundancy.

[BibT_eX]

[DOI]

CoRR, 2024

FairSkin: Fair Diffusion for Skin Disease Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching.

[BibT_eX]

[DOI]

CoRR, 2024

GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning.

[BibT_eX]

[DOI]

CoRR, 2024

PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches.

[BibT_eX]

[DOI]

CoRR, 2024

Adapt-∞: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection.

[BibT_eX]

[DOI]

CoRR, 2024

Leveraging Social Determinants of Health in Alzheimer's Research Using LLM-Augmented Literature Mining and Knowledge Graphs.

[BibT_eX]

[DOI]

CoRR, 2024

Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems.

[BibT_eX]

[DOI]

CoRR, 2024

Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork.

[BibT_eX]

[DOI]

CoRR, 2024

SDoH-GPT: Using Large Language Models to Extract Social Determinants of Health (SDoH).

[BibT_eX]

[DOI]

CoRR, 2024

DLO: Dynamic Layer Operation for Efficient Vertical Scaling of LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Cross-Lingual Multi-Hop Knowledge Editing - Benchmarks, Analysis and a Simple Contrastive Learning based Approach.

[BibT_eX]

[DOI]

CoRR, 2024

Benchmark on Drug Target Interaction Modeling from a Structure Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

Graph Sparsification via Mixture of Graphs.

[BibT_eX]

[DOI]

CoRR, 2024

Hybrid Quantum-Classical Scheduling for Accelerating Neural Network Training with Newton's Gradient Descent.

[BibT_eX]

[DOI]

CoRR, 2024

RESSA: Repair Sparse Vision-Language Models via Sparse Cross-Modality Adaptation.

[BibT_eX]

[DOI]

Shwai He

CoRR, 2024

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order.

[BibT_eX]

[DOI]

CoRR, 2024

Tuning-Free Accountable Intervention for LLM Deployment - A Metacognitive Approach.

[BibT_eX]

[DOI]

CoRR, 2024

Privacy-preserving Fine-tuning of Large Language Models through Flatness.

[BibT_eX]

[DOI]

CoRR, 2024

GraphRCG: Self-conditioned Graph Generation via Bootstrapped Representations.

[BibT_eX]

[DOI]

CoRR, 2024

The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative.

[BibT_eX]

[DOI]

CoRR, 2024

Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization.

[BibT_eX]

[DOI]

CoRR, 2024

Word-Sequence Entropy: Towards Uncertainty Estimation in Free-Form Medical Question Answering Applications and Beyond.

[BibT_eX]

[DOI]

CoRR, 2024

MerRec: A Large-scale Multipurpose Mercari Dataset for Consumer-to-Consumer Recommendation Systems.

[BibT_eX]

[DOI]

CoRR, 2024

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations.

[BibT_eX]

[DOI]

CoRR, 2024

TrustLLM: Trustworthiness in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Thought Graph: Generating Thought Process for Biological Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

Enhancing Quantum Security over Federated Learning via Post-Quantum Cryptography.

[BibT_eX]

[DOI]

Pingzhi Li

Raghuraman Krishnamoorthi

Junyu Liu

Proceedings of the 5th IEEE International Conference on Trust, 2024

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

ReTA: Recursively Thinking Ahead to Improve the Strategic Reasoning of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Health, 2024

Distributed UAV Beamforming Using Graph Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 13th IEEE Sensor Array and Multichannel Signal Processing Workshop, 2024

Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once.

[BibT_eX]

[DOI]

Shiyu Chang

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Position: TrustLLM: Trustworthiness in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Evolution-Inspired Loss Functions for Protein Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sparse MoE with Language Guided Routing for Multilingual Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Glue pizza and eat rocks - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Cross-Lingual Multi-Hop Knowledge Editing.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Mew: Multiplexed Immunofluorescence Image Analysis Through an Efficient Multiplex Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Facial Affective Behavior Analysis with Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Contextualization Distillation from Large Language Model for Knowledge Graph Completion.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Molecular Data Programming: Towards Molecule Pseudo-labeling with Systematic Weak Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HRBP: Hardware-friendly Regrouping towards Block-based Pruning for Sparse CNN Training.

[BibT_eX]

[DOI]

Proceedings of the Conference on Parsimony and Learning, 2024

Cross-Quality Few-Shot Transfer for Alloy Yield Strength Prediction: A New Materials Science Benchmark and A Sparsity-Oriented Optimization Framework.

[BibT_eX]

[DOI]

Xuxi Chen

Everardo Yeriel Olivares

Proceedings of the Conference on Parsimony and Learning, 2024

Towards Instructing Disease-Drug Link Prediction with Social Determinants of Health.

[BibT_eX]

[DOI]

Proceedings of the 15th ACM International Conference on Bioinformatics, 2024

Sparsity-Guided Holistic Explanation for LLMs with Interpretable Inference-Time Intervention.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Don't Be So Dense: Sparse-to-Sparse GAN Training Without Sacrificing Performance.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., October, 2023

Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive Benchmark Study.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Troubleshooting image segmentation models with human-in-the-loop.

[BibT_eX]

[DOI]

Mach. Learn., March, 2023

Can Pruning Improve Certified Robustness of Neural Networks?

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Graph Contrastive Learning: An Odyssey towards Generalizable, Scalable and Principled Representation Learning on Graphs.

[BibT_eX]

[DOI]

IEEE Data Eng. Bull., 2023

The Counterattack of CNNs in Self-Supervised Learning: Larger Kernel Size might be All You Need.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking PGD Attack: Is Sign Function Necessary?

[BibT_eX]

[DOI]

CoRR, 2023

SiRA: Sparse Mixture of Low Rank Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Learning imaging mechanism directly from optical microscopy observations.

[BibT_eX]

[DOI]

CoRR, 2023

Attend Who is Weak: Pruning-assisted Medical Image Localization under Sophisticated and Implicit Imbalances.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

QuantumSEA: In-Time Sparse Exploration for Noise Adaptive Quantum Circuits.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

Enhancing Adversarial Training via Reweighting Optimization Trajectory.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Learning to Optimize Differentiable Games.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Graph Domain Adaptation via Theory-Grounded Spectral Regularization.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast Self-Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Is Attention All That NeRF Needs?

[BibT_eX]

[DOI]

Subhashini Venugopalan

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

HotProtein: A Novel Framework for Protein Thermostability Prediction and Editing.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Robust Mixture-of-Expert Training for Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Accelerable Lottery Tickets with the Mixed-Precision Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Generalize Provably in Learning to Optimize.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models.

[BibT_eX]

[DOI]

Xuxi Chen

Weizhu Chen

Ahmed Hassan Awadallah

Yu Cheng

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Scalable Perception-Action-Communication Loops With Convolutional and Graph Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Signal Inf. Process. over Networks, 2022

DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2022

Can You Win Everything with A Lottery Ticket?

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

Queried Unlabeled Data Improves and Robustifies Class-Incremental Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

Adversarial Feature Augmentation and Normalization for Visual Recognition.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

Learning to Optimize: A Primer and A Benchmark.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2022

QuanGCN: Noise-Adaptive Training for Robust Quantum Graph Convolutional Networks.

[BibT_eX]

[DOI]

CoRR, 2022

M3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design.

[BibT_eX]

[DOI]

CoRR, 2022

Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?

[BibT_eX]

[DOI]

CoRR, 2022

Is Attention All NeRF Needs?

[BibT_eX]

[DOI]

Subhashini Venugopalan

CoRR, 2022

Neural Implicit Dictionary via Mixture-of-Expert Training.

[BibT_eX]

[DOI]

CoRR, 2022

More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity.

[BibT_eX]

[DOI]

CoRR, 2022

APP: Anytime Progressive Pruning.

[BibT_eX]

[DOI]

CoRR, 2022

VAQF: Fully Automatic Software-hardware Co-design Framework for Low-bit Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

Bringing Your Own View: Graph Contrastive Learning without Prefabricated Data Augmentations.

[BibT_eX]

[DOI]

Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Advancing Model Pruning via Bi-level Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Augmentations in Hypergraph Contrastive Learning: Fabricated and Generative.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sparse Winning Tickets are Data-Efficient Image Recognizers.

[BibT_eX]

[DOI]

Subhashini Venugopalan

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Old can be Gold: Better Gradient Flow can Make Vanilla-GCNs Great Again.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Comprehensive Study on Large-Scale Graph Training: Benchmarking and Rethinking.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

You Can Have Better Graph Neural Networks by Not Training Weights at All: Finding Untrained GNNs Tickets.

[BibT_eX]

[DOI]

Mykola Pechenizkiy

Shiwei Liu

Proceedings of the Learning on Graphs Conference, 2022

Towards Robust Detection and Segmentation Using Vertical and Horizontal Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2022

Neural Implicit Dictionary Learning via Mixture-of-Expert Training.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Universality of Winning Tickets: A Renormalization Group Perspective.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Training Your Sparse Neural Network Better with Any Mask.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Data-Efficient Double-Win Lottery Tickets from Robust Pre-training.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Symbolic Learning to Optimize: Towards Interpretability and Scalability.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Unified Visual Transformer Compression.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, And No Retraining.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training.

[BibT_eX]

[DOI]

Mykola Pechenizkiy

Proceedings of the Tenth International Conference on Learning Representations, 2022

Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Optimizer Amalgamation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable.

[BibT_eX]

[DOI]

Shaojin Ding

Proceedings of the Tenth International Conference on Learning Representations, 2022

Sparsity Winning Twice: Better Robust Generalization from More Efficient Training.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards Lifelong Learning of Multilingual Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Quarantine: Sparsity Can Uncover the Trojan Attack Trigger for Free.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Aug-NeRF: Training Stronger Neural Radiance Fields with Triple-Level Physically-Grounded Augmentations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

AutoCoG: A Unified Data-Model Co-Search Framework for Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Automated Machine Learning, 2022

Playing Lottery Tickets with Vision and Language.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Improving Contrastive Learning on Imbalanced Seed Data via Open-World Sampling.

[BibT_eX]

[DOI]

CoRR, 2021

CAP: Co-Adversarial Perturbation on Weights and Features for Improving Generalization of Graph Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Universality of Deep Neural Network Lottery Tickets: A Renormalization Group Perspective.

[BibT_eX]

[DOI]

CoRR, 2021

FreeTickets: Accurate, Robust and Efficient Deep Ensemble by Training with Dynamic Sparsity.

[BibT_eX]

[DOI]

CoRR, 2021

Playing Lottery Tickets with Vision and Language.

[BibT_eX]

[DOI]

CoRR, 2021

Learning Transferable 3D Adversarial Cloaks for Deep Trained Detectors.

[BibT_eX]

[DOI]

CoRR, 2021

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly.

[BibT_eX]

[DOI]

CoRR, 2021

Sandwich Batch Normalization.

[BibT_eX]

[DOI]

CoRR, 2021

Good Students Play Big Lottery Better.

[BibT_eX]

[DOI]

CoRR, 2021

Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Sparse Training via Boosting Pruning Plasticity with Neuroregeneration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Chasing Sparsity in Vision Transformers: An End-to-End Exploration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Data-Efficient GAN Training Beyond (Just) Augmentations: A Lottery Ticket Perspective.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Sparse and Imperceptible Adversarial Attack via a Homotopy Algorithm.

[BibT_eX]

[DOI]

Mingkang Zhu