Joshua M. Susskind

Affiliations:
  • Apple


According to our database1, Joshua M. Susskind authored at least 84 papers between 2008 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Flexible Language Modeling in Continuous Space with Transformer-based Autoregressive Flows.
CoRR, July, 2025

TADA: Improved Diffusion Sampling with Training-free Augmented Dynamics.
CoRR, June, 2025

STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis.
CoRR, June, 2025

How PARTs assemble into wholes: Learning the relative composition of images.
CoRR, June, 2025

Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting.
CoRR, May, 2025

Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion.
CoRR, April, 2025

Scaling Laws for Native Multimodal Models.
CoRR, April, 2025

Mechanisms of Projective Composition of Diffusion Models.
CoRR, February, 2025

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models.
CoRR, January, 2025

Improving GFlowNets for Text-to-Image Diffusion Alignment.
Trans. Mach. Learn. Res., 2025

Denoising Autoregressive Transformers for Scalable Text-to-Image Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

World-consistent Video Diffusion with Explicit 3D Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Multimodal Autoregressive Pre-training of Large Vision Encoders.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
The Slingshot Effect: A Late-Stage Optimization Anomaly in Adaptive Gradient Methods.
Trans. Mach. Learn. Res., 2024

3D Shape Tokenization.
CoRR, 2024

Normalizing Flows are Capable Generative Models.
CoRR, 2024

Coordinate In and Value Out: Training Flow Transformers in Ambient Space.
CoRR, 2024

World-consistent Video Diffusion with Explicit 3D Modeling.
CoRR, 2024

TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models.
CoRR, 2024

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation.
CoRR, 2024

CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models.
CoRR, 2024

Many-to-many Image Generation with Auto-regressive Diffusion Models.
CoRR, 2024

How Far Are We from Intelligent Visual Deductive Reasoning?
CoRR, 2024

How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation Networks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Swallowing the Bitter Pill: Simplified Scalable Conformer Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Data-free Distillation of Diffusion Models with Bootstrapping.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Scalable Pre-training of Large Autoregressive Image Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

What Algorithms can Transformers Learn? A Study in Length Generalization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Vanishing Gradients in Reinforcement Finetuning of Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Matryoshka Diffusion Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Manifold Diffusion Fields.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generative Modeling with Phase Stochastic Bridge.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

When can transformers reason with abstract symbols?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Pseudo-Generalized Dynamic View Synthesis from a Video.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Control3Diff: Learning Controllable 3D Diffusion Models from Single-view Images.
Proceedings of the International Conference on 3D Vision, 2024

2023
Generating Molecular Conformer Fields.
CoRR, 2023

Adaptivity and Modularity for Efficient Generalization Over Task Complexity.
CoRR, 2023

Is Generalized Dynamic Novel View Synthesis from Monocular Videos Possible Today?
CoRR, 2023

Generative Modeling with Phase Stochastic Bridges.
CoRR, 2023

Boolformer: Symbolic Regression of Logic Functions with Transformers.
CoRR, 2023

Value function estimation using conditional diffusion models for control.
CoRR, 2023

BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping.
CoRR, 2023

Manifold Diffusion Fields.
CoRR, 2023

Learning Controllable 3D Diffusion Models from Single-view Images.
CoRR, 2023

Transformers learn through gradual rank increase.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Stabilizing Transformer Training by Preventing Attention Entropy Collapse.
Proceedings of the International Conference on Machine Learning, 2023

NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion.
Proceedings of the International Conference on Machine Learning, 2023

Diffusion Probabilistic Fields.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

MAST: Masked Augmentation Subspace Training for Generalizable Self-Supervised Priors.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon.
CoRR, 2022

Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation.
CoRR, 2022

Fast and Explicit Neural View Synthesis.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

GAUDI: A Neural Architect for Immersive 3D Scene Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Regularized Training of Nearest Neighbor Language Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2022

Position Prediction as an Effective Pretraining Strategy.
Proceedings of the International Conference on Machine Learning, 2022

Efficient Representation Learning via Adaptive Context Pooling.
Proceedings of the International Conference on Machine Learning, 2022

Learning Representation from Neural Fisher Kernel with Low-rank Approximation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models.
CoRR, 2021

Implicit Greedy Rank Learning in Autoencoders via Overparameterized Linear Networks.
CoRR, 2021

Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks.
CoRR, 2021

An Attention Free Transformer.
CoRR, 2021

On the generalization of learning-based 3D reconstruction.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unconstrained Scene Generation with Locally Conditioned Radiance Fields.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MetricOpt: Learning To Optimize Black-Box Evaluation Metrics.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Set Distribution Networks: a Generative Model for Sets of Images.
CoRR, 2020

Equivariant Neural Rendering.
CoRR, 2020

Collegial Ensembles.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Equivariant Neural Rendering.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Skip-Clip: Self-Supervised Spatiotemporal Representation Learning by Future Clip Order Ranking.
CoRR, 2019

Adversarial Fisher Vectors for Unsupervised Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment.
Proceedings of the 36th International Conference on Machine Learning, 2019

2013
Modeling Natural Images Using Gated MRFs.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

2011
Modeling the joint density of two images under a variety of transformations.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

On deep generative models with applications to recognition.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2008
Analysis-by-Synthesis by Learning to Invert Generative Black Boxes.
Proceedings of the Artificial Neural Networks, 2008


  Loading...