Bin Ren
Orcid: 0000-0002-4116-5237Affiliations:
- William & Mary, Williamsburg, VA, USA
According to our database1,
Bin Ren
authored at least 99 papers
between 2011 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Mobile-3DCNN: An Acceleration Framework for Ultra-Real-Time Execution of Large 3D CNNs on Mobile Devices.
ACM Trans. Archit. Code Optim., September, 2025
HiSin: Efficient High-Resolution Sinogram Inpainting via Resolution-Guided Progressive Inference.
CoRR, June, 2025
ACM Trans. Comput. Heal., January, 2025
TMModel: Modeling Texture Memory and Mobile GPU Performance to Accelerate DNN Computations.
Proceedings of the 39th ACM International Conference on Supercomputing, 2025
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
2024
Trans. Recomm. Syst., March, 2024
FCDM: Sparse-view Sinogram Inpainting with Frequency Domain Convolution Enhanced Diffusion Models.
CoRR, 2024
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
Proceedings of the 38th ACM International Conference on Supercomputing, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
DACO: Pursuing Ultra-low Power Consumption via DNN-Adaptive CPU-GPU CO-optimization on Mobile Devices.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024
SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
2023
ACM Comput. Surv., 2023
Decentralized Application-Level Adaptive Scheduling for Multi-Instance DNNs on Open Mobile Devices.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023
Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework.
ACM Trans. Embed. Comput. Syst., September, 2022
MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging.
IEEE Trans. Parallel Distributed Syst., 2022
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration.
ACM Trans. Design Autom. Electr. Syst., 2022
GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices Based on Fine-Grained Structured Weight Sparsity.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Brief Industry Paper: Enabling Level-4 Autonomous Driving on a Single $1k Off-the-Shelf Card.
Proceedings of the 28th IEEE Real-Time and Embedded Technology and Applications Symposium, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 18th International Conference on Mobility, Sensing and Networking, 2022
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022
Proceedings of the 23rd International Symposium on Quality Electronic Design, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
CoRR, 2021
CoCoPIE: enabling real-time AI on off-the-shelf mobile devices via compression-compilation co-design.
Commun. ACM, 2021
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021
Brief Industry Paper: Towards Real-Time 3D Object Detection for Autonomous Vehicles with Pruning Search.
Proceedings of the 27th IEEE Real-Time and Embedded Technology and Applications Symposium, 2021
Work in Progress: Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework.
Proceedings of the 27th IEEE Real-Time and Embedded Technology and Applications Symposium, 2021
DNNFusion: accelerating deep neural networks execution with advanced operator fusion.
Proceedings of the PLDI '21: 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 7th IEEE/ACM Workshop on the LLVM Compiler Infrastructure in HPC, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
ClickTrain: efficient and accurate end-to-end deep learning training via fine-grained architecture-preserving pruning.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021
Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021
NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the ASPDAC '21: 26th Asia and South Pacific Design Automation Conference, 2021
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration.
CoRR, 2020
An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning.
CoRR, 2020
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization.
CoRR, 2020
CoRR, 2020
CoCoPIE: Making Mobile AI Sweet As PIE -Compression-Compilation Co-Design Goes a Long Way.
CoRR, 2020
CoRR, 2020
BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method.
CoRR, 2020
CoRR, 2020
Petascale XCT: 3D image reconstruction with hierarchical communications on multi-GPU nodes.
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the Languages and Compilers for Parallel Computing, 2020
Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Parallelizing pruned landmark labeling: dealing with dependencies in graph algorithms.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the GLSVLSI '20: Great Lakes Symposium on VLSI 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020
Proceedings of the CGO '20: 18th ACM/IEEE International Symposium on Code Generation and Optimization, 2020
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning.
Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
ACM Trans. Parallel Comput., 2019
Pruned Landmark Labeling Meets Vertex Centric Computation: A Surprisingly Happy Marriage!
CoRR, 2019
26ms Inference Time for ResNet-50: Towards Real-Time Execution of all DNNs on Smartphone.
CoRR, 2019
Proceedings of the International Conference for High Performance Computing, 2019
Transforming Query Sequences for High-Throughput B+ Tree Processing on Many-Core Processors.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019
2018
Graphphi: efficient parallel graph processing on emerging throughput-oriented architectures.
Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018
2017
Exploiting Vector and Multicore Parallelism for Recursive, Data- and Task-Parallel Programs.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017
Real-Time Data Analysis and Autonomous Steering of Synchrotron Light Source Experiments.
Proceedings of the 13th IEEE International Conference on e-Science, 2017
2016
ACM Trans. Archit. Code Optim., 2016
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016
Proceedings of the 45th International Conference on Parallel Processing, 2016
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016
2015
Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015
Proceedings of the Languages and Compilers for Parallel Computing, 2015
Proceedings of the Languages and Compilers for Parallel Computing, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
2014
A Portable Optimization Engine for Accelerating Irregular Data-Traversal Applications on SIMD Architectures.
ACM Trans. Archit. Code Optim., 2014
Proceedings of the 2014 International Conference on Supercomputing, 2014
Proceedings of the 2014 International Conference on Supercomputing, 2014
2013
Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, 2013
2012
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012
2011
Translating Chapel to Use FREERIDE: A Case Study in Using an HPC Language for Data-Intensive Computing.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Compiling Dynamic Data Structures in Python to Enable the Use of Multi-core and Many-core Libraries.
Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011