Mauricio J. Serrano

According to our database¹, Mauricio J. Serrano authored at least 42 papers between 1992 and 2022.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2022

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization.

[BibT_eX]

[DOI]

Andrea Fasoli

Chia-Yu Chen

Mauricio J. Serrano

Swagath Venkataramani

George Saon

Xiaodong Cui

Brian Kingsbury

Kailash Gopalakrishnan

Proceedings of the Interspeech 2022, 2022

2021

RaPiD: AI Accelerator for Ultra-low Precision Training and Inference.

[BibT_eX]

[DOI]

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

4-Bit Quantization of LSTM-Based Speech Recognition Models.

[BibT_eX]

[DOI]

Swagath Venkataramani

Kailash Gopalakrishnan

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020

Efficient AI System Design With Cross-Layer Approximate Computing.

[BibT_eX]

[DOI]

Proc. IEEE, 2020

2019

DeepTools: Compiler and Execution Runtime Extensions for RaPiD AI Accelerator.

[BibT_eX]

[DOI]

Swagath Venkataramani

Jungwook Choi

Vijayalakshmi Srinivasan

Kailash Gopalakrishnan

IEEE Micro, 2019

BlueConnect: Decomposing all-reduce for deep learning on heterogeneous network hierarchy.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2019

Efficient implementation of sparse matrix-sparse vector multiplication for large scale graph analytics.

[BibT_eX]

[DOI]

Mauricio J. Serrano

Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

2018

Graph Programming Interface (GPI): A Linear Algebra Programming Model for Large Scale Graph Computations.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2018

2017

Enabling massive deep neural networks with the GraphBLAS.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

2016

Efficient implementation of scatter-gather operations for large scale graph analytics.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

Graph programming interface (GPI): a linear algebra programming model for large scale graph computations.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2015

Active Memory Cube: A processing-in-memory architecture for exascale systems.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2015

2014

Simple, portable and fast SIMD intrinsic programming: generic simd library.

[BibT_eX]

[DOI]

Proceedings of the 2014 Workshop on Programming models for SIMD/Vector processing, 2014

2013

Trace construction using enhanced performance monitoring.

[BibT_eX]

[DOI]

Mauricio J. Serrano

Proceedings of the Computing Frontiers Conference, 2013

2011

Improving the performance of trace-based systems by false loop filtering.

[BibT_eX]

[DOI]

Hiroshige Hayashizaki

Proceedings of the 16th International Conference on Architectural Support for Programming Languages and Operating Systems, 2011

2009

Placement optimization using data context collected during garbage collection.

[BibT_eX]

[DOI]

Mauricio J. Serrano

Xiaotong Zhuang

Proceedings of the 8th International Symposium on Memory Management, 2009

Building Approximate Calling Context from Partial Call Traces.

[BibT_eX]

[DOI]

Mauricio J. Serrano

Xiaotong Zhuang

Proceedings of the CGO 2009, 2009

2008

Perfdiff: a framework for performance difference analysis in a virtual machine environment.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Symposium on Code Generation and Optimization (CGO 2008), 2008

2007

Call-chain Software Instruction Prefetching in J2EE Server Applications.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques (PACT 2007), 2007

2006

Accurate, efficient, and adaptive calling context profiling.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN 2006 Conference on Programming Language Design and Implementation, 2006

2004

Prefetch inection based on hardware monitoring and object metadata.

[BibT_eX]

[DOI]

Ali-Reza Adl-Tabatabai

Richard L. Hudson

Mauricio J. Serrano

Sreenivas Subramoney

Proceedings of the ACM SIGPLAN 2004 Conference on Programming Language Design and Implementation 2004, 2004

Whole-Stack Analysis and Optimization of Commercial Workloads on Server Systems.

[BibT_eX]

[DOI]

Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

2003

Stack allocation and synchronization optimizations for Java using escape analysis.

[BibT_eX]

[DOI]

ACM Trans. Program. Lang. Syst., 2003

2002

Efficiently Adapting Java Binaries in Limited Memory Contexts.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2002

Value-Profile Guided Stride Prefetching for Irregular Code.

[BibT_eX]

[DOI]

Proceedings of the Compiler Construction, 11th International Conference, 2002

2001

Characterizing the memory behavior of Java workloads: a structured view and opportunities for optimizations.

[BibT_eX]

[DOI]

Proceedings of the Joint International Conference on Measurements and Modeling of Computer Systems, 2001

[BibT_eX]

[DOI]

Vivek Sarkar

Mauricio J. Serrano

Barbara B. Simons

Proceedings of the 15th international conference on Supercomputing, 2001

A framework for efficient reuse of binary code in Java.

[BibT_eX]

[DOI]

Proceedings of the 15th international conference on Supercomputing, 2001

2000

The Jalapeño virtual machine.

[BibT_eX]

[DOI]

IBM Syst. J., 2000

Quicksilver: a quasi-static compiler for Java.

[BibT_eX]

[DOI]

Proceedings of the 2000 ACM SIGPLAN Conference on Object-Oriented Programming Systems, 2000

1999

Escape Analysis for Java.

[BibT_eX]

[DOI]

Proceedings of the 1999 ACM SIGPLAN Conference on Object-Oriented Programming Systems, 1999

Dependence Analysis for Java.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 1999

The Jalapeño Dynamic Optimizing Compiler for Java.

[BibT_eX]

[DOI]

Proceedings of the ACM 1999 Conference on Java Grande, JAVA '99, San Francisco, CA, USA, 1999

1998

Thin locks: featherweight Synchronization for Java (with retrospective)

[BibT_eX]

[DOI]

Proceedings of the 20 Years of the ACM SIGPLAN Conference on Programming Language Design and Implementation 1979-1999, 1998

Thin Locks: Featherweight Synchronization for Java.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN '98 Conference on Programming Language Design and Implementation (PLDI), 1998

1996

Performance Estimation in a Simultaneous Multithreading Processor.

[BibT_eX]

[DOI]

Mauricio J. Serrano

Proceedings of the MASCOTS '96, 1996

1995

Optimized code restructuring of OS/2 executables.

[BibT_eX]

[DOI]

Proceedings of the 1995 Conference of the Centre for Advanced Studies on Collaborative Research, 1995

1994

The Impact of Unresolved Branches on Branch Prediction Scheme Performance.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual International Symposium on Computer Architecture. Chicago, 1994

Performance Estimation of Multistreamed, Supersealar Processors.

[BibT_eX]

Proceedings of the 27th Annual Hawaii International Conference on System Sciences (HICSS-27), 1994

A Model for Performance Estimation in a Multistreamed Superscalar Processor.

[BibT_eX]

[DOI]

Proceedings of the Computer Performance Evaluation, 1994

1993

Optimal Architectures and Algorithms for Mesh-Connected Parallel Computers with Separable Row/Column Buses.

[BibT_eX]

[DOI]

Mauricio J. Serrano

Behrooz Parhami

IEEE Trans. Parallel Distributed Syst., 1993

1992

Optimal Aspect Ratio and Number of Separable Row/Column Buses for Mesh-Connected Parallel Computers.

[BibT_eX]

[DOI]

Mauricio J. Serrano

Behrooz Parhami

Proceedings of the 6th International Parallel Processing Symposium, 1992

Mauricio J. Serrano

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...