Weijia Shang

CoRR, 2023

Bandwidth Efficient Livestreaming in Mobile Wireless Networks: A Peer-to-Peer ACIDE Solution.

[BibT_eX]

[DOI]

Andrei Negulescu

CoRR, 2023

2019

Supernode transformation on GPGPUs.

[BibT_eX]

[DOI]

Yong Chen

Int. J. Parallel Emergent Distributed Syst., 2019

2014

A Case Study of Implementing Supernode Transformations.

[BibT_eX]

[DOI]

Johann Steinbrecher

Cesar J. Philippidis

Int. J. Parallel Program., 2014

On optimal media/video distribution in closed P2P-based IPTV networks.

[BibT_eX]

[DOI]

Hao Cui

Xiao Su

Comput. Networks, 2014

Parallelized feature extraction and acoustic model training.

[BibT_eX]

[DOI]

Haofeng Kou

Proceedings of the 19th International Conference on Digital Signal Processing, 2014

2013

ReShape: Towards a High-Level Approach to Design and Operation of Modular Reconfigurable Systems.

[BibT_eX]

[DOI]

Christopher E. Neely

Gordon J. Brebner

ACM Trans. Reconfigurable Technol. Syst., 2013

Optimized MFCC feature extraction on GPU.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

On Optimizing the Longest Common Subsequence Problem by Loop Unrolling Along Wavefronts.

[BibT_eX]

[DOI]

Johann Steinbrecher

Proceedings of the 20th Euromicro International Conference on Parallel, 2012

2010

Context Adaptive Lagrange Multiplier (CALM) for Rate-Distortion Optimal Motion Estimation in Video Coding.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2010

On minimizing register usage of linearly scheduled algorithms with uniform dependencies.

[BibT_eX]

[DOI]

Cesar J. Philippidis

Comput. Lang. Syst. Struct., 2010

Flexible and Modular Support for Timing Functions in High Performance Networking Acceleration.

[BibT_eX]

[DOI]

Christopher E. Neely

Gordon J. Brebner

Proceedings of the International Conference on Field Programmable Logic and Applications, 2010

ShapeUp: A High-Level Design Approach to Simplify Module Interconnection on FPGAs.

[BibT_eX]

[DOI]

Christopher E. Neely

Gordon J. Brebner

Proceedings of the 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2010

2009

Compiler Optimization Pass Visualization: The Procedural Abstraction Case.

[BibT_eX]

[DOI]

Ruth Davis

ACM Trans. Comput. Educ., 2009

Optimizing the stack size of recursive functions.

[BibT_eX]

[DOI]

Comput. Lang. Syst. Struct., 2009

Optimal dissemination of layered videos in P2P-Based IPTV networks.

[BibT_eX]

[DOI]

Hao Cui

Xiao Su

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Procedural Abstraction with Reverse Prefix Trees.

[BibT_eX]

[DOI]

Proceedings of the CGO 2009, 2009

2008

Visualization of Procedural Abstraction.

[BibT_eX]

[DOI]

Ruth Davis

Proceedings of the Fifth Program Visualization Workshop, 2008

2007

Coefficient Conversion for Transform Domain VC-1 TO H.264 Transcoding.

[BibT_eX]

[DOI]

Maria Pantoja

Nam Ling

Proceedings of the IEEE Workshop on Signal Processing Systems, 2007

Chroma Coding Efficiency Improvement with Context Adaptive Lagrange Multiplier (CALM).

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Stack size reduction of recursive programs.

[BibT_eX]

[DOI]

Proceedings of the 2007 International Conference on Compilers, 2007

2006

Bit rate distribution for motion estimation in H.264 coding.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2006

2004

Performance Trade-offs of DCT with Variable Length Carry Chains in FPGAs.

[BibT_eX]

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2004

2003

On Data Locality in Supernode Transformation.

[BibT_eX]

Srinivasan Subha

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2003

2002

On Time Optimal Supernode Shape.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2002

Bit-level two's complement matrix multiplication.

[BibT_eX]

[DOI]

Radhika S. Grover

Qiang Li

Integr., 2002

A faster distributed arithmetic architecture for FPGAs.

[BibT_eX]

[DOI]

Radhika S. Grover

Qiang Li

Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2002

2000

A Comparison of FPGA Implementations of Bit-Level and Word-Level Matrix Multipliers.

[BibT_eX]

[DOI]

Radhika S. Grover

Qiang Li

Proceedings of the Field-Programmable Logic and Applications, 2000

1998

On Supernode Transformation with Minimized Total Running Time.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 1998

1996

On Uniformization of Affine Dependence Algorithms.

[BibT_eX]

[DOI]

Zhigang Chen

IEEE Trans. Computers, 1996

On Optimal Size and Shape of Supernode Transformations.

[BibT_eX]

[DOI]

Proceedings of the 1996 International Conference on Parallel Processing, 1996

1994

On Loop Transformations for Generalized Cycle Shrinking.

[BibT_eX]

[DOI]

Matthew T. O'Keefe

IEEE Trans. Parallel Distributed Syst., 1994

Algorithm-Specific Parallel Processing with Linear Processing Arrays.

[BibT_eX]

[DOI]

Adv. Comput., 1994

Queueing performance analysis of co-scheduling in a pool of processors environment.

[BibT_eX]

[DOI]

Margaret A. Schaar

Kemal Efe

Proceedings of the 8th international conference on Supercomputing, 1994

Data alignment of loop nests without nonlocal communications.

[BibT_eX]

[DOI]

Zhongliang Shu

Proceedings of the International Conference on Application Specific Array Processors, 1994

1993

Mapping of Uniform Dependence Algorithm onto Fixed Size Processor Arrays.

[BibT_eX]

[DOI]

Zhigang Chen

Proceedings of the Seventh International Parallel Processing Symposium, 1993

Dependence Analysis and Architecture Design for Bit-Level Algorithms.

[BibT_eX]

[DOI]

Benjamin W. Wah

Proceedings of the 1993 International Conference on Parallel Processing, 1993

An algorithm for accurate data dependence test.

[BibT_eX]

[DOI]

Zhaoyun Xing

Proceedings of the International Conference on Application-Specific Array Processors, 1993

1992

On Time Mapping of Uniform Dependence Algorithms into Lower Dimensional Processor Arrays.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 1992

Independent Partitioning of Algorithms with Uniform Dependencies.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 1992

On Uniformization of Affine Dependence Algorithms.

[BibT_eX]

[DOI]

Zhigang Chen

Proceedings of the Fourth IEEE Symposium on Parallel and Distributed Processing, 1992

Conflict-Free Scheduling of Nested Loop Algorithms on Lower Dimensional Processor Arrays.

[BibT_eX]

[DOI]

Zhenhui Yang

Proceedings of the 6th International Parallel Processing Symposium, 1992

1991

Time Optimal Linear Schedules for Algorithms with Uniform Dependencies.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 1991

Generalized cycle shrinking.

[BibT_eX]

Matthew T. O'Keefe

Proceedings of the Algorithms and Parallel VLSI Architectures II, 1991

1990

Time-Optimal and Conflict-Free Mappings of Uniform Dependence Algorithms into Lower Dimensional Processor Arrays.

[BibT_eX]

Proceedings of the 1990 International Conference on Parallel Processing, 1990

1989

On the optimality of linear schedules.

[BibT_eX]

[DOI]

J. VLSI Signal Process., 1989

1988

Systematic Designs of Buffers in Macropipelines of Systolic Arrays.

[BibT_eX]

[DOI]

Benjamin W. Wah

Mokhtar Aboelaze

J. Parallel Distributed Comput., 1988

Independent Partitioning of Algorithms With Uniform Data Dependencies.

[BibT_eX]