Emilio L. Zapata

According to our database1, Emilio L. Zapata authored at least 224 papers between 1989 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2019
Improving hardware transactional memory parallelization of computational geometry algorithms using privatizing transactions.
J. Parallel Distributed Comput., 2019

2018
Privatizing transactions for Lee's algorithm in commercial hardware transactional memory.
J. Supercomput., 2018

2017
Lazy Irrevocability for Best-Effort Transactional Memory Systems.
IEEE Trans. Parallel Distributed Syst., 2017

Leveraging irrevocability to deal with signature saturation in hardware transactional memory.
J. Supercomput., 2017

Enhancing scalability in best-effort hardware transactional memory systems.
J. Parallel Distributed Comput., 2017

2016
Insights into the Fallback Path of Best-Effort Hardware Transactional Memory Systems.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

2015
Efficient Data Structure and Highly Scalable Algorithm for Total-Viewshed Computation.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2015

Conflict Detection in Hardware Transactional Memory.
Proceedings of the Transactional Memory. Foundations, Algorithms, Tools, and Applications, 2015

2014
Effective Transactional Memory Execution Management for Improved Concurrency.
ACM Trans. Archit. Code Optim., 2014

A case study of different task implementations for multioutput stages in non-trivial parallel pipeline applications.
Parallel Comput., 2014

Improving Signature Behavior by Irrevocability in Transactional Memory Systems.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

2013
Hardware Signature Designs to Deal with Asymmetry in Transactional Data Sets.
IEEE Trans. Parallel Distributed Syst., 2013

Optimal tilt and orientation maps: a multi-algorithm approach for heterogeneous multicore-GPU systems.
J. Supercomput., 2013

LS-Sig: Locality-Sensitive Signatures for Transactional Memory.
IEEE Trans. Computers, 2013

Multioperand Redundant Adders on FPGAs.
IEEE Trans. Computers, 2013

Simultaneous computation of total viewshed on large high resolution grids.
Int. J. Geogr. Inf. Sci., 2013

Efficient floating-point representation for balanced codes for FPGA devices.
Proceedings of the 2013 IEEE 31st International Conference on Computer Design, 2013

2012
Redundant Floating-Point Decimal CORDIC Algorithm.
IEEE Trans. Computers, 2012

A Fast GIS-tool to Compute the Maximum Solar Energy on Very Large Terrains.
Proceedings of the International Conference on Computational Science, 2012

A data dependence test based on the projection of paths over shape graphs.
J. Parallel Distributed Comput., 2012

VLBI-resolution radio-map algorithms: Performance analysis of different levels of data-sharing on multi-socket, multi-core architectures.
Comput. Phys. Commun., 2012

On-line Decimal Adder with RBCD Representation.
Proceedings of the 23rd IEEE International Conference on Application-Specific Systems, 2012

Decimal online multioperand addition.
Proceedings of the Conference Record of the Forty Sixth Asilomar Conference on Signals, 2012

2011
High-Speed Algorithms and Architectures for Range Reduction Computation.
IEEE Trans. Very Large Scale Integr. Syst., 2011

Load Balancing versus Occupancy Maximization on Graphics Processing Units: The Generalized Hough Transform as a Case Study.
Int. J. High Perform. Comput. Appl., 2011

High-performance three-horizon composition algorithm for large-scale terrains.
Int. J. Geogr. Inf. Sci., 2011

A case study of the task-based parallel wavefront pattern.
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

Multiset signatures for transactional memory.
Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

Spectral evolution simulation on leading multi-socket, multicore platforms.
Proceedings of the 18th International Conference on High Performance Computing, 2011

High-level template for the task-based parallel wavefront pattern.
Proceedings of the 18th International Conference on High Performance Computing, 2011

Unified Locality-Sensitive Signatures for Transactional Memory.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010
Enhanced Scaling-Free CORDIC.
IEEE Trans. Circuits Syst. I Regul. Pap., 2010

Learning a generic 3D face model from 2D image databases using incremental Structure-from-Motion.
Image Vis. Comput., 2010

Quantum computer simulation using the CUDA programming model.
Comput. Phys. Commun., 2010

Interval Filter: A Locality-Aware Alternative to Bloom Filters for Hardware Membership Queries by Interval Classification.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2010

Evaluation of the Task Programming Model in the Parallelization of Wavefront Problems.
Proceedings of the 12th IEEE International Conference on High Performance Computing and Communications, 2010

2009
Analytical Model of Patching and Load Sharing in a Distributed VoD System.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2009

Conflict Analysis for heap-based Data Dependence Detection.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

On the Automatic Detection of Heap-Induced Data Dependencies with Interprocedural Shape Analysis.
Proceedings of the ICPPW 2009, 2009

Efficient mapping on FPGA of convolution computation based on combined CSA-CPA accumulator.
Proceedings of the 16th IEEE International Conference on Electronics, 2009

Efficient image alignment using linear appearance models.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Efficient Implementation of Carry-Save Adders in FPGAs.
Proceedings of the 20th IEEE International Conference on Application-Specific Systems, 2009

Improving Signatures by Locality Exploitation for Transactional Memory.
Proceedings of the PACT 2009, 2009

2008
Pipelined Architecture for Additive Range Reduction.
J. Signal Process. Syst., 2008

Teaching the Cache Memory System Using a Reconfigurable Approach.
IEEE Trans. Educ., 2008

A Low-Latency Pipelined 2D and 3D CORDIC Processors.
IEEE Trans. Computers, 2008

An analytical model of locality-based parallel irregular reductions.
Parallel Comput., 2008

Fast clear-sky solar irradiation computation for very large digital elevation models.
Comput. Phys. Commun., 2008

Memory Locality Exploitation Strategies for FFT on the CUDA Architecture.
Proceedings of the High Performance Computing for Computational Science, 2008

Parallelizing irregular C codes assisted by interprocedural shape analysis.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

New SIMD instructions set for image processing applications enhancement.
Proceedings of the International Conference on Image Processing, 2008

Using Padding to Optimize Locality in Scientific Applications.
Proceedings of the Computational Science, 2008

Parallel Quantum Computer Simulation on the CUDA Architecture.
Proceedings of the Computational Science, 2008

Complete Def-Use Analysis in Recursive Programs with Dynamic Data Structures.
Proceedings of the Euro-Par 2008 Workshops, 2008

SIMD Enhancements for a Hough Transform Implementation.
Proceedings of the 11th Euromicro Conference on Digital System Design: Architectures, 2008

2007
Logotype detection to support semantic-based video annotation.
Signal Process. Image Commun., 2007

Detecting loop-carried dependences in programs with dynamic data structures.
J. Parallel Distributed Comput., 2007

Maximum and Sorted Cache Occupation Using Array Padding.
Proceedings of the 2007 International Conference on Embedded Computer Systems: Architectures, 2007

Simulating a Reconfigurable Cache System for Teaching Purposes.
Proceedings of the IEEE International Conference on Microelectronic Systems Education, 2007

Bilinear Active Appearance Models.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Fast Insolation Computation in Large Territories.
Proceedings of the Computational Science, 2007

A Clustering Technique for Video Copy Detection.
Proceedings of the Pattern Recognition and Image Analysis, Third Iberian Conference, 2007

2006
A Case Study of Load Sharing Based on Popularity in Distributed VoD Systems.
IEEE Trans. Multim., 2006

SAD computation based on online arithmetic for motion estimation.
Microprocess. Microsystems, 2006

Video Cataloging Based on Robust Logotype Detection.
Proceedings of the International Conference on Image Processing, 2006

Fast Full-Search Block Matching Algorithm Motion Estimation Alternatives in FPGA.
Proceedings of the 2006 International Conference on Field Programmable Logic and Applications (FPL), 2006

Towards a Versatile Pointer Analysis Framework.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Pipelined Range Reduction for Floating Point Numbers.
Proceedings of the 2006 IEEE International Conference on Application-Specific Systems, 2006

Tracking of Linear Appearance Models Using Second Order Minimization.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2006

2005
On the parallelization of irregular and dynamic programs.
Parallel Comput., 2005

Parallel techniques in irregular codes: cloth simulation as case of study.
J. Parallel Distributed Comput., 2005

Distributed Architecture System for Computer Performance Testing.
Proceedings of the Parallel Processing and Applied Mathematics, 2005

A New Strategy for Shape Analysis Based on Coexistent Link Sets.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Reducing Cache Misses by Loop Reordering.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

A Novel Approach for Detecting Heap-Based Loop-Carried Dependences.
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

On-line Multioperand Addition Based on On-line Full Adders.
Proceedings of the 16th IEEE International Conference on Application-Specific Systems, 2005

2004
CORDIC Processor for Variable-Precision Interval Arithmetic.
J. VLSI Signal Process., 2004

A Framework to Capture Dynamic Data Structures in Pointer-Based Codes.
IEEE Trans. Parallel Distributed Syst., 2004

A compiler tool to predict memory hierarchy performance of scientific codes.
Parallel Comput., 2004

Parallelization issues of a code for physically-based simulation of fabrics.
Comput. Phys. Commun., 2004

Data partitioning-based parallel irregular reductions.
Concurr. Comput. Pract. Exp., 2004

Nesting OpenMP and MPI in the Conjugate Gradient Method for Band Systems.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Optimization Techniques for Irregular and Pointer-Based Programs.
Proceedings of the 12th Euromicro Workshop on Parallel, 2004

Two Hybrid Multicast Algorithms for Optimizing Resources in a Distributed VoD System.
Proceedings of the 10th International Multimedia Modeling Conference (MMM 2004), 2004

A New Dependence Test Based on Shape Analysis for Pointer-Based Codes.
Proceedings of the Languages and Compilers for High Performance Computing, 2004

Evaluation of Elementary Functions Using Multimedia Features.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Topic 11: Numerical Algorithms.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

2003
Compiler Techniques for the Distribution of Data and Computation.
IEEE Trans. Parallel Distributed Syst., 2003

Probabilistic Miss Equations: Evaluating Memory Hierarchy Performance.
IEEE Trans. Computers, 2003

An efficient 2D deformable objects detection and location algorithm.
Pattern Recognit., 2003

Optimization techniques for parallel irregular reductions.
J. Syst. Archit., 2003

2002
A Configurable Architecture for the Wavelet Packet Transform.
J. VLSI Signal Process., 2002

An Advanced Compiler Framework for Non-Cache-Coherent Multiprocessors.
IEEE Trans. Parallel Distributed Syst., 2002

Planar object detection under scaled orthographic projection.
Pattern Recognit. Lett., 2002

Architecture for wavelet packet transform based on lifting steps.
Parallel Comput., 2002

New Shape Analysis and Interprocedural Techniques for Automatic Parallelization of C Codes.
Int. J. Parallel Program., 2002

On Improving the Performance of Data Partitioning Oriented Parallel Irregular Reductions.
Proceedings of the 10th Euromicro Workshop on Parallel, 2002

Towards Compiler Optimization of Codes Based on Arrays of Pointers.
Proceedings of the Languages and Compilers for Parallel Computing, 15th Workshop, 2002

Load sharing based on popularity in distributed video on demand systems.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Polynomial Evaluation on Multimedia Processors.
Proceedings of the 13th IEEE International Conference on Application-Specific Systems, 2002

2001
Data-task parallelism for the VMEC program.
Parallel Comput., 2001

Detection of arbitrary planar shapes with 3D pose.
Image Vis. Comput., 2001

A Data-Parallel Formulation for Divide and Conquer Algorithms.
Comput. J., 2001

Using Semantical Information to Enhance the Parallel Sparse Performance.
Comput. Artif. Intell., 2001

Parallelization of an algorithm for the automatic detection of deformable objects.
Acta Cybern., 2001

Improving parallel irregular reductions using partial array expansion.
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001

Balanced, Locality-Based Parallel Irregular Reductions.
Proceedings of the Languages and Compilers for Parallel Computing, 2001

Data Locality Exploitation in Algorithms including Sparse Communications.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

Progressive Shape Analysis for Real C Codes.
Proceedings of the 2001 International Conference on Parallel Processing, 2001

2000
Compile and Run-Time Support for the Parallelization of Sparse Matrix Updating Algorithms.
J. Supercomput., 2000

Automatic parallelization of irregular applications.
Parallel Comput., 2000

Parallel Algorithm for Fast Cloth Simulation.
Proceedings of the Vector and Parallel Processing, 2000

Parallelization of a Recursive Decoupling Method for Solving Tridiagonal Linear Systems on Distributed Memory Computer.
Proceedings of the Vector and Parallel Processing, 2000

A Data Parallel Formulation of the Barnes-Hut Method for N -Body Simulations.
Proceedings of the Applied Parallel Computing, 2000

Accurate Shape Analysis for Recursive Data Structures.
Proceedings of the Languages and Compilers for Parallel Computing, 2000

A compiler method for the parallel execution of irregular reductions in scalable shared memory multiprocessors.
Proceedings of the 14th international conference on Supercomputing, 2000

MMX-Like Architecture Extension to Support the Rotation Operation.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Deformable Shapes Detection by Stochastic Optimization.
Proceedings of the 2000 International Conference on Image Processing, 2000

An architecture for wavelet-packet based speech enhancement for hearing aids.
Proceedings of the IEEE International Conference on Acoustics, 2000

Detection of bidimensional shapes under global deformations.
Proceedings of the 10th European Signal Processing Conference, 2000

Fast Cloth Simulation with Parallel Computers.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

Improving the Sparse Parallelization Using Semantical Information at Compile-Time.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

FPGA Implementation of Wavelet Packet Transform with Reconfigurable Tree Structure.
Proceedings of the 26th EUROMICRO 2000 Conference, 2000

Architecture for Wavelet Packet Transform with Best Tree Searching.
Proceedings of the 12th IEEE International Conference on Application-Specific Systems, 2000

1999
An Efficient Architecture for the In-Place Fast Cosine Transform.
J. VLSI Signal Process., 1999

Bidimensional shape detection using an invariant approach.
Pattern Recognit., 1999

Memory Hierarchy Performance Prediction for Blocked Sparse Algorithms.
Parallel Process. Lett., 1999

Data-parallel support for numerical irregular problems.
Parallel Comput., 1999

Enhancing the Parallelization of Sparse Matrices through Dynamic Issues.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

Direct mapped cache performance modeling for sparse matrix operations.
Proceedings of the Seventh Euromicro Workshop on Parallel and Distributed Processing. PDP'99, 1999

An Automatic Iteration/Data Distribution Method Based on Access Descriptors for DSMM.
Proceedings of the Languages and Compilers for Parallel Computing, 1999

Sparse Matrix Block-Cyclic Redistribution.
Proceedings of the 13th International Parallel Processing Symposium / 10th Symposium on Parallel and Distributed Processing (IPPS / SPDP '99), 1999

New shape analysis techniques for automatic parallelization of C codes.
Proceedings of the 13th international conference on Supercomputing, 1999

Access Descriptor Based Locality Analysis for Distributed-Shared Memory Multiprocessors.
Proceedings of the International Conference on Parallel Processing 1999, 1999

Planar 3D Object Detection by Using the Generalized Hough Transform.
Proceedings of the 1oth International Conference on Image Analysis and Processing (ICIAP 1999), 1999

On Automatic Parallelization of Irregular Reductions on Scalable Shared Memory Systems.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

Set Associative Cache Behavior Optimization.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

Arithmetic Unit for the Computation of Interval Elementary Functions.
Proceedings of the 25th EUROMICRO '99 Conference, 1999

Interval Sine and Cosine Functions Computation Based on Variable-Precision CORDIC Algorithm.
Proceedings of the 14th IEEE Symposium on Computer Arithmetic (Arith-14 '99), 1999

Parallel Pivots LU Algorithm on the Cray T3E.
Proceedings of the Parallel Computation, 1999

Automatic Analytical Modeling for the Estimation of Cache Misses.
Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999

1998
Radix-4 Vectoring CORDIC Algorithm and Architectures.
J. VLSI Signal Process., 1998

Parallel Compensation of Scale Factor for the CORDIC Algorithm.
J. VLSI Signal Process., 1998

A novel design of a two operand normalization circuit.
IEEE Trans. Very Large Scale Integr. Syst., 1998

Parallel Implementation of DNAml Program on Message-Passing Architectures.
Parallel Comput., 1998

Computational space reduction and parallelization of a new clustering approach for large groups of sequences.
Bioinform., 1998

Modeling Set Associative Caches Behavior for Irregular Computations.
Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1998

A Hardware Approach to Interval Arithmetic for Sine and Cosine Functions.
Proceedings of the Developments in Reliable Computing, 1998

Biological sequence analysis on distributed-shared memory multiprocessors.
Proceedings of the Sixth Euromicro Workshop on Parallel and Distributed Processing, 1998

Parallelization Strategies for the VMEC Program.
Proceedings of the Applied Parallel Computing, 1998

HPF-2 Support for Dynamic Sparse Computations.
Proceedings of the Languages and Compilers for Parallel Computing, 1998

Local Enumeration Techniques for Sparse Algorithms.
Proceedings of the 12th International Parallel Processing Symposium / 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP '98), March 30, 1998

A memory system supporting the efficient SIMD computation of the two dimensional DWT.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Cache Misses Prediction for High Performance Sparse Algorithms.
Proceedings of the Euro-Par '98 Parallel Processing, 1998

Cache Probabilistic Modeling for Basic Sparse Algebra Kernels Involving Matrices with a Non Uniform Distribution.
Proceedings of the 24th EUROMICRO '98 Conference, 1998

Parallelization of Benchmarks for Scalable Shared-Memory Multiprocessors.
Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, 1998

1997
Mapping of Trellises Associated with General Encoders onto High-Performance VLSI Architectures.
J. VLSI Signal Process., 1997

Vienna-Fortran/HPF Extensions for Sparse and Irregular Problems and Their Compilation.
IEEE Trans. Parallel Distributed Syst., 1997

High-performance VLSI architecture for the Viterbi algorithm.
IEEE Trans. Commun., 1997

High Performance Rotation Architectures Based on the Radix-4 CORDIC Algorithm.
IEEE Trans. Computers, 1997

Error Analysis and Reduction for Angle Calculation Using the CORDIC Algorithm.
IEEE Trans. Computers, 1997

Lower order circle and ellipse Hough transform.
Pattern Recognit., 1997

Unified Framework for the Parallelization of Divide and Conquer Based Tridiagonal Systems.
Parallel Comput., 1997

Fast Hough Transform on Multiprocessors: A Branch and Bound Approach.
J. Parallel Distributed Comput., 1997

Mapping Tridiagonal System Algorithms onto Mesh Connected Computers.
Int. J. High Speed Comput., 1997

Parallel Computing of Semiconductor Laser Equations.
Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing, 1997

Modelling Superlinear Speedup on Distributed Memory Multiprocessors.
Proceedings of the Parallel Computing: Fundamentals, 1997

Data Parallel Language Extensions for Exploiting Locality in Irregular Problems.
Proceedings of the Languages and Compilers for Parallel Computing, 1997

A Probabilistic Model for the Best-First Search B&B Algorithms.
Proceedings of the Solving Irregularly Structured Problems in Parallel, 1997

The Sparse Cyclic Distribution against its Dense Counterparts.
Proceedings of the 11th International Parallel Processing Symposium (IPPS '97), 1997

Compiler Techniques for Effective Communication on Distributed-Memory Multiprocessors.
Proceedings of the 1997 International Conference on Parallel Processing (ICPP '97), 1997

An efficient architecture for the in place fast cosine transform.
Proceedings of the 1997 International Conference on Application-Specific Systems, 1997

1996
Cordic based parallel/pipelined architecture for the Hough transform.
J. VLSI Signal Process., 1996

Unified Mixed Radix 2-4 Redundant CORDIC Processor.
IEEE Trans. Computers, 1996

Implementation and experimental evaluation of the constrained ART algorithm on a multicomputer system.
Signal Process., 1996

FFTs on Mesh Connected Computers.
Parallel Comput., 1996

Parallelization Techniques for Sparse Matrix Applications.
J. Parallel Distributed Comput., 1996

Sparse Householder QR Factorization on a Mesh.
Proceedings of the 4th Euromicro Workshop on Parallel and Distributed Processing (PDP '96), 1996

Experimental Evaluation of Efficient Sparse Matrix Distributions.
Proceedings of the 10th international conference on Supercomputing, 1996

Parallelization of irregular algorithms for shape detection.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

High performance VLSI architecture for the trellis coded quantization.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

Efficient Parallel Solution of a Semiconductor Laser Array Dynamics Model.
Proceedings of the High-Performance Computing and Networking, 1996

Parallel Sparse Modified Gram-Schmidt QR Decomposition.
Proceedings of the High-Performance Computing and Networking, 1996

A Parallel Pipelined Hough Transform.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

High Radix Cordic Rotation Based on Selection by Rounding.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

Radix-4 Vectoring Cordic Algorithm And Architectures.
Proceedings of the 1996 International Conference on Application-Specific Systems, 1996

High-Speed Viterbi Decoder: An Efficient Scheduling Method to Exploit the Pipelining.
Proceedings of the 1996 International Conference on Application-Specific Systems, 1996

1995
Constant geometry split-radix algorithms.
J. VLSI Signal Process., 1995

A fast Hough transform for segment detection.
IEEE Trans. Image Process., 1995

Data Distributions for Sparse Matrix Vector Multiplication.
Parallel Comput., 1995

A Parallel Architecture for the Self-Sorting FFT Algorithm.
J. Parallel Distributed Comput., 1995

An image-processing approach to dotplots: an X-Window-based program for interactive analysis of dotplots derived from sequence and structural data.
Comput. Appl. Biosci., 1995

Exploiting locality on parallel irregular problem computations.
Proceedings of the 3rd Euromicro Workshop on Parallel and Distributed Processing (PDP '95), 1995

Run-Time Techniques for Parallelizing Sparse Matrix Problems.
Proceedings of the Parallel Algorithms for Irregularly Structured Problems, 1995

New data-parallel language features for sparse matrix computations.
Proceedings of IPPS '95, 1995

Efficient Resolution of Sparse Indirections in Data-Parallel Compilers.
Proceedings of the 9th international conference on Supercomputing, 1995

Sparse LU factorization on the CRAY T3D.
Proceedings of the High-Performance Computing and Networking, 1995

CORDIC Architectures with Parallel Compensation of the Scale Factor.
Proceedings of the International Conference on Application Specific Array Processors (ASAP'95), 1995

Digit On-line Large Radix CORDIC Rotator.
Proceedings of the International Conference on Application Specific Array Processors (ASAP'95), 1995

Redundant CORDIC Rotator Based on Parallel Prediction.
Proceedings of the 12th Symposium on Computer Arithmetic (ARITH-12 '95), 1995

1994
Parallel Architecture for Fast Transforms with Trigonometric Kernel.
IEEE Trans. Parallel Distributed Syst., 1994

Unified Architecture for Divide and Conquer Based Tridiagonal System Solvers.
IEEE Trans. Computers, 1994

Finite Element Simulation of Semiconductor Devices on Multiprocessor Computers.
Parallel Comput., 1994

On an efficient parallelization of exhaustive sequence comparison algorithms on message passing architectures.
Comput. Appl. Biosci., 1994

3d Reconstruction Of Macromolecules On Multiprocessors.
Proceedings of the Second Euromicro Workshop on Parallel and Distributed Processing, 1994

Mapping Strategies for Sequential Sequence Comparison Algorithms on LAN-Based Message Passing Architectures.
Proceedings of the High-Performance Computing and Networking, 1994

1993
Application-specific architecture for fast transforms based on the successive doubling method.
IEEE Trans. Signal Process., 1993

An Efficient Processor Allocation for Nested Parallel Loops on Distributed Memory Hypercubes.
Parallel Process. Lett., 1993

Design of a Pipelined Radix 4 CORDIC Processor.
Parallel Comput., 1993

Parallel WZ factorization on mesh multiprocessors.
Microprocess. Microprogramming, 1993

Parallel algorithm for principal components based on Hotelling's iterative procedure.
Proceedings of the 1993 Euromicro Workshop on Parallel and Distributed Processing, 1993

1992
A VLSI Constant Geometry Architecture for the Fast Hartley and Fourier Transforms.
IEEE Trans. Parallel Distributed Syst., 1992

Image reconstruction on hypercube computers: Application to electron microscopy.
Signal Process., 1992

Design of parallel algorithms for a distributed memory hypercube.
Microprocess. Microsystems, 1992

1991
Modified Gram-Schmidt QR Factorization on Hypercube SIMD Computers.
J. Parallel Distributed Comput., 1991

Design of a constant geometry fast Hartley transformer.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Image template matching on hypercube SIMD computers.
Signal Process., 1990

Cluster validity based on the hard tendency of the fuzzy classification.
Pattern Recognit. Lett., 1990

Gaussian elimination with pivoting on hypercubes.
Parallel Comput., 1990

Parallel quadrant interlocking factorization on hypercube computers.
Parallel Comput., 1990

A VLSI Systolic Architecture for Solving DBT-Transformed Fuzzy Clustering Problems of Arbitrary Size.
Parallel Comput., 1990

Parallel Squared Error Clustering on Hypercube Arrays.
J. Parallel Distributed Comput., 1990

ACLE: A Software Package for SIMD Computer Simulation.
Comput. J., 1990

1989
Parallel fuzzy clustering on fixed size hypercube SIMD computers.
Parallel Comput., 1989


  Loading...