Yong Dou

Orcid: 0000-0002-1256-8934

According to our database1, Yong Dou authored at least 265 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Self-Supervised Learning-For Underwater Acoustic Signal Classification With Mixup.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

2023
Could ChatGPT Imagine: Content Control for Artistic Painting Generation Via Large Language Models.
J. Intell. Robotic Syst., October, 2023

ArtVerse: A Paradigm for Parallel Human-Machine Collaborative Painting Creation in Metaverses.
IEEE Trans. Syst. Man Cybern. Syst., April, 2023

Isolate Sets Based Parallel Louvain Method for Community Detection.
J. Comput. Sci. Technol., April, 2023

Can ChatGPT Boost Artistic Creation: The Need of Imaginative Intelligence for Parallel Art.
IEEE CAA J. Autom. Sinica, April, 2023

Recent Trends in Deep Learning Based Textual Emotion Cause Extraction.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Rumor detection on social media through mining the social circles with high homogeneity.
Inf. Sci., 2023

Towards Vision Transformer Unrolling Fixed-Point Algorithm: a Case Study on Image Restoration.
CoRR, 2023

Meta-Learning Based Knowledge Extrapolation for Temporal Knowledge Graph.
Proceedings of the ACM Web Conference 2023, 2023

Incorporating Structured Sentences with Time-enhanced BERT for Fully-inductive Temporal Relation Prediction.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Automatic Audio Augmentation for Requests Sub-Challenge.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

HAAN: Human Action Aware Network for Multi-label Temporal Action Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Spatial and Frequency Domains Inconsistency Learning for Face Forgery Detection.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2023

Rumor Detection Via Assessing the Spreading Propensity of Users.
Proceedings of the IEEE International Conference on Acoustics, 2023

MANet: An Architecture Adaptive Method for Sparse Matrix Format Selection.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023

Temporal Extrapolation and Knowledge Transfer for Lifelong Temporal Knowledge Graph Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Pipeline-based Optimization Method for Large-Scale End-to-End Inference.
Proceedings of the 3rd International Conference on Artificial Intelligence, 2023

2022
A low-latency LSTM accelerator using balanced sparsity based on FPGA.
Microprocess. Microsystems, March, 2022

Hierarchical learning with backtracking algorithm based on the Visual Confusion Label Tree for large-scale image classification.
Vis. Comput., 2022

WRMatch: Improving FixMatch With Weighted Nuclear-Norm Regularization for Few-Shot Remote Sensing Scene Classification.
IEEE Trans. Geosci. Remote. Sens., 2022

An Adaptive Learning Rate Schedule for SIGNSGD Optimizer in Neural Networks.
Neural Process. Lett., 2022

Focus on Hard Categories and Hard Examples: Remote Sensing Image Scene Classification via Expert Model and Hard Example Mining.
IEEE Geosci. Remote. Sens. Lett., 2022

An automatic learning rate decay strategy for stochastic gradient descent optimization methods in neural networks.
Int. J. Intell. Syst., 2022

Multi-Outputs Is All You Need For Deblur.
CoRR, 2022

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples.
CoRR, 2022

A pipelining strategy for accelerating convolution neural networks on ARM CPUs.
Concurr. Comput. Pract. Exp., 2022

Heterogeneous Skill Learning for Multi-agent Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Topic and Reference Guided Keyphrase Generation from Social Media.
Proceedings of the Knowledge Science, Engineering and Management, 2022

Discourse Component Recognition via Graph Neural Network in Chinese Student Argumentative Essays.
Proceedings of the Knowledge Science, Engineering and Management, 2022

CNA: A Dataset for Parsing Discourse Structure on Chinese News Articles.
Proceedings of the 34th IEEE International Conference on Tools with Artificial Intelligence, 2022

MLPs: Efficient Training of MiniGo on Large-scale Heterogeneous Computing System.
Proceedings of the 28th IEEE International Conference on Parallel and Distributed Systems, 2022

Searching Latent Sub-Goals in Hierarchical Reinforcement Learning as Riemannian Manifold Optimization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

ROGC: Role-Oriented Graph Convolution Based Multi-Agent Reinforcement Learning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Improving the Classification of Phonetic Segments from Raw Ultrasound Using Self-Supervised Learning and Hard Example Mining.
Proceedings of the IEEE International Conference on Acoustics, 2022

Optimizing GNN on ARM Multi-Core Processors.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

Optimize DGL Operations on x86-64 Multi-Core Processors.
Proceedings of the HP3C 2022: 6th International Conference on High Performance Compilation, 2022

Accelerating minimap2 for long-read sequencing on NUMA multi-core CPU.
Proceedings of the 5th International Conference on Computer Science and Software Engineering, 2022

Isolate-Set-Based In-Memory Parallel Subgraph Matching Framework.
Proceedings of the 5th International Conference on Computer Science and Software Engineering, 2022

RSGT: Relational Structure Guided Temporal Relation Extraction.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

IMCI: Integrate Multi-view Contextual Information for Fact Extraction and Verification.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Adaptive Threshold Selective Self-Attention for Chinese NER.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures.
J. Supercomput., 2021

Representation learning on textual network with personalized PageRank.
Sci. China Inf. Sci., 2021

An energy-efficient convolutional neural network accelerator for speech classification based on FPGA and quantization.
CCF Trans. High Perform. Comput., 2021

A high-throughput scalable BNN accelerator with fully pipelined architecture.
CCF Trans. High Perform. Comput., 2021

Beyond AP: a new evaluation index for multiclass classification task accuracy.
Appl. Intell., 2021

COVID Edge-Net: Automated COVID-19 Lung Lesion Edge Detection in Chest CT Images.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track, 2021

Two-Stage Convolutional Neural Network for Knee Osteoarthritis Diagnosis in X-Rays.
Proceedings of 2021 International Conference on Medical Imaging and Computer-Aided Diagnosis, 2021

A Framework of Data Augmentation While Active Learning for Chinese Named Entity Recognition.
Proceedings of the Knowledge Science, Engineering and Management, 2021

Ddper: Decentralized Distributed Prioritized Experience Replay.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Graphcomm: A Graph Neural Network Based Method for Multi-Agent Reinforcement Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Global-Localized Agent Graph Convolution for Multi-Agent Reinforcement Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Local and Non-local Context Graph Convolutional Networks for Skeleton-Based Action Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

LADstackING: Stacking Ensemble Learning-based Computational Model for Predicting Potential LncRNA-disease Associations.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

RFC-HyPGCN: A Runtime Sparse Feature Compress Accelerator for Skeleton-Based GCNs Action Recognition Model with Hybrid Pruning.
Proceedings of the 32nd IEEE International Conference on Application-specific Systems, 2021

2020
Temporally Refined Graph U-Nets for Human Shape and Pose Estimation From Monocular Videos.
IEEE Signal Process. Lett., 2020

Non-locally Enhanced Feature Fusion Network for Aircraft Recognition in Remote Sensing Images.
Remote. Sens., 2020

Absent Multiple Kernel Learning Algorithms.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

DropFilterR: A Novel Regularization Method for Learning Convolutional Neural Networks.
Neural Process. Lett., 2020

Annealed gradient descent for deep learning.
Neurocomputing, 2020

Beyond top-<i>N</i> accuracy indicator: a comprehensive evaluation indicator of CNN models in image classification.
IET Comput. Vis., 2020

Fixed-size Objects Encoding for Visual Relationship Detection.
CoRR, 2020

Pose-Forecasting Aided Human Video Prediction With Graph Convolutional Networks.
IEEE Access, 2020

A Simplified Speaker Recognition System Based on FPGA Platform.
IEEE Access, 2020

Helping the Ineloquent Farmers: Finding Experts for Questions With Limited Text in Agricultural Q&A Communities.
IEEE Access, 2020

Objectness Consistent Representation for Weakly Supervised Object Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A High-Throughput LDPC Decoder Based on GPUs for 5G New Radio.
Proceedings of the IEEE Symposium on Computers and Communications, 2020

Towards Precise End-to-end Semi-Supervised Human Head Detection Network.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Attentional Fused Temporal Transformation Network for Video Action Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning Network Representation Through Reinforcement Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Data Layout Transformation for Stencil Computations Using ARM NEON Extension.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

Optimization and Performance Modeling of Stencil Computations on ARM Architectures.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

End-to-end Spatial Attention Network with Feature Mimicking for Head Detection.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

Rethinking Segmentation Guidance for Weakly Supervised Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Argumentation Mining on Essays at Multi Scales.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Exploring frame segmentation networks for temporal action localization.
J. Vis. Commun. Image Represent., 2019

Pair-HMM accelerator based on non-cooperative structure.
IEICE Electron. Express, 2019

IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition.
CoRR, 2019

A Novel Memory-Scheduling Strategy for Large Convolutional Neural Network on Memory-Limited Devices.
Comput. Intell. Neurosci., 2019

GBCNN: A Full GPU-Based Batch Multi-Task Cascaded Convolutional Networks.
IEEE Access, 2019

Aircraft Segmentation Based On Deep Learning framework : from extreme points to remote sensing image segmentation.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

A Pipelining Strategy for Accelerating Convolutional Networks on ARM Processors.
Proceedings of the Parallel Architectures, Algorithms and Programming, 2019

An Efficient Parallel Successive Cancellation List Polar Decoder Based on GPUs.
Proceedings of the 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2019

Accelerated Inference Framework of Sparse Neural Network Based on Nested Bitmask Structure.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Heavy-ball Algorithms Always Escape Saddle Points.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Fast Nonlocal Diffusion by Stacking Local Filters for Image Denoising.
Proceedings of the Data Science, 2019

Towards Precise End-to-End Weakly Supervised Object Detection Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Spatial Attention Network for Few-Shot Learning.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Deep Learning, 2019

2018
CaFPGA: An automatic generation model for CNN accelerator.
Microprocess. Microsystems, 2018

Distributed sparse bundle adjustment algorithm based on three-dimensional point partition and asynchronous communication.
Frontiers Inf. Technol. Electron. Eng., 2018

Local kernel alignment based multi-view clustering using extreme learning machine.
Neurocomputing, 2018

DropFilter: A Novel Regularization Method for Learning Convolutional Neural Networks.
CoRR, 2018

An efficient CPU-GPU hybrid parallel implementation for DVB-RCS2 receiver.
Concurr. Comput. Pract. Exp., 2018

High performance robust audio event recognition system based on FPGA platform.
Cogn. Syst. Res., 2018

A Community Detection Approach to Cleaning Extremely Large Face Database.
Comput. Intell. Neurosci., 2018

SpinMag: A New Fingerprinting Method for Robot Indoor Localization with Geomagnetic Field.
Ad Hoc Sens. Wirel. Networks, 2018

Frame Segmentation Networks for Temporal Action Localization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Image Translation Between High-Resolution Remote Sensing Optical and SAR Data Using Conditional GAN.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Spatial Attention Network for Head Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Visual Tree Convolutional Neural Network in Image Classification.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Visual Confusion Label Tree for Image Classification.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Temporal Pyramid Relation Network for Video-Based Gesture Recognition.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Deep Image Clustering Using Convolutional Autoencoder Embedding with Inception-Like Block.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Design and Implementation of Convolutional Neural Network Accelerator with Variable Layer-by-layer Debugging.
Proceedings of the 2018 2nd International Conference on Deep Learning Technologies, 2018

mmCNN: A Novel Method for Large Convolutional Neural Network on Memory-Limited Devices.
Proceedings of the 2018 IEEE 42nd Annual Computer Software and Applications Conference, 2018

Learning Generic Diffusion Processes for Image Restoration.
Proceedings of the British Machine Vision Conference 2018, 2018

paraSNF: An Parallel Approach for Large-Scale Similarity Network Fusion.
Proceedings of the Advanced Computer Architecture - 12th Conference, 2018

Research on Acceleration Method of Speech Recognition Training.
Proceedings of the Advanced Computer Architecture - 12th Conference, 2018

Exploring Temporal Preservation Networks for Precise Temporal Action Localization.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Throughput-Optimized FPGA Accelerator for Deep Convolutional Neural Networks.
ACM Trans. Reconfigurable Technol. Syst., 2017

Qualitative Action Recognition by Wireless Radio Signals in Human-Machine Systems.
IEEE Trans. Hum. Mach. Syst., 2017

Variational single image interpolation with time-varying regularization.
Signal Process. Image Commun., 2017

Multiple kernel learning with hybrid kernel alignment maximization.
Pattern Recognit., 2017

Airport Detection on Optical Satellite Images Using Deep Convolutional Neural Networks.
IEEE Geosci. Remote. Sens. Lett., 2017

An optimized design of CAN FD for automotive cyber-physical systems.
J. Syst. Archit., 2017

Heterogeneous blocked CPU-GPU accelerate scheme for large scale extreme learning machine.
Neurocomputing, 2017

A fast and memory saved GPU acceleration algorithm of convolutional neural networks for target detection.
Neurocomputing, 2017

Multiple kernel clustering with corrupted kernels.
Neurocomputing, 2017

Robust regularized extreme learning machine for regression using iteratively reweighted least squares.
Neurocomputing, 2017

TPC: Temporal Preservation Convolutional Networks for Precise Temporal Action Localization.
CoRR, 2017

Learning Non-local Image Diffusion for Image Denoising.
CoRR, 2017

Ranking Support Vector Machine with Kernel Approximation.
Comput. Intell. Neurosci., 2017

Adaptive Energy-Aware Computation Offloading for Cloud of Things Systems.
IEEE Access, 2017

Learning Non-local Image Diffusion for Image Denoising.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Approximate Large-scale Multiple Kernel k-means Using Deep Neural Network.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Multiple Kernel Clustering Framework with Improved Kernels.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Confusion Graph: Detecting Confusion Communities in Large Scale Image Classification.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

An FPGA-based processor for training convolutional neural networks.
Proceedings of the International Conference on Field Programmable Technology, 2017

Accuracy Evaluation of Long Short Term Memory Network Based Language Model with Fixed-Point Arithmetic.
Proceedings of the Applied Reconfigurable Computing - 13th International Symposium, 2017

Platform-Adaptive High-Throughput Surveillance Video Condensation on Heterogeneous Processor Clusters.
Proceedings of the Advanced Parallel Processing Technologies, 2017

Optimal Neighborhood Kernel Clustering with Multiple Kernels.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Multiple Kernel k-Means with Incomplete Kernels.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Coarse-Grained Architecture for Fingerprint Matching.
ACM Trans. Reconfigurable Technol. Syst., 2016

An FPGA Implementation for Solving the Large Single-Source-Shortest-Path Problem.
IEEE Trans. Circuits Syst. II Express Briefs, 2016

Affine-Transformation Parameters Regression for Face Alignment.
IEEE Signal Process. Lett., 2016

Classification of Hyperspectral Remote Sensing Image Using Hierarchical Local-Receptive-Field-Based Extreme Learning Machine.
IEEE Geosci. Remote. Sens. Lett., 2016

Relative distance features for gait recognition with Kinect.
J. Vis. Commun. Image Represent., 2016

Leveraging local receptive fields based random weights networks for hyperspectral image classification.
J. Intell. Fuzzy Syst., 2016

A novel multi-view clustering method via low-rank and matrix-induced regularization.
Neurocomputing, 2016

An efficient and effective convolutional auto-encoder extreme learning machine network for 3d feature learning.
Neurocomputing, 2016

Multi-view clustering with extreme learning machine.
Neurocomputing, 2016

PR-ELM: Parallel regularized extreme learning machine based on cluster.
Neurocomputing, 2016

Joint diversity regularization and graph regularization for multiple kernel k-means clustering via latent variables.
Neurocomputing, 2016

Weakly supervised object detection using pseudo-strong labels.
CoRR, 2016

Face Verification Algorithm with Exploiting Feature Distribution.
Proceedings of the PRICAI 2016: Trends in Artificial Intelligence, 2016

ELM based multiple kernel k-means with diversity-induced regularization.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Airport detection from remote sensing images using transferable convolutional neural networks.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Multiple Kernel Clustering with Local Kernel Alignment Maximization.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Region-based convolutional neural networks for object detection in very high resolution remote sensing images.
Proceedings of the 12th International Conference on Natural Computation, 2016

Hyperspectral image classification via kernel extreme learning machine using local receptive fields.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Localized region context and object feature fusion for people head detection.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Optimized GPU Acceleration Algorithm of Convolutional Neural Networks for Target Detection.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

Automatic code generation of convolutional neural networks in FPGA implementation.
Proceedings of the 2016 International Conference on Field-Programmable Technology, 2016

Multiple Kernel <i>k</i>-Means Clustering with Matrix-Induced Regularization.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
面向定制结构的稀疏矩阵分块方法 (Sparse Matrix Blocking Method for Custom Architecture).
计算机科学, 2015

Urban Land Use and Land Cover Classification Using Remotely Sensed SAR Data through Deep Belief Networks.
J. Sensors, 2015

A deeply-pipelined FPGA-based SpMV accelerator with a hardware-friendly storage scheme.
IEICE Electron. Express, 2015

An efficient multi-standard QC-LDPC decoder based on the row-layered decoding algorithm.
IEICE Electron. Express, 2015

Efficient graphics processing unit based layered decoders for quasicyclic low-density parity-check codes.
Concurr. Comput. Pract. Exp., 2015

An Efficient Robust Eye Localization by Learning the Convolution Distribution Using Eye Template.
Comput. Intell. Neurosci., 2015

Accelerating Molecular Dynamics Simulations on Heterogeneous Architecture.
Proceedings of the Computer Engineering and Technology - 19th CCF Conference, 2015

Designing Parallel Sparse Matrix Transposition Algorithm Using ELLPACK-R for GPUs.
Proceedings of the Computer Engineering and Technology - 19th CCF Conference, 2015

Optimized deep belief networks on CUDA GPUs.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Exploring Relative Motion Features for Gait Recognition with Kinect.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Hyperspectral image classification via local receptive fields based random weights networks.
Proceedings of the 11th International Conference on Natural Computation, 2015

Classification of Tiangong-1 hyperspectral remote sensing image via contextual sparse coding.
Proceedings of the 2015 International Conference on Machine Learning and Cybernetics, 2015

Depth enhancement via non-local means filter.
Proceedings of the Seventh International Conference on Advanced Computational Intelligence, 2015

Absent Multiple Kernel Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
FPGA Implementation of a Special-Purpose VLIW Structure for Double-Precision Elementary Function.
ACM Trans. Reconfigurable Technol. Syst., 2014

CuSora: Real-time software radio using multi-core graphics processing unit.
J. Syst. Archit., 2014

Design and Implement of High Performance Crypto Coprocessor.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014

Efficient Parallel Interference Cancellation MIMO Detector for Software Defined Radio on GPUs.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014

An Efficient Parallel SOVA-Based Turbo Decoder for Software Defined Radio on GPU.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014

Parallel graph traversal for FPGA.
IEICE Electron. Express, 2014

Transpose-free variable-size FFT accelerator based on-chip SRAM.
IEICE Electron. Express, 2014

Supernodal sparse Cholesky factorization on graphics processing units.
Concurr. Comput. Pract. Exp., 2014

CPU-GPU hybrid parallel strategy for cosmological simulations.
Concurr. Comput. Pract. Exp., 2014

Efficient parallel implementation of three-point viterbi decoding algorithm on CPU, GPU, and FPGA.
Concurr. Comput. Pract. Exp., 2014

A piecewise-based contrast enhancement framework for low lighting video.
Proceedings of the Proceedings IEEE International Conference on Security, 2014

Efficient parallel implementation of morphological operation on GPU and FPGA.
Proceedings of the Proceedings IEEE International Conference on Security, 2014

3D pipeline contention: Asymmetric full duplex in wireless networks.
Proceedings of the 2014 IEEE Conference on Computer Communications, 2014

Classification of land cover based on deep belief networks using polarimetric RADARSAT-2 data.
Proceedings of the 2014 IEEE Geoscience and Remote Sensing Symposium, 2014

Vehicle Recognition for Surveillance Video Using Sparse Coding.
Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014

A Study on Layer Connection Strategies in Stacked Convolutional Deep Belief Networks.
Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014

A high throughput K-best detector on FPGA.
Proceedings of the IEEE International Black Sea Conference on Communications and Networking, 2014

A Novel Design of Flexible Crypto Coprocessor and Its Application.
Proceedings of the Advanced Computer Architecture - 10th Annual Conference, 2014

2013
FPGA implementation of an exact dot product and its application in variable-precision floating-point arithmetic.
J. Supercomput., 2013

High-Performance Architecture for the Conjugate Gradient Solver on FPGAs.
IEEE Trans. Circuits Syst. II Express Briefs, 2013

VLIW coprocessor for IEEE-754 quadruple-precision elementary functions.
ACM Trans. Archit. Code Optim., 2013

From WiFi to WiMAX: Efficient GPU-based Parameterized Transceiver across Different OFDM Protocols.
KSII Trans. Internet Inf. Syst., 2013

Parallel Sparse Cholesky Factorization on a Heterogeneous Platform.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013

Window Memory Layout Scheme for Alternate Row-Wise/Column-Wise Matrix Access.
IEICE Trans. Inf. Syst., 2013

High performance sparse matrix-vector multiplication on FPGA.
IEICE Electron. Express, 2013

A fully parallel truncated Viterbi decoder for Software Defined Radio on GPUs.
Proceedings of the 2013 IEEE Wireless Communications and Networking Conference (WCNC), 2013

A multi-standard efficient column-layered LDPC decoder for Software Defined Radio on GPUs.
Proceedings of the 14th IEEE Workshop on Signal Processing Advances in Wireless Communications, 2013

Design and Implementation of Novel Flexible Crypto Coprocessor and Its Application in Security Protocol.
Proceedings of the Computer Engineering and Technology - 17th CCF Conference, 2013

Direction-Optimizing Breadth-First Search on CPU-GPU Heterogeneous Platforms.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

Empirical Evaluation of Fixed-Point Arithmetic for Deep Belief Networks.
Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2013

2012
The unified accelerator architecture for RNA secondary structure prediction on FPGA.
J. Supercomput., 2012

A High Performance and Memory Efficient LU Decomposer on FPGAs.
IEEE Trans. Computers, 2012

Design and Implementation of the Parameterized Multi-Standard High-Throughput Radix-4 Viterbi Decoder on FPGA.
IEICE Trans. Commun., 2012

Optimization schemes and performance evaluation of Smith-Waterman algorithm on CPU, GPU and FPGA.
Concurr. Comput. Pract. Exp., 2012

CPU-GPU hybrid accelerating the Zuker algorithm for RNA secondary structure prediction applications.
BMC Genom., 2012

Parallelizing sparse LU decomposition on FPGAs.
Proceedings of the 2012 International Conference on Field-Programmable Technology, 2012

A self-organizing and self-adaptive French flag Organism based on lateral activation model.
Proceedings of the IEEE Congress on Evolutionary Computation, 2012

A bio-inspired self-organizing approach for multicellular embryonic architecture.
Proceedings of the 2012 NASA/ESA Conference on Adaptive Hardware and Systems, 2012

2011
FPGA-Specific Custom VLIW Architecture for Arbitrary Precision Floating-Point Arithmetic.
IEICE Trans. Inf. Syst., 2011

FPGA accelerator for protein secondary structure prediction based on the GOR algorithm.
BMC Bioinform., 2011

A high-throughput reconfigurable Viterbi decoder.
Proceedings of the 2011 International Conference on Wireless Communications & Signal Processing, 2011

Special-purposed VLIW architecture for IEEE-754 quadruple precision elementary functions on FPGA.
Proceedings of the IEEE 29th International Conference on Computer Design, 2011

VPFPAP: A Special-Purpose VLIW Processor for Variable-Precision Floating-Point Arithmetic.
Proceedings of the International Conference on Field Programmable Logic and Applications, 2011

FPGA Implementation of Variable-Precision Floating-Point Arithmetic.
Proceedings of the Advanced Parallel Processing Technologies - 9th International Symposium, 2011

Etissue: A bio-inspired match-based reconfigurable hardware architecture supporting hierarchical self-healing and self-evolution.
Proceedings of the 2011 NASA/ESA Conference on Adaptive Hardware and Systems, 2011

2010
Fine-grained parallel RNA secondary structure prediction using SCFGs on FPGA.
Parallel Comput., 2010

A Unified Co-Processor Architecture for Matrix Decomposition.
J. Comput. Sci. Technol., 2010

Fpqrna: Hardware-Accelerated Qrna Package for noncoding RNA Gene Detecting on FPGA.
J. Bioinform. Comput. Biol., 2010

FPGA accelerating double/quad-double high precision floating-point applications for ExaScale computing.
Proceedings of the 24th International Conference on Supercomputing, 2010

Automatic synthesis of processor arrays with local memories on FPGAs.
Proceedings of the International Conference on Field-Programmable Technology, 2010

High performance and memory efficient implementation of matrix multiplication on FPGAs.
Proceedings of the International Conference on Field-Programmable Technology, 2010

Blocking LU Decomposition for FPGAs.
Proceedings of the 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2010

2009
Loop Kernel Pipelining Mapping onto Coarse-Grained Reconfigurable Architecture for Data-Intensive Applications.
J. Softw., 2009

FPGA Accelerator for Wavelet-Based Automated Global Image Registration.
EURASIP J. Embed. Syst., 2009

A Reconfigurable Architecture for Rotation Invariant Multi-View Face Detection Based on a Novel Two-Stage Boosting Method.
EURASIP J. Adv. Signal Process., 2009

A coarse-grained reconfigurable computing architecture with loop self-pipelining.
Sci. China Ser. F Inf. Sci., 2009

Fine-grained parallel RNAalifold algorithm for RNA secondary structure prediction on FPGA.
BMC Bioinform., 2009

FPGA-based Memory-efficient Parallel RNA Secondary Structure Prediction Accelerator Using SCFGs.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2009

Exploiting Fine-Grained Pipeline Parallelism for Wavefront Computations on Multicore Platforms.
Proceedings of the ICPPW 2009, 2009

FPGA accelerating three QR decomposition algorithms in the unified pipelined framework.
Proceedings of the 19th International Conference on Field Programmable Logic and Applications, 2009

A Fine-grained Pipelined Implementation of the LINPACK Benchmark on FPGAs.
Proceedings of the FCCM 2009, 2009

Fine-grained parallel application specific computing for RNA secondary structure prediction using SCFGS on FPGA.
Proceedings of the 2009 International Conference on Compilers, 2009

A Fine-Grained Pipelined Implementation for Large-Scale Matrix Inversion on FPGA.
Proceedings of the Advanced Parallel Processing Technologies, 8th International Symposium, 2009

Implementation of Rotation Invariant Multi-View Face Detection on FPGA.
Proceedings of the Advanced Parallel Processing Technologies, 8th International Symposium, 2009

2008
Rectangularly Multi-Module Memory System with Table-Based Dynamic Addressing Scheme.
Proceedings of The 2008 IEEE International Conference on Networking, 2008

DMA Performance Analysis and Multi-core Memory Optimization for SWIM Benchmark on the Cell Processor.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2008

Subblock-Based BPE Scheme to Conquer Mismatch in Memory Access Pattern.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008

Dynamic Configurable Floating-Point FFT Pipelines and Hybrid-Mode CORDIC on FPGA.
Proceedings of the International Conference on Embedded Software and Systems, 2008

Fine-grained parallel application specific computing for RNA secondary structure prediction on FPGA.
Proceedings of the 26th International Conference on Computer Design, 2008

Double Precision Hybrid-Mode Floating-Point FPGA CORDIC Co-processor.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008

Dimensional Bubble Flow Control and Fully Adaptive Routing in the 2-D Mesh Network on Chip.
Proceedings of the 2008 IEEE/IPIP International Conference on Embedded and Ubiquitous Computing (EUC 2008), 2008

Families of FPGA-Based Accelerators for BLAST Algorithm with Multi-seeds Detection and Parallel Extension.
Proceedings of the Bioinformatics Research and Development, 2008

Fine-Grained Parallel Zuker Algorithm Accelerator with Storage Optimization on FPGA.
Proceedings of the International Conference on Bioinformatics & Computational Biology, 2008

Collaborative hardware/software partition of coarse-grained reconfigurable system using evolutionary ant colony optimization.
Proceedings of the 13th Asia South Pacific Design Automation Conference, 2008

Hybrid-Mode Floating-Point FPGA CORDIC Co-processor.
Proceedings of the Reconfigurable Computing: Architectures, 2008

Hardware BLAST Algorithms with Multi-seeds Detection and Parallel Extension.
Proceedings of the Reconfigurable Computing: Architectures, 2008

Multi-access memory architecture for image applications with multiple interested regions.
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008

Area and throughput trade-offs in design of arithmetic encoder for JPEG2000.
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008

Computation rotating for data reuse.
Proceedings of the 13th Asia-Pacific Computer Systems Architecture Conference, 2008

2007
FIDP: A Novel Architecture for Lifting-Based 2D DWT in JPEG2000.
Proceedings of the Advances in Multimedia Modeling, 2007

FPGA Accelerating Algorithms of Active Shape Model in People Tracking Applications.
Proceedings of the Tenth Euromicro Conference on Digital System Design: Architectures, 2007

Distributed Collaborative Partition Method of Reconfigurable SoC Using Ant Colony Optimization.
Proceedings of the 11th International Conference on Computer Supported Cooperative Work in Design, 2007

A Parameterized Architecture Model in High Level Synthesis for Image Processing Applications.
Proceedings of the 12th Conference on Asia South Pacific Design Automation, 2007

FPGA SAR Processor with Window Memory Accesses.
Proceedings of the IEEE International Conference on Application-Specific Systems, 2007

FPGA-Accelerated Molecular Dynamics Simulations: An Overview.
Proceedings of the Reconfigurable Computing: Architectures, 2007

The Implementation of a Coarse-Grained Reconfigurable Architecture with Loop Self-pipelining.
Proceedings of the Reconfigurable Computing: Architectures, 2007

Optimized Generation of Memory Structure in Compiling Window Operations onto Reconfigurable Hardware.
Proceedings of the Reconfigurable Computing: Architectures, 2007

Reducing Storage Requirements in Accelerating Algorithm of Global BioSequence Alignment on FPGA.
Proceedings of the Advanced Parallel Processing Technologies, 7th International Symposium, 2007

FPGA-Accelerated Active Shape Model for Real-Time People Tracking.
Proceedings of the Advances in Computer Systems Architecture, 2007

2006
Progress and Challenges in High Performance Computer Technology.
J. Comput. Sci. Technol., 2006

Clustering Multicast on Hypercube Network.
Proceedings of the High Performance Computing and Communications, 2006

Robust and real-time automatic target recognition using partial hausdorff distance measure on reconfigurable hardware.
Proceedings of the 2006 IEEE International Conference on Field Programmable Technology, 2006

Designing a Coarse-Grained Reconfigurable Architecture Using Loop Self-Pipelining.
Proceedings of the Advances in Computer Systems Architecture, 11th Asia-Pacific Conference, 2006

2005
64-bit floating-point FPGA matrix multiplication.
Proceedings of the ACM/SIGDA 13th International Symposium on Field Programmable Gate Arrays, 2005

RIMP: Runtime Implicit Predication.
Proceedings of the Advanced Parallel Processing Technologies, 6th International Workshop, 2005

2003
LEAP: A Data Driven Loop Engine on Array Processor.
Proceedings of the Advanced Parallel Programming Technologies, 5th International Workshop, 2003


  Loading...