Leonel Sousa

Proceedings of the Fourth Workshop on Cryptography and Security in Computing Systems, 2017

2016

Adaptive Scheduling Framework for Real-Time Video Encoding on Heterogeneous Systems.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

A Framework for Application-Guided Task Management on Heterogeneous Embedded Systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2016

GPU-assisted HEVC intra decoder.

[BibT_eX]

[DOI]

J. Real Time Image Process., 2016

Exploiting task and data parallelism for advanced video coding on hybrid CPU + GPU platforms.

[BibT_eX]

[DOI]

J. Real Time Image Process., 2016

Method for designing two levels RNS reverse converters for large dynamic ranges.

[BibT_eX]

[DOI]

Integr., 2016

Guest Editors' Introduction.

[BibT_eX]

[DOI]

Min Chen

Yonghong Tian

Int. J. Semantic Comput., 2016

Ubiquitous Multimedia: Emerging Research on Multimedia Computing.

[BibT_eX]

[DOI]

Yonghong Tian

Min Chen

IEEE Multim., 2016

A Survey on Programmable LDPC Decoders.

[BibT_eX]

[DOI]

IEEE Access, 2016

HPC on the Intel Xeon Phi: Homomorphic Word Searching.

[BibT_eX]

[DOI]

Azadeh Alsadat Emrani Zarandi

Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016

Efficient HEVC decoder for heterogeneous CPU with GPU systems.

[BibT_eX]

[DOI]

Biao Wang

Mauricio Alvarez-Mesa

Proceedings of the 18th IEEE International Workshop on Multimedia Signal Processing, 2016

Area-delay-power-aware adder placement method for RNS reverse converter design.

[BibT_eX]

[DOI]

Amir Sabbagh Molahosseini

Mehdi Hosseinzadeh

Keivan Navi

Proceedings of the IEEE 7th Latin American Symposium on Circuits & Systems, 2016

Enhancing Data Parallelism of Fully Homomorphic Encryption.

[BibT_eX]

[DOI]

Azadeh Alsadat Emrani Zarandi

Proceedings of the Information Security and Cryptology - ICISC 2016 - 19th International Conference, Seoul, South Korea, November 30, 2016

High-Level Designs of Complex FIR Filters on FPGAs for the SKA.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

2015

Reverse Converter Design via Parallel-Prefix Adders: Novel Components, Methodology, and Implementations.

[BibT_eX]

[DOI]

Amir Sabbagh Molahosseini

IEEE Trans. Very Large Scale Integr. Syst., 2015

Arithmetic-Based Binary-to-RNS Converter Modulo {2n±k} for jn-bit Dynamic Range.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2015

Base Transformation With Injective Residue Mapping for Dynamic Range Reduction in RNS.

[BibT_eX]

[DOI]

Thian Fatt Tay

Chip-Hong Chang

IEEE Trans. Circuits Syst. I Regul. Pap., 2015

2n RNS Scalers for Extended 4-Moduli Sets.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2015

Real-time implementation of remotely sensed hyperspectral image unmixing on GPUs.

[BibT_eX]

[DOI]

J. Real Time Image Process., 2015

Attaining performance fairness in big.LITTLE systems.

[BibT_eX]

[DOI]

Proceedings of the 12th International Workshop on Intelligent Solutions in Embedded Systems, 2015

Accelerating Phylogenetic Inference on Heterogeneous OpenCL Platforms.

[BibT_eX]

[DOI]

Lidia Kuan

Proceedings of the 2015 IEEE TrustCom/BigDataSE/ISPA, 2015

HEVC in-loop filters GPU parallelization in embedded systems.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Embedded Computer Systems: Architectures, 2015

Run-Time Machine Learning for HEVC/H.265 Fast Partitioning Decision.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Featuring Immediate Revocation in Mikey-Sakke (FIRM).

[BibT_eX]

[DOI]

Parashuram Chawan

Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

RNS reverse converters based on the new Chinese Remainder Theorem I.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

High performance IP core for HEVC quantization.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Towards GPU HEVC intra decoding: Seizing fine-grain parallelism.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Stretching the limits of Programmable Embedded Devices for Public-key Cryptography.

[BibT_eX]

[DOI]

Proceedings of the Second Workshop on Cryptography and Security in Computing Systems, 2015

GPU acceleration of the HEVC decoder inter prediction module.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

Programmable RNS lattice-based parallel cryptographic decryption.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on Application-specific Systems, 2015

2014

An Efficient Scalable RNS Architecture for Large Dynamic Ranges.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2014

A Flexible Architecture for Modular Arithmetic Hardware Accelerators based on RNS.

[BibT_eX]

[DOI]

Samuel Antão

J. Signal Process. Syst., 2014

Dynamic Load Balancing for Real-Time Video Encoding on Heterogeneous CPU+GPU Systems.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Efficient Method for Designing Modulo {2n ± k} Multipliers.

[BibT_eX]

[DOI]

Sorin Cotofana

J. Circuits Syst. Comput., 2014

Unified transform architecture for AVC, AVS, VC-1 and HEVC high-performance codecs.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2014

Method for Designing Efficient Mixed Radix Multipliers.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2014

Cache-aware Roofline model: Upgrading the loft.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2014

On the Evaluation of Multi-core Systems with SIMD Engines for Public-Key Cryptography.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing Workshop, 2014

Performance-Aware Task Management and Frequency Scaling in Embedded Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison.

[BibT_eX]

[DOI]

Proceedings of the International Work-Conference on Bioinformatics and Biomedical Engineering, 2014

ROM-less RNS-to-binary converter moduli {22n - 1, 22n + 1, 2n - 3, 2n + 3}.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Symposium on Integrated Circuits (ISIC), 2014

Method for designing multi-channel RNS architectures to prevent power analysis SCA.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014

FEVES: Framework for Efficient Parallel Video Encoding on Heterogeneous Systems.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing, 2014

Collaborative inter-prediction on CPU+GPU systems.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Reconfigurable data flow engine for HEVC motion estimation.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Cooperative CPU+GPU deblocking filter parallelization for high performance HEVC video codecs.

[BibT_eX]

[DOI]

Diego F. de Souza

Proceedings of the IEEE International Conference on Acoustics, 2014

Opencl parallelization of the HEVC de-quantization and inverse transform for heterogeneous platforms.

[BibT_eX]

[DOI]

Diego F. de Souza

Proceedings of the 22nd European Signal Processing Conference, 2014

Nonlinear system identification using constellation based multiple model adaptive estimators.

[BibT_eX]

[DOI]

José Jasnau Caeiro

Proceedings of the 22nd European Signal Processing Conference, 2014

SchedMon: A Performance and Energy Monitoring Tool for Modern Multi-cores.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

Combining flexibility with low power: Dataflow and wide-pipeline LDPC decoding engines in the Gbit/s era.

[BibT_eX]

[DOI]

Gabriel Falcão

Proceedings of the IEEE 25th International Conference on Application-Specific Systems, 2014

Finite-Difference in Time-Domain Scalable Implementations on CUDA and OpenCL.

[BibT_eX]

[DOI]

Lidia Kuan

Proceedings of the Numerical Computations with GPUs, 2014

2013

On the Design of RNS Reverse Converters for the Four-Moduli Set ${\bf\{2^{\mmb n}+1, 2^{\mmb n}-1, 2^{\mmb n}, 2^{{\mmb n}+1}+1\}}$.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2013

A Lab Project on the Design and Implementation of Programmable and Configurable Embedded Systems.

[BibT_eX]

[DOI]

IEEE Trans. Educ., 2013

Method to Design General RNS Reverse Converters for Extended Moduli Sets.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2013

RNS Reverse Converters for Moduli Sets With Dynamic Ranges up to (8n+1)-bit.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2013

The CRNS framework and its application to programmable and reconfigurable cryptography.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2013

Scalable Unified Transform Architecture for Advanced Video Coding Embedded Systems.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2013

Randomised multi-modulo residue number system architecture for double-and-add to prevent power analysis side channel attacks.

[BibT_eX]

[DOI]

IET Circuits Devices Syst., 2013

Monitoring Performance and Power for Application Characterization with the Cache-Aware Roofline Model.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2013

Stressing the BER simulation of LDPC codes in the error floor region using GPU clusters.

[BibT_eX]

[DOI]

Proceedings of the ISWCS 2013, 2013

A comparison of computing architectures and parallelization frameworks based on a two-dimensional FDTD.

[BibT_eX]

[DOI]

Lidia Kuan

Proceedings of the International Conference on High Performance Computing & Simulation, 2013

An RNS-based architecture targeting hardware accelerators for modular arithmetic.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Open the Gates: Using High-level Synthesis towards programmable LDPC decoders on FPGAs.

[BibT_eX]

[DOI]

Gabriel Falcão

Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

Accelerating the Computation of Induced Dipoles for Molecular Mechanics with Dataflow Engines.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2013

High performance multi-standard architecture for DCT computation in H.264/AVC High Profile and HEVC codecs.

[BibT_eX]

[DOI]

Proceedings of the 2013 Conference on Design and Architectures for Signal and Image Processing, 2013

DARNS: A randomized multi-modulo RNS architecture for double-and-add in ECC to prevent power analysis side channel attacks.

[BibT_eX]

[DOI]

Jude Angelo Ambrose

Proceedings of the 18th Asia and South Pacific Design Automation Conference, 2013

A compact and scalable RNS architecture.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Application-Specific Systems, 2013

2012

Corrections to "MRC-Based RNS Reverse Converters for the Four-Moduli Sets 2n+1, 2n-1, 2n, 22n+1-1 and 2n+1, 2n-1, 22n, 22n+1-1".

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2012

MRC-Based RNS Reverse Converters for the Four-Moduli Sets 2n+1, 2n-1, 2n, 22n+1-1 and 2n+1, 2n-1, 22n, 22n+1-1.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2012

Portable LDPC Decoding on Multicores Using OpenCL [Applications Corner].

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2012

Fine-grain parallelism using multi-core, Cell/BE, and GPU Systems.

[BibT_eX]

[DOI]

Pedro Trancoso

Alexandros Stamatakis

Guochun Shi

Volodymyr V. Kindratenko

Parallel Comput., 2012

Computation of Induced Dipoles in Molecular Mechanics Simulations Using Graphics Processors.

[BibT_eX]

[DOI]

Johannes M. Dieterich

Ricardo A. Mata

J. Chem. Inf. Model., 2012

Configurable M-factor VLSI DVB-S2 LDPC decoder architecture with optimized memory tiling design.

[BibT_eX]

[DOI]

Marco Alexandre Cravo Gomes

Joao Cacheira

EURASIP J. Wirel. Commun. Netw., 2012

RNS-Based Elliptic Curve Point Multiplication for Massive Parallel Architectures.

[BibT_eX]

[DOI]

Jean-Claude Bajard

Comput. J., 2012

Energy efficient stream-based configurable architecture for embedded platforms.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Conference on Embedded Computer Systems: Architectures, 2012

On Realistic Divisible Load Scheduling in Highly Heterogeneous Distributed Systems.

[BibT_eX]

[DOI]

Proceedings of the 20th Euromicro International Conference on Parallel, 2012

Simultaneous Multi-Level Divisible Load Balancing for Heterogeneous Desktop Systems.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Multi-level Parallelization of Advanced Video Coding on Hybrid CPU+GPU Platforms.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

Hierarchical Partitioning Algorithm for Scientific Computing on Highly Heterogeneous CPU + GPU Clusters.

[BibT_eX]

[DOI]

David Clarke

Alexey L. Lastovetsky

Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

VLSI Reverse Converter for RNS Based on the Moduli Set.

[BibT_eX]

[DOI]

Proceedings of the 15th Euromicro Conference on Digital System Design, 2012

RNS Arithmetic Units for Modulo {2^n+-k}.

[BibT_eX]

[DOI]

Proceedings of the 15th Euromicro Conference on Digital System Design, 2012

High Performance Unified Architecture for Forward and Inverse Quantization in H.264/AVC.

[BibT_eX]

[DOI]

Proceedings of the 15th Euromicro Conference on Digital System Design, 2012

Efficient implementation of multi-moduli architectures for Binary-to-RNS conversion.

[BibT_eX]

[DOI]

Jude Angelo Ambrose

Proceedings of the 17th Asia and South Pacific Design Automation Conference, 2012

2011

Modeling and Evaluating Non-shared Memory CELL/BE Type Multi-core Architectures for Local Image and Video Processing.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2011

Massively LDPC Decoding on Multicore Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2011

A flexible architecture for the computation of direct and inverse transforms in H.264/AVC video codecs.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2011

A tutorial overview on the properties of the discrete cosine transform for encoded image and video processing.

[BibT_eX]

[DOI]

Signal Process., 2011

Parallel Computing - Special Issue.

[BibT_eX]

[DOI]

Yves Robert

Denis Trystram

Parallel Comput., 2011

CHPS: An Environment for Collaborative Execution on Heterogeneous Desktop Systems.

[BibT_eX]

[DOI]

Int. J. Netw. Comput., 2011

High throughput and scalable architecture for unified transform coding in embedded H.264/AVC video coding systems.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Embedded Computer Systems: Architectures, 2011

A new approach to system identification and parameter tuning with multiple model adaptive estimators.

[BibT_eX]

[DOI]

João Carlos Martins

José Jasnau Caeiro

Proceedings of the 7th International Symposium on Image and Signal Processing and Analysis, 2011

Real-time DVB-S2 LDPC decoding on many-core GPU accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Introduction.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

Scheduling Divisible Loads on Heterogeneous Desktop Systems with Limited Memory.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

Binary-to-RNS Conversion Units for moduli {2^n ± 3}.

[BibT_eX]

[DOI]

Proceedings of the 14th Euromicro Conference on Digital System Design, 2011

Virtualization for Morphable Multi-Cores.

[BibT_eX]

[DOI]

Proceedings of the ARCS 2011, 2011

2010

Measuring and Extraction of Biological Information on New Handheld Biochip-Based Microsystem.

[BibT_eX]

[DOI]

Hugo Alexandre Ferreira

IEEE Trans. Instrum. Meas., 2010

On the Modeling of New Tunnel Junction Magnetoresistive Biosensors.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2010

A quantitative analysis of firing rate estimators: Unveiling bias sources.

[BibT_eX]

[DOI]

Neurocomputing, 2010

An improved RNS generator 2n +/- k based on threshold logic.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE/IFIP VLSI-SoC 2010, 2010

Unifying stream based and reconfigurable computing to design application accelerators.

[BibT_eX]

[DOI]

Bruno Francisco

Proceedings of the 18th IEEE/IFIP VLSI-SoC 2010, 2010

Embedded multicore architectures for LDPC decoding.

[BibT_eX]

[DOI]

Proceedings of the 2010 International Conference on Embedded Computer Systems: Architectures, 2010

Programming Cell/BE and GPUs systems for real-time video encoding.

[BibT_eX]

[DOI]

Proceedings of the Real-Time Image and Video Processing 2010, 2010

p264: open platform for designing parallel H.264/AVC video encoders on multi-core systems.

[BibT_eX]

[DOI]

António Rodrigues

Proceedings of the Network and Operating System Support for Digital Audio and Video, 2010

H.264/AVC framework for multi-core embedded video encoders.

[BibT_eX]

[DOI]

Proceedings of the 2010 International Symposium on System on Chip, SoC 2010, Tampere, 2010

An improved RNS reverse converter for the {22n+1-1, 2n, 2n-1} moduli set.

[BibT_eX]

[DOI]

Kazeem Alagbe Gbolagade

Sorin Dan Cotofana

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Collaborative execution environment for heterogeneous parallel systems.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Exploiting SIMD extensions for linear image processing with OpenCL.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computer Design, 2010

High-Performance Computing on Heterogeneous Systems: Database Queries on CPU and GPU.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing: From Grids and Clouds to Exascale, 2010

Arithmetic Units for RNS Moduli {2n-3} and {2n+3} Operations.

[BibT_eX]

[DOI]

Proceedings of the 13th Euromicro Conference on Digital System Design, 2010

Hardware/software co-design of H.264/AVC encoders for multi-core embedded systems.

[BibT_eX]

[DOI]

Proceedings of the 2010 Conference on Design & Architectures for Signal & Image Processing, 2010

Iterative induced dipoles computation for molecular mechanics on GPUs.

[BibT_eX]

[DOI]

Ricardo A. Mata

Proceedings of 3rd Workshop on General Purpose Processing on Graphics Processing Units, 2010

Elliptic Curve point multiplication on GPUs.

[BibT_eX]

[DOI]

Jean-Claude Bajard

Proceedings of the 21st IEEE International Conference on Application-specific Systems Architectures and Processors, 2010

Efficient Independent Component Analysis on a GPU.

[BibT_eX]

[DOI]

Rui Ramalho

Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009

A Feature Selection Algorithm for the Regularization of Neuron Models.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2009

A Portable and Autonomous Magnetic Detection Platform for Biosensing.

[BibT_eX]

[DOI]

Verónica C. Martins

Sensors, 2009

Modelling and programming stream-based distributed computing based on the meta-pipeline approach.

[BibT_eX]

[DOI]

Int. J. Parallel Emergent Distributed Syst., 2009

Parallel LDPC Decoding on GPUs Using a Stream-Based Computing Approach.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2009

Neural code metrics: Analysis and application to the assessment of neural models.

[BibT_eX]

[DOI]

Neurocomputing, 2009

Development and evaluation of scalable video motion estimators on GPU.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Signal Processing Systems, 2009

Applying the Stream-Based Computing Model to Design Hardware Accelerators: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, 2009

On the design of distributed autonomous embedded systems for biomedical applications.

[BibT_eX]

[DOI]

Rui Ramalho

Proceedings of the 3rd International Conference on Pervasive Computing Technologies for Healthcare, 2009

CaravelaMPI: Message Passing Interface for Parallel GPU-Based Applications.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Symposium on Parallel and Distributed Computing, 2009

Distributed Software Platform for Automation and Control of General Anaesthesia.

[BibT_eX]

[DOI]

Gesner Passos

Bertinho Andrade da Costa

João Miranda Lemos

Proceedings of the Eighth International Symposium on Parallel and Distributed Computing, 2009

How GPUs can outperform ASICs for fast LDPC decoding.

[BibT_eX]

[DOI]

Proceedings of the 23rd international conference on Supercomputing, 2009

Fine-grain Parallelism Using Multi-core, Cell/BE, and GPU Systems: Accelerating the Phylogenetic Likelihood Function.

[BibT_eX]

[DOI]

Pedro Trancoso

Alexandros Stamatakis

Proceedings of the ICPP 2009, 2009

Multi-core platforms for signal processing: source and channel coding.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Parallel LDPC Decoding on the Cell/B.E. Processor.

[BibT_eX]

[DOI]

José Marinho

Proceedings of the High Performance Embedded Architectures and Compilers, 2009

Compact and Flexible Microcoded Elliptic Curve Processor for Reconfigurable Devices.

[BibT_eX]

[DOI]

Proceedings of the FCCM 2009, 2009

Preface.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2009, 2009

2008

Cost-Efficient SHA Hardware Accelerators.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2008

Statistical Analysis of a Spike Train Distance in Poisson Models.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2008

Parallel Advanced Video Coding: Motion Estimation on Multi-cores.

[BibT_eX]

[DOI]

Scalable Comput. Pract. Exp., 2008

Massive parallel LDPC decoding on GPU.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008

Edge Stream Oriented LDPC Decoding.

[BibT_eX]

[DOI]

Marco Alexandre Cravo Gomes

Proceedings of the 16th Euromicro International Conference on Parallel, 2008

Heuristic Optimization Methods for Improving Performance of Recursive General Purpose Applications on GPUs.

[BibT_eX]

[DOI]

Koichi Wada

Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC 2008), 2008

Distributed Web-based Platform for Computer Architecture Simulation.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC 2008), 2008

Design and implementation of a tool for modeling and programming deadlock free meta-pipeline applications.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

BRAM-LUT Tradeoff on a Polymorphic DES Design.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2008

Efficient FPGA elliptic curve cryptographic processor over GF(2m).

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Field-Programmable Technology, 2008

On-the-fly attestation of reconfigurable hardware.

[BibT_eX]

[DOI]

Georgi Kuzmanov

Proceedings of the FPL 2008, 2008

Application Specific Programmable IP Core for Motion Estimation: Technology Comparison Targeting Efficient Embedded Co-Processing Units.

[BibT_eX]

[DOI]

Proceedings of the 11th Euromicro Conference on Digital System Design: Architectures, 2008

An RNS based Specific Processor for Computing the Minimum Sum-of-Absolute-Differences.

[BibT_eX]

[DOI]

Proceedings of the 11th Euromicro Conference on Digital System Design: Architectures, 2008

Merged Computation for Whirlpool Hashing.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2008

A Parallel Algorithm for Advanced Video Motion Estimation on Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Complex, 2008

Low power microarchitecture with instruction reuse.

[BibT_eX]

[DOI]

Proceedings of the 5th Conference on Computing Frontiers, 2008

Towards a Unified Model for the Retina - Static vs Dynamic Integrate and Fire Models.

[BibT_eX]

Proceedings of the First International Conference on Biomedical Electronics and Devices, 2008

2007

Reconfigurable architectures and processors for real-time video motion estimation.

[BibT_eX]

[DOI]

J. Real Time Image Process., 2007

Improving residue number system multiplication with more balanced moduli sets and enhanced modular arithmetic structures.

[BibT_eX]

[DOI]

IET Comput. Digit. Tech., 2007

Embedded Systems for Portable and Mobile Video Platforms.

[BibT_eX]

[DOI]

EURASIP J. Embed. Syst., 2007

Adaptive Motion Estimation Processor for Autonomous Video Devices.

[BibT_eX]

[DOI]

Nuno Filipe Valentim Roma

EURASIP J. Embed. Syst., 2007

Efficient Hybrid DCT-Domain Algorithm for Video Spatial Downscaling.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2007

Caravela: A Novel Stream-Based Distributed Computing Environment.

[BibT_eX]

[DOI]

Computer, 2007

Developing and Integrating Lab Projects as Important Learning Components in an Embedded Systems Course.

[BibT_eX]

[DOI]

Francisco André Corrêa Alegria

Proceedings of the IEEE International Conference on Microelectronic Systems Education, 2007

Meta-Pipeline: A New Execution Mechanism for Distributed Pipeline Processing.

[BibT_eX]

[DOI]

Tomás Brandão

Proceedings of the 6th International Symposium on Parallel and Distributed Computing (ISPDC 2007), 2007

A New Handheld Biochip-based Microsystem.

[BibT_eX]

[DOI]

Hugo Alexandre Ferreira

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007

Generic Architecture Designed for Biomedical Embedded Systems.

[BibT_eX]

[DOI]

Proceedings of the Embedded System Design: Topics, Techniques and Trends, IFIP TC10 Working Conference: International Embedded Systems Symposium (IESS), May 30, 2007

Additive Logistic Regression Applied to Retina Modelling.

[BibT_eX]

[DOI]

Sérgio F. Martins

Proceedings of the International Conference on Image Processing, 2007

An Efficient Expectation-Maximisation Algorithm for Spike Classification.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Digital Signal Processing, 2007

Adaptive Motion Estimation Algorithm for H.264/AVC.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Digital Signal Processing, 2007

A Run-time Reconfigurable Processor for Video Motion Estimation.

[BibT_eX]

[DOI]

Miguel Ribeiro

Proceedings of the FPL 2007, 2007

Stochastic integrate-and-fire model for the retina.

[BibT_eX]

[DOI]

Sergio Capela

Proceedings of the 15th European Signal Processing Conference, 2007

Data buffering optimization methods toward a uniform programming interface for gpu-based applications.

[BibT_eX]

[DOI]

Diogo Antão

Proceedings of the 4th Conference on Computing Frontiers, 2007

Design and implementation of a stream-based distributedcomputing platform using graphics processing units.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Computing Frontiers, 2007

Efficient Method for Magnitude Comparison in RNS Based on Two Pairs of Conjugate Moduli.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE Symposium on Computer Arithmetic (ARITH-18 2007), 2007

2006

Toward a Realistic Task Scheduling Model.

[BibT_eX]

[DOI]

Frode Eika Sandnes

IEEE Trans. Parallel Distributed Syst., 2006

A New Hand-Held Microsystem Architecture for Biological Analysis.

[BibT_eX]

[DOI]

Bertinho Andrade da Costa

João Miranda Lemos

Hugo Alexandre Ferreira

IEEE Trans. Circuits Syst. I Regul. Pap., 2006

Maestro2: Experimental Evaluation of Communication Performance Improvement Techniques in the Link Layer.

[BibT_eX]

[DOI]

J. Interconnect. Networks, 2006

Rescheduling for Optimized SHA-1 Calculation.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, 2006

Low Power Distance Measurement Unit for Real-Time Hardware Motion Estimators.

[BibT_eX]

[DOI]

Proceedings of the Integrated Circuit and System Design. Power and Timing Modeling, 2006

Reconfigurable memory based AES co-processor.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Application Specific Instruction Set Processor for Adaptive Video Motion Estimation.

[BibT_eX]

[DOI]

Proceedings of the Ninth Euromicro Conference on Digital System Design: Architectures, Methods and Tools (DSD 2006), 30 August, 2006

Improving SHA-2 Hardware Implementations.

[BibT_eX]

[DOI]

Proceedings of the Cryptographic Hardware and Embedded Systems, 2006

Configurable Embedded Core for Controlling Electro-Mechanical Systems.

[BibT_eX]

[DOI]

Rodrigo Piedade

Proceedings of the Reconfigurable Computing: Architectures and Applications, 2006

2005

Communication Contention in Task Scheduling.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2005

Corrections to "A Universal Architecture for Designing Efficient Modulo 2n+1 Multipliers".

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2005

A universal architecture for designing efficient modulo 2n+1 multipliers.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2005

Visual neuroprosthesis: a non invasive system for stimulating the cortex.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2005

Efficient VLSI Architecture for Real-Time Motion Estimation in Advanced Video Coding.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 2005 IEEE International SOC Conference, 2005

On the Implementation and Evaluation of Berkeley Sockets on Maestro2 cluster computing environment.

[BibT_eX]

[DOI]

Ricardo Guapo

Proceedings of the 4th International Symposium on Parallel and Distributed Computing (ISPDC 2005), 2005

Least squares motion estimation algorithm in the compressed DCT domain for H.26x/MPEG-x video sequences.

[BibT_eX]

[DOI]

Proceedings of the Advanced Video and Signal Based Surveillance, 2005

The Midlifekicker Microarchitecture Evaluation Metric.

[BibT_eX]

[DOI]

Stamatis Vassiliadis

Georgi Gaydadjiev

Proceedings of the 16th IEEE International Conference on Application-Specific Systems, 2005

2004

On Task Scheduling Accuracy: Evaluation Methodology and Results.

[BibT_eX]

[DOI]

J. Supercomput., 2004

List scheduling: extension for contention awareness and evaluation of node priorities for heterogeneous cluster architectures.

[BibT_eX]

[DOI]

Parallel Comput., 2004

A programmable cellular neural network circuit.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Symposium on Integrated Circuits and Systems Design, 2004

Task Scheduling: Considering the Processor Involvement in Communication.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Symposium on Parallel and Distributed Computing (ISPDC 2004), 2004

Distributed Shared Memory System Based on the Maestro2 High Performance Cluster Network.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Symposium on Parallel and Distributed Computing (ISPDC 2004), 2004

On the performance of Maestro2 high performance network equipment, using new improvement techniques.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Performance Computing and Communications Conference, 2004

{2n+1, sn+k, sn-1}: A New RNS Moduli Set Extension.

[BibT_eX]

[DOI]

Proceedings of the 2004 Euromicro Symposium on Digital Systems Design (DSD 2004), Architectures, Methods and Tools, 31 August, 2004

2003

Automatic Synthesis of Motion Estimation Processors Based on a New Class of Hardware Architectures.

[BibT_eX]

[DOI]

J. VLSI Signal Process., 2003

Fast transcoding architectures for insertion of non-regular shaped objects in the compressed DCT-domain.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2003

An FPL Bioinspired Visual Encoding System to Stimulate Cortical Neurons in Real-Time.

[BibT_eX]

[DOI]

Francisco J. Pelayo

Antonio Martínez-Álvarez

Christian A. Morillas

Samuel F. Romero

Proceedings of the Field Programmable Logic and Application, 13th International Conference, 2003

Customisable Core-Based Architectures for Real-Time Motion Estimation on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the Field Programmable Logic and Application, 13th International Conference, 2003

RDSP: A RISC DSP based on Residue Number System.

[BibT_eX]

[DOI]

Proceedings of the 2003 Euromicro Symposium on Digital Systems Design (DSD 2003), 2003

2002

Efficient and configurable full-search block-matching processors.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2002

Video coding by using the 3D zero-tree approach in the wavelet transform domain.

[BibT_eX]

[DOI]

José A. C. Salvado

Proceedings of the 14th International Conference on Digital Signal Processing, 2002

Insertion of irregular-shaped logos in the compressed DCT domain.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Digital Signal Processing, 2002

2001

A New Efficient VLSI Architecture for Full Search Block Matching Motion Estimation.

[BibT_eX]

Proceedings of the SOC Design Methodologies, 2001

Comparison of Contention Aware List Scheduling Heuristics for Cluster Computing.

[BibT_eX]

[DOI]

Proceedings of the 30th International Workshops on Parallel Processing (ICPP 2001 Workshops), 2001

Scheduling Task Graphs on Arbitrary Processor Architectures Considering Contention.

[BibT_eX]

[DOI]

Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

Exploiting Unused Time Slots in List Scheduling Considering Communication Contention.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2001: Parallel Processing, 2001

2000

Synchronous Non-local Image Processing on Orthogonal Multiprocessor Systems.

[BibT_eX]

[DOI]

Proceedings of the Vector and Parallel Processing, 2000

A Platform Independent Parallelising Tool Based on Graph Theoretic Models.

[BibT_eX]

[DOI]

Proceedings of the Vector and Parallel Processing, 2000

In the Development and Evaluation of Specialized Processors for Computing High-Order 2-D Image Moments in Real-Time.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Workshop on Computer Architectures for Machine Perception (CAMP 2000), 2000

1999

Low-power array architectures for motion estimation.

[BibT_eX]

[DOI]

Proceedings of the Third IEEE Workshop on Multimedia Signal Processing, 1999

Applying Conditional Processing to Design Low-Power Array Processors for Motion Estimation.

[BibT_eX]

[DOI]

Proceedings of the 1999 International Conference on Image Processing, 1999

On the Development of a Video CODEC for Low Bitrate Communication in General Purpose Computers.

[BibT_eX]

Proceedings of the 17th IASTED International Conference on Applied Informatics, 1999

1998

Bidirectional systolic arrays for digital recursive filters.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE International Conference on Electronics, Circuits and Systems, 1998

1997

A new orthogonal multiprocessor and its application to image processing.

[BibT_eX]

[DOI]