Tiziano De Matteis | Tiziano De Matteis

Latest

Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Developing a BLAS library for the AMD AI Engine
ExDe: Design space exploration of scheduler architectures and mechanisms for serverless data-processing
FootPrinter: Quantifying Data Center Carbon Footprint
The Cost of Simplicity: Understanding Datacenter Scheduler Programming Abstractions
Streaming Task Graph Scheduling for Dataflow Architectures
Noise in the Clouds: Influence of Network Performance Variability on Application Scalability
Python FPGA Programming with Data-Centric Multi-Level Design
Productivity, Portability, Performance: Data-Centric Python
StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems
FBLAS: Streaming Linear Algebra on FPGA
Streaming Message Interface: High-Performance DistributedMemory Programming on Reconfigurable Hardware
GASSER: An Auto-Tunable System for General Sliding-Window Streaming Operators on GPUs
D2K: Scalable Community Detection in Massive Networks via Small-Diameter k-Plexes
Reducing Message Latency and CPU Utilization in the CAF Actor Framework
Simplifying self-adaptive and power-aware computing with Nornir
The RePhrase Extended Pattern Set for Data Intensive Parallel Computing
Bringing Parallel Patterns Out of the Corner: The P$^3$ARSEC Benchmark Suite
On dynamic memory allocation in sliding-window parallel patterns for streaming analytics
Parallel Continuous Preference Queries over Out-of-Order and Bursty Data Streams
Parallel Patterns for Window-Based Stateful Operators on Data Streams: An Algorithmic Skeleton Approach
Elastic Scaling for Distributed Latency-sensitive Data Stream Operators
Evaluating Concurrency Throttling and Thread Packing on SMT Multicores
Nornir: A Customisable Framework for Autonomic and Power-Aware Applications
P$^3$ARSEC: Towards Parallel Patterns Benchmarking
Proactive elasticity and energy awareness in data stream processing
A Divide-and-conquer Parallel Pattern Implementation for Multicores
Continuous Skyline Queries on Multicore Architectures
Data stream processing via code annotations
Keep Calm and React with Foresight: Strategies for Low-Latency and Energy-Efficient Elastic Data Stream Processing
Parallel Patterns for Adaptive Data Stream Processing
A Multicore Parallelization of Continuous Skyline Queries on Data Streams
Parallelizing High-Frequency Trading Applications by using C+ + 11 Attributes
Autonomic Parallel Data Stream Processing
A High-Throughput and Low-Latency Parallelization of Window-based Stream Joins on Multicores
A Lightweight Run-Time Support for Fast Dense Linear Algebra on Multi-Core
Optimizing Message-Passing on Multicore Architectures Using Hardware Multi-threading
Evaluation of Architectural Supports for Fine-Grained Synchronization Mechanisms