Choose Your Paper: Find your paper in this list to ensure that your submission goes the correct directory. Implementing a hybrid SRAM / eDRAM NUCA architecture Scalable Clustering using Multiple GPUs Coordination Mechanisms for Selfish Multi-Organization Scheduling Partial Globalization of Partitioned Address Spaces for Zero-copy Communication with Shared Memory Weighted Dynamic Scheduling for Mitigating Noise on Multicore Clusters STEAMEngine: Driving MapReduce Provisioning in the Cloud Spectral Evolution Simulation on Leading Multi-socket, Multicore Platforms High level template for the task-based parallel wavefront pattern Parallel Implementation of MOPSO on GPU using OpenCL and CUDA Robust Thread-Level Speculation Multi-model prediction for enhancing content locality in elastic server infrastructures Compute & Memory Optimizations for High-Quality Speech Recognition on Low-End GPU Processors Maximizing Throughput of Jobs with Multiple Resource Requirements Reliable and Randomized Data Distribution Strategies for Large Scale Storage Systems A Fast Centralized Computation Routing Algorithm for Self-Configuring NoC Systems Parallel Multiple Precision Division by a Single Precision Divisor A Multi-GPU Algorithm for Communication in Neuronal Network Simulations Optimizations for Message Driven Applications on Multicore Architectures Porting Irregular Reductions on Heterogeneous CPU-GPU Configurations Modelling Authorization and Execution of Video Workflows Increasing the Energy Efficiency of TLS Systems Using Intermediate Checkpointing A Multiresolution Data Model for Improving Simulation I/O Performance The Impact of Hyper-Threading on Processor Resource Utilization in Production Applications Adaptive Memory Power Management Techniques for HPC Workloads GVT Algorithms and Discrete Event Dynamics on 64K+ Processor Cores Dynamic hosting management of Web based applications over clouds Dynamic selection of tile sizes Scheduling Diverse High Performance Computing Systems With the Goal of Maximizing Utilization High Performance Cache Block Replication Using Re-Reference Probability in CMPs Hybrid Implementation of Error Diffusion Dithering A Dynamic Scheduling Framework for Emerging Heterogeneous Systems A Machine Learning-Based Approach for Thread Mapping on Transactional Memory Applications Hybrid Algorithms for List Ranking and Graph Connected Components Building Algorithmically Nonstop Fault Tolerant MPI Programs Supporting Computational Data Model Representation with High-performance I/O in Parallel netCDF Comparing archival policies for Blue Waters Highly Scalable Barriers for Future High-Performance Computing Clusters Enabling CUDA Acceleration within Virtual Machines using rCUDA Multi-threaded UPC Runtime with Network Endpoints: Design Alternatives and Evaluation on Multi-core Architectures Improving Graph Coloring on Distributed-Memory Parallel Computers
You need to sign, scan, and submit a single file, pdf, tiff, or jpeg file.