A quantitative comparison of PRAM based emulated shared memory architectures to current multicore CPUs and GPUs

Erik Hansson, Erik Alnervik, Christoph Kessler, Martti Forsell

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Abstract

    The performance of current multicore CPUs and GPUs is limited in computations making frequent use of com- munication/synchronization between the subtasks executed in parallel. This is because the directory-based cache systems scale weakly and/or the cost of synchronization is high. The Emulated Shared Memory (ESM) architectures relying on multithreading and efficient synchronization mechanism have been developed to solve these problems affecting both performance and pro- grammability of current machines. In this paper, we compare preliminarily the performance of three hardware implemented ESM architectures with state-of-the-art multicore CPUs and GPUs. The benchmarks are selected to cover different patterns of parallel computation and therefore reveal the performance potential of ESM architectures with respect to current multicores.
    Original languageEnglish
    Title of host publication2014 Workshop Proceedings on Architecture of Computing Systems (ARCS 2014)
    PublisherVDE Verlag
    Number of pages7
    ISBN (Print)978-3-8007-3579-2
    Publication statusPublished - 2014
    MoE publication typeA4 Article in a conference publication
    Event27th International Conference on Architecture of Computing Systems, ARCS 2014 - Lübeck, Germany
    Duration: 25 Feb 201428 Feb 2014

    Conference

    Conference27th International Conference on Architecture of Computing Systems, ARCS 2014
    Abbreviated titleARCS 2014
    Country/TerritoryGermany
    CityLübeck
    Period25/02/1428/02/14

    Fingerprint

    Dive into the research topics of 'A quantitative comparison of PRAM based emulated shared memory architectures to current multicore CPUs and GPUs'. Together they form a unique fingerprint.

    Cite this