A quantitative comparison of PRAM based emulated shared memory architectures to current multicore CPUs and GPUs

E. Hansson, E. Alnervik, C Kessler, Martti Forsell

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Abstract

    The performance of current multicore CPUs and GPUs is limited in computations making frequent use of com- munication/synchronization between the subtasks executed in parallel. This is because the directory-based cache systems scale weakly and/or the cost of synchronization is high. The Emulated Shared Memory (ESM) architectures relying on multithreading and efficient synchronization mechanism have been developed to solve these problems affecting both performance and pro- grammability of current machines. In this paper, we compare preliminarily the performance of three hardware implemented ESM architectures with state-of-the-art multicore CPUs and GPUs. The benchmarks are selected to cover different patterns of parallel computation and therefore reveal the performance potential of ESM architectures with respect to current multicores.
    Original languageEnglish
    Title of host publicationARCS 2014 Workshop Proceedings
    Number of pages7
    Publication statusPublished - 2014
    MoE publication typeA4 Article in a conference publication
    Event27th International Conference on Architecture of Computing Systems, ARCS 2014 - Lübeck, Germany
    Duration: 25 Feb 201428 Feb 2014

    Conference

    Conference27th International Conference on Architecture of Computing Systems, ARCS 2014
    Abbreviated titleARCS 2014
    CountryGermany
    CityLübeck
    Period25/02/1428/02/14

      Fingerprint

    Cite this