Inproceedings,

Overcoming single-thread performance hurdles in the core fusion reconfigurable multicore architecture

J. Mukundan, S. Ghose, R. Karmazin, E. Ípek, and J. Mart\'ınez.
Proceedings of the 26th ACM international conference on Supercomputing, page 101--110. New York, NY, USA, ACM, (2012)
DOI: 10.1145/2304576.2304592

Abstract

Though the prime target of multicore architectures is parallel and multithreaded workloads (which favors maximum core count), executing sequential code fast continues to remain critical (which benefits from maximum core size). This poses a difficult design trade-off. Core Fusion is a recently-proposed reconfigurable multicore architecture that attempts to circumvent this compromise by "fusing" groups of fundamentally independent cores into larger, more aggressive processors dynamically as needed. In this way, it accommodates highly parallel, partially parallel, multiprogrammed, and sequential codes with ease. However, the sequential performance of the original fused configuration falls quite short of an area-equivalent, monolithic, out-of-order processor. This paper effectively eliminates the fusion deficit for sequential codes by attacking two major sources of inefficiency: collective commit and instruction steering. We demonstrate in detail that these modifications allow Core Fusion to essentially match the performance of an area-equivalent monolithic out-of-order processor. The implication is that the inclusion of wide-issue cores in future multicore designs may be unnecessary.

BibTeX key: Mukundan:2012:OSP:2304576.2304592
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 26th ACM international conference on Supercomputing
year: 2012
pages: 101--110
publisher: ACM
series: ICS '12
location: San Servolo Island, Venice, Italy
acmid: 2304592
isbn: 978-1-4503-1316-2
numpages: 10
DOI: 10.1145/2304576.2304592

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{Mukundan:2012:OSP:2304576.2304592, abstract = {Though the prime target of multicore architectures is parallel and multithreaded workloads (which favors maximum core count), executing sequential code fast continues to remain critical (which benefits from maximum core size). This poses a difficult design trade-off. Core Fusion is a recently-proposed reconfigurable multicore architecture that attempts to circumvent this compromise by "fusing" groups of fundamentally independent cores into larger, more aggressive processors dynamically as needed. In this way, it accommodates highly parallel, partially parallel, multiprogrammed, and sequential codes with ease. However, the sequential performance of the original fused configuration falls quite short of an area-equivalent, monolithic, out-of-order processor. This paper effectively eliminates the fusion deficit for sequential codes by attacking two major sources of inefficiency: collective commit and instruction steering. We demonstrate in detail that these modifications allow Core Fusion to essentially match the performance of an area-equivalent monolithic out-of-order processor. The implication is that the inclusion of wide-issue cores in future multicore designs may be unnecessary.}, acmid = {2304592}, added-at = {2012-11-07T14:58:54.000+0100}, address = {New York, NY, USA}, author = {Mukundan, Janani and Ghose, Saugata and Karmazin, Robert and \'{I}pek, Engin and Mart\'{\i}nez, Jos{\'e} F.}, biburl = {https://www.bibsonomy.org/bibtex/2ab62de053ebfdcc56a901f2a0abb33d4/ytyoun}, booktitle = {Proceedings of the 26th ACM international conference on Supercomputing}, doi = {10.1145/2304576.2304592}, interhash = {25f987b0fcfc157f21b06508bb7e3364}, intrahash = {ab62de053ebfdcc56a901f2a0abb33d4}, isbn = {978-1-4503-1316-2}, keywords = {thread}, location = {San Servolo Island, Venice, Italy}, numpages = {10}, pages = {101--110}, publisher = {ACM}, series = {ICS '12}, timestamp = {2012-11-07T14:58:54.000+0100}, title = {Overcoming single-thread performance hurdles in the core fusion reconfigurable multicore architecture}, year = 2012 }

BibSonomy

Overcoming single-thread performance hurdles in the core fusion reconfigurable multicore architecture

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on