A methodology for speeding up loop kernels by exploiting the software information and the memory architecture

Vasilios Kelefouras; Angeliki Kritikakou; Costas Goutis

doi:10.1016/j.cl.2015.01.003

Article Dans Une Revue Computer Languages, Systems and Structures Année : 2015

A methodology for speeding up loop kernels by exploiting the software information and the memory architecture

(1) , (2) , (1)

1
2

Vasilios Kelefouras

Fonction : Auteur

Department of Electrical and Computer Engineering [Patras]

Angeliki Kritikakou

Fonction : Auteur
PersonId : 974711

Energy Efficient Computing ArchItectures with Embedded Reconfigurable Resources

Costas Goutis

Fonction : Auteur

Department of Electrical and Computer Engineering [Patras]

Résumé

It is well-known that today׳s compilers and state of the art libraries have three major drawbacks. First, the compiler sub-problems are optimized separately; this is not efficient because the separate sub-problems optimization gives a different schedule for each sub-problem and these schedules cannot coexist as the refining of one, causes the degradation of another. Second, they take into account only part of the specific algorithm׳s information. Third, they take into account only a few hardware architecture parameters. These approaches cannot give an optimal solution. In this paper, a new methodology/pre-compiler is introduced, which speeds up loop kernels, by overcoming the above problems. This methodology solves four of the major scheduling sub-problems, together as one problem and not separately; these are the sub-problems of finding the schedules with the minimum numbers of (i) L1 data cache accesses, (ii) L2 data cache accesses, (iii) main memory data accesses, (iv) addressing instructions. First, the exploration space (possible solutions) is found according to the algorithm׳s information, e.g. array subscripts. Then, the exploration space is decreased by orders of magnitude, by applying constraint propagation to the software and hardware parameters. We take the C-code and the memory architecture parameters as input and we automatically produce a new faster C-code; this code cannot be obtained by applying the existing compiler transformations to the original code. The proposed methodology has been evaluated for five well-known algorithms in both general and embedded processors; it is compared with gcc and clang compilers and also with iterative compilation.

Domaines

Informatique [cs]

Angeliki Kritikakou : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01239699

Soumis le : mardi 8 décembre 2015-10:19:35

Dernière modification le : vendredi 24 mars 2023-14:53:01

Dates et versions

hal-01239699 , version 1 (08-12-2015)

Identifiants

HAL Id : hal-01239699 , version 1
DOI : 10.1016/j.cl.2015.01.003

Citer

Vasilios Kelefouras, Angeliki Kritikakou, Costas Goutis. A methodology for speeding up loop kernels by exploiting the software information and the memory architecture. Computer Languages, Systems and Structures, 2015, 41, pp.21-41. ⟨10.1016/j.cl.2015.01.003⟩. ⟨hal-01239699⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC IRISA-D3 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

101 Consultations

0 Téléchargements

A methodology for speeding up loop kernels by exploiting the software information and the memory architecture

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager