Improving Compiler-Generated Transactional Code Performance and Programmability via Language-Level Constructs

Date: Feb 1, 2023.

Candidate: Bruno Chinelato Honorio

Advisor:

Prof. Dr. Guido Costa Souza de Araujo

Abstract:

With chip manufacturers such as Intel, IBM and ARM offering native support for trans- actional memory in their instruction set architectures, memory transactions are getting increasingly popular in the industry and as a research topic. Despite this recent increase in popularity on the hardware side of transactional memory (HTM), software support for transactional memory (STM) is still scarce and the only C/C++ compiler with transac- tional support currently available, the GNU Compiler Collection (GCC), does not generate code that achieves desirable performance and it is not easy to use, making the programmer take into account how functions are used and annotate them appropriately, which is far from the ideal abstraction of TM. Modern approaches seek to combine both HTM and STM to better exploit performance. In particular, Phased TMs (PhTMs) systems execute transactions in phases, not allowing both hardware and software transactions to run concurrently to avoid coordination over- heads. One of its main issues is designing good transition mode heuristics that exploit HTM as much as possible considering the STM side problems of performance. This thesis makes 5 main contributions to transactional memory research: Firstly, it proposes a novel annotation mechanism that selectively eliminates unnecessary trans- actional memory read/write barriers from compiler generated code. It does that by annotating variables that do not need to be instrumented with a type qualifier. Using the type qualifier, speedups of up to 7x and a reduction by 95% in barriers were shown on some applications of the STAMP benchmark. Secondly, these annotations were extended to be used in Phased Transactional Memory implementations. Next, it proposes a language-level construct that eases programmability of compiler-generated transactional code. In this case, TMFree work as a C style casting on functions that do not need to be instrumented. This work automates a lot of what the programmer had to do manually, significantly decreasing the job the programmer has to do. Results show that the programmer only has to manually insert 5 of this TMFree casting total on a group of STAMP applications to make them work with no performance loss, compared to the 358 annotations that were necessary for the same results. As a fourth contribution, support for TM was added to the Clang/LLVM framework, and both annotation methods created were implemented in this new TM support. And finally, it proposes a transition mode mechanism (CTM) to better exploit both modes of phases transactional memory. It does that by using a different approach in PhTM: the use of commit throughput and cache simulation to mimic the behavior of HTM storage constraints while in STM mode. Experimental results have shown that up to 5x of speedups on some applications of the STAMP benchmark were achieved when using this new approach. This thesis, then, makes a case for transactional memory being a strong contender for parallel programming techniques, with many of the results presented being the state-of- the-art for transactional memory research.

Multimaterial and Multidomain Acoustic Topology Optimization Based on an Evolutionary Approach.

Stress-based structural topology optimization for design-dependent loads problems using the BESO method.

Related posts

Topological optimization of the adhesive of bonded joints formed by the superposition of two adherents using the cohesive zone model as failure criterion.

Stress-based structural topology optimization for design-dependent loads problems using the BESO method.

Multimaterial and Multidomain Acoustic Topology Optimization Based on an Evolutionary Approach.