Translating the ARM Neon and VFP instructions in a binary translator

被引：6

作者：

Guo, Yu-Chuan ^{[1
]}

Yang, Wuu ^{[1
]}

Chen, Jiunn-Yeu ^{[1
]}

Lee, Jenq-Kuen ^{[2
]}

机构：

[1] Natl Chiao Tung Univ, Hsinchu, Taiwan

[2] Natl Tsing Hua Univ, Hsinchu, Taiwan

来源：

SOFTWARE-PRACTICE & EXPERIENCE | 2016年 / 46卷 / 12期

关键词：

binary translation; cloud computing; LLVM; floating-point instruction; Neon; vector instruction; VFP; virtualization;

D O I：

10.1002/spe.2394

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Binary translation attempts to emulate one instruction set with another on the same or different platforms. The important technique is widely used in modern software. Vector and floating-point instructions are widely used in many applications, including multimedia, graphics, and gaming. Although these instructions are usually simulated with software in a binary translator, it is important to support them such that the host single-instruction, multiple-data (SIMD) and floating-point hardware are efficiently used during emulation. We report our design and implementation of the emulation of ARM Neon and vector floating point (VFP) instructions in the machine-code-to-low-level-virtual-machine (MC2LLVM) binary translator. The Neon and VFP instructions are first translated into carefully chosen sequences of LLVM intermediate representation (IR), and later, the IR sequences are optimized and translated into the host native binary by the existing LLVM backend. Because MC2LLVM makes use of the vector and floating-point types in LLVM IR, the generated host native binary can take full advantage of the vector and floating-point functional units, if present, of the host machine. To be fully compliant with Neon and VFP instruction sets, all the features are supported, including the flush-to-zero mode, default not a number mode, and floating-point exceptions. The experimental results show that code generated by MC2LLVM with the Neon and VFP extensions achieves an average speedup of 1.174x in SPEC 2006 benchmark suites and exhibits a floating-point throughput of 12.05x in LINPACK, compared with code generated by MC2LLVM without the Neon and VFP extensions. Furthermore, MC2LLVM is 3.36x faster than QEMU for processing Neon/VFP instructions. Copyright (c) 2016 John Wiley & Sons, Ltd.

引用

下载

页码：1591 / 1615

页数：25

共 50 条

[41] The Binary Identities of Translator in Self-translation
黄焱
海外英语, 2017, (06) : 123 - 124
[42] The job of translating 1: Interview with the translator JUAN ANTONIO VIVANCO GEFAELL
Vivanco Gefaell, Juan Antonio
ZIBALDONE-ESTUDIOS ITALIANOS DE LA TORRE DEL VIRREY, 2013, 1 (01): : 100 - 102
[43] ON TRANSLATING THE LIVING AND THE DEAD - SOME THOUGHTS OF A HEBREW-ENGLISH TRANSLATOR
HALKIN, H
PROOFTEXTS-A JOURNAL OF JEWISH LITERARY HISTORY, 1983, 3 (01): : 73 - 90
[44] Self-exile in Translating Butlerian Diaspora Translator's Notes
Wang, Nan
PHILOSOPHIA-A JOURNAL OF CONTINENTAL FEMINISM, 2019, 9 (01): : 133 - 134
[45] TRANSLATOR'S NOTE Look to the Law On Translating the Poems of Ennio Moltedo
Feitlowitz, Marguerite
WORLD LITERATURE TODAY, 2019, 93 (01) : 6 - 6
[46] Translating English modal expressions An Arab translator trainee's perspective
Eades, Domenyk
BABEL-REVUE INTERNATIONALE DE LA TRADUCTION-INTERNATIONAL JOURNAL OF TRANSLATION, 2011, 57 (03): : 283 - 304
[47] THE TRANSLATOR AT WORK + TRANSLATING FOR THE ENGLISH-NATIONAL-OPERA .3.
MILNES, R
TRACEY, E
HAMMOND, T
LLOYDJONES, D
OPERA, 1975, 26 (03): : 242 - 250
[48] Translator's note (Translating the poems of Ulalume Gonzalez de Leon)
Nelson, S
LITERARY REVIEW, 2002, 45 (03) : 618 - 618
[49] Split Loyalties The Translator between Translating Institutions' Norms and Translation Theories
Atari, Omar E.
Al-Sharafi, Abdul Gabbar
INTERPRETER AND TRANSLATOR TRAINER, 2012, 6 (02): : 195 - 217
[50] TRANSLATOR'S NOTE Lather, Rinse, Rewrite On Translating Eugene Vodolazkin
Hayden, Lisa C.
WORLD LITERATURE TODAY, 2019, 93 (02) : 6 - 6

← 1 2 3 4 5 →