Blame - README.aarch64 - platform/external/valgrind

blob: ddb25873e486143b98c6f412189547de3c29fbf8 [file] [log] [blame]

sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	1
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	2	Status
				3	~~~~~~
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	4
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	5	As of Jan 2014 the trunk contains a port to AArch64 ARMv8 -- loosely,
				6	the 64-bit ARM architecture. Currently it supports integer and FP
sewardj	fc073c3	2014-01-15 14:30:24 +0000	[diff] [blame]	7	instructions and can run almost anything generated by gcc-4.8.2 -O2.
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	8	The port is under active development.
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	9
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	10	Current limitations, as of mid-Jan 2014.
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	11
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	12	* threaded apps won't work, due to inadequate sys_clone() support.
				13
				14	* almost no support of vector (SIMD) instructions
				15
				16	* Integration with the built in GDB server doesn't work yet.
				17
				18	There has been extensive testing of the baseline simulation of integer
				19	and FP instructions. Memcheck is also believed to work, at least for
				20	small examples. Other tools appear to at least not crash when running
				21	/bin/date.
				22
				23
				24	Building
				25	~~~~~~~~
				26
				27	You could probably build it directly on a target OS, using the normal
				28	non-cross scheme
				29
				30	./autogen.sh ; ./configure --prefix=.. ; make ; make install
				31
				32	Development so far was however done by cross compiling, viz:
				33
				34	export CC=aarch64-linux-gnu-gcc
				35	export LD=aarch64-linux-gnu-ld
				36	export AR=aarch64-linux-gnu-ar
				37
				38	./autogen.sh
				39	./configure --prefix=`pwd`/Inst --host=aarch64-unknown-linux \
				40	--enable-only64bit
				41	make -j4
				42	make -j4 install
				43
				44	Doing this assumes that the install path (`pwd`/Inst) is valid on
				45	both host and target, which isn't normally the case. To avoid
				46	this limitation, do instead:
				47
				48	./configure --prefix=/install/path/on/target \
				49	--host=aarch64-unknown-linux \
				50	--enable-only64bit
				51	make -j4
				52	make -j4 install DESTDIR=/a/temp/dir/on/host
				53	# and then copy the contents of DESTDIR to the target.
				54
				55	See README.android for more examples of cross-compile building.
				56
				57
				58	Implementation tidying-up/TODO notes
				59	~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	60
				61	UnwindStartRegs -- what should that contain?
				62
				63
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	64	vki-arm64-linux.h: vki_sigaction_base
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	65	I really don't think that __vki_sigrestore_t sa_restorer
				66	should be present. Adding it surely puts sa_mask at a wrong
				67	offset compared to (kernel) reality. But not having it causes
				68	compilation of m_signals.c to fail in hard to understand ways,
				69	so adding it temporarily.
				70
				71
				72	m_trampoline.S: what's the unexecutable-insn value? 0xFFFFFFFF
				73	is there at the moment, but 0x00000000 is probably what it should be.
				74	Also, fix indentation/tab-vs-space stuff
				75
				76
				77	./include/vki/vki-arm64-linux.h: uses __uint128_t. Should change
				78	it to __vki_uint128_t, but what's the defn of that?
				79
				80
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	81	m_debuginfo/priv_storage.h: need proper defn of DiCfSI
				82
				83
				84	readdwarf.c: is this correct?
				85	#elif defined(VGP_arm64_linux)
				86	# define FP_REG 29 //???
				87	# define SP_REG 31 //???
				88	# define RA_REG_DEFAULT 30 //???
				89
				90
				91	vki-arm64-linux.h:
				92	re linux-3.10.5/include/uapi/asm-generic/sembuf.h
				93	I'd say the amd64 version has padding it shouldn't have. Check?
				94
				95
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	96	syswrap-linux.c run_a_thread_NORETURN assembly sections
				97	seems like tst->os_state.exitcode has word type
				98	in which case the ppc64_linux use of lwz to read it, is wrong
				99
				100
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	101	syswrap-linux.c ML_(do_fork_clone)
				102	assuming that VGP_arm64_linux is the same as VGP_arm_linux here
				103
				104
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	105	dispatch-arm64-linux.S: FIXME: set up FP control state before
				106	entering generated code. Also fix screwy indentation.
				107
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	108
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	109	dispatcher-ery general: what's a good (predictor-friendly) way to
				110	branch to a register?
				111
				112
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	113	in vki-arm64-scnums.h
				114	//#if __BITS_PER_LONG == 64 && !defined(__SYSCALL_COMPAT)
				115	Probably want to reenable that and clean up accordingly
				116
				117
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	118	putIRegXXorZR: figure out a way that the computed value is actually
				119	used, so as to keep any memory reads that might generate it, alive.
				120	(else the simulation can lose exceptions). At least, for writes to
				121	the zero register generated by loads .. or .. can anything other
				122	integer instructions, that write to a register, cause exceptions?
				123
				124
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	125	loads/stores: generate stack alignment checks as necessary
				126
				127
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	128	fix barrier insns: ISB, DMB
				129
				130
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	131	fix atomic loads/stores
				132
				133
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	134	FMADD/FMSUB/FNMADD/FNMSUB: generate and use the relevant fused
				135	IROps so as to avoid double rounding
				136
				137
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	138	ARM64Instr_Call getRegUsage: re-check relative to what
				139	getAllocableRegs_ARM64 makes available
				140
				141
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	142	Make dispatch-arm64-linux.S save any callee-saved Q regs
				143	I think what is required is to save D8-D15 and nothing more than that.
				144
				145
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	146	wrapper for __NR3264_fstat -- correct?
				147
				148
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	149	PRE(sys_clone): get rid of references to vki_modify_ldt_t and the
				150	definition of it in vki-arm64-linux.h. Ditto for 32 bit arm.
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	151
				152
				153	sigframe-arm64-linux.c: build_sigframe: references to nonexistent
				154	siguc->uc_mcontext.trap_no, siguc->uc_mcontext.error_code have been
				155	replaced by zero. Also in synth_ucontext.
				156
				157
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	158	m_debugger.c:
				159	uregs.pstate = LibVEX_GuestARM64_get_nzcv(vex); /* is this correct? */
				160	Is that remotely correct?
				161
				162
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	163	host_arm64_defs.c: emit_ARM64INstr:
				164	ARM64in_VDfromX and ARM64in_VQfromXX: use simple top-half zeroing
				165	MOVs to vector registers instead of INS Vd.D[0], Xreg, to avoid false
				166	dependencies on the top half of the register. (Or at least check
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	167	the semantics of INS Vd.D[0] to see if it zeroes out the top.)
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	168
				169
				170	preferredVectorSubTypeFromSize: review perf effects and decide
				171	on a types-for-subparts policy
				172
				173
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	174	fold_IRExpr_Unop: add a reduction rule for this
				175	1Sto64(CmpNEZ64( Or64(GET:I64(1192),GET:I64(1184)) ))
				176	vis 1Sto64(CmpNEZ64(x)) --> CmpwNEZ64(x)
				177
				178
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	179	check insn selection for memcheck-only primops:
				180	Left64 CmpwNEZ64 V128to64 V128HIto64 1Sto64 CmpNEZ64 CmpNEZ32
				181	widen_z_8_to_64 1Sto32 Left32 32HLto64 CmpwNEZ32 CmpNEZ8
				182
				183
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	184	isel: get rid of various cases where zero is put into a register
				185	and just use xzr instead. Especially for CmpNEZ64/32. And for
				186	writing zeroes into the CC thunk fields.
				187
				188
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	189	/* Keep this list in sync with that in iselNext below */
				190	/* Keep this list in sync with that for Ist_Exit above */
				191	uh .. they are not in sync
				192
				193
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	194	very stupid:
				195	imm64 x23, 0xFFFFFFFFFFFFFFA0
				196	17 F4 9F D2 F7 FF BF F2 F7 FF DF F2 F7 FF FF F2
				197
				198
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	199	valgrind.h: fix VALGRIND_ALIGN_STACK/VALGRIND_RESTORE_STACK,
				200	also add CFI annotations
sewardj	fdaf9e4	2014-01-13 00:18:51 +0000	[diff] [blame]	201
				202
sewardj	fdaf9e4	2014-01-13 00:18:51 +0000	[diff] [blame]	203	could possibly bring r29 into use, which be useful as it is
				204	callee saved
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	205
				206
				207	ubfm/sbfm etc: special case cases that are simple shifts, as iropt
				208	can't always simplify the general-case IR to a shift in such cases.
sewardj	1cd6c90	2014-02-05 11:02:34 +0000	[diff] [blame^]	209
				210
				211	LDP,STP (immediate, simm7) (FP&VEC)
				212	should zero out hi parts of dst registers in the LDP case
				213
				214
				215	DUP insns: use Iop_Dup8x16, Iop_Dup16x8, Iop_Dup32x4
				216	rather than doing it "by hand"
				217
				218
				219	Any place where ZeroHI64ofV128 is used in conjunction with
				220	FP vector IROps: find a way to make sure that arithmetic on
				221	the upper half of the values is "harmless."
				222
				223
				224	math_MINMAXV: use real Iop_Cat{Odd,Even}Lanes ops rather than
				225	inline scalar code