Blame - README.aarch64 - platform/external/valgrind

blob: 6f5de968d3837026390223bdc835ba00ac572204 [file] [log] [blame]

sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	1
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	2	Status
				3	~~~~~~
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	4
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	5	As of Jan 2014 the trunk contains a port to AArch64 ARMv8 -- loosely,
				6	the 64-bit ARM architecture. Currently it supports integer and FP
sewardj	fc073c3	2014-01-15 14:30:24 +0000	[diff] [blame]	7	instructions and can run almost anything generated by gcc-4.8.2 -O2.
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	8	The port is under active development.
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	9
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	10	Current limitations, as of mid-Jan 2014.
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	11
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	12	* threaded apps won't work, due to inadequate sys_clone() support.
				13
				14	* almost no support of vector (SIMD) instructions
				15
philippe	3ef45eb	2014-02-12 00:02:05 +0000	[diff] [blame]	16	* Integration with the built in GDB server:
				17	- basically works but breakpoints are causing crashes due to missing
philippe	7c2800a	2014-02-12 20:48:18 +0000	[diff] [blame^]	18	unchainXDirect_ARM64 needed by LibVEX_UnChain.
				19	Use --vgdb=full to bypass the problem.
philippe	3ef45eb	2014-02-12 00:02:05 +0000	[diff] [blame]	20	- still to do:
philippe	7c2800a	2014-02-12 20:48:18 +0000	[diff] [blame^]	21	arm64 xml register description files (allowing shadow registers
				22	to be looked at).
philippe	3ef45eb	2014-02-12 00:02:05 +0000	[diff] [blame]	23	ptrace invoker : currently disabled for both arm and arm64
				24	cpsr transfer to/from gdb to be looked at (see also arm equivalent code)
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	25
				26	There has been extensive testing of the baseline simulation of integer
				27	and FP instructions. Memcheck is also believed to work, at least for
				28	small examples. Other tools appear to at least not crash when running
				29	/bin/date.
				30
				31
				32	Building
				33	~~~~~~~~
				34
				35	You could probably build it directly on a target OS, using the normal
				36	non-cross scheme
				37
				38	./autogen.sh ; ./configure --prefix=.. ; make ; make install
				39
				40	Development so far was however done by cross compiling, viz:
				41
				42	export CC=aarch64-linux-gnu-gcc
				43	export LD=aarch64-linux-gnu-ld
				44	export AR=aarch64-linux-gnu-ar
				45
				46	./autogen.sh
				47	./configure --prefix=`pwd`/Inst --host=aarch64-unknown-linux \
				48	--enable-only64bit
				49	make -j4
				50	make -j4 install
				51
				52	Doing this assumes that the install path (`pwd`/Inst) is valid on
				53	both host and target, which isn't normally the case. To avoid
				54	this limitation, do instead:
				55
				56	./configure --prefix=/install/path/on/target \
				57	--host=aarch64-unknown-linux \
				58	--enable-only64bit
				59	make -j4
				60	make -j4 install DESTDIR=/a/temp/dir/on/host
				61	# and then copy the contents of DESTDIR to the target.
				62
				63	See README.android for more examples of cross-compile building.
				64
				65
				66	Implementation tidying-up/TODO notes
				67	~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	68
				69	UnwindStartRegs -- what should that contain?
				70
				71
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	72	vki-arm64-linux.h: vki_sigaction_base
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	73	I really don't think that __vki_sigrestore_t sa_restorer
				74	should be present. Adding it surely puts sa_mask at a wrong
				75	offset compared to (kernel) reality. But not having it causes
				76	compilation of m_signals.c to fail in hard to understand ways,
				77	so adding it temporarily.
				78
				79
				80	m_trampoline.S: what's the unexecutable-insn value? 0xFFFFFFFF
				81	is there at the moment, but 0x00000000 is probably what it should be.
				82	Also, fix indentation/tab-vs-space stuff
				83
				84
				85	./include/vki/vki-arm64-linux.h: uses __uint128_t. Should change
				86	it to __vki_uint128_t, but what's the defn of that?
				87
				88
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	89	m_debuginfo/priv_storage.h: need proper defn of DiCfSI
				90
				91
				92	readdwarf.c: is this correct?
				93	#elif defined(VGP_arm64_linux)
				94	# define FP_REG 29 //???
				95	# define SP_REG 31 //???
				96	# define RA_REG_DEFAULT 30 //???
				97
				98
				99	vki-arm64-linux.h:
				100	re linux-3.10.5/include/uapi/asm-generic/sembuf.h
				101	I'd say the amd64 version has padding it shouldn't have. Check?
				102
				103
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	104	syswrap-linux.c run_a_thread_NORETURN assembly sections
				105	seems like tst->os_state.exitcode has word type
				106	in which case the ppc64_linux use of lwz to read it, is wrong
				107
				108
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	109	syswrap-linux.c ML_(do_fork_clone)
				110	assuming that VGP_arm64_linux is the same as VGP_arm_linux here
				111
				112
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	113	dispatch-arm64-linux.S: FIXME: set up FP control state before
				114	entering generated code. Also fix screwy indentation.
				115
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	116
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	117	dispatcher-ery general: what's a good (predictor-friendly) way to
				118	branch to a register?
				119
				120
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	121	in vki-arm64-scnums.h
				122	//#if __BITS_PER_LONG == 64 && !defined(__SYSCALL_COMPAT)
				123	Probably want to reenable that and clean up accordingly
				124
				125
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	126	putIRegXXorZR: figure out a way that the computed value is actually
				127	used, so as to keep any memory reads that might generate it, alive.
				128	(else the simulation can lose exceptions). At least, for writes to
				129	the zero register generated by loads .. or .. can anything other
				130	integer instructions, that write to a register, cause exceptions?
				131
				132
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	133	loads/stores: generate stack alignment checks as necessary
				134
				135
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	136	fix barrier insns: ISB, DMB
				137
				138
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	139	fix atomic loads/stores
				140
				141
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	142	FMADD/FMSUB/FNMADD/FNMSUB: generate and use the relevant fused
				143	IROps so as to avoid double rounding
				144
				145
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	146	ARM64Instr_Call getRegUsage: re-check relative to what
				147	getAllocableRegs_ARM64 makes available
				148
				149
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	150	Make dispatch-arm64-linux.S save any callee-saved Q regs
				151	I think what is required is to save D8-D15 and nothing more than that.
				152
				153
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	154	wrapper for __NR3264_fstat -- correct?
				155
				156
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	157	PRE(sys_clone): get rid of references to vki_modify_ldt_t and the
				158	definition of it in vki-arm64-linux.h. Ditto for 32 bit arm.
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	159
				160
				161	sigframe-arm64-linux.c: build_sigframe: references to nonexistent
				162	siguc->uc_mcontext.trap_no, siguc->uc_mcontext.error_code have been
				163	replaced by zero. Also in synth_ucontext.
				164
				165
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	166	m_debugger.c:
				167	uregs.pstate = LibVEX_GuestARM64_get_nzcv(vex); /* is this correct? */
				168	Is that remotely correct?
				169
				170
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	171	host_arm64_defs.c: emit_ARM64INstr:
				172	ARM64in_VDfromX and ARM64in_VQfromXX: use simple top-half zeroing
				173	MOVs to vector registers instead of INS Vd.D[0], Xreg, to avoid false
				174	dependencies on the top half of the register. (Or at least check
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	175	the semantics of INS Vd.D[0] to see if it zeroes out the top.)
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	176
				177
				178	preferredVectorSubTypeFromSize: review perf effects and decide
				179	on a types-for-subparts policy
				180
				181
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	182	fold_IRExpr_Unop: add a reduction rule for this
				183	1Sto64(CmpNEZ64( Or64(GET:I64(1192),GET:I64(1184)) ))
				184	vis 1Sto64(CmpNEZ64(x)) --> CmpwNEZ64(x)
				185
				186
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	187	check insn selection for memcheck-only primops:
				188	Left64 CmpwNEZ64 V128to64 V128HIto64 1Sto64 CmpNEZ64 CmpNEZ32
				189	widen_z_8_to_64 1Sto32 Left32 32HLto64 CmpwNEZ32 CmpNEZ8
				190
				191
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	192	isel: get rid of various cases where zero is put into a register
				193	and just use xzr instead. Especially for CmpNEZ64/32. And for
				194	writing zeroes into the CC thunk fields.
				195
				196
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	197	/* Keep this list in sync with that in iselNext below */
				198	/* Keep this list in sync with that for Ist_Exit above */
				199	uh .. they are not in sync
				200
				201
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	202	very stupid:
				203	imm64 x23, 0xFFFFFFFFFFFFFFA0
				204	17 F4 9F D2 F7 FF BF F2 F7 FF DF F2 F7 FF FF F2
				205
				206
sewardj	f0c1250	2014-01-12 12:54:00 +0000	[diff] [blame]	207	valgrind.h: fix VALGRIND_ALIGN_STACK/VALGRIND_RESTORE_STACK,
				208	also add CFI annotations
sewardj	fdaf9e4	2014-01-13 00:18:51 +0000	[diff] [blame]	209
				210
sewardj	fdaf9e4	2014-01-13 00:18:51 +0000	[diff] [blame]	211	could possibly bring r29 into use, which be useful as it is
				212	callee saved
sewardj	383d5d3	2014-01-13 11:50:17 +0000	[diff] [blame]	213
				214
				215	ubfm/sbfm etc: special case cases that are simple shifts, as iropt
				216	can't always simplify the general-case IR to a shift in such cases.
sewardj	1cd6c90	2014-02-05 11:02:34 +0000	[diff] [blame]	217
				218
				219	LDP,STP (immediate, simm7) (FP&VEC)
				220	should zero out hi parts of dst registers in the LDP case
				221
				222
				223	DUP insns: use Iop_Dup8x16, Iop_Dup16x8, Iop_Dup32x4
				224	rather than doing it "by hand"
				225
				226
				227	Any place where ZeroHI64ofV128 is used in conjunction with
				228	FP vector IROps: find a way to make sure that arithmetic on
				229	the upper half of the values is "harmless."
				230
				231
				232	math_MINMAXV: use real Iop_Cat{Odd,Even}Lanes ops rather than
				233	inline scalar code