mirrors/Odin - Odin - Kyren's Code

mirror of https://github.com/odin-lang/Odin.git synced 2026-06-19 16:42:33 +00:00

Author	SHA1	Message	Date
Brendan Punsky	4cc6977321	Merge origin/bill/rexcode: struct repack (#raw_union #packed), wasm arch Merge gingerBill's latest into bill/rexcode. His changes: minimize the Instruction/Operand structs across ISAs with packed raw-unions (+ the compiler support for #raw_union #packed), the new core:rexcode/wasm arch and wasm/module, encode() now returns (byte_count, ok) instead of a Result struct, decode_one made public, and assorted formatting/inlining. Conflict: arm64/tests/pipeline_smoke.odin CSEL test -- kept the generated 4-arg inst_csel(dst,src,src2,cond) (mnemonic_builders.odin is generated, not from Bill's branch) and adopted Bill's (byte_count, success) encode signature. Required rebuilding ./odin from the merged source for the packed-union syntax. Re-validated after the repack: regenerated all artifacts (idempotent -- no spurious churn), all 10 arches gen/builders/check/test green, and byte-compared the new arm32 BF + mips PS/MMI/DSP/R6 forms to confirm no field truncation. arm64/arm32/mips still 100%.	2026-06-18 05:44:48 -04:00
Brendan Punsky	83bdd501a3	rexcode: remove dead BFCSEL else-target scaffolding; tidy mips COPY specgen BFCSEL's else-target turned out to be the implicit fall-through, so the BF_BELSE operand encoding, the BFCSEL_ELSE_T32 relocation, and their encoder/decoder cases were never referenced by any table entry. Remove them. Also restructure the MSA COPY specgen loop so COPY_U only iterates .B/.H (COPY_U.W is mips64-only and emitted in the mips64 section), which drops the spurious 'skipped COPY_U_W' message. No functional change to any generated encode form; arm64/arm32/mips all still 100%, 461/600/281 tests green.	2026-06-18 05:29:20 -04:00
Brendan Punsky	c8851c546d	rexcode/arm32: BFCSEL -> Branch Future complete, arm32 100% BFCSEL = bf-point + true-target (hw1, like BF) + 4-bit condition at hw0[5:2], base 0xF002E001 (hw0[1] is a static marker). The else-target is the architectural fall-through, so it is not a separate operand -- BFCSEL is modelled as three operands and reproduces llvm-mc's bytes exactly (f082e003 / f102e803 / f086e003 across boff/true/cond variations). Every encodable arm32 Mnemonic now has an encode form (gap = 0). 600 tests green.	2026-06-18 04:55:26 -04:00
Brendan Punsky	808716517e	rexcode/arm32: Branch Future BF/BFL/BFLX/BFI_BR encode forms Reverse-engineered the ARMv8.1-M Branch Future T32 encoding from llvm-mc: bf-point imm4 = (label-(PC+4))/2 at hw0[10:7]; branch target val = (label-(PC+4))/2 with J at hw1[11] and imm10 at hw1[10:1]; BFLX/BFX target is Rm at hw0[3:0]. New REL_BF operand + BF_BOFF/BF_BLOC/BF_RM encodings + BF_BOFF_T32/BF_BLOC_T32 relocations with resolver. BF=0xF040E001, BFL=0xF000C001, BFLX=0xF070E001, BFI_BR=0xF060E001. Tightened the WLSTP/DLSTP masks to mark hw0[6] static (it is always 0 for valid B/H/W/D sizes) so they no longer shadow the BF register forms. Byte-exact vs llvm-mc with resolved bf-point/target offsets; 600 tests green. (BFCSEL still pending -- it adds an else-target + condition.)	2026-06-18 04:47:03 -04:00
Brendan Punsky	e4cff78a70	rexcode/arm32: document BF family as intentionally unimplemented The 5 Branch Future mnemonics (BF/BFI_BR/BFL/BFLX/BFCSEL) are left enum-only on purpose: deprecated ARMv8.1-M, not disassemblable by llvm-objdump (so unverifiable), and a correct encoder needs dual-offset PC-relative relocation infrastructure that doesn't exist. Noted in the enum for future readers.	2026-06-18 03:05:25 -04:00
Brendan Punsky	a63fb51fdd	rexcode/arm32: MVE VMLSV/VMLSVA (correct 3-bit Q regs); drop placeholders Implement VMLSV/VMLSVA (MVE multiply-subtract reduce) properly: new VN_Q_MVE (Qn at 19:17) and VM_Q_MVE (Qm at 3:1) encodings -- the actual 3-bit MVE Q fields -- with Rd at 15:12 (RDLO_A32). The earlier collision was from reusing the 4-bit VN_Q (19:16) and RD_T32 (11:8), which place the fields wrong; byte-exact vs llvm-mc now with distinct Qn/Qm/Rd. Drop three placeholder/redundant enum entries: VRINT and VPRINT (not real instructions -- llvm rejects bare 'vrint'; VPRINT is a printf-like debug pseudo-op), and VRSHL_MVE (the author's own comment marks it a placeholder; 'vrshl q,q,q' already decodes via VRSHL's MVE form). 600 tests green, verify matches llvm-mc.	2026-06-18 01:58:19 -04:00
Brendan Punsky	239dea4f55	rexcode/arm32: MVE VHCADD (saturating halving complex add) + VCMLA New MVE_ROT_HCADD (#90/#270 at bit12) and MVE_ROT_CMLA (#0/90/180/270 at bits 24:23) rotation encodings -- the rotation degrees round-trip properly (unlike the existing FCMA VCMLA which leaves it unencoded). One form each with the element-size bits left variable (MVE convention). Verify round-trips; all rotations byte-exact vs llvm-mc; 600 tests green. (VMLSV/VMLSVA reduce ops deferred: their format decode-collides with other MVE encodings given the 4-bit VN_Q vs MVE's 3-bit Qn.)	2026-06-18 01:47:44 -04:00
Brendan Punsky	55463b6719	rexcode/arm32: VMOV (ARM core register to scalar) Dd[lane], Rt New VMOV_LANE_8/16/32 encodings: Dd at bits 19:16+bit7, lane bits per element size (.8 = bit21:bit6:bit5 with bit22 size marker; .16 = bit21:bit6 with bit5 marker; .32 = bit21). Verify round-trips all three sizes; spot-checked .8 byte-exact incl. max lane; 600 tests green.	2026-06-18 01:34:48 -04:00
Brendan Punsky	5df81b5117	rexcode/arm32: VQDMULH/VQRDMULH by-scalar-lane New NEON_VM_SCALAR16/32 encodings for the Dm[lane] scalar operand: .16 places Dm in D0..D7 (bits 2:0) with the lane split bit5:bit3, .32 places Dm in D0..D15 (bits 3:0) with the lane at bit5. VQDMULH_LANE and VQRDMULH_LANE across .s16/.s32, D and Q destinations (8 forms). Verify round-trips; spot-checked byte-exact incl. max register/lane and decode-clean; 600 tests green.	2026-06-18 01:29:19 -04:00
Brendan Punsky	acc14864f3	rexcode/arm32: DCPS1/DCPS2/DCPS3 (debug change PE state) Fixed T32 encodings (0xF78F8001/2/3), no operands. Verify round-trips; 600 tests green.	2026-06-18 01:25:51 -04:00
Brendan Punsky	b2b14998f7	rexcode/arm32: VRSRA, VRECPE_F/VRSQRTE_F, VPADD_F, VCVTR VRSRA (NEON rounding shift-right-accumulate, D/Q, mirrors VSRA's raw imm6 convention), VRECPE_F/VRSQRTE_F (FP reciprocal/rsqrt estimate, D/Q), VPADD_F (FP pairwise add, f32/f16), and VCVTR (VFP convert-to-integer using the FPSCR rounding mode; s32/u32 from f32 and f64). Hand-written mirroring the existing VSRA/VRECPE/VPADD/VCVT forms. Built-in llvm round-trip verify passes; spot-checked byte-exact; 600 tests green.	2026-06-18 01:22:12 -04:00
Brendan Punsky	59750926d9	rexcode/arm32: unprivileged (translate) post-indexed loads/stores LDRT/LDRBT/STRT/STRBT (imm12) and LDRHT/STRHT/LDRSBT/LDRSHT (imm8 split): each is the corresponding post-indexed load/store with the W bit (21) set. Hand-written, reusing the existing MEM_POST_INDEX encoding. All 8 byte-exact vs llvm-mc and decode-clean; 600 tests green.	2026-06-18 01:17:34 -04:00
Brendan Punsky	6fd233f041	rexcode/arm32: NEON long/wide/compare/shift encode forms (specgen) New arm32 specgen (llvm-mc --triple=armv8a --mattr=+neon as the bits oracle, empirical masks): VADDL/VSUBL/VABAL/VABDL (Qd,Dn,Dm) and VADDW/VSUBW (Qd,Qn,Dm) across s/u 8/16/32; the compare aliases VCLE/VCLT (= VCGE/VCGT with Vn/Vm swapped) and VACLE/VACLT (= VACGE/VACGT swapped, f32); and VQRSHL shift-by-vector. 84 forms over 11 mnemonics. Built-in llvm round-trip verify passes; spot-checked byte-exact with distinct Q/D registers; 600 tests green.	2026-06-18 01:15:22 -04:00
gingerBill	c9ce8794c7	Replace `-> isa.Result` with `-> (byte_code: u32, ok: bool)`	2026-06-15 21:43:58 +01:00
Brendan Punsky	47fc72e0ba	rexcode: 100% generated mnemonic-builder coverage; drop hand-written collisions Every mnemonic with an encode form now has a generated inst_<mnem>/emit_<mnem> overload group. The per-arch generators map ALL operand types — nothing is skipped: arm64 gains shifted/extended registers (multi-param via op_shifted/op_extended), SVE Z-regs + predicates, SME tile/slice, NEON arrangements/lanes, bitmask/sysreg/pattern immediates and condition codes (427 -> 777 mnemonics); arm32 gains shifted/register-shifted regs, register lists, NEON lanes and all encoded-immediate subclasses (479 -> 592); x86 gains m80 and descriptor-table memory operands — FBLD/FBSTP, LGDT/SGDT/LIDT/SIDT, FLD/FSTP, far-indirect JMP/CALL, BOUND (1167 -> 1175). Mnemonic-specific builders are now fully generated, not hand-written: deleted the hand-written helpers the generated groups collided with — riscv inst_jal/inst_jalr, arm64 inst_b_cond/inst_cbz/inst_tbz/inst_csel, mos6502 inst_tst — and let the generators own those names (arm64 also gains inst_cbnz/tbnz/csinc/csinv/csneg). Updated the affected test call-sites. The generic operand-shape helpers (inst_r_r, inst_r_r_i, inst_ldst, ...) remain as delegation targets. Decode-only mnemonics with no encode form are correctly left without builders. ppc/ppc_vle/rsp/mos65816 were already complete. All 10 ISAs: structure + compile + tests pass; generators idempotent.	2026-06-15 12:52:10 -04:00
Brendan Punsky	1b72d425d4	rexcode: add typed per-mnemonic builders for all arches; CWD-independent regen Add generated mnemonic_builders.odin (inst_<mnem>/emit_<mnem> typed overload sets) for arm32, arm64, mips, riscv, ppc, ppc_vle, rsp, mos6502 and mos65816, matching the existing x86 builders. Each is produced by a per-arch tools/gen_mnemonic_builders.odin that walks ENCODE_FORMS and maps operand types to typed params + op_* constructors. Anchor every generator's output via #directory so regeneration is CWD-independent; previously the bare "mnemonic_builders.odin" path wrote to the current directory and misfired when run from the repo root. Wire a --builders task into build.lua (folded into 'all', covered by --idempotent, enforced by the structural invariants) and document it in the README.	2026-06-15 12:52:10 -04:00
gingerBill	9ec6f3e378	Minimize Instruction and Operand across ISAs further with `struct #raw_union #packed`	2026-06-15 14:50:55 +01:00
gingerBill	7aaef31bb3	Correct sizes of arm32 Instruction and Operand	2026-06-15 14:24:05 +01:00
gingerBill	b733f7d7a4	Use `@(rodata)` where appropriate for the table generation	2026-06-15 13:56:24 +01:00
Flāvius	a4f08f8307	Load rexcode encode/decode tables from committed binary blobs Each ISA's hand-written ENCODING_TABLE (the single source of truth) now lives in a per-arch tablegen/ metaprogram that flattens it and serializes committed binary blobs; the library #loads those into @(rodata) at compile time rather than compiling a table body. No arch keeps encoding_table.odin or decoding_tables.odin -- only a generated tables.odin loader and tables/.bin. Two-stage, type-checked pipeline: tablegen Stage A emits human-readable generated Odin, which compiles and serializes the blobs in Stage B. * encode() goes through encoding_forms(m); decoders are unchanged apart from x86's flattened 2-D index. Decode tables are byte-identical to the old ones. * build.lua: a LuaJIT driver for the metaprograms, validations, and tests, with cross-platform gating and a clear report. * Docs refreshed; the obsolete forward-looking plan in cross_arch_design.md trimmed to what was actually built. * Attribution headers added to all rexcode source files; the generators emit them so generated files keep them.	2026-06-15 07:43:29 -04:00
gingerBill	ecf9a305ee	Add `@(require_results)` to register procedures	2026-06-14 22:00:37 +01:00
gingerBill	5c9cd0146d	Add `@(require_results)` to operand procedures	2026-06-14 21:57:27 +01:00
gingerBill	611cc807cd	Add `@(require_results)` to instruction procedures	2026-06-14 21:54:24 +01:00
gingerBill	ced500fc94	Add fmt formatting to the `Instruction.operands`	2026-06-14 21:52:14 +01:00
gingerBill	176ee8c68d	Minimize arm32 decode table size	2026-06-14 19:19:11 +01:00
gingerBill	c49e296f5e	Update doc files	2026-06-14 18:24:59 +01:00
gingerBill	d6ae77b67e	`core:rexcode`	2026-06-14 16:30:18 +01:00

27 Commits