Assembly language and Forth get along pretty well. Here's a simple BusyWaiting loop written for the x86 in a Forth assembler. One instruction or macro per line:
CODE BUSYWAIT ( n -- ) 1 CHECK_POP *BEGIN EAX DECL<- CMP ALU.L#8 ,EAX 0 C, ?NE JCCB *AGAIN MOVL<- EDX ,[EDI] MOVL<- EAX ,D8[EDI] 4 C, LEAL EDI ,D8[EDI] 8 C, NEXT;Lots of notes about it.
Have fun with that one... -- EdwardKiser
By the way, I am at some point going to publish this Forth... I'm not just talking about the Forth source for the assembler; I have written an entire Forth implementation which includes this assembler, and it is the entire implementation that I am going to publish.
The general idea seems pretty handy, but I'm confused. Why are some things prefix and others postfix? E.g. "EAX DECL<-" being postfix, and "MOVL<- EDX ,[EDI]" being prefix? I must be missing some rather large issue here.
No, it's just that EAX DECL<- is a one-byte instruction. In this case, EAX is a general word that is used by lots of instructions. It pushes a constant on Forth's stack. A word like ,[EDI] will pop that off the stack and build a MOD/R/M byte. But DECL<- will pop the same value off the stack and build a decrement instruction.
It is also possible to write DECL ,EAX but that produces a two-byte instruction. DECL emits the first byte, FF, and then pushes a value on the stack. ,EAX pops that off and builds a MOD/R/M byte.
What's happening here is that I'm following the byte order of the x86 as closely as I can. I want to avoid putting lots of stuff on the stack while building an instruction. The most nearly opposite case would be to push everything on the stack and then write a single word which compiles any instruction based on its description on the stack.
I see. Hmm. Does this end up being mnemonic in practice? You don't have to memorize 1 byte vs two byte, it's just implied by the semantics of the instruction? Consistency of syntax would be nice...
I use a cheat sheet.
There is no agreed upon syntax for Forth assembly code. I've seen stuff as simple as "a b mov" to parsers that can handle MASM format verbatim. F83 for Intel chips commonly came with two assemblers, one prefix (like MASM) and one postfix (like Forth). One particularly well implemented assembler comes with Pygmy Forth. Here is a nice tutorial on building a 6809 Forth assembler:
(Well, there are eight ALU operations, and something like nine addressing modes. Using CREATE DOES> would have required me to define 72 words, and that's not counting the defining word with the CREATE and the DOES in it, whereas by breaking it apart, I need only seventeen words.) ...but you have inconsistent syntax and need a cheat sheet. :) I'm glad it works for you, but it is confusing to at least two of us (and I know Forth and x86 assembly).
As do I. -- dm
Just pointing out that there are other ways to do it that put a little more burden on the assembler wordset and less burden on the programmer.
I agree. This is the sort of thing that forces a tool to be single-use or single-user, for the sake of making the implementation easier. For reusability or wider adoption, more consistency and ease of use is important, at the cost of more implementation difficulty.
I wasn't previously familiar with this approach to assembly, though, and I think it's a brilliant idea in the right niche, so I'm glad to see any example of it. -- dm
P.S. I also would encourage you to go ahead with your intention to publish it. It's interesting to see small minimal attacks on problems.
It isn't much of a burden to think in terms of phrases instead of words. If you wanted to encode a deck of cards, it might be better to define ACE, JACK, QUEEN, KING, HEARTS, SPADES, DIAMONDS, and CLUBS. Then write ACE SPADES and 3 HEARTS and so forth. Much better than defining 52 words.
I certainly agree, but that has nothing to do with our responses. We're talking about consistency, not about pre-defining the world vs using atoms to make molecules.
Writing CMP ALU.L<- and ADD ALU.L<- and CMP ALU.B<- is exactly that; using atoms to make molecules. There are eight ALU instructions (ADD, ADC, SUB, SBB, AND, OR, XOR, CMP) and at least eleven addressing modes (ALU.L<-, ALU.L->, ALU.B<-, ALU.B->, ALU.W<-, ALU.W->, ALU.L#8, ALU.B#8, ALU.L#32, ALU.W#8, ALU.W#16). This is from memory and I may be wrong about the exact spelling of some of them. But the usage is consistent once it's been explained. The ones with pound signs require an appropriate MOD/M specification and an immediate value; the ones with arrows require an R followed by a MOD/M. Thus:
ADD ALU.L<- EAX ,[EDI] \ add long at [EDI] into EAX ADD ALU.L-> EAX ,[EDI] \ add EAX into long at [EDI] ADD ALU.W<- AX ,BX \ add BX into AX (16-bit) ADD ALU.B-> DL ,[EDI] \ add DL into byte at [EDI] ADD ALU.L#8 ,EAX 55 C, \ sign-extend 55 to 32 bits and add it into EAX ADD ALU.L#32 ,EAX 55555555 L, \ add 55555555 into EAX ADD ALU.L#32 ,D8[--] EAX*4 +EDI 3 C, 55555555 L, \ add 55555555 into [3 + EAX*4 + EDI]I just sprung the notation for SIB bytes onto you... sorry... but you can substitute XOR for ADD and everything else works the same way.
DECL is a little different but it's a different kind of instruction; it has two variants.
EAX DECL<- \ one-byte version DECL ,EAX \ two-byte version DECL ,[EDI] \ only possible with two-byte versionThe example at the top of the page is way too short to show how consistent the syntax is. There are only four or five instructions in the whole example. Instructions of similar types tend to have very similar syntaxes.
That's encouraging. I'll be interested to see your release.
Any news on this project? -- GarryHamilton
I'm interested in this as well.
However, the x86 assemblers are vastly more sophisticated than for other CPUs. The assembler I wrote for the 65816 is somewhat similar to what is described here though. All 255 opcodes have their own Forth word (e.g., LDA(,Y), STZ,X, etc), and then I place the operands down with 1, , 2, , or 3, depending on the length of the operand (1, 2, or 3 bytes, respectively). The original idea for this came from a close associate of mine on the 6502.org web forum, so this is not my original idea by any means. For example, to clear the screen on a Commodore 64:
: CLS
LDX#, 0 1, LDA#, $20 1, BEGIN, STAA,X, $0400 2, STAA,X, $0500 2, STAA,X, $0600 2, STAA,X, $0700 2, INX, 0<>, UNTIL, ;--SamuelFalvo?
One frustration with ForthAssistedHandAssembly is that it does not support 64-bit code or the REX prefix byte. This could be changed, but it will require some retooling.
There is a similar but completely un-Forth-like project on Google Code called AsmJit?: http://code.google.com/p/asmjit/ . I have looked at its documentation but cannot find where you make it call Windows API functions such as RtlAddFunctionTable or RtlInstallFunctionTableCallback. This means you cannot JIT-compile a "frame function." On 64-bit Windows systems, proper exception handling requires that only frame functions can call other functions. Frame functions have to have certain types of prologs and epilogs, and Function Table metadata has to describe them. (Leaf functions are exempt from this requirement but cannot call other functions.) The Compiler class in AsmJit? has Prolog and Epilog functions, but I don't think they have anything to do with Function Tables in x64 Windows.