LWOS: src/int.s comparison

comparison src/int.s @ 113:0aac26453849

Actually implement the 32 bit integer division algorithm correctly

author	William Astle <lost@l-w.ca>
date	Tue, 26 Dec 2023 23:37:45 -0700
parents	98b0646360e1
children

comparison

equal deleted inserted replaced

-:98b0646360e1
+:0aac26453849
 ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;
 ; Divide 32 bit integer in fpa0 significand by 32 bit integer in fpa1 significand, both treated as unsigned. Leave
 ; quotient at fpaextra...fpaextra+3 and remainder at fpaextra+4...fpaextra+7; does not check for division by zero
 ; which will result in a quotient of 0xffffffff and a remainder will be the dividend. It will not get suck in a loop.
 ;
-; Algorithm is basically pencil and paper long division. We check to see if the divisor "goes" at each step by doing
+; Uses a 96 bit temporary at fpaextra. The high 32 bits are the result and the mid 32 bits will be the remainder. The
-; a trial subtraction without saving the result. If it doesn't go, we just loop around again. If it does go, we stash
+; remaining 32 bits are used to avoid modifing the fpa0 significand. The extra 4 bytes could be avoided by simply
-; a 1 bit in the quotient and actually do the subtraction. Then go loop around again. Doing it this way rather than
+; clobbering the fpa0 significand.
-; with an actual subtraction and then undoing it with addition saves two store instructions on the comparison saves
+;
-; having to do a restore in the no-go case which is going to be quite common with values whose upper bits are
+; The algorithm is basically the pencil and paper long division algorithm. First, the dividend is shifted left one
-; mostly zeroes, thus it makes the operations faster in that case, for integers. (Floating point is a different
+; bit into the remainder. If a carry occurs from that, the subtraction will succeed. If not, do a trial subtraction
-; problem.)
+; which is just the subtraction without saving the result. If that results in a carry, then the divisor doesn't "go"
+; at this bit position so the quotient bit will be zero. If there is no carry here or there was a carry on the shift,
+; it does "go" so the quotient bit will be one. If it went, actually do the subtraction. In either event, shift the
+; quotient bit into the accumulated quotient. Do this for all 32 bits.
+;
+; The carry has to be checked on the shift to simulate doing what is basically a 33 bit subtraction.
 util_div32      ldd fpa0+fpa.sig+2              ; copy dividend to result location
-std fpaextra+6
+std fpaextra+10
 ldd fpa0+fpa.sig
-std fpaextra+4
+std fpaextra+8
-ldb #32                         ; do 32 bits
+ldb #32                         ; do 64 bits - will give us a remainder
 stb fpa0+fpa.exp                ; save counter somewhere because we don't have enough registers
 ldd zero                        ; zero out remainder
+std fpaextra
+std fpaextra+2
 std fpaextra+4
 std fpaextra+6
-util_div32a     lsl fpaextra+3                  ; shift dividend residue into remainder
+util_div32a     lsl fpaextra+11                 ; shift residue and remainder over
-rol fpaextra+2
+rol fpaextra+10
-rol fpaextra+1
+rol fpaextra+9
-rol fpaextra
+rol fpaextra+8
 rol fpaextra+7
 rol fpaextra+6
 rol fpaextra+5
 rol fpaextra+4
-ldd fpaextra+6                  ; now subtract divisor from remainder
+pshs cc                         ; save "goes" status from shift
+bcs util_div32b                 ; brif it definitely goes
+ldd fpaextra+6                  ; do a trial subtraction
 subd fpa1+fpa.sig+2
 ldd fpaextra+4
 sbcb fpa1+fpa.sig+1
 sbca fpa1+fpa.sig
-bcs util_div32b                 ; brif it doesn't go - don't subtract or set bit
+bcs util_div32c                 ; brif it doesn't go - goes status on stack is right
-inc fpaextra+3                  ; set quotient bit
+inc ,s                          ; set "C" on the stack
-ldd fpaextra+6                  ; actually do the subtraction
+util_div32b     ldd fpaextra+6                  ; actually do the subtraction
 subd fpa1+fpa.sig+2
 std fpaextra+6
 ldd fpaextra+4
 sbcb fpa1+fpa.sig+1
 sbca fpa1+fpa.sig
 std fpaextra+4
-util_div32b     dec fpa0+fpa.exp                ; done all 32 bits?
+util_div32c     puls cc                         ; get back "goes" status
-bne util_div32a                 ; do another
+rol fpaextra+3                  ; shift quotient over and put the new bit in
+rol fpaextra+2
+rol fpaextra+1
+rol fpaextra
+dec fpa0+fpa.exp                ; done 32 bits?
+bne util_div32a                 ; brif not - handle another bit position
+rts
 *pragmapop list

Mercurial > hg > index.cgi

comparison src/int.s @ 113:0aac26453849