PiDP-8/I Software: Artifact [0f9f794363]

Artifact 0f9f79436352f679b46455c5adfb25061aec69d3:

Wiki page [PEP001.PA] by tangent 2016-12-01 15:26:00.

D 2016-12-01T15:26:00.561
L PEP001.PA
N text/x-markdown
P 867307b25b3555a75a2cfb066c5935dc604fa536
U tangent
W 15522
# PEP001.PA

## Or, How I Managed to Spend Three Days Solving FizzBuzz

[FizzBuzz](http://rosettacode.org/wiki/FizzBuzz) is a common test problem for software developers, often seen in interviews. It is [famously](https://blog.codinghorror.com/why-cant-programmers-program/) easy for a competent programmer to solve, yet somehow I — a self-styled competent programmer — managed to spend three days solving it. Let me tell you the story.

## The Challenge

I set myself the challenge of solving [Project Euler Problem #1](https://projecteuler.net/problem=1) — a close relative of FizzBuzz — in PDP-8 assembly language as a way to force myself to learn the platform. Nothing motivates like an achievable goal, particularly when one's good self-image is on the line.

When I started, I had only been playing with [my PiDP-8/I](wiki?name=Warren%27s+PiDP-8/I+System) for a few weeks. I had no prior experience with a PDP-8, or any similar machine. The closest experience I had was a little experience with 6502 assembly language on the Apple ][ series in the early to late 1980s. I'd barely figured out how to run the assembler prior to setting myself this challenge.

On the positive side of the balance scale, I've been a software developer for over 30 years now, almost all of that in a professional capacity. I've learned an [awful lot of programming languages and CPUs in that time](http://tangentsoft.com/resume/), including a few other assembly languages.

How hard could it be?

## Trap 1: No ROM, No Runtime

One universal characteristic of all personal computers is that it needs to have some sort of ROM, a fixed bit of programming that contains commonly-used routines, such as one to print a line of text to the user's display. The OS may paper over this ROM with some higher-level runtime, but there must be some base level of functionality that you can count on being present so you don't continually reinvent the wheel.

The PDP-8 was the spiritual ancestor of the personal computer: inexpensive, simple, and small, by the standards of the time. Just like its microcomputer descendants, the PDP-8 had a flat file system, and it ran BASIC, but real programmers wrote in assembly instead. The PDP-8 wasn't quite a "personal" computer, but it was as close as you could get in 1965-1974, the heyday of the PDP-8, before microcomputers ate its lunch.

One major thing the PDP-8 didn't have that microcomputers had, though, is a ROM. When you turn a PDP-8 on, the CPU initializes itself, and then *nothing else happens*. It just sits there, waiting for you to give it some code to run. Every single word of program code that the machine runs had to be put in its memory by some human. DEC shipped nothing, other than the media you need to load such programs in from. Strictly speaking, data media aren't even necessary: you can toggle programs in via [the front panel](https://en.wikipedia.org/wiki/Front_panel) if you want instead. This directly sets memory locations with values you specify, in binary. It doesn't get much more low-level than this.

So that brings me to Trap 1: How does this program print an answer to the screen? Solution? Write your own [decimal number printing](finfo?name=examples/routines/decprt.pal) and [ASCII string printing](finfo?name=examples/routines/prints.pal) routines! In assembly language. Debugged via the front panel.

There's one day gone.

## Trap 2: No Modulo Operator

The standard solution to FizzBuzz is to use the [modulo operator](https://en.wikipedia.org/wiki/Modulo_operation), which returns the remainder when dividing one number into another. 6 mod 4 is 2, for example.

Major problem: the PDP-8's stock instruction set doesn't have a modulo operator, or even a division operator. This was common on early microcomputers, too: the 6502, the 8080, and the Z80 all lacked a hardware multiplier, and certainly didn't have a divider.

Fortunately for me, DEC produced an add-on to the PDP-8/I I used for this project called the [KE8/I](http://dustyoldcomputers.com/pdp-common/reference/drawings/peripherals/ke8i.html), also known as the Extended Arithmetic Element. The PDP-8 simulator I used for this project does have an EAE feature, so I was able to use its `DVI` instruction.

But of course the standard PDP-8 assembler doesn't know about the `DVI` instruction, or any of the special flags, registers, and microcode instructions you need to use along with it. You must actually define all of this at the top of your program so that the assembler knows how to generate the necessary code!

`DVI` is an integer divide instruction that leaves the remainder in the PDP-8's accumulator, which means it effectively implements the modulo operation as a side effect of doing division. But in order to figure that out, you must read the KE8/I specification document, which describes in excruciating detail how the KE8/I is implemented at the silicon gate level.

Nowhere in this document does it simply say "divides the value following the DVI instruction into {AC:MQ}, with the quotient in MQ and the remainder in AC." It does say all of that, piece by piece over the *five* pages documenting the `DVI` instruction, but you must extract it yourself. Here, I've done that for you in a single sentence. You're welcome.

There's another day gone.

## Trap 3: No Precision

The problem I set myself states that you must add the values of the numbers that match the given criteria up, so that the web site knows you have solved it if you can provide a rather large integer in an input box.

No, I'm not going to tell you what the answer is. Go write a 5-line Python program to solve it yourself.

Me, I took 156 PDP-8 instructions to solve it, not counting space for constants, variables, and links, the latter being a dirty trick that PDP-8 assemblers use for jumping between 128-word "pages" of memory. And you thought the 64k segments of real mode x86 assembly were bad!

But I digress.

The PDP-8 is a 12-bit machine. Since the standard arithmetic operations assume you want signed integer arithmetic, this means the largest positive number you can do math on within a PDP-8 is 2047. As the problem is posed on the Project Euler site, you need to add up the matching numbers in the range 1 to 999. That means if only three numbers match, we're likely to run over that limit.

I solved this by making my program keep track of the current total and print it out as a subtotal value once the total exceeds 1024, since no single integer in the target range could possibly overflow a 12-bit signed integer. It wrote this out as a long string of additions:

ANSWER: 1064 + 1059 + 1060 + 1137 + 1158 + 1125 + 1047 + 1125 + ...

It went on like that for many lines. I ran this on the simulated PDP-8 via an OS X Terminal window, then copied the long string of additions and pasted it into [`bc(1)`](https://linux.die.net/man/1/bc) which did all the adding up on a processor many orders of magnitude more powerful than the PDP-8.

Cheating? Maybe. I briefly considered saving each subtotal to core memory, then writing a routine to add the values up in grade school fashion, right-to-left, carries and all.

I decided that, day 3 gone, that I'd done well enough.

## The Solution

Here is my winning solution to the problem, all 221 well-commented lines of it:

/ Project Euler Problem #1, Multiples of 3 and 5:
/
/ If we list all the natural numbers below 10 that are multiples of
/ 3 or 5, we get 3, 5, 6 and 9. The sum of these multiples is 23.
/ Find the sum of all the multiples of 3 or 5 below 1000.
/
/ Solution by Warren Young of tangentsoft.com, 2016.11.30
////////////////////////////////////////////////////////////////////////

////////////////////////////////////////////////////////////////////////
/ DIY ASSEMBLER INSTRUCTIONS

/ Our assembler doesn't know the EAE instructions, so teach it
DVI=7407 / integer divide .+1 into {AC:MQ}, answer in MQ
MUY=7405 / integer multiply MQ by .+1, answer in {AC:MQ}

/ Combined microcoded instruction aliases
CLR=CLA CLL / clear both AC and L
MCM=CLA MQA / move MQ to AC
TCA=CMA IAC / two's complement the accumulator
XMA=MQA MQL / exchange MQ and AC
AC1=CLA IAC / set AC to 1

//// MAIN //////////////////////////////////////////////////////////////
/ Program entry point. We purposely reinitialize global variables and
/ processor state in case we're restarting this program in-core.

PAGE 1
MAIN, CLR
TAD (3)
DCA CURR / start with 3, because we know 1 & 2 can't work
DCA TOTAL / reset total to 0
TLS / send null to terminal to get flags set right
TAD (ANSWER) / write "ANSWER: " to the terminal
JMS PRINTS
JMP MLCORE
CURR, 0 / current number we're checking
TOTAL, 0 / the answer so far; at the end, printed out

/ Constants
DECIMAL
MAX, 999 / check natural numbers CURR to MAX; must be < 2048!
STMAX, 1024 / subtotal max; avoids overflow of 12-bit signed int

OCTAL
CRLF, 15;12;0 / ASCII character values; don't forget trailing 0!
PLUS, 40;53;40;0
ANSWER, 101;116;123;127;105;122;72;40;0

//// MLCORE ////////////////////////////////////////////////////////////
/ The core of the main loop. MAIN just inits the globals and calls us.

/ Try dividing 3 into CURR first
MLCORE, CLR
TAD (3)
JMS ISMOD0
SNA / if ISMOD0 left AC = 0, CURR divided evenly by
JMP NXITER / 3, so skip 5 lest we count multiples of 15 2x

/ That didn't work out, so try dividing 5 into it
CLA
TAD (5)
JMS ISMOD0

/ Loop cleanup
NXITER, CLA
TAD CURR
TCA
TAD MAX / = 0 if CURR == MAX
SNA / if so, leave calculation loop
JMP MLDONE

CLR / CURR still < MAX, so increment CURR
TAD CURR
IAC
DCA CURR

TAD TOTAL / if TOTAL is getting too big, print...
TCA / a subtotal and zero TOTAL so we don't...
TAD STMAX / overflow the 12-bit limit
SZL
JMP MLCORE / STMAX - TOTAL > 0 so re-enter loop core
JMS SHOTOT / exceeded threshold, so display subtotal and " + "
CLA
DCA TOTAL / take advantage of free zero
TAD (PLUS)
JMS PRINTS
JMP MLCORE

/ Calculation complete. Show answer and exit gracefully.
MLDONE, JMS SHOTOT
CLA
TAD (CRLF)
JMS PRINTS
JMP ENDG

//// ISMOD0 ////////////////////////////////////////////////////////////
/ If passed AC divides evenly into CURR (in C-speak, CURR % AC == 0)
/ add CURR to TOTAL and return 0 in AC. Else, return nonzero in AC and
/ leave TOTAL untouched.

ISMOD0, 0
/ Divide CURR by DIVISOR, passed as AC
DCA DIVISOR
TAD CURR / load CURR into just-cleared AC
MQL DVI / move CURR to MQ, divide by DIVISOR...
DIVISOR,0 / ...quotient in MQ, remainder in AC
SZA
JMP I ISMOD0 / remainder nonzero, so leave early

/ Division left AC empty, so CURR divides evenly by DIVISOR!
TAD CURR / don't need to clear AC; prior test says AC == 0
TAD TOTAL
DCA TOTAL
JMP I ISMOD0

//// SHOTOT ////////////////////////////////////////////////////////////
/ Write TOTAL to terminal in decimal, nothing following.

SHOTOT,0
CLR
TAD TOTAL
JMS DECPRT / print answer on console, in decimal
TSF / wait for terminal to be ready again
JMP .-1

JMP I SHOTOT / and done

//// PRINTS ////////////////////////////////////////////////////////////

PRINTS,0
DCA SADDR / save AC as string address
PSNEXT, TAD I SADDR / load next character
SNA
JMP I PRINTS / found the null terminator; leave

TSF / wait for terminal to be ready
JMP .-1
TLS / write character to the terminal

CLA / increment string address pointer
TAD SADDR
IAC
DCA SADDR

JMP PSNEXT / look at next character
SADDR, 0

//// ENDG //////////////////////////////////////////////////////////////
// End program gracefully, either re-entering OS/8 if we can see that
// its entry point looks sane, or halting with the answer in AC so the
// user can see the answer on the front panel.

ENDG, CLA
TAD OS8ENT
TCA
TAD OS8INS1
SNA
JMP 7600 / re-enter OS/8
CLR
TAD TOTAL
HLT / not running under OS/8, so halt
OS8ENT, 7600 / OS/8 entry point
OS8INS1,4207 / first instruction at that entry point

//// DECPRT ////////////////////////////////////////////////////////////
// Decimal number printer; see examples/routines/decprt.pal

PAGE 2
DECPRT, 0
DCA VALUE /SAVE INPUT
DCA DIGIT /CLEAR
TAD CNTRZA
DCA CNTRZB /SET COUNTER TO FOUR
TAD ADDRZA
DCA ARROW /SET TABLE POINTER
SKP
DCA VALUE /SAVE
CLL
TAD VALUE
ARROW, TAD TENPWR /SUBTRACT POWER OF TEN
SZL
ISZ DIGIT /DEVELOP BCD DIGIT
SZL
JMP ARROW-3 /LOOP
CLA /HAVE BCD DIGIT
TAD DIGIT /GET DIGIT
TAD K260 /MAKE IT ASCII
TSF /OR TAD DIGIT
JMP .-1 /JMS TDIGIT(SEE 8-19-U)
TLS /TYPE DIGIT
CLA
DCA DIGIT /CLEAR
ISZ ARROW /UPDATE POINTER
ISZ CNTRZB /DONE ALL FOUR?
JMP ARROW-1 /NO: CONTINUE
JMP I DECPRT /YES: EXIT
ADDRZA, TAD TENPWR
CNTRZA, -4
TENPWR, -1750 /ONE THOUSAND
-0144 /ONE HUNDRED
-0012 /TEN
-0001 /ONE
K260, 260
VALUE, 0
DIGIT, 0
CNTRZB, 0

//// END ///////////////////////////////////////////////////////////////
/ Assembler-generated constants will appear below this in the list file
$

Allow me to repeat that in most modern programming languages, you get the same result in about 5 lines of code, and you don't need to fob the arithmetic off on `bc(1)`.

Isn't it glorious?
Z 340ef7ea4da669e58332e9e3dbaaef55