Number Systems

Dale Swanson 11-11-2007

We all grew up using a number system that has 10 digits. However, there are other number systems with any number of digits you want. To understand how these work we'll look at how our normal system works. But, before that we need to clear up the terms:

Digit: A digit is a symbol the represents a number, note that there are only 10 digits in our number system, and we combine them to make numbers. A digit is comparable to a letter, and a number is comparable to a word (note a single letter can be a word, just like a single digit can be a number).
Base X: When dealing with number systems we say base to mean how many digits a system uses. Our normal system uses 10 and is thus base 10.
Decimal: This is the normal number system we use day in and day out, it has 10 digits and thus is base 10. Dec is some Roman root for 10, and thus the name (along with decade).
Binary: This is maybe the most commonly heard other number system, base 2 (root of bi). It's very easy to represent since you only need two digits, you can use just about anything to store numbers as binary, which is why computers use binary.
Octal: This is base 8, and the only reason it gets used is because 8 is a power of 2, and binary is base 2. It's easy to convert numbers from one base to another if the bases are powers of each other.
Hexadecimal: Usually called hex, it's base 16 (hex is 6 + decimal for 10 = 16). This is probably the most commonly used number system other than decimal. Again since 16 is a power of 2 (2⁴) it's easy to convert numbers from binary to hex and back. Almost no one works with real binary, rather they work with hex representing binary.

Decimal

We have 10 digits, 0, 1, 2, 3, 4, 5, 6, 7, 8, and 9, and with those we can make any number we want. When we count we start at 0, then we go up by one to 1, then another one up to 2, again to 3, and so on until 9. At nine we have a problem, we've run out of digits, so what do we do? We add a 1 to the left and go back to 0, giving us 10. This is obvious to us since we've been using it for life, but you need to really understand what's going to to grasp how other number systems work. You may remember that there are an infinite number of 0's to the left of every number (and also to the right after the decimal place), we don't usually write these out because writing an infinite number of 0's gets old fast. You can write any number of 0's less than infinity and it is still the same number (there are still infinite more 0's assumed to the left of the 0's you write). So the number 138 = 0138 = 00000138. Back to counting we really start out with 000, then 001, then 002 ... 009 then it's easy to see why a one pops up next to the nine, since there was a zero there all along, and the next digit after zero is one. So now we have 010, then 011, 012 ... 019, then again we just rollover the one to a two, and reset the nine to zero, giving us 020, then 021 ... up to 099. You can probably guess that this is no different than just 009, in that the zero is going to go up to one and the nines are going to reset to zeros, giving us 100, then 101, etc. So now you probably understand better how counting actually works. Let's see what happens in another base.

Base 9

No one uses base nine, for anything useful, but since it's just one less than our system I'll use it to break in the concept. Now the first confusing thing about bases is that the highest digit is one less than the base you are in. In base 10 the highest digit is 9, not 10, there is no digit for 10 (it's a combo of two digits). This is because the first digit is always zero, the second digit is one, you can see how we are one higher in the digit count than what number the digit represents. In base 10 this seems obvious since we are used to it, but in base 9 it means that the highest digit is eight, not nine (if it were nine it'd be no different than base 10). So let's see what happens when we count in base 9, like we did above in base 10. Start with 000, then 001, then 002, it's the same up to 008. Now we have a problem, we've run out of digits (remember the digit 9 doesn't exist in base 9). However, this is no different than when we ran out of digits in base 10. What did we do there? We rolled over the zero to a one, and reset the nine to zero. We'll do the same here (except we reset the eight). Thus the next number after 008 is 010. The numbers continue as before 011, 012 ... 018, again we don't have a nine, so we just rollover giving 020, this pattern continues up to 088, now we have to roll over that zero to one and we have 100, then 101...etc. Counting in different bases is actually pretty easy, you just have to make a conscience effort not to use the digits that don't exist. As for how to read the numbers, the convention is to say each digit separately, and if it hasn't been established say the base you are in. To read the number 047 in base 9 you'd say "zero four seven, base nine". If you said "forty seven, base nine" it would be assumed you meant the number 47 (in base 10) in base 9, which is 52. That last part may be confusing, and that's because converting between bases is confusing. For now lets just stick to counting in the bases.

Binary

Binary is base 2, and as I said above binary is wonderful because it only needs two symbols, and thus is very easy to represent. For instance you could store a number in binary with just dots and blanks, or dots and dashes, or horizontal and vertical lines (| and -). In the real world you could store it using anything that could be measured by setting a barrier, and saying anything below it is one thing, and anything over is the other. For example you could use the length of sticks, and say any stick over a foot in length is one digit, and the other digit is any stick shorter than a foot. In computers binary is stored using measures of electricity and magnetism.

So let's count in binary. We start as always with 000. We go up by one to 001, however we already have a problem, as 0 and 1 are our only digits (base 2 only has two digits, remember the number of the base is never a digit in that base, thus 2 isn't a digit in base 2). Since we have no other way to continue we must rollover the zero, giving us 010. Now we continue, 011, but again we run out of digits, so we must rollover both ones, giving 100, then 101, then (rollover again) 110, and 111. That's all the number we can represent with three digits in binary. To continue we must rollover one of the assumed zeros to the left to get 1000, then 1001, and so on. Binary is much easier to understand if you see it in a chart which I have below.

Hexadecimal

Hexadecimal is base 16, and like I said above hex is used because it's easy to convert from binary to hex. Binary is annoying to use because it takes so many digits to represent numbers (for example the number 138 in decimal is 10001010 in binary). The higher the base the less digits you need to represent a number. Let's count in hex. Start with 000, 001, 002...009, now we have a real problem, since we have sixteen digits in hex we can't rollover to 010 yet. But, what do we use after nine to represent that digit. There are no rules, you could use anything, but by far the convention is the start using letters. So after 009 we have 00a. There are no absolute rules about this but almost always a follows right after 9 (and a or A can be used interchangeably), then the letters continue in normal order for however long you need. So we are at 00a, then 00b, 00c, 00d, 00e, 00f. F is our last digit (digit sixteen), so we must rollover to 010. Then we continue as always 011, 012... 019, 01a, 01b... 01f, (rollover) 020, 021... 0ff (rollover) 100. Again this is easier to understand when it's laid out in a chart, don't worry the chart is coming.

Base-26

Base 26 isn't really used much, however, it does have one obvious quality. It has the same number of digits as letters in the alphabet. Normally with a number system we start with zero, then use letters when we use all the normal digits. With base 26 though, I like to use only letters. 0 is a, 1 is b, and so on. This can be a bit confusing, since we aren't used to seeing a's as zeros. Thus there are an infinite amount of a's to the left of the number. To count we start with aaa (which is the same as a or aaaaaa, it's all zero), then go to aab, aac, aad, and so on. When we get to aaz we rollover to aba just as you'd expect. Since base 26 isn't really used how you represent the digits depends on the context, you can use all letters or the number letter combo like in hex. Although I don't see why you'd be using base 26 except to be able to encode your numbers as pure letters.

Base-36

If you want to use letters and numbers then base 36 is a much better choice. Base 36 uses the digits 0-9 then the letters a-z for it's digits. To count you start with 000, 001...009, 00a, 00b...00z, 010, 011...019, 01a, 01b...01z, 020. There's not much more to say about base 36, if you understood the other bases you should understand base 36.

Base-64

Base 64 is the highest base I've ever seen used at all. It has two qualities that make it useful. First, it's a power of 2, like octal or hex, meaning it's easy to convert from binary to it. Second, it uses all the letters, lower and upper case, plus all the digits, and then 2 other symbols. What the other 2 symbols are isn't really agreed upon, and really it doesn't matter. I like to use space and period, since they are the two most common none letter or digit characters you'd find in normal text. Using space can cause problems though, as multiple spaces next to each other can be quite hard to read. So in the chart below I used slash / instead. To count in base 64 we start with 000, 001...009, 00A, 00B...00Z, 00a, 00b...00z, 00/, 00., 010, 011. Again there's no real standard as to which is greater A or a, so just use whatever.

Bits and Bytes

Bits and bytes aren't a number system, so why am I bringing them up? Because computers store everything using them, and thus they go hand in hand with binary and hex. A bit is a binary digit. A bit is a single digit in binary, a 1 or 0, yes or no, true or false, positive or negative. It's the smallest amount of info you can store. There are eight bits in a byte. When you write out a byte in binary you write out 8 zeros or ones, usually you put a space between groups of eight. The number 138 in binary would be 10001010, that would be one byte. Since a byte is eight base two digits it can store any number up to 2⁸ = 256. This is the same for any number system, the highest number you can represent with d digits in base b is b^d. So with 5 digits in decimal you can store 10⁵ = 100,000 different numbers. A note here is that since you have to represent zero you can only represent up to 99,999 with five digits, although there are 100,000 numbers from 0-99,999 (count them, I'll wait).

Ok, welcome back. So now we know that a byte is eight ones or zeros called bits. A byte can represent any number from 0-255 (if you really want to be confused a byte could be considered a base 256 number). Everything on a computer is converted to numbers and then stored as a series of numbers. Text for example is stored using a standard called ASCII, in which each character (letter) is one byte (eight bits). A capital X for example is stored as the number 88 (in base 10) which is 01011000 in binary. As I said above dealing with binary becomes annoying due to its size. This is why hex is used. A single hex digit can represent four binary digits. Thus two hex digits can represent eight binary digits, and what else is eight binary digits? A byte, right. You can test this by opening up notepad typing a single letter then saving it and checking the file size. It's will be one byte. So our capital X in hex is 58. 58 is much easier to work with than 01011000, and unlike 88 (which is the decimal form of it), you can still easily convert from 58 to binary. To see what numbers represent all the letters you can look up an ASCII Chart.

I keep saying hex is easy to convert to binary, but you probably aren't convinced. Like I said a single hex digit can represent four bits (binary digit). For example E is 1110. Two hex digits represents eight bits, example E3 is 1110 0011. I spaced that binary number into two groups of four to show that the E still becomes 1110 just like it did when it was by itself, and 3 becomes 0011. No matter how many hex digits are around hex E it will always become binary 1110. Going from binary to hex the reverse is true, 1110 will always become E in hex. If you don't have a multiple of four number of bits then you just add zeros to the left (remember there are infinite zeros to the left) until you do.

Since a byte is eight bits, and eight bits is two hex digits you usually see hex in two digit pairs, which represent bytes. As an example the ASCII version of Monkey in hex would be 4D 6F 6E 6B 65 79. Each two hex digit pair represents one character, which is one byte, and eight bits. Another common use of hex is to represent colors. If you've done any html or photo editing, you've probably seen six alphanumeric codes that represent colors. These are just three hex pairs. Each pair represents a number from 0-255 and that represents how bright a color is. Any color can be made by combining three colors (the three are different for light and pigments, for light they are red, green, blue, for pigments it's red, blue, yellow, or technically magenta, cyan, yellow). If you get really close to a TV you can see that there are a bunch of little squares made up of those three colors. By displaying different levels of brightness from each of those a TV (or a monitor) can make any color it wants. Each of those squares is called a pixel. When you see those six digit codes what are are seeing is the level of brightness of red, green, and blue on a scale from 0-255, or 00-FF in hex. Thus 00FF00 is pure green (255 level brightness green, 0 brightness for blue and red). All the colors maxed out make white (FFFFFF), and all of them at zero make black (000000).

Well I think I've explained this pretty thoroughly now. You should grasp how the number systems work, and why hex is commonly used when dealing with computers. If you are still foggy here is the promised chart:

Name	Decimal	Binary	Octal	Hexadecimal	Base-26	Base-26	Base-36	Base-64	Roman
Base	10	2	8	16	26	26	36	64
zero	0	00000000	0000	00	00	AA	00	00
one	1	00000001	0001	01	01	AB	01	01	I
two	2	00000010	0002	02	02	AC	02	02	II
three	3	00000011	0003	03	03	AD	03	03	III
four	4	00000100	0004	04	04	AE	04	04	IV
five	5	00000101	0005	05	05	AF	05	05	V
six	6	00000110	0006	06	06	AG	06	06	VI
seven	7	00000111	0007	07	07	AH	07	07	VII
eight	8	00001000	0010	08	08	AI	08	08	VIII
nine	9	00001001	0011	09	09	AJ	09	09	IX
ten	10	00001010	0012	0A	0A	AK	0A	0A	X
eleven	11	00001011	0013	0B	0B	AL	0B	0B	XI
twelve	12	00001100	0014	0C	0C	AM	0C	0C	XII
thirteen	13	00001101	0015	0D	0D	AN	0D	0D	XIII
fourteen	14	00001110	0016	0E	0E	AO	0E	0E	XIV
fifteen	15	00001111	0017	0F	0F	AP	0F	0F	XV
sixteen	16	00010000	0020	10	0G	AQ	0G	0G	XVI
seventeen	17	00010001	0021	11	0H	AR	0H	0H	XVII
eighteen	18	00010010	0022	12	0I	AS	0I	0I	XVIII
nineteen	19	00010011	0023	13	0J	AT	0J	0J	XIX
twenty	20	00010100	0024	14	0K	AU	0K	0K	XX
twenty one	21	00010101	0025	15	0L	AV	0L	0L	XXI
twenty five	25	00011001	0031	19	0P	AZ	0P	0P	XXV
twenty six	26	00011010	0032	1A	10	BA	0Q	0Q	XXVI
twenty seven	27	00011011	0033	1B	11	BB	0R	0R	XXVII
twenty eight	28	00011100	0034	1C	12	BC	0S	0S	XXVIII
thirty five	35	00100011	0043	23	19	BJ	0Z	0Z	XXXV
thirty six	36	00100100	0044	24	1A	BK	10	0a	XXXVI
thirty seven	37	00100101	0045	25	1B	BL	11	0b	XXXVII
thirty eight	38	00100110	0046	26	1C	BM	12	0c	XXXVIII
sixty one	61	00111101	0075	3D	29	CJ	1P	0z	LXI
sixty two	62	00111110	0076	3E	2A	CK	1Q	0/	LXII
sixty three	63	00111111	0077	3F	2B	CL	1R	0.	LXIII
sixty four	64	01000000	0100	40	2C	CM	1S	10	LXIV
sixty five	65	01000001	0101	41	2D	CN	1T	11	LXV
sixty six	66	01000010	0102	42	2E	CO	1U	12	LXVI
one hundred twenty six	126	01111110	0176	7E	4M	DW	3I	1/	CXXVI
one hundred twenty seven	127	01111111	0177	7F	4N	DX	3J	1.	CXXVII
one hundred twenty eight	128	10000000	0200	80	4O	DY	3K	20	CXXVIII
one hundred twenty nine	129	10000001	0201	81	4P	DZ	3L	21	CXXIX
one hundred thirty	130	10000010	0202	82	50	EA	3M	22	CXXX
two hundred fifty four	254	11111110	0376	FE	9K	JU	72	3/	CCLIV
two hundred fifty five	255	11111111	0377	FF	9L	JV	73	3.	CCLV
two hundred fifty six	256	100000000	400	100	9M	JW	74	40	CCLVI
one thousand	1,000	1111101000	1750	3E8	1CC	BMM	RS	Fe	M
two thousand, five hundred	2,500	10111011100	4704	9C4	3I4	CSD	1XG	d4	MMD
five thousand	5,000	1001110001000	11610	1388	7A8	HKI	3UW	1E8	V
sixteen thousand, three hundred eighty three	16,383	11111111111111	37777	3FFF	O63	YFC	CN3	3..	XVMCCCLXXXIII
sixteen thousand, three hundred eighty four	16,384	100000000000000	40000	4000	O64	YFD	CN4	400	XVMCCCLXXXIV
seventeen thousand, four hundred fifty seven	17,457	100010000110001	42061	4431	PLB	ZVL	DGX	4Gn	XVMMCDLVII
thirty two thousand, seven hundred sixty seven	32,767	111111111111111	77777	7FFF	1MC7	BWMH	PA7	7..	XXXMMDCCLXVII
thirty two thousand, seven hundred sixty eight	32,768	1000000000000000	100000	8000	1MC8	BWMI	PA8	800	XXXMMDCCLXVIII
sixty five thousand, five hundred thirty five	65,535	1111111111111111	177777	FFFF	3IOF	DSAP	1EKF	F..	LXVDXXXV
sixty five thousand, five hundred thirty six	65,536	10000000000000000	200000	10000	3IOG	DSAQ	1EKG	G00	LXVDXXXVI
four billion, two hundred ninety four million, nine hundred sixty seven thousand, two hundred ninety five	4,294,967,295	11111111111111111111111111111111	37777777777	FFFFFFFF	DNCHBNL	NXMRLXV	1Z141Z3	3.....
four billion, two hundred ninety four million, nine hundred sixty seven thousand, two hundred ninety six	4,294,967,296	100000000000000000000000000000000	40000000000	100000000	DNCHBNM	NXMRLXW	1Z141Z4	400000

Home

Stuff I Wrote