AN codes explained
AN codes are error-correcting code that are used in arithmetic applications.[1] Arithmetic codes were commonly used in computer processors to ensure the accuracy of its arithmetic operations when electronics were more unreliable. Arithmetic codes help the processor to detect when an error is made and correct it. Without these codes, processors would be unreliable since any errors would go undetected. AN codes are arithmetic codes that are named for the integers
and
that are used to encode and decode the codewords.
These codes differ from most other codes in that they use arithmetic weight to maximize the arithmetic distance between codewords as opposed to the hamming weight and hamming distance. The arithmetic distance between two words is a measure of the number of errors made while computing an arithmetic operation. Using the arithmetic distance is necessary since one error in an arithmetic operation can cause a large hamming distance between the received answer and the correct answer.
Arithmetic Weight and Distance
The arithmetic weight of an integer
in base
is defined by
where
<
,
, and
.
[2] The arithmetic distance of a word is upper bounded by its hamming weight since any integer can be represented by its standard polynomial form of
where the
are the digits in the integer. Removing all the terms where
will simulate a
equal to its hamming weight. The arithmetic weight will usually be less than the hamming weight since the
are allowed to be negative. For example, the integer
which is
in binary has a hamming weight of
. This is a quick upper bound on the arithmetic weight since
. However, since the
can be negative, we can write
which makes the arithmetic weight equal to
.
The arithmetic distance between two integers is defined by
This is one of the primary metrics used when analyzing arithmetic codes.[3] [4]
AN Codes
AN codes are defined by integers
and
and are used to encode integers from
to
such that
<
Each choice of
will result in a different code, while
serves as a limiting factor to ensure useful properties in the distance of the code. If
is too large, it could let a codeword with a very small arithmetic weight into the code which will degrade the distance of the entire code. To utilize these codes, before an arithmetic operation is performed on two integers, each integer is multiplied by
. Let the result of the operation on the codewords be
. Note that
must also be between
to
for proper decoding. To decode, simply divide
. If
is not a factor of
, then at least one error has occurred and the most likely solution will be the codeword with the least arithmetic distance from
. As with codes using hamming distance, AN codes can correct up to
errors where
is the distance of the code.
For example, an AN code with
, the operation of adding
and
will start by encoding both operands. This results in the operation
. Then, to find the solution we divide
. As long as
>
, this will be a possible operation under the code. Suppose an error occurs in each of the binary representation of the operands such that
and
, then
. Notice that since
, the hamming weight between the received word and the correct solution is
after just
errors. To compute the arithmetic weight, we take
which can be represented as
or
. In either case, the arithmetic distance is
as expected since this is the number of errors that were made. To correct this error, an algorithm would be used to compute the nearest codeword to the received word in terms of arithmetic distance. We will not describe the algorithms in detail.
To ensure that the distance of the code will not be too small, we will define modular AN codes. A modular AN code
is a subgroup of
, where
. The codes are measured in terms of modular distance which is defined in terms of a graph with vertices being the elements of
. Two vertices
and
are connected iff
x-x'\equiv\pmc ⋅ rj\pmod{m}
where
and
<
<
,
. Then the modular distance between two words is the length of the shortest path between their nodes in the graph. The modular weight of a word is its distance from
which is equal to
wm(x)=min\{w(y)|y\inZ,y\equivx\pmod{m}\}
In practice, the value of
is typically chosen such that
since most computer arithmetic is computed
so there is no additional loss of data due to the code going out of bounds since the computer will also be out of bounds. Choosing
also tends to result in codes with larger distances than other codes.
By using modular weight with
, the AN codes will be
cyclic code.
definition: A cyclic AN code is a code
that is a subgroup of
, where
[rn-1]=\{0,1,2,...,rn-1\}
.
A cyclic AN code is a principal ideal of the ring
. There are integers
and
where
and
satisfy the definition of an AN code. Cyclic AN codes are a subset of cyclic codes and have the same properties.
Mandelbaum-Barrows Codes
The Mandelbaum-Barrows Codes are a type of cyclic AN codes introduced by D. Mandelbaum and J. T. Barrows.[5] [6] These codes are created by choosing
to be a prime number that does not divide
such that
is generated by
and
, and
. Let
be a positive integer where
and
. For example, choosing
, and
the result will be a Mandelbaum-Barrows Code such that
<
in base
.
To analyze the distance of the Mandelbaum-Barrows Codes, we will need the following theorem.
theorem: Let
be a cyclic AN code with generator
, and
Then,
\sumxwm(x)=n(\lfloor
\rfloor-\lfloor
\rfloor)
proof: Assume that each
has a unique cyclic
NAF[7] representation which is
x\equiv
ci,xri\pmod{rn-1}
We define an
matrix with elements
where
and
. This matrix is essentially a list of all the codewords in
where each column is a codeword. Since
is cyclic, each column of the matrix has the same number of zeros. We must now calculate
, which is
times the number of codewords that don't end with a
. As a property of being in cyclic NAF,
iff there is a
with
<
. Since
with
<
, then
<
. Then the number of integers that have a zero as their last bit are
. Multiplying this by the
characters in the codewords gives us a sum of the weights of the codewords of
as desired.
We will now use the previous theorem to show that the Mandelbaum-Barrows Codes are equidistant (which means that every pair of codewords have the same distance), with a distance of
proof: Let
, then
and
is not divisible by
. This implies there
\existsj(N\equiv\pmrj\pmod{B})
. Then
. This proves that
is equidistant since all codewords have the same weight as
. Since all codewords have the same weight, and by the previous theorem we know the total weight of all codewords, the distance of the code is found by dividing the total weight by the number of codewords (excluding 0).
See also
Notes and References
- Book: Peterson . W. Wesley . Jr . E. J. Weldon . Error-Correcting Codes, second edition . 15 March 1972 . MIT Press . 978-0-262-52731-6 . en.
- Clark . W. . Liang . J. . On arithmetic weight for a general radix representation of integers (Corresp.) . IEEE Transactions on Information Theory . November 1973 . 19 . 6 . 823–826 . 10.1109/TIT.1973.1055100.
- Book: Peterson . W. Wesley . Jr . E. J. Weldon . Error-Correcting Codes, second edition . 15 March 1972 . MIT Press . 978-0-262-52731-6 . en.
- Astola . J. . A note on perfect arithmetic codes (Corresp.) . IEEE Transactions on Information Theory . May 1986 . 32 . 3 . 443–445 . 10.1109/TIT.1986.1057175.
- Massey . James L. . García . Oscar N. . Error-Correcting Codes in Computer Arithmetic . Advances in Information Systems Science . 1972 . 273–326 . 10.1007/978-1-4615-9053-8_5.
- J.H. Van Lint (1982). Introduction to Coding Theory. GTM. 86. New York: Springer-Verlag.
- Clark, W. E. and Liang, J. J.: On modular weight and cyclic nonadjacent forms for arithmetic codes. IEEE Trans. Info. Theory, 20 pp. 767-770(1974)