Java - char, int conversions

Cosmic_Dust picture Cosmic_Dust · Jan 23, 2014 · Viewed 21k times · Source

In Java, the following is allowed:

char c = 'A' + 1;

Here, c will hold the value 'B'. Above, first the expression is evaluated. So 'A' gets converted to 65, the whole expression evaluates to 66, and then 66 is converted to 'B' since we are storing the value in a char.

The following, however, gives a compile-time error:

char c = 'A';
c = c + 1;

What is the explanation for how Java views the expressions differently? By the way, the following works fine too:

char c = 'A';
c++;

Answer

Radiodef picture Radiodef · Jan 23, 2014

The first example (which compiles) is special because both operands of the addition are literals.

A few definitions to start with:

  • Converting an int to char is called a narrowing primitive conversion, because char is a smaller type than int.

  • 'A' + 1 is a constant expression. A constant expression is (basically) an expression whose result is always the same and can be determined at compile-time. In particular, 'A' + 1 is a constant expression because the operands of + are both literals.

A narrowing conversion is allowed during the assignments of byte, short and char, if the right-hand side of the assignment is a constant expression:

In addition, if the expression [on the right-hand side] is a constant expression of type byte, short, char, or int:

  • A narrowing primitive conversion may be used if the variable is of type byte, short, or char, and the value of the constant expression is representable in the type of the variable.

c + 1 is not a constant expression, because c is a non-final variable, so a compile-time error occurs for the assignment. From looking at the code, we can determine that the result is always the same, but the compiler isn't allowed to do that in this case.

One interesting thing we can do is this:

final char a = 'a';
char b = a + 1;

In that case a + 1 is a constant expression, because a is a final variable which is initialized with a constant expression.

The caveat "if […] the value […] is representable in the type of the variable" means that the following would not compile:

char c = 'A' + 99999;

The value of 'A' + 99999 (which is 100064, or 0x186E0) is too big to fit in to a char, because char is an unsigned 16-bit integer.


As for the postfix ++ operator:

The type of the postfix increment expression is the type of the variable.

...

Before the addition, binary numeric promotion* is performed on the value 1 and the value of the variable. If necessary, the sum is narrowed by a narrowing primitive conversion and/or subjected to boxing conversion to the type of the variable before it is stored.

(* Binary numeric promotion takes byte, short and char operands of operators such as + and converts them to int or some other larger type. Java doesn't do arithmetic on integral types smaller than int.)

In other words, the statement c++; is mostly equivalent to:

c = (char)(c + 1);

(The difference is that the result of the expression c++, if we assigned it to something, is the value of c before the increment.)

The other increments and decrements have very similar specifications.

Compound assignment operators such as += automatically perform narrowing conversion as well, so expressions such as c += 1 (or even c += 3.14) are also allowed.