negative integer number >> 31 = -1 not 1?

Question

so, lets say I have a signed integer (couple of examples):

-1101363339 = 10111110 01011010 10000111 01110101 in binary.
-2147463094 = 10000000 00000000 01010000 01001010 in binary.
-20552      = 11111111 11111111 10101111 10111000 in binary.

now: -1101363339 >> 31 for example, should equal 1 right? but on my computer, I am getting -1. Regardless of what negative integer I pick if x = negative number, x >> 31 = -1. why? clearly in binary it should be 1.

Pretty sure this is still undefined behavior in C. Which is to say, there's no hard rule about what'll happen on other systems. — cHao, Jun 21 '13 at 00:16
@cHao I remember hearing that it is up to the implementation wether to make the shift arithmetic, however most choose to do so — aaronman, Jun 21 '13 at 00:17
@aaronman: Yeah, apparently the standard (or my copy of the draft, at least) says it's implementation-defined. Oops. :) — cHao, Jun 21 '13 at 00:21
If you're trying to use this as a test for branch-optimization or something, just mask it afterwards: `(x >> 31) & 1`. Or even better, treat it as unsigned: `(unsigned)x >> 31`. — paddy, Jun 21 '13 at 00:27
"should equal 1 right?" -- Nope. "clearly in binary it should be 1" -- Nope. See http://en.wikipedia.org/wiki/Arithmetic_shift — Jim Balter, Jun 21 '13 at 00:49

Jeff Walden · Accepted Answer · 2013-06-21T00:37:32.910

20

Per C99 6.5.7 Bitwise shift operators:

If E1 has a signed type and a negative value, the resulting value is implementation-defined.

where E1 is the left-hand side of the shift expression. So it depends on your compiler what you'll get.

edited Jun 21 '13 at 00:37

answered Jun 21 '13 at 00:17

Jeff Walden

7,008
2
38
55

@aaronman: Cause people love a good standards quote. :) – cHao Jun 21 '13 at 00:23
9

@aaronman: cause his answer is the only correct one. – liori Jun 21 '13 at 00:23
2

@liori actually his isn't even an answer to the OP – aaronman Jun 21 '13 at 00:24
1

@aaronman: Sometimes the only correct answer is "it depends". Absent information about the CPU and compiler, you don't really know what's supposed to be the right answer. – cHao Jun 21 '13 at 00:25
1

@cHao I do not think the OP cares about the c standard rather his question is referring to the arithmetic shift of a number which is what he didn't understand, all this answer is is a quote of the standard without actually explaining what arithmetic shift is – aaronman Jun 21 '13 at 00:27
2

@aaronman: This is the only correct answer as far as the C language is concerned... – Kerrek SB Jun 21 '13 at 00:29
@all so, if its arithmetic it will be replace with all 1's, if the HSB is 1. so like -1101363339 >> 31 = FF FF FF FF = -1 . right? – dgamma3 Jun 21 '13 at 00:30
@KerrekSB just because it is correct doesn't mean it is helpful, anyone can quote the standard, this is the only answer that doesn't explain what is happening which is arithmetic bit shifting – aaronman Jun 21 '13 at 00:32
@dgamma3: Depends on the implementation. Each compiler/CPU combo can do things however it pleases. On your machine, with your compiler, that's apparently how it works. – cHao Jun 21 '13 at 00:32
2

@aaronman: Because it's C. *You can't count on that even happening.* – cHao Jun 21 '13 at 00:33
@cHao how else could it be though. the only other type of shift is logical. so your saying, it will either pick logical or arithmetic ? – dgamma3 Jun 21 '13 at 00:33
3

@dgamma3 it's implementation defined, so the OP's implementation defined it to be an arithmetic shift – aaronman Jun 21 '13 at 00:34
2

@dgamma3: It might. It might just keep the sign bit and rotate the rest. It might pick a random number. C leaves it entirely up to the compiler to specify what happens when you shift a negative number. – cHao Jun 21 '13 at 00:35
2

@aaronman: This website isn't just about being useful to any one person. It's about being useful to the community. Votes reflect that. Value to an individual is reflected by "accept" bounty. – Kerrek SB Jun 21 '13 at 00:36
thanks for your help guys. C is crazy right? – dgamma3 Jun 21 '13 at 00:37
3

@dgamma3: If you were to implement signed arithmetic with sign-magnitude or one's-complement representations, what would such a shift mean? The fact that there's "no natural best choice" is the reason why it's explicitly implementation-defined. An implementation can pick whichever methods suits it best (e.g. corresponds most closely to the hardware). – Kerrek SB Jun 21 '13 at 00:39
@KerrekSB imo quoting a part of the standard that basically says do whatever here is not helpful but I guess people are entitled to their opinions – aaronman Jun 21 '13 at 00:39
1

@cHao "Because it's C. You can't count on that even happening." This is certainly correct in the grand scheme of things. However, the behavior demonstrated by the OP can be explained by more than just saying "it depends". – Code-Apprentice Jun 21 '13 at 00:42
@dgamma3 - "C is crazy right?" C is what it is. Like most mainstream languages, it is very good for some applications and very bad for others. It is harder than most to use "correctly", since it defines "correctly" in a way that is impossible for a compiler to check fully. In exchange, you get portable, almost-direct access to the metal. – Nemo Jun 21 '13 at 00:52
@Code-Guru: Hardly. The compiler can do whatever it likes. Though the behavior is generally expected to be sane, only the docs can tell you for sure what happens. What if the compiler's behavior were to set the value to -1? Any discussion of arithmetic vs logical vs whatever becomes irrelevant without details about what behavior the implementation has defined in that case. – cHao Jun 21 '13 at 00:53
@Nemo: The dialects used in the 1990s provided almost-direct access to the metal. That has become unfashionable, however. If you can't imagine any reason why, on a 32-bit system with 16-bit `short`, the function `unsigned mult(unsigned short x, unsigned short y) { return x*y;}` should ever do anything other than return the arithmetical product of x and y, you lack imagination. From the standpoint of gcc's authors, the fact a processor would behave predictably doesn't give programmers any right to expect that a compiler for that processor will do likewise. – supercat May 05 '16 at 18:53

score 11 · Answer 2 · edited Jun 21 '13 at 01:39

11

In most languages when you shift to the right it does an arithmetic shift, meaning it preserves the most significant bit. Therefore in your case you have all 1's in binary, which is -1 in decimal. If you use an unsigned int you will get the result you are looking for.

Per C 2011 6.5.7 Bitwise shift operators:

The result of E1 >> E2 is E1 right-shifted E2 bit positions. If E1 has an unsigned type or if E1 has a signed type and a nonnegative value, the value of the result is the integral part of the quotient of E1/ 2^E2. If E1 has a signed type and a negative value, the resulting value is implementation-defined.

Basically, the right-shift of a negative signed integer is implementation defined but most implementations choose to do it as an arithmetic shift.

edited Jun 21 '13 at 01:39

jxh

69,070
8
110
193

answered Jun 21 '13 at 00:15

aaronman

18,343
7
63
78

+1 For suggesting `unsigned int` as a solution – Code-Apprentice Jun 21 '13 at 00:27
@Code-Guru thanks, I honestly don't understand why your answer and the other one got downvoted when the answer being upvoted isn't even an answer – aaronman Jun 21 '13 at 00:29
Yes, the answer by @JeffWalden doesn't really explain why the OP sees this particular behavior. – Code-Apprentice Jun 21 '13 at 00:36
@aaronman. if I use an unsigned int does this mean the binary interpretation of a negative value will be different? – dgamma3 Jun 21 '13 at 00:44
@dgamma3 yes but I think if you casted to one it may do some sort of conversion for you – aaronman Jun 21 '13 at 00:49
@indiv for the sake of humanity I have done it – aaronman Jun 21 '13 at 00:51
1

@dgamma3 An *unsigned* int can't be negative. So there is no "binary interpretation of a negative value" – Code-Apprentice Jun 21 '13 at 01:00
8

@aaronman: +1, but you should stop whining about the plight of your answer. If you are answering questions, your answer will often not get picked (even if you think it is better). – jxh Jun 21 '13 at 01:01
@jxh I agree with you, but everyone can be immature once in a while – aaronman Jun 21 '13 at 01:03

Code-Apprentice · Answer 3 · 2013-06-21T00:54:13.257

5

The behavior you are seeing is called an arithmetic shift which is when right shifting extends the sign bit. This means that the MSBs will carry the same value as the original sign bit. In other words, a negative number will always be negative after a left shift operation.

Note that this behavior is implementation defined and cannot be guaranteed with a different compiler.

edited Jun 21 '13 at 00:54

answered Jun 21 '13 at 00:15

Code-Apprentice

81,660
23
145
268

5

Nope, in C right shifting a negative value is implementation-defined behavior. Also, in C the `>>>` operator doesn't exist. – Matteo Italia Jun 21 '13 at 00:18
@jxh The other bits come from what is in the previous bit, which is the definition of a left shift. Obviously the OP doesn't know about the rules for the MSB. – Code-Apprentice Jun 21 '13 at 00:19
I don't think they have `>>>` in c unfortunately – aaronman Jun 21 '13 at 00:21
@jxh "This means that the MSB will always be the same no matter how many bits you shift to the left" This statement is only guaranteed for the MSB. I can easily imagine an example where `x >>= 1` changes *every* bit of `x` except the MSB. – Code-Apprentice Jun 21 '13 at 00:21
@aaronman Looks like you are right. Java has it, but not C. – Code-Apprentice Jun 21 '13 at 00:23
@MatteoItalia Clarified that this is the behavior that the OP is seeing. – Code-Apprentice Jun 21 '13 at 00:24
@Code-Guru: ok, downvote removed (but you should also reword "a negative number will always be negative after a left shift operation", which seems to imply that this is the case on any compiler). – Matteo Italia Jun 21 '13 at 00:25
@MatteoItalia Added further clarification. – Code-Apprentice Jun 21 '13 at 00:26
@Code-Guru: I know what you are trying to say, but your explanation is a little ambiguous. For example, with an 8 bit int, your explanation could be taken as: `10000000 >> 4` becomes `10001000`. – jxh Jun 21 '13 at 00:32
@jxh If you have any suggestions to make it more clear, feel free to edit my answer. – Code-Apprentice Jun 21 '13 at 00:34
1

@Code-Guru: Much better, thanks. +1 – jxh Jun 21 '13 at 00:38
@jxh Hopefully my latest edit is more to your liking. I've tried to narrow the language to indicate that I am talking about the specific behavior demonstrated by the OP. – Code-Apprentice Jun 21 '13 at 00:38
"left shifting" -- No, this is a right shift. – Jim Balter Jun 21 '13 at 00:52
@JimBalter I live in a [a mirror world](http://en.wikipedia.org/wiki/Mirror,_Mirror_(Star_Trek:_The_Original_Series)). – Code-Apprentice Jun 21 '13 at 00:55

Matteo Italia · Answer 4 · 2013-06-21T01:57:29.243

What you are seeing is an arithmetic shift, in contrast to the bitwise shift you were expecting; i.e., the compiler, instead of "brutally" shifting the bits, is propagating the sign bit, thus dividing by 2^N.

When talking about unsigned ints and positive ints, a right shift is a very simple operation - the bits are shifted to the right by one place (inserting 0 on the left), regardless of their meaning. In such cases, the operation is equivalent to dividing by 2^N (and actually the C standard defines it like that).

The distinction comes up when talking about negative numbers. Several negative numbers representation exist, although currently for integers most commonly 2's complement representation is used.

The problem of a "brutal" bitwise shift here is, for starters, that one of the bits is used in some way to express the sign; thus, shifting the binary digits regardless of the negative integers representation can give unexpected results.

For example, commonly in 2's representation the most significant bit is 1 for negative numbers, 0 for positive numbers; applying a bitwise shift (with zeroes inserted to the left) to a negative number would (between other things) make it positive, not resulting in the (usually expected) division by 2^N

So, arithmetic shift is introduced; negative numbers represented in 2's complement have an interesting property: the division by 2^N behavior of the shift is preserved if, instead of inserting zeroes from the left, you insert bits that have the same value of the original sign bit.

In this way, signed divisions by 2^N can be performed with just a bit of extra logic in the shift, without having to resort to a fully-fledged division routine.

Now, is arithmetic shift guaranteed for signed integers? In some languages yes¹, but in C it's not like that - the behavior of the shift operators when dealing with negative integers is left as an implementation-defined detail.

As often happens, this is due to different hardware support for the operation; C is used on vastly different platforms, and, especially in the past, there was quite a difference in the "cost" of operations depending on the platform.

For example, if the processor does not provide an arithmetic right shift instruction, the compiler would be mandated to emit a much slower DIV instruction of some kind, which could be a problem in an inner loop on slower processors. For these reasons, the C standard leaves it up to the implementor to do the most appropriate thing for the current platform.

In your case, your implementation probably chose arithmetic shift because you are running on an x86 processor, that uses 2's complement arithmetic and provides both bitwise and arithmetic shift as single CPU instructions.

Actually, languages like Java even have separated arithmetic and bitwise shift operators - this is mainly due to the fact that they do not have unsigned types to e.g. store bitfields.

negative integer number >> 31 = -1 not 1?

4 Answers4