Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Groups
  • Search
  • Get Qt Extensions
  • Unsolved
Collapse
Brand Logo
  1. Home
  2. Qt Development
  3. General and Desktop
  4. QByteArray and char type
Forum Updated to NodeBB v4.3 + New Features

QByteArray and char type

Scheduled Pinned Locked Moved Solved General and Desktop
58 Posts 9 Posters 10.6k Views 4 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • fcarneyF fcarney

    I might be old, but I don't understand how an 8 bit char is not a byte. The definition of byte is that it is 8 bits. Is there some new definition of byte that somehow excludes char or signed 8 bits?

    S Offline
    S Offline
    stretchthebits
    wrote on last edited by
    #41

    @fcarney
    Yes, a byte = 8 bit
    The problem is, are you going to treat that as unsigned char or signed char. Because, if you are going to be performing mathematical operation on them, the sign matters. if it is just text, it does not matter.

    kshegunovK fcarneyF 2 Replies Last reply
    2
    • JonBJ JonB

      @JKSH
      We'll have to be careful. I realize this discussion will get out of hand, you know more than I do about correct definitions.

      What is your detailed definition of a byte?

      About twice a "nibble" ;-) Also, if I get a mosquito nibble it doesn't hurt so much, but if I get a mosquito byte it really itches.

      In a nutshell, I see for example in Python

      Return a new "bytes" object, which is an immutable sequence of small integers in the range 0 <= x < 256

      Wikipedia:

      The modern de facto standard of eight bits, as documented in ISO/IEC 2382-1:1993, is a convenient power of two permitting the binary-encoded values 0 through 255 for one byte

      Assuming 8-bits to keep it simple, I have always taken "byte" as meaning an unsigned quantity 0--255, as opposed to a signed one, -128--127. That is the nub. It's just that's how I see it used elsewhere.

      Can you provide a concrete example where you'd want to check that a byte is greater than 200 or whatever? (And I mean a byte, not a number, not an ASCII character)

      Nope, nothing practical :) I have an imaginary piece of hardware sending me a stream of byte values. For whatever reason (the joystick is faulty in one direction), I wish to ignore the ones larger than 200. I don't want to worry about casting/sign extension. QByteArray b; if (b.at(0) > 200) ....

      Does unsigned char fit your definition in #1?

      Yep. And I don't have to worry about sign!

      Does std::byte fit your definition in #1?

      It does when I don't look at the content. It's a bit useless when I do want to look at it (as I have to cast all over the place), So all in all it turns out it's a bit like a quantum object :)

      Do you think in common parlance that a "byte" implies to you a value between 0--255 (just assume 8-bit). Perhaps it just as much suggests -128--127 to you?

      KroMignonK Offline
      KroMignonK Offline
      KroMignon
      wrote on last edited by
      #42

      @JonB said in QByteArray and char type:

      Nope, nothing practical :) I have an imaginary piece of hardware sending me a stream of byte values. For whatever reason (the joystick is faulty in one direction), I wish to ignore the ones larger than 200. I don't want to worry about casting/sign extension. QByteArray b; if (b.at(0) > 200) ....

      This is wrong (as QByteArray::at() will return a signed value)

      QByteArray b = <something>;
      if (b.at(0) > 200) ....
      

      This is the right way to do:

      QByteArray b = <something>;
      if (quint8(b.at(0)) > 200) ....
      

      Just my 2 cts,

      It is an old maxim of mine that when you have excluded the impossible, whatever remains, however improbable, must be the truth. (Sherlock Holmes)

      JonBJ 1 Reply Last reply
      0
      • S stretchthebits

        @fcarney
        Yes, a byte = 8 bit
        The problem is, are you going to treat that as unsigned char or signed char. Because, if you are going to be performing mathematical operation on them, the sign matters. if it is just text, it does not matter.

        kshegunovK Offline
        kshegunovK Offline
        kshegunov
        Moderators
        wrote on last edited by
        #43

        @stretchthebits said in QByteArray and char type:

        The problem is, are you going to treat that as unsigned char or signed char. Because, if you are going to be performing mathematical operation on them, the sign matters. if it is just text, it does not matter.

        Or as I'd said:

        Byte doesn't imply a value per se, it's a storage unit.

        Take 4 consecutive bytes in memory, does that imply a value between 2^-32 to 2^32 - 1? Surely not, you can have at least several separate interpretations off the top of my head (packed struct assumed):
        int, unsigned int, struct { short a, b; }, char x[4] and so on. All of this is four bytes and it's the same for the single byte, the interpretation is not tied to actual storage, strictly speaking.

        @KroMignon said in QByteArray and char type:

        This is the right way to do:

        QByteArray b = <something>;
        if (quint8(b.at(0)) > 200) ....
        

        I suggest:

        if (quint8(b.at(0)) > quint8(200))
        

        so you don't get the value promoted to int for no good reason.

        Read and abide by the Qt Code of Conduct

        KroMignonK 1 Reply Last reply
        0
        • kshegunovK kshegunov

          @stretchthebits said in QByteArray and char type:

          The problem is, are you going to treat that as unsigned char or signed char. Because, if you are going to be performing mathematical operation on them, the sign matters. if it is just text, it does not matter.

          Or as I'd said:

          Byte doesn't imply a value per se, it's a storage unit.

          Take 4 consecutive bytes in memory, does that imply a value between 2^-32 to 2^32 - 1? Surely not, you can have at least several separate interpretations off the top of my head (packed struct assumed):
          int, unsigned int, struct { short a, b; }, char x[4] and so on. All of this is four bytes and it's the same for the single byte, the interpretation is not tied to actual storage, strictly speaking.

          @KroMignon said in QByteArray and char type:

          This is the right way to do:

          QByteArray b = <something>;
          if (quint8(b.at(0)) > 200) ....
          

          I suggest:

          if (quint8(b.at(0)) > quint8(200))
          

          so you don't get the value promoted to int for no good reason.

          KroMignonK Offline
          KroMignonK Offline
          KroMignon
          wrote on last edited by
          #44

          @kshegunov said in QByteArray and char type:

          I suggest:
          if (quint8(b.at(0)) > quint8(200))

          so you don't get the value promoted to int for no good reason.

          I don't see a issue with if (quint8(b.at(0)) > 200), but if (b.at(0) > 200) is wrong and will never work.

          It is an old maxim of mine that when you have excluded the impossible, whatever remains, however improbable, must be the truth. (Sherlock Holmes)

          kshegunovK 1 Reply Last reply
          1
          • S stretchthebits

            @fcarney
            Yes, a byte = 8 bit
            The problem is, are you going to treat that as unsigned char or signed char. Because, if you are going to be performing mathematical operation on them, the sign matters. if it is just text, it does not matter.

            fcarneyF Offline
            fcarneyF Offline
            fcarney
            wrote on last edited by
            #45

            @stretchthebits said in QByteArray and char type:

            The problem is, are you going to treat that as unsigned char or signed char.

            I am going to treat it as whatever storage type I need. I will cast it to what is needed for that particular piece of code. Is this discussion about having to cast the pointer? I do casting all the time from base objects to derived types. How is this any different? I am not even promoting the type. Just saying its unsigned char* now. Why is this an issue?

            C++ is a perfectly valid school of magic.

            1 Reply Last reply
            0
            • KroMignonK KroMignon

              @kshegunov said in QByteArray and char type:

              I suggest:
              if (quint8(b.at(0)) > quint8(200))

              so you don't get the value promoted to int for no good reason.

              I don't see a issue with if (quint8(b.at(0)) > 200), but if (b.at(0) > 200) is wrong and will never work.

              kshegunovK Offline
              kshegunovK Offline
              kshegunov
              Moderators
              wrote on last edited by kshegunov
              #46

              @KroMignon said in QByteArray and char type:

              I don't see a issue with if (quint8(b.at(0)) > 200), but if (b.at(0) > 200) is wrong and will never work.

              It will work, of course, and the compiler is smart enough to optimize it out it appears. In C/C++ this return value should've been promoted to int as 200 is an int literal, but I didn't take into account that the ax registers are already integers, so this is going to be pruned when optimizing. Note the finer details here: https://godbolt.org/z/6hb8bv

              Read and abide by the Qt Code of Conduct

              KroMignonK 1 Reply Last reply
              0
              • KroMignonK KroMignon

                @JonB said in QByteArray and char type:

                Nope, nothing practical :) I have an imaginary piece of hardware sending me a stream of byte values. For whatever reason (the joystick is faulty in one direction), I wish to ignore the ones larger than 200. I don't want to worry about casting/sign extension. QByteArray b; if (b.at(0) > 200) ....

                This is wrong (as QByteArray::at() will return a signed value)

                QByteArray b = <something>;
                if (b.at(0) > 200) ....
                

                This is the right way to do:

                QByteArray b = <something>;
                if (quint8(b.at(0)) > 200) ....
                

                Just my 2 cts,

                JonBJ Offline
                JonBJ Offline
                JonB
                wrote on last edited by JonB
                #47

                @KroMignon said in QByteArray and char type:

                This is wrong (as QByteArray::at() will return a signed value)

                I know it doesn't work, that's why I wrote it. This whole thread is (supposed to be) a discussion of why that is the case in something named a QByteArray.

                kshegunovK KroMignonK 2 Replies Last reply
                0
                • JonBJ JonB

                  @KroMignon said in QByteArray and char type:

                  This is wrong (as QByteArray::at() will return a signed value)

                  I know it doesn't work, that's why I wrote it. This whole thread is (supposed to be) a discussion of why that is the case in something named a QByteArray.

                  kshegunovK Offline
                  kshegunovK Offline
                  kshegunov
                  Moderators
                  wrote on last edited by
                  #48

                  I believe it's most convenient in practice. If I were to reimplement it I'd do it with a char as well. At least this way you can have comparisons with character literals without "signed/unsigned comparison" warnings.

                  Read and abide by the Qt Code of Conduct

                  1 Reply Last reply
                  1
                  • fcarneyF fcarney

                    I might be old, but I don't understand how an 8 bit char is not a byte. The definition of byte is that it is 8 bits. Is there some new definition of byte that somehow excludes char or signed 8 bits?

                    jsulmJ Offline
                    jsulmJ Offline
                    jsulm
                    Lifetime Qt Champion
                    wrote on last edited by jsulm
                    #49

                    @fcarney said in QByteArray and char type:

                    The definition of byte is that it is 8 bits

                    No, it's not. A byte is the smallest unit addressable by the CPU.
                    On most architectures it is 8bit, but not on all.
                    https://en.wikipedia.org/wiki/Byte

                    https://forum.qt.io/topic/113070/qt-code-of-conduct

                    fcarneyF 1 Reply Last reply
                    1
                    • JonBJ JonB

                      @KroMignon said in QByteArray and char type:

                      This is wrong (as QByteArray::at() will return a signed value)

                      I know it doesn't work, that's why I wrote it. This whole thread is (supposed to be) a discussion of why that is the case in something named a QByteArray.

                      KroMignonK Offline
                      KroMignonK Offline
                      KroMignon
                      wrote on last edited by KroMignon
                      #50

                      @JonB said in QByteArray and char type:

                      I know it doesn't work, that's why I wrote it. This whole thread is (supposed to be) a discussion of why that is the case in something named a QByteArray.

                      As noticed, it is called QByteArray and not QUnsignedByteArray or QSignedByteArray, so there is nothing in the name which implies signed or unsigned.
                      And I found made return signed octets a natural type, because when you use char, short or long in your code, they are per default signed. You always have to specify unsigned to got unsigned value.
                      It made also sense to me, because QByteArray are design to work in combination with strings, which are signed char

                      @jsulm Your are right Byte, at beginning was not a definition of a data structure, but since decades byte and octet have same meaning in programming world.

                      It is an old maxim of mine that when you have excluded the impossible, whatever remains, however improbable, must be the truth. (Sherlock Holmes)

                      jsulmJ kshegunovK JonBJ 3 Replies Last reply
                      1
                      • KroMignonK KroMignon

                        @JonB said in QByteArray and char type:

                        I know it doesn't work, that's why I wrote it. This whole thread is (supposed to be) a discussion of why that is the case in something named a QByteArray.

                        As noticed, it is called QByteArray and not QUnsignedByteArray or QSignedByteArray, so there is nothing in the name which implies signed or unsigned.
                        And I found made return signed octets a natural type, because when you use char, short or long in your code, they are per default signed. You always have to specify unsigned to got unsigned value.
                        It made also sense to me, because QByteArray are design to work in combination with strings, which are signed char

                        @jsulm Your are right Byte, at beginning was not a definition of a data structure, but since decades byte and octet have same meaning in programming world.

                        jsulmJ Offline
                        jsulmJ Offline
                        jsulm
                        Lifetime Qt Champion
                        wrote on last edited by
                        #51

                        @KroMignon said in QByteArray and char type:

                        but since decades byte and octet have same meaning in programming world

                        Yes, but there is no "official" specification that it has always to be 8bit. It is a "de facto standard".

                        https://forum.qt.io/topic/113070/qt-code-of-conduct

                        KroMignonK 1 Reply Last reply
                        0
                        • jsulmJ jsulm

                          @KroMignon said in QByteArray and char type:

                          but since decades byte and octet have same meaning in programming world

                          Yes, but there is no "official" specification that it has always to be 8bit. It is a "de facto standard".

                          KroMignonK Offline
                          KroMignonK Offline
                          KroMignon
                          wrote on last edited by
                          #52

                          @jsulm said in QByteArray and char type:

                          Yes, but there is no "official" specification that it has always to be 8bit. It is a "de facto standard".

                          Yes, I agree with you, but as often, it is the "de facto standard" with prevail.
                          Wenn you look at many binary protocol specification, in the most case "byte" is used instead of "octet".
                          It is wrong, but it is the reality.

                          It is an old maxim of mine that when you have excluded the impossible, whatever remains, however improbable, must be the truth. (Sherlock Holmes)

                          1 Reply Last reply
                          0
                          • KroMignonK KroMignon

                            @JonB said in QByteArray and char type:

                            I know it doesn't work, that's why I wrote it. This whole thread is (supposed to be) a discussion of why that is the case in something named a QByteArray.

                            As noticed, it is called QByteArray and not QUnsignedByteArray or QSignedByteArray, so there is nothing in the name which implies signed or unsigned.
                            And I found made return signed octets a natural type, because when you use char, short or long in your code, they are per default signed. You always have to specify unsigned to got unsigned value.
                            It made also sense to me, because QByteArray are design to work in combination with strings, which are signed char

                            @jsulm Your are right Byte, at beginning was not a definition of a data structure, but since decades byte and octet have same meaning in programming world.

                            kshegunovK Offline
                            kshegunovK Offline
                            kshegunov
                            Moderators
                            wrote on last edited by
                            #53

                            @KroMignon said in QByteArray and char type:

                            And I found made return signed octets a natural type, because when you use char, short or long in your code, they are per default signed.

                            Note that char may be signed or unsigned, this is implementation defined.

                            Read and abide by the Qt Code of Conduct

                            1 Reply Last reply
                            0
                            • KroMignonK KroMignon

                              @JonB said in QByteArray and char type:

                              I know it doesn't work, that's why I wrote it. This whole thread is (supposed to be) a discussion of why that is the case in something named a QByteArray.

                              As noticed, it is called QByteArray and not QUnsignedByteArray or QSignedByteArray, so there is nothing in the name which implies signed or unsigned.
                              And I found made return signed octets a natural type, because when you use char, short or long in your code, they are per default signed. You always have to specify unsigned to got unsigned value.
                              It made also sense to me, because QByteArray are design to work in combination with strings, which are signed char

                              @jsulm Your are right Byte, at beginning was not a definition of a data structure, but since decades byte and octet have same meaning in programming world.

                              JonBJ Offline
                              JonBJ Offline
                              JonB
                              wrote on last edited by JonB
                              #54

                              @KroMignon said in QByteArray and char type:

                              As noticed, it is called QByteArray and not QUnsignedByteArray or QSignedByteArray, so there is nothing in the name which implies signed or unsigned.

                              That is what this thread is about. I have offered a couple of examples --- I could have sought more --- of what I believe illustrates that in common parlance, and in other programming languages/libraries, the word "byte" does imply unsigned. The examples quoted a range of "0--255" where they might equally well have quoted "-128--127", but in practice they did not.

                              Maybe that's my opinion, or the opinion of some, but not shared by others.

                              At which point we have probably exhausted the debate.

                              1 Reply Last reply
                              0
                              • JonBJ JonB

                                @JKSH
                                We'll have to be careful. I realize this discussion will get out of hand, you know more than I do about correct definitions.

                                What is your detailed definition of a byte?

                                About twice a "nibble" ;-) Also, if I get a mosquito nibble it doesn't hurt so much, but if I get a mosquito byte it really itches.

                                In a nutshell, I see for example in Python

                                Return a new "bytes" object, which is an immutable sequence of small integers in the range 0 <= x < 256

                                Wikipedia:

                                The modern de facto standard of eight bits, as documented in ISO/IEC 2382-1:1993, is a convenient power of two permitting the binary-encoded values 0 through 255 for one byte

                                Assuming 8-bits to keep it simple, I have always taken "byte" as meaning an unsigned quantity 0--255, as opposed to a signed one, -128--127. That is the nub. It's just that's how I see it used elsewhere.

                                Can you provide a concrete example where you'd want to check that a byte is greater than 200 or whatever? (And I mean a byte, not a number, not an ASCII character)

                                Nope, nothing practical :) I have an imaginary piece of hardware sending me a stream of byte values. For whatever reason (the joystick is faulty in one direction), I wish to ignore the ones larger than 200. I don't want to worry about casting/sign extension. QByteArray b; if (b.at(0) > 200) ....

                                Does unsigned char fit your definition in #1?

                                Yep. And I don't have to worry about sign!

                                Does std::byte fit your definition in #1?

                                It does when I don't look at the content. It's a bit useless when I do want to look at it (as I have to cast all over the place), So all in all it turns out it's a bit like a quantum object :)

                                Do you think in common parlance that a "byte" implies to you a value between 0--255 (just assume 8-bit). Perhaps it just as much suggests -128--127 to you?

                                JKSHJ Offline
                                JKSHJ Offline
                                JKSH
                                Moderators
                                wrote on last edited by JKSH
                                #55

                                OK, we'll stick with 1 byte == 8 bits for simplicity

                                @JonB said in QByteArray and char type:

                                In a nutshell, I see for example in Python

                                Return a new "bytes" object, which is an immutable sequence of small integers in the range 0 <= x < 256

                                ...

                                I have always taken "byte" as meaning an unsigned quantity 0--255, as opposed to a signed one, -128--127.  That is the nub.  It's just that's how I see it used elsewhere.

                                ...

                                Do you think in common parlance that a "byte" implies to you a value between 0--255 (just assume 8-bit).  Perhaps it just as much suggests -128--127 to you?

                                We both agree that a byte should not be treated as a signed number -128 -- 127.

                                After this discussion and after some extra reading, I realize now that it's common for a byte to be treated as an unsigned number 0 -- 255.

                                I understand now that your definition of a byte is "an unsigned 8-bit integer". In this light, your original post makes sense: char is not a suitable datatype to store unsigned 8-bit integer, and I agree with you on this point.

                                Personally though, I prefer to think of a byte as an 8-bit blob of data, distinct from an 8-bit number. That's why I have no problem with QByteArray storing chars -- because the signedness of the implementation has no effect on the meaning of the blob. It only affects people who want to implicitly convert the blob into a number (which you do).

                                There is no unanimous consensus, however:

                                Language Byte-ish Datatype What is it?
                                C unsigned char unsigned 8-bit integer
                                C++ unsigned char unsigned 8-bit integer
                                C++ std::byte 8-bit blob
                                C# byte unsigned 8-bit integer
                                Go byte unsigned 8-bit integer
                                Java byte signed 8-bit integer
                                JavaScript (element of an ArrayBuffer) 8-bit blob
                                Python (element of a bytes-like object) unsigned 8-bit integer
                                R (element of a raw vector) 8-bit blob
                                Swift (element of Data) unsigned 8-bit integer
                                Visual Basic Byte unsigned 8-bit integer
                                Web IDL byte signed 8-bit integer
                                Web IDL octet unsigned 8-bit integer

                                (4 languages above don't let you create a singular variable with a byte type; the bytes always come in an array and extracting the byte involves conversion)

                                What is your detailed definition of a byte?

                                About twice a "nibble" ;-)  Also, if I get a mosquito nibble it doesn't hurt so much, but if I get a mosquito byte it really itches.

                                Haha, good one!

                                Qt Doc Search for browsers: forum.qt.io/topic/35616/web-browser-extension-for-improved-doc-searches

                                1 Reply Last reply
                                4
                                • kshegunovK kshegunov

                                  @KroMignon said in QByteArray and char type:

                                  I don't see a issue with if (quint8(b.at(0)) > 200), but if (b.at(0) > 200) is wrong and will never work.

                                  It will work, of course, and the compiler is smart enough to optimize it out it appears. In C/C++ this return value should've been promoted to int as 200 is an int literal, but I didn't take into account that the ax registers are already integers, so this is going to be pruned when optimizing. Note the finer details here: https://godbolt.org/z/6hb8bv

                                  KroMignonK Offline
                                  KroMignonK Offline
                                  KroMignon
                                  wrote on last edited by KroMignon
                                  #56

                                  @kshegunov said in QByteArray and char type:

                                  . In C/C++ this return value should've been promoted to int as 200 is an int literal, but I didn't take into account that the ax registers are already integers, so this is going to be pruned when optimizing.

                                  Again, I don't see any issue here, as value is unsigned, promoting it to int will not propagate sign bit.
                                  Supposing b.at(0) = 0x81, which is 129 in base 10 when unsigned or -127 in base 10 when value is signed.

                                  If promoted to int value (32 bit):

                                  • b.at(0) which is a signed value will becomes 0xFFFFFF81 = -127 in base 10 (or 4294967169 when cast to unsigned)
                                  • quint8(b.at(0)) will becomes 0x00000081 = 129 in base 10

                                  It is an old maxim of mine that when you have excluded the impossible, whatever remains, however improbable, must be the truth. (Sherlock Holmes)

                                  1 Reply Last reply
                                  0
                                  • JKSHJ Offline
                                    JKSHJ Offline
                                    JKSH
                                    Moderators
                                    wrote on last edited by
                                    #57

                                    @jsulm said in QByteArray and char type:

                                    @fcarney said in QByteArray and char type:

                                    The definition of byte is that it is 8 bits

                                    No, it's not. A byte is the smallest unit addressable by the CPU.
                                    On most architectures it is 8bit, but not on all.
                                    https://en.wikipedia.org/wiki/Byte

                                    Agreed.

                                    Desktop/mobile devs will probably only encounter 8-bit bytes. But DSP programmers often deal with 16-bit bytes: https://processors.wiki.ti.com/index.php/Byte_Accesses_with_the_C28x_CPU

                                    Qt Doc Search for browsers: forum.qt.io/topic/35616/web-browser-extension-for-improved-doc-searches

                                    1 Reply Last reply
                                    0
                                    • jsulmJ jsulm

                                      @fcarney said in QByteArray and char type:

                                      The definition of byte is that it is 8 bits

                                      No, it's not. A byte is the smallest unit addressable by the CPU.
                                      On most architectures it is 8bit, but not on all.
                                      https://en.wikipedia.org/wiki/Byte

                                      fcarneyF Offline
                                      fcarneyF Offline
                                      fcarney
                                      wrote on last edited by
                                      #58

                                      @jsulm said in QByteArray and char type:

                                      @fcarney said in QByteArray and char type:

                                      The definition of byte is that it is 8 bits

                                      No, it's not. A byte is the smallest unit addressable by the CPU.
                                      On most architectures it is 8bit, but not on all.
                                      https://en.wikipedia.org/wiki/Byte

                                      THERE ARE FOUR LIGHTS! ;-)

                                      C++ is a perfectly valid school of magic.

                                      1 Reply Last reply
                                      0

                                      • Login

                                      • Login or register to search.
                                      • First post
                                        Last post
                                      0
                                      • Categories
                                      • Recent
                                      • Tags
                                      • Popular
                                      • Users
                                      • Groups
                                      • Search
                                      • Get Qt Extensions
                                      • Unsolved