Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • Users
  • Groups
  • Search
  • Get Qt Extensions
  • Unsolved
Collapse
Brand Logo
  1. Home
  2. Qt Development
  3. General and Desktop
  4. How to implement reading serial data with ANSI color codes and printing out to textbox in color
QtWS25 Last Chance

How to implement reading serial data with ANSI color codes and printing out to textbox in color

Scheduled Pinned Locked Moved Unsolved General and Desktop
51 Posts 7 Posters 7.6k Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • L lukutis222
    27 Jul 2022, 04:32

    @JonB Yes it is possible that this may match something else but I dont think there is any other way to handle this. At least not that I think of.

    If there is a match for "0;33m" it is most likely an ANSI color code and not just a random serial message.

    Can you clarify what do you mean Make sure you only outputting the match in the parentheses, not the whole thing?
    That is what I am trying to figure out how to do right now. I am trying to exclude that color code from printing to the terminal.

    UPDATE

    I use the following Regex:

    (?<=0;33m).*?(?=0m)
    

    The expression above is looking for a match between ANSI prefix and postfix.

    07ffdd77-f8d9-4dfe-a328-077fa0b03ae6-image.png

    As you can see from the image above, it matched the string successfully, but it does not remove the unnecessary text. I am not sure how can this be done since the function that is sending the data and highlightblock functions are completely seperate.

    void Widget::readData()
    {
        const QByteArray data = serial->serial_connection.readAll();
        ui->Console->insertPlainText(data);
    
        if(ui->checkBox->isChecked()){
            QScrollBar *bar = ui->Console->verticalScrollBar();
            bar->setValue(bar->maximum());
        }
        else{
    
        }
    
    }
    

    I am inserting data to my console regardless of what it is and then it is up to the highlighter to format and remove unnecessary data.

    Is it possible for the regex to completely remove the characters outside of match? At the moment, I am simply selecting all the characters between 0;33m and 0m. Additionally, I need to not only select the data, but completely remove the 0;33m and 0m

    J Offline
    J Offline
    JonB
    wrote on 27 Jul 2022, 08:32 last edited by JonB
    #8

    @lukutis222 said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

    Can you clarify what do you mean Make sure you only outputting the match in the parentheses, not the whole thing?
    That is what I am trying to figure out how to do right now. I am trying to exclude that color code from printing to the terminal.

    Indeed!

    I haven't looked into exactly what you are doing/what you need to do, but: you have a pattern like abc(.*)def. What I think you are showing is that matches and what you output is the whole match, e.g. abcXYZdef. But what you want, and should be able to do, is only output the part which matched inside the parentheses each time. Am I right that you are not doing that?

    I have not looked at how you accomplish that with globalMatch() and QRegularExpressionMatchIterator. But have a look at match() and QRegularExpressionMatch, see https://doc.qt.io/qt-6/qregularexpressionmatch.html#details. See how that uses QRegularExpressionMatch::captured(1) (not 0!) to pick out the first parentheses. That is what you are wanting to do. There must/ought be some way to achieve that within each global match?

    While I think of it: there may be a better/more efficient way, but if all else fails: you get back a string of one "whole" match in the global iteration. If you now push just that through the same regular expression a second time but this time using match() you should be able to get just the parenthesized segment per the example above.

    P.S.
    Having said that. I see for globalMatch()/QRegularExpressionMatchIterator that https://doc.qt.io/qt-6/qregularexpressionmatchiterator.html#details says:

    Each result is a QRegularExpressionMatch object holding all the information for that result (including captured substrings).

    So I think it should already have accessible information about the sub-parenthesized capture groups. Ah, I think I see. In your code you use only match.capturedStart(), match.capturedLength(). That only accesses the whole match. You should be using match.captured(1) (or some number other than 0/omitted), then you will be good!

    L 1 Reply Last reply 28 Jul 2022, 05:46
    0
    • S SimonSchroeder
      27 Jul 2022, 07:14

      Have a closer look at ANSI Escape codes: https://en.wikipedia.org/wiki/ANSI_escape_code

      Colors are always introduces with an escape character. This character is, in your case, displayed as a left arrow. According to wikipedia this character has the value 0x1B. In your original code example you have written this as "[1B]" to match. However, "[1B]" is for characters and not a single character with the value 0x1B (usually unprintable).

      The syntax highlighter will help with the highlighting, but I don't think it will remove any characters from what is displayed. I think, your original approach seems valid. You just have to create a string that starts with 0x1B instead of "[1B]".

      L Offline
      L Offline
      lukutis222
      wrote on 28 Jul 2022, 05:09 last edited by
      #9

      @SimonSchroeder
      Thanks for the response. I am aware of that 1B code in front of the message I just was not sure what to do with it. I do not have control over what messages the connected serial device is sending. This is simply the format ESP32 (microcontroller) is writing serial data.
      Are you suggesting that I should incorporate 1B in my regex?
      At the moment I have:

      (?<=0;33m).*?(?=0m)
      
      S 1 Reply Last reply 28 Jul 2022, 07:21
      0
      • J JonB
        27 Jul 2022, 08:32

        @lukutis222 said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

        Can you clarify what do you mean Make sure you only outputting the match in the parentheses, not the whole thing?
        That is what I am trying to figure out how to do right now. I am trying to exclude that color code from printing to the terminal.

        Indeed!

        I haven't looked into exactly what you are doing/what you need to do, but: you have a pattern like abc(.*)def. What I think you are showing is that matches and what you output is the whole match, e.g. abcXYZdef. But what you want, and should be able to do, is only output the part which matched inside the parentheses each time. Am I right that you are not doing that?

        I have not looked at how you accomplish that with globalMatch() and QRegularExpressionMatchIterator. But have a look at match() and QRegularExpressionMatch, see https://doc.qt.io/qt-6/qregularexpressionmatch.html#details. See how that uses QRegularExpressionMatch::captured(1) (not 0!) to pick out the first parentheses. That is what you are wanting to do. There must/ought be some way to achieve that within each global match?

        While I think of it: there may be a better/more efficient way, but if all else fails: you get back a string of one "whole" match in the global iteration. If you now push just that through the same regular expression a second time but this time using match() you should be able to get just the parenthesized segment per the example above.

        P.S.
        Having said that. I see for globalMatch()/QRegularExpressionMatchIterator that https://doc.qt.io/qt-6/qregularexpressionmatchiterator.html#details says:

        Each result is a QRegularExpressionMatch object holding all the information for that result (including captured substrings).

        So I think it should already have accessible information about the sub-parenthesized capture groups. Ah, I think I see. In your code you use only match.capturedStart(), match.capturedLength(). That only accesses the whole match. You should be using match.captured(1) (or some number other than 0/omitted), then you will be good!

        L Offline
        L Offline
        lukutis222
        wrote on 28 Jul 2022, 05:46 last edited by lukutis222
        #10

        @JonB
        Hello. Thank you for this information. I have been reading and learning about this for a while but unfortunately I still cannot get this to work.

        I do not understand how match() and match.captured work. I have put the example string and regex from the QT docs to the regex parser online:

        QRegularExpression re("(\\d\\d) (?<name>\\w+)");
        QRegularExpressionMatch match = re.match("23 Jordan");
        if (match.hasMatch()) {
            QString number = match.captured(1); // first == "23"
            QString name = match.captured("name"); // name == "Jordan"
        }
        

        30b69bca-3fc3-4a38-9d8a-56d12b45ce75-image.png

        As you can see from above image, the regex does not seem to match anything. I am now studying more about match instead of global match and trying to find out how it works.

        Just for the experimentation sake I have tried the following:

            QRegularExpression regex("(?<=0;33m).*?(?=0m)", QRegularExpression::MultilineOption);
            QRegularExpressionMatch match = regex.match(text);
            if (match.hasMatch()) {
                QString matched_text = match.captured(1); 
                qDebug("matched text = %s \n",matched_text.toStdString().c_str());
                
            }
        
        

        The if condition is triggered so it finds a match but when trying to print it, it is printing "nothing"

        832dbe9a-7af4-49ab-a743-97f7fed19311-image.png

        UPDATE

        I have tried the following:

            QRegularExpression regex("(?<=0;33m).*?(?=0m)", QRegularExpression::MultilineOption);
            QRegularExpressionMatch match = regex.match(text);
            if (match.hasMatch()) {
        
                QString matched_text0 = match.captured(0);
                QString matched_text1 = match.captured(1);
                QString matched_text2 = match.captured(2);
                qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
        
            }
        

        I can now succesfully print the matched text which is caputed using match.captured(0).

        matched text0 = W (00:03:08.580) THERMOSTAT: CURRENT TIME: Thu Jan  1 00:03:08 1970[ 
        matched text1 =  
        matched text2 =  
        

        Is the above wrong since you suggested not using captured(0)?

        I still cannot fully understand how can I get rid of the unmatched text. Since I write this data to my console using the following function

        void Widget::readData()
        {
            const QByteArray data = serial->serial_connection.readAll();
            ui->Console->insertPlainText(data);
        
            if(ui->checkBox->isChecked()){
                QScrollBar *bar = ui->Console->verticalScrollBar();
                bar->setValue(bar->maximum());
            }
            else{
        
            }
        
        }
        

        So regardless if the highlighter formats the data or not, the data is still there

        J 1 Reply Last reply 28 Jul 2022, 08:06
        0
        • L lukutis222
          28 Jul 2022, 05:09

          @SimonSchroeder
          Thanks for the response. I am aware of that 1B code in front of the message I just was not sure what to do with it. I do not have control over what messages the connected serial device is sending. This is simply the format ESP32 (microcontroller) is writing serial data.
          Are you suggesting that I should incorporate 1B in my regex?
          At the moment I have:

          (?<=0;33m).*?(?=0m)
          
          S Offline
          S Offline
          SimonSchroeder
          wrote on 28 Jul 2022, 07:21 last edited by
          #11

          @lukutis222 said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

          Are you suggesting that I should incorporate 1B in my regex?

          If you want to remove that character as well (which I assume you want to), then yes! I am not entirely sure about the full syntax that QRegularExpression supports, but it seems like you could include this character by its octal value as "\033". This would mean that your regular expression should be "(?<=\033[0;33m).*?(?=\033[0m)".

          Here is the explanation for your last successful try: Every match has the full match as captured(0) (which in general might be empty if there is no match). If you want to have additional matches you need to use parentheses "(\w) (\w)" would match two words where captured(0) has the whole captured string with two words and a space in between, captured(1) has the first word, and captured(2) has the second word. However, the question mark in (?...) tells the regular expression that the part inside the parentheses is not a capture. This is the reason why in your case there is only one captured string (have a look at captureCount() which should be 0 (0 means that there is only the implicitly captured group).

          L 1 Reply Last reply 28 Jul 2022, 09:40
          1
          • L lukutis222
            28 Jul 2022, 05:46

            @JonB
            Hello. Thank you for this information. I have been reading and learning about this for a while but unfortunately I still cannot get this to work.

            I do not understand how match() and match.captured work. I have put the example string and regex from the QT docs to the regex parser online:

            QRegularExpression re("(\\d\\d) (?<name>\\w+)");
            QRegularExpressionMatch match = re.match("23 Jordan");
            if (match.hasMatch()) {
                QString number = match.captured(1); // first == "23"
                QString name = match.captured("name"); // name == "Jordan"
            }
            

            30b69bca-3fc3-4a38-9d8a-56d12b45ce75-image.png

            As you can see from above image, the regex does not seem to match anything. I am now studying more about match instead of global match and trying to find out how it works.

            Just for the experimentation sake I have tried the following:

                QRegularExpression regex("(?<=0;33m).*?(?=0m)", QRegularExpression::MultilineOption);
                QRegularExpressionMatch match = regex.match(text);
                if (match.hasMatch()) {
                    QString matched_text = match.captured(1); 
                    qDebug("matched text = %s \n",matched_text.toStdString().c_str());
                    
                }
            
            

            The if condition is triggered so it finds a match but when trying to print it, it is printing "nothing"

            832dbe9a-7af4-49ab-a743-97f7fed19311-image.png

            UPDATE

            I have tried the following:

                QRegularExpression regex("(?<=0;33m).*?(?=0m)", QRegularExpression::MultilineOption);
                QRegularExpressionMatch match = regex.match(text);
                if (match.hasMatch()) {
            
                    QString matched_text0 = match.captured(0);
                    QString matched_text1 = match.captured(1);
                    QString matched_text2 = match.captured(2);
                    qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                    qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                    qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
            
                }
            

            I can now succesfully print the matched text which is caputed using match.captured(0).

            matched text0 = W (00:03:08.580) THERMOSTAT: CURRENT TIME: Thu Jan  1 00:03:08 1970[ 
            matched text1 =  
            matched text2 =  
            

            Is the above wrong since you suggested not using captured(0)?

            I still cannot fully understand how can I get rid of the unmatched text. Since I write this data to my console using the following function

            void Widget::readData()
            {
                const QByteArray data = serial->serial_connection.readAll();
                ui->Console->insertPlainText(data);
            
                if(ui->checkBox->isChecked()){
                    QScrollBar *bar = ui->Console->verticalScrollBar();
                    bar->setValue(bar->maximum());
                }
                else{
            
                }
            
            }
            

            So regardless if the highlighter formats the data or not, the data is still there

            J Offline
            J Offline
            JonB
            wrote on 28 Jul 2022, 08:06 last edited by
            #12

            @lukutis222
            I do not claim to follow the full ins and outs of your long post. But I believe @SimonSchroeder is saying what I would: it is your use of the "advanced" (?... stuff that is probably at issue.

            Please start your testing and grabbing of (...) segments on a much simpler reg ex. Just for example:

            "(\\033[0;33m)(.*)(\\033[0m)"
            

            (I'm not 100% about \\033 vs \033, you may have to play, I believe either/both will work, for different reasons.) That should have captures 1--3. Get that principle working before you move onto a more complex expression!

            1 Reply Last reply
            0
            • S SimonSchroeder
              28 Jul 2022, 07:21

              @lukutis222 said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

              Are you suggesting that I should incorporate 1B in my regex?

              If you want to remove that character as well (which I assume you want to), then yes! I am not entirely sure about the full syntax that QRegularExpression supports, but it seems like you could include this character by its octal value as "\033". This would mean that your regular expression should be "(?<=\033[0;33m).*?(?=\033[0m)".

              Here is the explanation for your last successful try: Every match has the full match as captured(0) (which in general might be empty if there is no match). If you want to have additional matches you need to use parentheses "(\w) (\w)" would match two words where captured(0) has the whole captured string with two words and a space in between, captured(1) has the first word, and captured(2) has the second word. However, the question mark in (?...) tells the regular expression that the part inside the parentheses is not a capture. This is the reason why in your case there is only one captured string (have a look at captureCount() which should be 0 (0 means that there is only the implicitly captured group).

              L Offline
              L Offline
              lukutis222
              wrote on 28 Jul 2022, 09:40 last edited by
              #13

              @SimonSchroeder

              Thanks for help. Maybe you know how can I simulate this on regex tester online. That would allow me to understand how your suggested expression is intended to work:

              "(?<=\033[0;33m).*?(?=\033[0m)"
              

              If I put it in my code:

                  QRegularExpression regex("(?<=\033[0;33m).*?(?=\033[0m)", QRegularExpression::MultilineOption);
                  QRegularExpressionMatchIterator i = regex.globalMatch(text);
              
                  while (i.hasNext())
                  {
                    QRegularExpressionMatch match = i.next();
                    setFormat(match.capturedStart(), match.capturedLength(), myClassFormat1);
                  }
              

              The error message is returned:

              QRegularExpressionPrivate::doMatch(): called on an invalid QRegularExpression object
              

              Trying this on https://regex101.com/ or https://www.regextester.com/ does not seem to work. Is that because regex testers online do not accept octal representations?
              String under testing:

              [1B][0;33mW (00:02:12.590) THERMOSTAT: CURRENT TIME: Thu Jan  1 00:02:12 1970[1B][0m
              

              Regex used:

              (?<=\033[0;33m).*?(?=\033[0m)
              

              Result(no match):
              0bd304cd-9348-46e5-99b9-bfa1d0584434-image.png

              J 1 Reply Last reply 28 Jul 2022, 09:46
              0
              • L lukutis222
                28 Jul 2022, 09:40

                @SimonSchroeder

                Thanks for help. Maybe you know how can I simulate this on regex tester online. That would allow me to understand how your suggested expression is intended to work:

                "(?<=\033[0;33m).*?(?=\033[0m)"
                

                If I put it in my code:

                    QRegularExpression regex("(?<=\033[0;33m).*?(?=\033[0m)", QRegularExpression::MultilineOption);
                    QRegularExpressionMatchIterator i = regex.globalMatch(text);
                
                    while (i.hasNext())
                    {
                      QRegularExpressionMatch match = i.next();
                      setFormat(match.capturedStart(), match.capturedLength(), myClassFormat1);
                    }
                

                The error message is returned:

                QRegularExpressionPrivate::doMatch(): called on an invalid QRegularExpression object
                

                Trying this on https://regex101.com/ or https://www.regextester.com/ does not seem to work. Is that because regex testers online do not accept octal representations?
                String under testing:

                [1B][0;33mW (00:02:12.590) THERMOSTAT: CURRENT TIME: Thu Jan  1 00:02:12 1970[1B][0m
                

                Regex used:

                (?<=\033[0;33m).*?(?=\033[0m)
                

                Result(no match):
                0bd304cd-9348-46e5-99b9-bfa1d0584434-image.png

                J Offline
                J Offline
                JonB
                wrote on 28 Jul 2022, 09:46 last edited by JonB
                #14

                @lukutis222
                It has red-underlined (?<=. Did you hover that to see if it says anything? You are using Javascript reg exes, have you looked to see whether that supports this construct? regex101 (or whatever online) reg exps won't be identical to Qt's, that's also why it offers different "flavors" of reg ex to try.

                I previously suggested you try a simpler example which does not use that construct. Why bother with it all when you can just use a plain (...) anyway, and write your code accordingly? I think you are making your first attempt harder than it need be. Up to you....

                L 1 Reply Last reply 28 Jul 2022, 09:59
                0
                • J JonB
                  28 Jul 2022, 09:46

                  @lukutis222
                  It has red-underlined (?<=. Did you hover that to see if it says anything? You are using Javascript reg exes, have you looked to see whether that supports this construct? regex101 (or whatever online) reg exps won't be identical to Qt's, that's also why it offers different "flavors" of reg ex to try.

                  I previously suggested you try a simpler example which does not use that construct. Why bother with it all when you can just use a plain (...) anyway, and write your code accordingly? I think you are making your first attempt harder than it need be. Up to you....

                  L Offline
                  L Offline
                  lukutis222
                  wrote on 28 Jul 2022, 09:59 last edited by lukutis222
                  #15

                  @JonB When I decided to parse ANSI color codes I really did not expect this to be such complex topic.

                  As you have suggested, I started experimenting with different regex expressions. The one that I am currently working with is the following:

                  (0;33m)(.*)(0m)
                  

                  My code:

                      QRegularExpression regex("(0;33m)(.*)(0m)", QRegularExpression::MultilineOption);
                      QRegularExpressionMatch match = regex.match(text);
                      if (match.hasMatch()) {
                  
                          QString matched_text0 = match.captured(0);
                          QString matched_text1 = match.captured(1);
                          QString matched_text2 = match.captured(2);
                          qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                          qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                          qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
                      }
                  
                  

                  Is now returning:

                  matched text0 = 0;33mW (00:30:02.590) THERMOSTAT: CURRENT TIME: Thu Jan  1 00:30:02 1970[0m 
                  matched text1 = 0;33m 
                  matched text2 = W (00:30:02.590) THERMOSTAT: CURRENT TIME: Thu Jan  1 00:30:02 1970[ 
                  

                  Which I think is getting closer to what I want to achieve. matched text2 is a text that I want to highlight.

                  I am still not fully sure about how can I now ensure that the unmatched data does not get displayed on my console?

                  940b1207-4eab-4454-bf86-f1acec086f51-image.png

                  UPDATE

                  Do you know how can I format only the specific match that I choose instead of whole string? When I was using global match, I used capturedStart() but I dont think I can use that now.

                  For example, I want to format matched_text2 only. I tried the following:

                      QRegularExpression regex("(0;33m)(.*)(0m)", QRegularExpression::MultilineOption);
                      QRegularExpressionMatch match = regex.match(text);
                      if (match.hasMatch()) {
                  
                          QString matched_text0 = match.captured(0);
                          QString matched_text1 = match.captured(1);
                          QString matched_text2 = match.captured(2);
                          qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                          qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                          qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
                          setFormat(match.capturedStart(matched_text2), match.capturedLength(matched_text2), myClassFormat1);
                      }
                  

                  However, the format is not applied to the matched text

                  J 1 Reply Last reply 28 Jul 2022, 14:21
                  0
                  • L lukutis222
                    28 Jul 2022, 09:59

                    @JonB When I decided to parse ANSI color codes I really did not expect this to be such complex topic.

                    As you have suggested, I started experimenting with different regex expressions. The one that I am currently working with is the following:

                    (0;33m)(.*)(0m)
                    

                    My code:

                        QRegularExpression regex("(0;33m)(.*)(0m)", QRegularExpression::MultilineOption);
                        QRegularExpressionMatch match = regex.match(text);
                        if (match.hasMatch()) {
                    
                            QString matched_text0 = match.captured(0);
                            QString matched_text1 = match.captured(1);
                            QString matched_text2 = match.captured(2);
                            qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                            qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                            qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
                        }
                    
                    

                    Is now returning:

                    matched text0 = 0;33mW (00:30:02.590) THERMOSTAT: CURRENT TIME: Thu Jan  1 00:30:02 1970[0m 
                    matched text1 = 0;33m 
                    matched text2 = W (00:30:02.590) THERMOSTAT: CURRENT TIME: Thu Jan  1 00:30:02 1970[ 
                    

                    Which I think is getting closer to what I want to achieve. matched text2 is a text that I want to highlight.

                    I am still not fully sure about how can I now ensure that the unmatched data does not get displayed on my console?

                    940b1207-4eab-4454-bf86-f1acec086f51-image.png

                    UPDATE

                    Do you know how can I format only the specific match that I choose instead of whole string? When I was using global match, I used capturedStart() but I dont think I can use that now.

                    For example, I want to format matched_text2 only. I tried the following:

                        QRegularExpression regex("(0;33m)(.*)(0m)", QRegularExpression::MultilineOption);
                        QRegularExpressionMatch match = regex.match(text);
                        if (match.hasMatch()) {
                    
                            QString matched_text0 = match.captured(0);
                            QString matched_text1 = match.captured(1);
                            QString matched_text2 = match.captured(2);
                            qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                            qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                            qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
                            setFormat(match.capturedStart(matched_text2), match.capturedLength(matched_text2), myClassFormat1);
                        }
                    

                    However, the format is not applied to the matched text

                    J Offline
                    J Offline
                    JonB
                    wrote on 28 Jul 2022, 14:21 last edited by JonB
                    #16

                    @lukutis222

                    match.capturedStart(matched_text2), match.capturedLength(matched_text2)

                    I believe you are doing totally the wrong thing here. If you want --- as you do --- the second (...) matched the whole thing is given in match.captured(2) (your matched_text2), and that is all you should be looking at.

                    L 1 Reply Last reply 29 Jul 2022, 04:19
                    1
                    • J JonB
                      28 Jul 2022, 14:21

                      @lukutis222

                      match.capturedStart(matched_text2), match.capturedLength(matched_text2)

                      I believe you are doing totally the wrong thing here. If you want --- as you do --- the second (...) matched the whole thing is given in match.captured(2) (your matched_text2), and that is all you should be looking at.

                      L Offline
                      L Offline
                      lukutis222
                      wrote on 29 Jul 2022, 04:19 last edited by lukutis222
                      #17

                      @JonB

                      I really appreciate the help but I dont think we are on the same page and I am not sure if you fully understand what I am trying to achieve. I want to give a simpler example perhaps you can help me understand how the highlighter is meant to remove the unwanted text cause I am still not fully convinced this is possible. Imagine I have a text "12345HELLO12345" That I want to send to the console.

                          const QByteArray data  = "12345HELLO12345";
                          ui->Console->insertPlainText(data);
                          syntaxhighlighter = new SyntaxHighlighter(ui->Console->document());
                      

                      Since I have attached the syntaxhiglihhter to my console, as soon as the data is sent, the higlightblock function will be called.

                      Now 2 things need to happen inside higlightblock function:

                      1. Discard the "12345" before and after the word "HELLO"
                      2. Format the text and make it green color.

                      void SyntaxHighlighter::highlightBlock(const QString &text)
                      {
                      QTextCharFormat myClassFormat1;
                      myClassFormat1.setFontWeight(QFont::Bold);
                      myClassFormat1.setForeground(QColorConstants::Svg::darkorange);

                      qDebug(" text inside highlightblock = %s \n",text.toStdString().c_str());
                      
                      QRegularExpression regex("(12345)(.*)(12345)", QRegularExpression::MultilineOption);
                      QRegularExpressionMatch match = regex.match(text);
                      if (match.hasMatch()) {
                      
                          QString matched_text0 = match.captured(0);
                          QString matched_text1 = match.captured(1);
                          QString matched_text2 = match.captured(2);
                          qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                          qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                          qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
                      
                      }
                      

                      }

                      The QT logs:

                       text inside highlightblock = 12345HELLO12345 
                      matched text0 = 12345HELLO12345 
                      matched text1 = 12345 
                      matched text2 = HELLO 
                      

                      As you can see I have sucesfully mathced the text that I want in matched text2. How can I ensure the "12345" is discarded from the console and how can I format the text? My console shows:

                      16f48fff-7da3-4e41-bf7d-ccca96e1589d-image.png

                      I only want to see text "HELLO" in green color in the console

                      I hope this very simple example is clear and shows what I am trying to achieve.

                      S J 2 Replies Last reply 29 Jul 2022, 07:17
                      0
                      • L lukutis222
                        29 Jul 2022, 04:19

                        @JonB

                        I really appreciate the help but I dont think we are on the same page and I am not sure if you fully understand what I am trying to achieve. I want to give a simpler example perhaps you can help me understand how the highlighter is meant to remove the unwanted text cause I am still not fully convinced this is possible. Imagine I have a text "12345HELLO12345" That I want to send to the console.

                            const QByteArray data  = "12345HELLO12345";
                            ui->Console->insertPlainText(data);
                            syntaxhighlighter = new SyntaxHighlighter(ui->Console->document());
                        

                        Since I have attached the syntaxhiglihhter to my console, as soon as the data is sent, the higlightblock function will be called.

                        Now 2 things need to happen inside higlightblock function:

                        1. Discard the "12345" before and after the word "HELLO"
                        2. Format the text and make it green color.

                        void SyntaxHighlighter::highlightBlock(const QString &text)
                        {
                        QTextCharFormat myClassFormat1;
                        myClassFormat1.setFontWeight(QFont::Bold);
                        myClassFormat1.setForeground(QColorConstants::Svg::darkorange);

                        qDebug(" text inside highlightblock = %s \n",text.toStdString().c_str());
                        
                        QRegularExpression regex("(12345)(.*)(12345)", QRegularExpression::MultilineOption);
                        QRegularExpressionMatch match = regex.match(text);
                        if (match.hasMatch()) {
                        
                            QString matched_text0 = match.captured(0);
                            QString matched_text1 = match.captured(1);
                            QString matched_text2 = match.captured(2);
                            qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                            qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                            qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
                        
                        }
                        

                        }

                        The QT logs:

                         text inside highlightblock = 12345HELLO12345 
                        matched text0 = 12345HELLO12345 
                        matched text1 = 12345 
                        matched text2 = HELLO 
                        

                        As you can see I have sucesfully mathced the text that I want in matched text2. How can I ensure the "12345" is discarded from the console and how can I format the text? My console shows:

                        16f48fff-7da3-4e41-bf7d-ccca96e1589d-image.png

                        I only want to see text "HELLO" in green color in the console

                        I hope this very simple example is clear and shows what I am trying to achieve.

                        S Offline
                        S Offline
                        SimonSchroeder
                        wrote on 29 Jul 2022, 07:17 last edited by
                        #18

                        @lukutis222 said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

                        Now 2 things need to happen inside higlightblock function:

                        1. Discard the "12345" before and after the word "HELLO"
                        2. Format the text and make it green color.

                        The QSyntaxHighlight has one job and that is highlighting. Just as I have said before:

                        The syntax highlighter will help with the highlighting, but I don't think it will remove any characters from what is displayed. I think, your original approach seems valid.

                        The best suggestion I have is to drop the syntax highlighter because you have to first have to remove the unwanted part of the string before adding it to your ui->Console. But then the syntax highlighter has no information for highlighting. (The syntax highlighter highlights syntax; ANSII escape code are more like markup and cannot be handled by a syntax highlighter.)

                        I suggest that you first clean up the string (maybe replace the ANSII escape code by HTML) and add them to the text edit with the right format. Using HTML would help to just replace the escape codes and then use addHTML() instead of addPlainText(). The regular expression will still help to capture the right parts of the string and the escape codes.

                        1 Reply Last reply
                        0
                        • L lukutis222
                          29 Jul 2022, 04:19

                          @JonB

                          I really appreciate the help but I dont think we are on the same page and I am not sure if you fully understand what I am trying to achieve. I want to give a simpler example perhaps you can help me understand how the highlighter is meant to remove the unwanted text cause I am still not fully convinced this is possible. Imagine I have a text "12345HELLO12345" That I want to send to the console.

                              const QByteArray data  = "12345HELLO12345";
                              ui->Console->insertPlainText(data);
                              syntaxhighlighter = new SyntaxHighlighter(ui->Console->document());
                          

                          Since I have attached the syntaxhiglihhter to my console, as soon as the data is sent, the higlightblock function will be called.

                          Now 2 things need to happen inside higlightblock function:

                          1. Discard the "12345" before and after the word "HELLO"
                          2. Format the text and make it green color.

                          void SyntaxHighlighter::highlightBlock(const QString &text)
                          {
                          QTextCharFormat myClassFormat1;
                          myClassFormat1.setFontWeight(QFont::Bold);
                          myClassFormat1.setForeground(QColorConstants::Svg::darkorange);

                          qDebug(" text inside highlightblock = %s \n",text.toStdString().c_str());
                          
                          QRegularExpression regex("(12345)(.*)(12345)", QRegularExpression::MultilineOption);
                          QRegularExpressionMatch match = regex.match(text);
                          if (match.hasMatch()) {
                          
                              QString matched_text0 = match.captured(0);
                              QString matched_text1 = match.captured(1);
                              QString matched_text2 = match.captured(2);
                              qDebug("matched text0 = %s \n",matched_text0.toStdString().c_str());
                              qDebug("matched text1 = %s \n",matched_text1.toStdString().c_str());
                              qDebug("matched text2 = %s \n",matched_text2.toStdString().c_str());
                          
                          }
                          

                          }

                          The QT logs:

                           text inside highlightblock = 12345HELLO12345 
                          matched text0 = 12345HELLO12345 
                          matched text1 = 12345 
                          matched text2 = HELLO 
                          

                          As you can see I have sucesfully mathced the text that I want in matched text2. How can I ensure the "12345" is discarded from the console and how can I format the text? My console shows:

                          16f48fff-7da3-4e41-bf7d-ccca96e1589d-image.png

                          I only want to see text "HELLO" in green color in the console

                          I hope this very simple example is clear and shows what I am trying to achieve.

                          J Offline
                          J Offline
                          JonB
                          wrote on 29 Jul 2022, 07:22 last edited by JonB
                          #19

                          @lukutis222 said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

                          I dont think we are on the same page and I am not sure if you fully understand what I am trying to achieve

                          :) I think/hope we are!

                          "(12345)(.*)(12345)"

                          You keep talking about removing the segment(s) you do not want. In this case, remove groups 1 & 3, to leave group 2.

                          I am turning that round. I am saying why can't you implement this by simply preserving what you do want? In this case, don't worry about "removing" anything, instead only output group 2. That is what we normally do with regular expression matches. In order to end up with HELLO why do you want to worry about how to remove the 12345 which comes before and/or after when all you want to end up with is the HELLO?

                          Certainly that is what I would do from the regular expression above. Isn't QString matched_text2 = match.captured(2); all you want to end up outputting from this input?

                          BTW, just in case we are not on the same page about one thing: you are aware that you cannot actually remove anything once output has gone to the console, aren't you? You have to not-send-output in the first place if you don't want something to appear.

                          I'll also say one further thing. Although it is useful to get these regular expressions working it's clearly taking some time for you to get what you want. If all you want to do is the one reg exp you have shown so far, and you will not be expanding that to more complex/varied additional ones, you could have written this in ten minutes by abandoning reg exps and just using a loop to go through the characters in the string omitting/removing/outputting. Recognising \033[ etc. for your couple of cases without reg exps is trivially easy.

                          L 1 Reply Last reply 29 Jul 2022, 11:22
                          0
                          • J JonB
                            29 Jul 2022, 07:22

                            @lukutis222 said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

                            I dont think we are on the same page and I am not sure if you fully understand what I am trying to achieve

                            :) I think/hope we are!

                            "(12345)(.*)(12345)"

                            You keep talking about removing the segment(s) you do not want. In this case, remove groups 1 & 3, to leave group 2.

                            I am turning that round. I am saying why can't you implement this by simply preserving what you do want? In this case, don't worry about "removing" anything, instead only output group 2. That is what we normally do with regular expression matches. In order to end up with HELLO why do you want to worry about how to remove the 12345 which comes before and/or after when all you want to end up with is the HELLO?

                            Certainly that is what I would do from the regular expression above. Isn't QString matched_text2 = match.captured(2); all you want to end up outputting from this input?

                            BTW, just in case we are not on the same page about one thing: you are aware that you cannot actually remove anything once output has gone to the console, aren't you? You have to not-send-output in the first place if you don't want something to appear.

                            I'll also say one further thing. Although it is useful to get these regular expressions working it's clearly taking some time for you to get what you want. If all you want to do is the one reg exp you have shown so far, and you will not be expanding that to more complex/varied additional ones, you could have written this in ten minutes by abandoning reg exps and just using a loop to go through the characters in the string omitting/removing/outputting. Recognising \033[ etc. for your couple of cases without reg exps is trivially easy.

                            L Offline
                            L Offline
                            lukutis222
                            wrote on 29 Jul 2022, 11:22 last edited by lukutis222
                            #20

                            @JonB , @SimonSchroeder

                            Thanks for confirming. When I say I want to remove the unwanted part, I literally meant that I want to discard this from being printed to the console. I was hoping that maybe regex had some hidden feature to discard the unwanted text or replace it with NULL characters or something like that. I kept asking how can I discard the unwanted text but maybe you misunderstood what I meant by discarding. Since you both just confirmed that it is not possible with my current approach, I must select different approach then. I can use highlighter to highlight the text but not replace or discard some characters.

                            @SimonSchroeder suggested replacing ANSI escape codes by HTML although I have no clue what that means at this point. I will try to research about that and see if that makes sense to me. It is still unclear to me how the data will be sent to the console, I assume the HTML will still show up as some strange symbols on the console?

                            J 1 Reply Last reply 29 Jul 2022, 11:48
                            0
                            • L lukutis222
                              29 Jul 2022, 11:22

                              @JonB , @SimonSchroeder

                              Thanks for confirming. When I say I want to remove the unwanted part, I literally meant that I want to discard this from being printed to the console. I was hoping that maybe regex had some hidden feature to discard the unwanted text or replace it with NULL characters or something like that. I kept asking how can I discard the unwanted text but maybe you misunderstood what I meant by discarding. Since you both just confirmed that it is not possible with my current approach, I must select different approach then. I can use highlighter to highlight the text but not replace or discard some characters.

                              @SimonSchroeder suggested replacing ANSI escape codes by HTML although I have no clue what that means at this point. I will try to research about that and see if that makes sense to me. It is still unclear to me how the data will be sent to the console, I assume the HTML will still show up as some strange symbols on the console?

                              J Offline
                              J Offline
                              JonB
                              wrote on 29 Jul 2022, 11:48 last edited by
                              #21

                              @lukutis222
                              I'm still a little lost in your question! :)

                              You receive some data from a serial port, right? And you write code to output that to the console (e.g. your ui->Console->insertPlainText(data);), right? So you are choosing what to output. You do not have to copy everything you receive, and then later try to "remove" some bits of it, right? In any case, once something has gone to the console you can't get it back/change it.

                              Consequently just outputting what you do want to output from the string you showed, rather than trying to delete what you don't want, seems simpler.

                              I will leave you with a couple of thoughts:

                              • The methods of QRegularExpression allow you to pick out captures. They do not offer to remove something from the input string.

                              • QString does have methods to remove bits of a string and return a new one. QString &QString::replace(const QRegularExpression &re, const QString &after) can be used to find occurences of a reg exp and replace the matching part with something (including an empty string to remove it completely). And it can deal with captures too if you want that. Perhaps you would have been more comfortable starting from QString rather than from QRegularExpression?

                              • Really to remove ANSI escape sequences you shouldn't be bothering to recognise that start-color-then-text-then-end-color pattern, like "(\\033[0;33m)(.*)(\\033[0m)". All you should do is recognise each ANSI escape sequence and remove it, without worrying about pairing it with a closing sequence or what comes in the middle. Now, for the two escape sequence you are at least currently interested they both match e.g.

                              \\033\[[^m]*m
                              

                              i.e. escape, open-square, sequence of non-m characters, m character terminator. So you could just use that to remove all matching ANSI sequences you have, it covers both your begin and end ones. Then you don't have to worry about capture groups and stuff between a start and end marker.

                              L 1 Reply Last reply 29 Jul 2022, 12:03
                              1
                              • J JonB
                                29 Jul 2022, 11:48

                                @lukutis222
                                I'm still a little lost in your question! :)

                                You receive some data from a serial port, right? And you write code to output that to the console (e.g. your ui->Console->insertPlainText(data);), right? So you are choosing what to output. You do not have to copy everything you receive, and then later try to "remove" some bits of it, right? In any case, once something has gone to the console you can't get it back/change it.

                                Consequently just outputting what you do want to output from the string you showed, rather than trying to delete what you don't want, seems simpler.

                                I will leave you with a couple of thoughts:

                                • The methods of QRegularExpression allow you to pick out captures. They do not offer to remove something from the input string.

                                • QString does have methods to remove bits of a string and return a new one. QString &QString::replace(const QRegularExpression &re, const QString &after) can be used to find occurences of a reg exp and replace the matching part with something (including an empty string to remove it completely). And it can deal with captures too if you want that. Perhaps you would have been more comfortable starting from QString rather than from QRegularExpression?

                                • Really to remove ANSI escape sequences you shouldn't be bothering to recognise that start-color-then-text-then-end-color pattern, like "(\\033[0;33m)(.*)(\\033[0m)". All you should do is recognise each ANSI escape sequence and remove it, without worrying about pairing it with a closing sequence or what comes in the middle. Now, for the two escape sequence you are at least currently interested they both match e.g.

                                \\033\[[^m]*m
                                

                                i.e. escape, open-square, sequence of non-m characters, m character terminator. So you could just use that to remove all matching ANSI sequences you have, it covers both your begin and end ones. Then you don't have to worry about capture groups and stuff between a start and end marker.

                                L Offline
                                L Offline
                                lukutis222
                                wrote on 29 Jul 2022, 12:03 last edited by lukutis222
                                #22

                                @JonB

                                Yes I receive a bunch of data (alot of data actually) from a serial port. This data will come in 3 different colors (as far as I know).

                                If text has 0;33m ANSI prefix, I must display the text in orange color
                                if text has 0;32m ANSI prefix, I must display in green color
                                if text has 0;34m ANSI prefix, I must display in red color

                                I sort of understand what you mean, but one thing I am not sure about:

                                I can do some operation with QString before I send text to the console as you have suggested, I can even remove the ANSI color code and replace it with an empty string, however, how my higlighter is supposed to know how to format the text (apply different colors since I no longer have ASCI code to match and apply format.?

                                Are you suggesting that I can color different messages without using the higlihter. Some pseudo code:

                                void Widget::readData()
                                {
                                    const QByteArray data = serial->serial_connection.readAll();
                                
                                // 1 .detect ANSI color code and replace it with empty characters 
                                
                                // 2. Is it possible to apply different colors here instead of the highlighter?
                                
                                    ui->Console->insertPlainText(data);
                                
                                
                                
                                }
                                
                                J 1 Reply Last reply 29 Jul 2022, 12:09
                                0
                                • L lukutis222
                                  29 Jul 2022, 12:03

                                  @JonB

                                  Yes I receive a bunch of data (alot of data actually) from a serial port. This data will come in 3 different colors (as far as I know).

                                  If text has 0;33m ANSI prefix, I must display the text in orange color
                                  if text has 0;32m ANSI prefix, I must display in green color
                                  if text has 0;34m ANSI prefix, I must display in red color

                                  I sort of understand what you mean, but one thing I am not sure about:

                                  I can do some operation with QString before I send text to the console as you have suggested, I can even remove the ANSI color code and replace it with an empty string, however, how my higlighter is supposed to know how to format the text (apply different colors since I no longer have ASCI code to match and apply format.?

                                  Are you suggesting that I can color different messages without using the higlihter. Some pseudo code:

                                  void Widget::readData()
                                  {
                                      const QByteArray data = serial->serial_connection.readAll();
                                  
                                  // 1 .detect ANSI color code and replace it with empty characters 
                                  
                                  // 2. Is it possible to apply different colors here instead of the highlighter?
                                  
                                      ui->Console->insertPlainText(data);
                                  
                                  
                                  
                                  }
                                  
                                  J Offline
                                  J Offline
                                  JonB
                                  wrote on 29 Jul 2022, 12:09 last edited by
                                  #23

                                  @lukutis222
                                  I am getting a little lost in your questions, and also a touch exhausted :)

                                  You seem to start with the ANSI escape codes, and you have shown output in the appropriate color from that. Then you want to remove those and replace them with something which shows the color, but you already had the color.... ???

                                  I don't know what class ui->Console is so i don't know what you can do with it.

                                  I don't know what you are or are not doing with a QSyntaxHighlighter.

                                  I may be reaching my limit time on this topic... :)

                                  L J 2 Replies Last reply 29 Jul 2022, 12:22
                                  0
                                  • J JonB
                                    29 Jul 2022, 12:09

                                    @lukutis222
                                    I am getting a little lost in your questions, and also a touch exhausted :)

                                    You seem to start with the ANSI escape codes, and you have shown output in the appropriate color from that. Then you want to remove those and replace them with something which shows the color, but you already had the color.... ???

                                    I don't know what class ui->Console is so i don't know what you can do with it.

                                    I don't know what you are or are not doing with a QSyntaxHighlighter.

                                    I may be reaching my limit time on this topic... :)

                                    L Offline
                                    L Offline
                                    lukutis222
                                    wrote on 29 Jul 2022, 12:22 last edited by lukutis222
                                    #24

                                    @JonB
                                    What I am trying to do is very simple. I am not sure if I am very bad at explaning but I have shown it with the 12345HELLO12345 example.

                                    I have a string "12345HELLO12345" and I must do 2 things:

                                    1. DISCARD "12345"
                                    2. Color the remaining text in green color. So the output to the console should be "HELLO" in green color.

                                    For my "real world" example, replace the "12345" with the ANSI color codes. The end user does not need to see the ANSI color codes in the console but he must see the text in different colors. The remote device might send me a string:

                                     HELLO FROM REMOTE DEVICE

                                    I need to do 2 things:

                                    1. Replace the [0;33m and [0m with empty string
                                    2. Color the remaining text ("HELLO FROM REMOVE DEVICE") to orange color.

                                    The remote device can send a different string:

                                     THIS MESSAGE MUST BE GREEN

                                    I need to do 2 things:

                                    1. Replace the [0;32m and [0m with empty string
                                    2. Color the remaining text ("THIS MESSAGE MUST BE GREEN") to green color.

                                    I was initially trying to do it all with the syntaxhighlighter because I thought that you said it is possible to discard (COMPLETELY REMOVE) the unwanted text. But I have just found out that this is not possible so my initial approach was not correct.

                                    Another approach that you have suggested is to discard the unwanted data before it is sent to the console which makes sense, however, the syntax highlighter will have no way to distinguish whether that particular text must be displayed in green, yellow or red color since the ANSI color code has been already discarded before sending the data to the console

                                    1 Reply Last reply
                                    0
                                    • J JonB
                                      29 Jul 2022, 12:09

                                      @lukutis222
                                      I am getting a little lost in your questions, and also a touch exhausted :)

                                      You seem to start with the ANSI escape codes, and you have shown output in the appropriate color from that. Then you want to remove those and replace them with something which shows the color, but you already had the color.... ???

                                      I don't know what class ui->Console is so i don't know what you can do with it.

                                      I don't know what you are or are not doing with a QSyntaxHighlighter.

                                      I may be reaching my limit time on this topic... :)

                                      J Offline
                                      J Offline
                                      JonB
                                      wrote on 29 Jul 2022, 12:28 last edited by
                                      #25

                                      @JonB said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

                                      I don't know what class ui->Console is so i don't know what you can do with it.

                                      ?

                                      L 1 Reply Last reply 31 Jul 2022, 10:48
                                      1
                                      • J JonB
                                        29 Jul 2022, 12:28

                                        @JonB said in How to implement reading serial data with ANSI color codes and printing out to textbox in color:

                                        I don't know what class ui->Console is so i don't know what you can do with it.

                                        ?

                                        L Offline
                                        L Offline
                                        lukutis222
                                        wrote on 31 Jul 2022, 10:48 last edited by
                                        #26

                                        @JonB Sorry for the late response. The ui-> console class is QTextEdit

                                        mrjjM 1 Reply Last reply 31 Jul 2022, 17:35
                                        0
                                        • L lukutis222
                                          31 Jul 2022, 10:48

                                          @JonB Sorry for the late response. The ui-> console class is QTextEdit

                                          mrjjM Offline
                                          mrjjM Offline
                                          mrjj
                                          Lifetime Qt Champion
                                          wrote on 31 Jul 2022, 17:35 last edited by mrjj
                                          #27

                                          @lukutis222
                                          Hi
                                          If you can already find and replace the ANSI codes then using a QTextEdit should be straightforward.
                                          You simply note the code/color and remove it and then insert the "clean" text with the right color.

                                              auto output = ui->textEdit;
                                              output->setTextColor(Qt::red); // predefined color
                                              output->insertPlainText("hello");
                                              output->setTextColor(QColor(0,255,0)); // 'custom' color
                                              output->insertPlainText("device");
                                          
                                          

                                          alt text

                                          1 Reply Last reply
                                          2

                                          17/51

                                          29 Jul 2022, 04:19

                                          • Login

                                          • Login or register to search.
                                          17 out of 51
                                          • First post
                                            17/51
                                            Last post
                                          0
                                          • Categories
                                          • Recent
                                          • Tags
                                          • Popular
                                          • Users
                                          • Groups
                                          • Search
                                          • Get Qt Extensions
                                          • Unsolved