Welcome to Software Development on Codidact!
Will you help us build our independent community of developers helping developers? We're small and trying to grow. We welcome questions about all aspects of software development, from design to code to QA and more. Got questions? Got answers? Got code you'd like someone to review? Please join us.
Can you have syntax highlighting for streaming text in Python?
Suppose you have a situation where text is coming 1 word at a time, and you want to quickly show it to the user. "Quickly" is not in the sense of a performance constraint, but rather we don't want to wait until the end of the stream to apply highlighting as that would take too long. The rate is somewhat slow and sometimes the stream can be quite long.
The text is markdown but can often include substantial code. It would be important for the code to be highlighted well also. Sometimes the code fences indicate language, sometimes not.
Is this feasible, and how? I am asking specifically for printing text from a Python program.
2 answers
Does bat
work for this, as well as for your other question? I see you have tagged your question as Python, but I'm not sure I understand how it's relevant.
Sources: this issue comment and also that one.
The answer is "it depends". It depends on the language, how much buffering you're willing to allow, how much you're willing to accept some approximate/incorrect syntax highlighting, and whether you'd accept control codes that correct the syntax highlighting after the fact.
If your bar for syntax highlighting was simply that language keywords, operators, and other "punctuation" are highlighted, then almost all programming languages allow this with only a small amount of buffering needed. Some amount of buffering is needed because I can't tell if "i" is part of the keyword "if" until I see the next two characters.
On the other hand, if you wanted to highlight unmatched brackets, then you may not be able to correctly highlight an opening bracket at the beginning of the file until the very end of the file.
This is an issue in most languages for many non-trivial uses of syntax highlighting. Specifically, this is an issue with markdown if you'd like to, for example, use bold text for text wrapped in **s.
As the highlighting of this paragraph indicates, whether the text following ** is made bold depends on whether there is a following ** among other rules. I didn't do any escaping in this case.
Ultimately, the only (non-trivial) solution in general is for the streaming syntax highlighting to make a guess and sometimes be wrong. If allowed, it might be possible to use control codes to edit the previously displayed output when the syntax highlighter has enough context to realize it made a mistake.
bat
, for example, just assumes **
starts a bold span and makes the following text bold even when this is incorrect.
0 comment threads