Speed up `PrintBuffer::flush()` by xilec · Pull Request #811 · gyscos/cursive

xilec · 2025-02-23T00:35:41Z

Moved persistent flag to type level. Now compiler knows it value at compile time and optimize it away. It almost have no impact on performance, but it helps to split implementation of flush_to_backend() on 2 cases and may helps to experiment and optimize in the future.
Replaced clone_from_slice() with manual copy actual cell to frozen. First of all, I want to note Cell is not Copy, and can't be copied as 1 slice of memory, therefore I suppose clone_from_slice() unfolded by compiler just in loop of clones from actual cell to frozen. Мoving this clone() to the existing loop like in PR changes usually we skip many call of clone(), because of in common case changing only part of screen, and amortize cloning by parallel execution on cpu instruction level.

By my tests of performance this changes give about 20-30% speed up

For tests I measure time of invocation PrintBuffer::flush() on different examples:

lorem - not modified example, tested just scrolling
lorem (war and peace) - changed test to just English text that fill allmost all screen, tested just scrolling
progress - not modified example, tested just changes from progress changing
progress - changed count of progress bars to 200, tested just changes from progress changing

UPDATE:

Added compare styles by value. It reduces about 3 additional microseconds (6%)

xilec · 2025-04-15T10:05:13Z

@gyscos, can you review this PR?

xilec added 2 commits February 23, 2025 07:05

Speed up PrintBuffer::flush()

7240d19

compare styles by values

dd727ea

Provide feedback