### What problem does this PR solve? Improve the chat stream logic for NvidiaCV ### Type of change - [x] Refactoring