A lot of unicode characters are "problematic" when entered. Here is a proposed list of characters: https://www.tbray.org/ongoing/When/202x/2025/08/14/RFC9839
We should check what happens on each of the platforms if they get simulated and then run automatic tests.