How to fix UTF encoding for whitespaces?

How to fix UTF encoding for whitespaces?

194 160 is the UTF-8 encoding of a NO-BREAK SPACE codepoint (the same codepoint that HTML calls  ).

So it's really not a space, even though it looks like one. (You'll see it won't word-wrap, for instance.) A regular expression match for \s would match it, but a plain comparison with a space won't.

To simply replace NO-BREAK spaces you can do the following:

src = src.Replace('\u00A0', ' ');

 

奇怪的字符,看起来像是空格,实际上又不是空格

BTW-TVA / BE 0437 971 826 RPR Brussel

 

作者:Chuck Lu    GitHub    
posted @   ChuckLu  阅读(8)  评论(0编辑  收藏  举报
相关博文:
阅读排行:
· 全程不用写代码,我用AI程序员写了一个飞机大战
· DeepSeek 开源周回顾「GitHub 热点速览」
· MongoDB 8.0这个新功能碉堡了,比商业数据库还牛
· 记一次.NET内存居高不下排查解决与启示
· 白话解读 Dapr 1.15:你的「微服务管家」又秀新绝活了
历史上的今天:
2019-04-18 226. Invert Binary Tree
2019-04-18 BinaryTree
2018-04-18 Unable to update auto-refresh reference 'microsoft.codedom.providers.dotnetcompilerplatform.dll'.
2016-04-18 通过代码或者配置文件 对log4net进行配置
点击右上角即可分享
微信分享提示