您的位置:首页 > 编程语言 > Python开发

python删除所有的中文字符、非ASCII或非英文字符,检查字符串是否包含非ASCII

2017-03-25 21:34 771 查看
Your
''.join()
expression is filtering, removing anything non-ASCII; you could use a conditional expression instead:

return ''.join([i if ord(i) < 128 else ' ' for i in text])

This handles characters one by one and would still use one space per character replaced.

Your regular expression should just replace consecutive non-ASCII characters with a space:

re.sub(r'[^\x00-\x7F]+',' ', text)


re.sub(r'[^\x00-\x7f]', ' ', str)


Note the
+
there.

检查字符串是否包含非英文ASCII等:

a = "ds  dl,;sd!@)~`09历史s"
regexp = re.compile(r'[^\x00-\x7f]')
if regexp.search(a):
print('matched')
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: 
相关文章推荐