关于去除数据中特殊字符(非数字字符)的一些方法
最近数据处理时经常发现有数字中含有大量字符,执行过程报“ORA-01722: invalid number”异常,可以有一下几种方法去除:
1、replace 用法简单,写法较复杂,只能处理已知字符
With test_table1 As ( Select 1 seq_num, '2134?654?ag d35' strings From dual Union All Select 2 seq_num, '651 354a g5 dd21' strings From dual ) Select seq_num, Replace(Replace(Replace(Replace(Replace(strings, '?', ''), 'a', ''), 'g', ''), 'd', ''), ' ', '') From test_table1; ---执行结果 1 213465435 2 651354521
2、translate 用法简单,写法简单,只能处理已知字符,字符串、待查找字符,替换字符,均不能为null,否则返回null,字符串按查找顺序替换,若无则去除
With test_table1 As ( Select 1 seq_num, '2134?654?ag d35' strings From dual Union All Select 2 seq_num, '651 354a g5 dd21' strings From dual ) Select seq_num, translate(strings, '1asdfasg ?', '1') From test_table1; ---执行结果 1 213465435 2 651354521
3、regexp_replace 正则表达式 增强型replace 参数多,可根据正则式处理所有字符
With test_table1 As ( Select 1 seq_num, '2134?654?ag d35' strings From dual Union All Select 2 seq_num, '651 354a g5 dd21' strings From dual Union All Select 3 seq_num, '2134654?ag d35' strings From dual Union All Select 4 seq_num, '16?54?aasdgf78as' strings From dual Union All Select 5 seq_num, '16?!@#$%^&*()~:"+_?><|~8as' strings From dual ) Select seq_num, regexp_replace(strings, '[^0-9]', '') From test_table1; ---执行结果 1 213465435 2 651354521 3 213465435 4 165478 5 168