您的位置:首页 > 编程语言 > Java开发

Java 正则表达式基础

2012-07-03 22:05 471 查看
正则表达式是一种用于模式匹配或替换的规范。编程时经常遇到字符串查找,匹配,或者屏蔽特殊字符等等,如果采用正则表达式,则可以起到事半功倍的效果。

一、正则表达式基本语法。

1、.----句点符号

句点符号可以匹配任意单个字符,如a-z, 0-9,!,@,#,$等特殊字符,即通常意义上所说的通配符。

Pattern pattern = Pattern.compile( "leng" );

Matcher matcher = pattern.matcher( "len." );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

2、[]----方括号

.句点符号会匹配所有的字符,匹配范围太广。[]方括号可以限定匹配范围,即只能匹配[]中限定的字符。如[ade]、 [a-z]。

Pattern pattern = Pattern.compile( "l[abc]ng" );

Matcher matcher = pattern.matcher( "leng" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为false

3、|---或符号

|符号进行的是或运算,即匹配|符号连接的字符。如(a|bc|de)

Pattern pattern = Pattern.compile( "l(a|bc|de)ng" );

Matcher matcher = pattern.matcher( "ldeng" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

4、匹配次数符号

*----表示匹配0次或多次

+----表示匹配1次或多次

?----表示匹配0次或1次

{n}----表示匹配n次(n为非负整数)

{m,n}----表示匹配最少m次,最多n次

Pattern pattern = Pattern.compile( "leng*" );

Matcher matcher = pattern.matcher( "len" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

Pattern pattern = Pattern.compile( "leng+" );

Matcher matcher = pattern.matcher( "len" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为false

Pattern pattern = Pattern.compile( "leng?" );

Matcher matcher = pattern.matcher( "len" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

Pattern pattern = Pattern.compile( "leng{1}" );

Matcher matcher = pattern.matcher( "leng" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

Pattern pattern = Pattern.compile( "leng{1,3}" );

Matcher matcher = pattern.matcher( "leng" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

5、^符号

1) 如果^符号用在[]方括号内部,则表示否定,即不匹配方括号内限定的字符,如[^abc]。

Pattern pattern = Pattern.compile( "[^b]*" );

Matcher matcher = pattern.matcher( "leng" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

2)如果^符号用在[]方括号外部,则表示起始,即匹配以^符号引导的开始字符,如^leng.

Pattern pattern = Pattern.compile( "^eng" );

Matcher matcher = pattern.matcher( "leng" );

boolean ret = matcher.matches( ); //输出为false

6、$符号

$符号用来匹配以$符号引导的结束字符,如eng$

Pattern pattern = Pattern.compile( ".*ang$" );

Matcher matcher = pattern.matcher( "lengxuegang" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

7、Java中常用的特殊符号

\d----表示数字:[0-9]

\D----表示非数字:[^0-9]

\w----表示字符:[a-zA-Z0-9]

\W----表示非字符:[^a-zA-Z0-9]

\s----表示空白字符:[\t\n\r\f]

\S----表示非空白字符:[^\t\n\r\f]

二、示例

1、字符串匹配。匹配以age结尾的字符串

Pattern pattern = Pattern.compile( ".*age$" );

Matcher matcher = pattern.matcher( "Java program language" );

boolean ret = matcher.matches( );

System.out.println(ret); //输出为true

2、字符串分割。将字符串以某个或某几个字符进行拆分

Pattern pattern = Pattern.compile( "[.!?]" );

String[] array = pattern.split( "Java program language. I like it!" );

for ( int i = 0; i < array.length; i++ ) {

System.out.println( array[i].trim( ) );

}

输出为:

Java program language

I like it

3、字符串替换

1)首次替换

Pattern pattern = Pattern.compile( "Java" );

Matcher matcher = pattern.matcher( "Java program language. I like Java programming");

System.out.println(matcher.replaceFirst( "C++" ));

输出为:

C++ program language. I like Java programming

2)全部替换

Pattern pattern = Pattern.compile( "Java" );

Matcher matcher = pattern.matcher( "Java program language. I like Java programming" );

System.out.println(matcher.replaceAll( "C++" ));

输出为:

C++ program language. I like C++ programming
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: