Java Read Html
2016-07-28 14:37
453 查看
Need Jar:
jsoup-1.8.1.jar
jsoup-1.8.1.jar
阅读更多
public static void main(String[] args) {
String content="";
try {
content = executeGet("qq.com","t.qq.com","pgv_pvi=2633264128; RK=AWkaEwm4WM; ptcz=b94a87c80a0a85ceec47cd63566d582d7110bb329e378f2ef066185e9b957333; pt2gguin=o0002442254; ts_refer=url.cn/sorry; wbilang_10000=zh_TW; mb_reg_quick=1; wb_regf=%3B0%3B%3Bapi.t.qq.com%3B0; pgv_info=ssid=s1646546122; ts_last=t.qq.com/snow13000521; pgv_pvid=6340134792; o_cookie=2442254; ts_uid=9369757478");
} catch (IOException e1) {
// TODO Auto-generated catch block
e1.printStackTrace();
}
org.jsoup.nodes.Document doc = (org.jsoup.nodes.Document) Jsoup.parse(content);
org.jsoup.nodes.Element element = doc.getElementById("mainWrapper");
org.jsoup.nodes.Document doc2 = (org.jsoup.nodes.Document) Jsoup.parse(element.getElementsByClass("avatar").toString());
Elements elements = doc2.select("a[href]");
String qqUrl = "";
for(org.jsoup.nodes.Element ele : elements){
qqUrl = ele.attr("href");
}
//To get region
try {
content = executeGet("qq.com","t.qq.com","pgv_pvi=2633264128; RK=AWkaEwm4WM; ptcz=b94a87c80a0a85ceec47cd63566d582d7110bb329e378f2ef066185e9b957333; pt2gguin=o0002442254; ts_refer=url.cn/sorry; wbilang_10000=zh_TW; mb_reg_quick=1; wb_regf=%3B0%3B%3Bapi.t.qq.com%3B0; pgv_info=ssid=s1646546122; ts_last=t.qq.com/snow13000521; pgv_pvid=6340134792; o_cookie=2442254; ts_uid=9369757478");
} catch (IOException e) {
// TODO Auto-generated catch block
e.printStackTrace();
}
//System.out.println(content);
if(content.length()>0){
org.jsoup.nodes.Document docAuthor = (org.jsoup.nodes.Document) Jsoup.parse(content);
Elements elementAuthors = docAuthor.getElementsByClass("ico_location");
if(elementAuthors.size()>0){
org.jsoup.nodes.Element elementAuthor = elementAuthors.get(0).nextElementSibling();
System.err.println("==QQ Region==== "+elementAuthor.text());
}
}
}
相关文章推荐
- java InputStream的三个read的区别 (参考原文:http://www.cnblogs.com/pengyingh/articles/2507207.html)
- 关于java中BufferedReader的read()及readLine()方法的使用心得
- Java 将Word文档转换Html
- Java Lucene(8):解析html页面
- Units Problem: How to read text size as custom attr from xml and set it to TextView in java code
- java 去掉html标签
- 【Java】【IO】FileInputStream read 简介
- In Java, how do I read/convert an InputStream to a String? - Stack Overflow
- http://www.blogjava.net/qileilove/archive/2012/05/10/377756.html
- 数据库连接丢失,重连 Cause: java.sql.SQLException: Could not retrieve transation read-only status server
- android中java与js通信(可以用html来做页面,进行交互)
- android 中本地java代码与html交互总结
- 【Java TCP/IP Socket】TCP Socket通信中由read返回值造成的的死锁问题(含代码)
- Java 批量删除html中注释内容的方法
- jsoup 1.6.2发布 最棒的Java HTML解析器
- 关于java中BufferedReader的read()及readLine()方法的使用心得
- javaScript JSP HTML Java CSS 注释
- java用正则去除html标签
- 《HTMLCSS设计与构建网站》(中文版)》pdf附网盘下载链接+(附一个菜鸟的java学习之路)
- 《HTMLCSS设计与构建网站》(中文版)》pdf附网盘下载链接+(附一个菜鸟的java学习之路)