Jsoup入门

1、什么是Jsoup?

jsoup 是一款Java 的HTML解析器,可直接解析某个URL地址、HTML文本内容。它提供了一套非常省力的API,可通过DOM,CSS以及类似于jQuery的操作方法来取出和操作数据。

 

2、pox.xml

 

    <!-- 添加httpclient支持 -->
  	<dependency>
	    <groupId>org.apache.httpcomponents</groupId>
	    <artifactId>httpclient</artifactId>
	    <version>4.5.2</version>
	</dependency>
	
	<!-- 添加Jsoup支持 -->
	<dependency>
	    <groupId>org.jsoup</groupId>
	    <artifactId>jsoup</artifactId>
	    <version>1.10.3</version>
	</dependency>


3、相关代码

 

 

package com.rye;

import java.util.Iterator;

import org.apache.http.HttpEntity;
import org.apache.http.client.methods.CloseableHttpResponse;
import org.apache.http.client.methods.HttpGet;
import org.apache.http.impl.client.CloseableHttpClient;
import org.apache.http.impl.client.HttpClients;
import org.apache.http.util.EntityUtils;
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
import org.jsoup.select.Elements;

public class Demo4 {

	public static void main(String[] args) throws Exception{
		CloseableHttpClient httpclient = HttpClients.createDefault(); // 创建httpclient实例
        HttpGet httpget = new HttpGet("http://www.cnblogs.com/"); // 创建httpget实例,访问博客园
         
        CloseableHttpResponse response = httpclient.execute(httpget); // 执行get请求
        HttpEntity entity=response.getEntity(); // 获取返回实体
        String content=EntityUtils.toString(entity, "utf-8");//网页内容
        response.close(); // 关闭流和释放系统资源
        
        Document doc = Jsoup.parse(content);//Jsoup解析网页内容为文档
        Elements els = doc.getElementsByTag("title");//通过标签获取内容集合
        Element e = els.get(0);//获取第一个元素
        String text = e.text();//返回元素文本
        String html = e.html();//返回元素html
        System.out.println("text:"+text);
        System.out.println("html:"+html);
        System.out.println("========================");
        
        Element e2 = doc.getElementById("site_nav_top");//获取指定Id的元素
        String text2 = e2.text();//获取文本
        System.out.println(text2);
        System.out.println("========================");
        
        Elements els3 = doc.getElementsByClass("titlelnk");//根据样式查询
        Iterator it3 = els3.iterator();
        while(it3.hasNext()) {
        	Element el3 = (Element) it3.next();
        	String text3 = el3.text();
        	System.out.println(text3);
        }
        System.out.println("========================");
        
        Elements els4 = doc.getElementsByAttributeValue("width","48");
        for(Element e4:els4) {
        	System.out.println(e4.toString());
        }
        System.out.println("========================");
        
        //选择器查找
        Elements els5 = doc.select(".post_item .post_item_body h3 a");
        for(Element e5:els5) {
        	System.out.println("标题:"+e5.text());
        	System.out.println("博客地址:"+e5.attr("href"));//获取元素的属性的值
        }
        System.out.println("========================");
        
        Elements els6 = doc.select("a[href]");//带有href的<a>元素
        for(Element e6:els6) {
        	System.out.println("a[href]:"+e6.toString());
        }
        System.out.println("========================");
        
        Elements els7 = doc.select("img[src$=.png]");//查找img标签,具有src元素,并且以png结尾
        for(Element e7:els7) {
        	System.out.println("img[src$=.png]:"+e7.toString());
        }
        System.out.println("========================");
        
        Element linkElement = doc.select("#friend_link").first();//获取集合的第一个元素
        System.out.println("纯文本:"+linkElement.text());//只要文字去掉里头的HTML所有元素
        System.out.println("HTML:"+linkElement.html());//获取元素下所有的子元素
	}
}

 

 

运行结果:

 

text:博客园 - 开发者的网上家园
html:博客园 - 开发者的网上家园
========================
代码改变世界
========================
创建 overlay 网络 - 每天5分钟玩转 Docker 容器技术(50)
Javascript跨域后台设置拦截
Jenkins发布MVC应用程序
一步一步学习Vue(十一)
关于“工业互联网平台”建设,俺有本要奏
hexo博客MathJax公式渲染问题
使用 Router 实现的模块化,如何优雅的回到主页面
C#多线程之旅(7)——终止线程
Android异步处理技术
Docker Machine
BFC(Block Formatting Context)基础分析
python函数(6):内置函数和匿名函数
Spring-Framework 源码阅读之@Autowired和AutowiredAnnotationBeanPostProcessor
你可能需要为你的APP适配iOS11
Mybatis框架分析
细谈字符串及其格式化表达式
[Android FrameWork 6.0源码学习] View的重绘过程之Layout
Android Binder机制详解:手写IPC通信
Socket 整理以及CocoaAsyncSocket、SRWebSocket源码解析(一)
【机器学习】反向传播算法 BP
========================
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/775365/20160228114235.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/686418/20170722174325.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/733213/20161113215823.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/418791/20130914174932.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/279374/20160822092145.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1102323/20170803222538.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/u234895.jpg?id=28112946" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/414640/20160117210953.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/803699/20160923095030.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/704456/20160908115235.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/881297/20160114203735.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1054024/20161031172341.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/672564/20150920190853.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/992994/20170504141151.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1055208/20170725183852.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1165242/20170630211508.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/615210/20140321143008.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/739006/20160530150513.png" alt="">
<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1093385/20170109195643.png" alt="">
========================
标题:创建 overlay 网络 - 每天5分钟玩转 Docker 容器技术(50)
博客地址:http://www.cnblogs.com/CloudMan6/p/7280787.html
标题:Javascript跨域后台设置拦截
博客地址:http://www.cnblogs.com/woshimrf/p/js-cors.html
标题:Jenkins发布MVC应用程序
博客地址:http://www.cnblogs.com/5ishare/p/7282493.html
标题:一步一步学习Vue(十一)
博客地址:http://www.cnblogs.com/Johnzhang/p/7282584.html
标题:关于“工业互联网平台”建设,俺有本要奏
博客地址:http://www.cnblogs.com/lsjwq/p/7282220.html
标题:hexo博客MathJax公式渲染问题
博客地址:http://www.cnblogs.com/Ai-heng/p/7282110.html
标题:使用 Router 实现的模块化,如何优雅的回到主页面
博客地址:http://www.cnblogs.com/plokmju/p/7282113.html
标题:C#多线程之旅(7)——终止线程
博客地址:http://www.cnblogs.com/jackson0714/p/AbortThread.html
标题:Android异步处理技术
博客地址:http://www.cnblogs.com/hustzhb/p/7278113.html
标题:Docker Machine
博客地址:http://www.cnblogs.com/shoufengwei/p/7281873.html
标题:BFC(Block Formatting Context)基础分析
博客地址:http://www.cnblogs.com/asheng2016/p/7281784.html
标题:python函数(6):内置函数和匿名函数
博客地址:http://www.cnblogs.com/liluning/p/7280832.html
标题:Spring-Framework 源码阅读之@Autowired和AutowiredAnnotationBeanPostProcessor
博客地址:http://www.cnblogs.com/liferecord/p/7281655.html
标题:你可能需要为你的APP适配iOS11
博客地址:http://www.cnblogs.com/wetest/p/7281536.html
标题:Mybatis框架分析
博客地址:http://www.cnblogs.com/lizo/p/7281441.html
标题:细谈字符串及其格式化表达式
博客地址:http://www.cnblogs.com/Jeffding/p/7277637.html
标题:[Android FrameWork 6.0源码学习] View的重绘过程之Layout
博客地址:http://www.cnblogs.com/kezhuang/p/7280746.html
标题:Android Binder机制详解:手写IPC通信
博客地址:http://www.cnblogs.com/DoNetCoder/p/7280481.html
标题:Socket 整理以及CocoaAsyncSocket、SRWebSocket源码解析(一)
博客地址:http://www.cnblogs.com/taoxu/p/7064103.html
标题:【机器学习】反向传播算法 BP
博客地址:http://www.cnblogs.com/KID-XiaoYuan/p/7273683.html
========================
a[href]:<a href="https://www.cnblogs.com/" title="开发者的网上家园"><img src="/images/logo_small.gif" alt="博客园Logo" width="142" height="55"></a>
a[href]:<a href="https://home.cnblogs.com/">园子</a>
a[href]:<a href="https://news.cnblogs.com">新闻</a>
a[href]:<a href="https://q.cnblogs.com/">博问</a>
a[href]:<a href="https://ing.cnblogs.com/">闪存</a>
a[href]:<a href="https://group.cnblogs.com/">小组</a>
a[href]:<a href="https://wz.cnblogs.com/">收藏</a>
a[href]:<a href="https://job.cnblogs.com/">招聘</a>
a[href]:<a href="http://kb.cnblogs.com/">知识库</a>
a[href]:<a href="http://zzk.cnblogs.com/">找找看</a>
a[href]:<a href="/" class="current_nav">首页</a>
a[href]:<a href="/pick/" title="编辑精选博文">精华</a>
a[href]:<a href="/candidate/" title="候选区的博文">候选</a>
a[href]:<a href="/news/" title="新闻频道最新新闻">新闻</a>
a[href]:<a href="/following" title="我关注博客的最新博文">关注</a>
a[href]:<a href="/aggsite/mycommented" title="我评论过的博文">我评</a>
a[href]:<a href="/aggsite/mydigged" title="我推荐过的博文">我赞</a>
a[href]:<a id="posts_refresh" href="#" class="refresh" title="刷新博文列表" onclick="aggSite.loadCategoryPostList();return false">刷新</a>
a[href]:<a href="http://feed.cnblogs.com/blog/sitehome/rss"><img src="//common.cnblogs.com/images/icon_rss.gif" alt="点击订阅" style="position:relative;top:2px;" title="订阅博客园文章"></a>
a[href]:<a href="http://www.cnblogs.com/liwanliangblog/p/7275462.html" id="editor_pick_lnk" target="_blank">【编辑推荐】细思极恐:被软连接拯救的数据<span id="editor_pick_count"></span></a>
a[href]:<a href="/aggsite/headline" title="查看更多编辑推荐" class="right_more">»</a>
a[href]:<a href="http://www.cnblogs.com/sxs161028/p/7278219.html" target="_blank" title="阅读1644, 评论39, 推荐28">[最多推荐]css简单实现五角星评分、点赞收藏、展示评分(半颗星、1/3颗星)(39/1644)</a>
a[href]:<a href="/aggsite/topdigged24h" title="查看24小时推荐排行" class="right_more">»</a>
a[href]:<a href="http://www.cnblogs.com/mafly/p/allhttps.html" target="_blank" title="阅读1587, 评论8, 推荐16">[最多评论]全站 HTTPS 没你想象的那么简单(8/1587)</a>
a[href]:<a href="/aggsite/topcommented24h" title="查看24小时评论排行" class="right_more">»</a>
a[href]:<a href="http://news.cnblogs.com/n/575248/" target="_blank" title="阅读803, 评论1, 推荐5">[新闻头条]中国高铁正式进入350公里时代!(1/803)</a>
a[href]:<a href="http://news.cnblogs.com/" title="查看更多新闻" class="right_more">»</a>
a[href]:<a href="http://news.cnblogs.com/n/575224/" target="_blank" title="阅读1286, 评论11, 推荐7">[推荐新闻]Windows 10 Build 16257发布:支持眼球交互操作(11/1286)</a>
a[href]:<a href="http://news.cnblogs.com/n/recommend" title="查看更多推荐新闻" class="right_more">»</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/CloudMan6/p/7280787.html" target="_blank">创建 overlay 网络 - 每天5分钟玩转 Docker 容器技术(50)</a>
a[href]:<a href="http://www.cnblogs.com/CloudMan6/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/775365/20160228114235.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/CloudMan6/" class="lightblue">CloudMan</a>
a[href]:<a href="http://www.cnblogs.com/CloudMan6/p/7280787.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/CloudMan6/p/7280787.html" class="gray">阅读(12)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/woshimrf/p/js-cors.html" target="_blank">Javascript跨域后台设置拦截</a>
a[href]:<a href="http://www.cnblogs.com/woshimrf/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/686418/20170722174325.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/woshimrf/" class="lightblue">Ryan.Miao</a>
a[href]:<a href="http://www.cnblogs.com/woshimrf/p/js-cors.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/woshimrf/p/js-cors.html" class="gray">阅读(37)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/5ishare/p/7282493.html" target="_blank">Jenkins发布MVC应用程序</a>
a[href]:<a href="http://www.cnblogs.com/5ishare/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/733213/20161113215823.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/5ishare/" class="lightblue">社会主义接班人</a>
a[href]:<a href="http://www.cnblogs.com/5ishare/p/7282493.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/5ishare/p/7282493.html" class="gray">阅读(58)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/Johnzhang/p/7282584.html" target="_blank">一步一步学习Vue(十一)</a>
a[href]:<a href="http://www.cnblogs.com/Johnzhang/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/418791/20130914174932.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/Johnzhang/" class="lightblue">JerremyZhang</a>
a[href]:<a href="http://www.cnblogs.com/Johnzhang/p/7282584.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/Johnzhang/p/7282584.html" class="gray">阅读(66)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/lsjwq/p/7282220.html" target="_blank">关于“工业互联网平台”建设,俺有本要奏</a>
a[href]:<a href="http://www.cnblogs.com/lsjwq/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/279374/20160822092145.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/lsjwq/" class="lightblue">唯笑志在</a>
a[href]:<a href="http://www.cnblogs.com/lsjwq/p/7282220.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/lsjwq/p/7282220.html" class="gray">阅读(106)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/Ai-heng/p/7282110.html" target="_blank">hexo博客MathJax公式渲染问题</a>
a[href]:<a href="http://www.cnblogs.com/Ai-heng/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1102323/20170803222538.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/Ai-heng/" class="lightblue">灬習慣丶悸沫</a>
a[href]:<a href="http://www.cnblogs.com/Ai-heng/p/7282110.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/Ai-heng/p/7282110.html" class="gray">阅读(35)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/plokmju/p/7282113.html" target="_blank">使用 Router 实现的模块化,如何优雅的回到主页面</a>
a[href]:<a href="http://www.cnblogs.com/plokmju/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/u234895.jpg?id=28112946" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/plokmju/" class="lightblue">承香墨影</a>
a[href]:<a href="http://www.cnblogs.com/plokmju/p/7282113.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/plokmju/p/7282113.html" class="gray">阅读(64)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/jackson0714/p/AbortThread.html" target="_blank">C#多线程之旅(7)——终止线程</a>
a[href]:<a href="http://www.cnblogs.com/jackson0714/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/414640/20160117210953.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/jackson0714/" class="lightblue">jackson0714</a>
a[href]:<a href="http://www.cnblogs.com/jackson0714/p/AbortThread.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/jackson0714/p/AbortThread.html" class="gray">阅读(101)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/hustzhb/p/7278113.html" target="_blank">Android异步处理技术</a>
a[href]:<a href="http://www.cnblogs.com/hustzhb/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/803699/20160923095030.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/hustzhb/" class="lightblue">行歌</a>
a[href]:<a href="http://www.cnblogs.com/hustzhb/p/7278113.html#commentform" title="2017-08-03 21:51" class="gray"> 评论(1)</a>
a[href]:<a href="http://www.cnblogs.com/hustzhb/p/7278113.html" class="gray">阅读(42)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/shoufengwei/p/7281873.html" target="_blank">Docker Machine</a>
a[href]:<a href="http://www.cnblogs.com/shoufengwei/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/704456/20160908115235.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/shoufengwei/" class="lightblue">shoufengwei</a>
a[href]:<a href="http://www.cnblogs.com/shoufengwei/p/7281873.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/shoufengwei/p/7281873.html" class="gray">阅读(40)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/asheng2016/p/7281784.html" target="_blank">BFC(Block Formatting Context)基础分析</a>
a[href]:<a href="http://www.cnblogs.com/asheng2016/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/881297/20160114203735.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/asheng2016/" class="lightblue">肆无忌惮sheng</a>
a[href]:<a href="http://www.cnblogs.com/asheng2016/p/7281784.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/asheng2016/p/7281784.html" class="gray">阅读(48)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/liluning/p/7280832.html" target="_blank">python函数(6):内置函数和匿名函数</a>
a[href]:<a href="http://www.cnblogs.com/liluning/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1054024/20161031172341.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/liluning/" class="lightblue">布吉岛丶</a>
a[href]:<a href="http://www.cnblogs.com/liluning/p/7280832.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/liluning/p/7280832.html" class="gray">阅读(51)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/liferecord/p/7281655.html" target="_blank">Spring-Framework 源码阅读之@Autowired和AutowiredAnnotationBeanPostProcessor</a>
a[href]:<a href="http://www.cnblogs.com/liferecord/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/672564/20150920190853.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/liferecord/" class="lightblue">vOoT</a>
a[href]:<a href="http://www.cnblogs.com/liferecord/p/7281655.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/liferecord/p/7281655.html" class="gray">阅读(56)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/wetest/p/7281536.html" target="_blank">你可能需要为你的APP适配iOS11</a>
a[href]:<a href="http://www.cnblogs.com/wetest/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/992994/20170504141151.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/wetest/" class="lightblue">腾讯WeTest</a>
a[href]:<a href="http://www.cnblogs.com/wetest/p/7281536.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/wetest/p/7281536.html" class="gray">阅读(97)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/lizo/p/7281441.html" target="_blank">Mybatis框架分析</a>
a[href]:<a href="http://www.cnblogs.com/lizo/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1055208/20170725183852.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/lizo/" class="lightblue">atheva</a>
a[href]:<a href="http://www.cnblogs.com/lizo/p/7281441.html#commentform" title="2017-08-03 20:22" class="gray"> 评论(1)</a>
a[href]:<a href="http://www.cnblogs.com/lizo/p/7281441.html" class="gray">阅读(166)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/Jeffding/p/7277637.html" target="_blank">细谈字符串及其格式化表达式</a>
a[href]:<a href="http://www.cnblogs.com/Jeffding/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1165242/20170630211508.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/Jeffding/" class="lightblue">JeffD</a>
a[href]:<a href="http://www.cnblogs.com/Jeffding/p/7277637.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/Jeffding/p/7277637.html" class="gray">阅读(73)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/kezhuang/p/7280746.html" target="_blank">[Android FrameWork 6.0源码学习] View的重绘过程之Layout</a>
a[href]:<a href="http://www.cnblogs.com/kezhuang/" class="lightblue">柯壮</a>
a[href]:<a href="http://www.cnblogs.com/kezhuang/p/7280746.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/kezhuang/p/7280746.html" class="gray">阅读(94)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/DoNetCoder/p/7280481.html" target="_blank">Android Binder机制详解:手写IPC通信</a>
a[href]:<a href="http://www.cnblogs.com/DoNetCoder/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/615210/20140321143008.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/DoNetCoder/" class="lightblue">zhoumy</a>
a[href]:<a href="http://www.cnblogs.com/DoNetCoder/p/7280481.html#commentform" title="2017-08-03 17:03" class="gray"> 评论(1)</a>
a[href]:<a href="http://www.cnblogs.com/DoNetCoder/p/7280481.html" class="gray">阅读(147)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/taoxu/p/7064103.html" target="_blank">Socket 整理以及CocoaAsyncSocket、SRWebSocket源码解析(一)</a>
a[href]:<a href="http://www.cnblogs.com/taoxu/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/739006/20160530150513.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/taoxu/" class="lightblue">Zxiao..旭</a>
a[href]:<a href="http://www.cnblogs.com/taoxu/p/7064103.html#commentform" title="" class="gray"> 评论(0)</a>
a[href]:<a href="http://www.cnblogs.com/taoxu/p/7064103.html" class="gray">阅读(96)</a>
a[href]:<a class="titlelnk" href="http://www.cnblogs.com/KID-XiaoYuan/p/7273683.html" target="_blank">【机器学习】反向传播算法 BP</a>
a[href]:<a href="http://www.cnblogs.com/KID-XiaoYuan/" target="_blank"><img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1093385/20170109195643.png" alt=""></a>
a[href]:<a href="http://www.cnblogs.com/KID-XiaoYuan/" class="lightblue">KID_XiaoYuan</a>
a[href]:<a href="http://www.cnblogs.com/KID-XiaoYuan/p/7273683.html#commentform" title="2017-08-03 16:22" class="gray"> 评论(2)</a>
a[href]:<a href="http://www.cnblogs.com/KID-XiaoYuan/p/7273683.html" class="gray">阅读(203)</a>
a[href]:<a href="/" class="p_1 current" onclick="aggSite.loadCategoryPostList(1,20);buildPaging(1);return false;">1</a>
a[href]:<a href="/sitehome/p/2" class="p_2 middle" onclick="aggSite.loadCategoryPostList(2,20);buildPaging(2);return false;">2</a>
a[href]:<a href="/sitehome/p/3" class="p_3 middle" onclick="aggSite.loadCategoryPostList(3,20);buildPaging(3);return false;">3</a>
a[href]:<a href="/sitehome/p/4" class="p_4 middle" onclick="aggSite.loadCategoryPostList(4,20);buildPaging(4);return false;">4</a>
a[href]:<a href="/sitehome/p/5" class="p_5 middle" onclick="aggSite.loadCategoryPostList(5,20);buildPaging(5);return false;">5</a>
a[href]:<a href="/sitehome/p/6" class="p_6 middle" onclick="aggSite.loadCategoryPostList(6,20);buildPaging(6);return false;">6</a>
a[href]:<a href="/sitehome/p/7" class="p_7 middle" onclick="aggSite.loadCategoryPostList(7,20);buildPaging(7);return false;">7</a>
a[href]:<a href="/sitehome/p/8" class="p_8 middle" onclick="aggSite.loadCategoryPostList(8,20);buildPaging(8);return false;">8</a>
a[href]:<a href="/sitehome/p/9" class="p_9 middle" onclick="aggSite.loadCategoryPostList(9,20);buildPaging(9);return false;">9</a>
a[href]:<a href="/sitehome/p/10" class="p_10 middle" onclick="aggSite.loadCategoryPostList(10,20);buildPaging(10);return false;">10</a>
a[href]:<a href="/sitehome/p/11" class="p_11 middle" onclick="aggSite.loadCategoryPostList(11,20);buildPaging(11);return false;">11</a>
a[href]:<a href="/sitehome/p/200" class="p_200 last" onclick="aggSite.loadCategoryPostList(200,20);buildPaging(200);return false;">200</a>
a[href]:<a href="/sitehome/p/2" onclick="aggSite.loadCategoryPostList(2,20);buildPaging(2);return false;">Next ></a>
a[href]:<a href="/cate/108698/">.NET技术(3)</a>
a[href]:<a href="/cate/2/">编程语言(1)</a>
a[href]:<a href="/cate/108701/">软件设计(0)</a>
a[href]:<a href="/cate/108703/">Web前端(1)</a>
a[href]:<a href="/cate/108704/">企业信息化(1)</a>
a[href]:<a href="/cate/108705/">手机开发(1)</a>
a[href]:<a href="/cate/108709/">软件工程(0)</a>
a[href]:<a href="/cate/108712/">数据库技术(1)</a>
a[href]:<a href="/cate/108724/">操作系统(0)</a>
a[href]:<a href="/cate/4/">其他分类(0)</a>
a[href]:<a href="/cate/all/">所有随笔(159)</a>
a[href]:<a href="/comment/">所有评论(9)</a>
a[href]:<a href="http://space.cnblogs.com/forum/public" target="_blank">反馈或建议</a>
a[href]:<a href="http://www.cnblogs.com/cmt/" target="_blank">官方博客</a>
a[href]:<a href="/skins.aspx" target="_blank">博客模板</a>
a[href]:<a href="http://www.blogjava.net/">Java博客</a>
a[href]:<a href="http://www.cppblog.com/">C++博客</a>
a[href]:<a href="http://www.cnblogs.com/cmt/archive/2009/09/26/1574616.html">手机版</a>
a[href]:<a href="http://click.aliyun.com/m/994/" target="_blank"><img src="//common.cnblogs.com/images/friend_link/aliyun.png" alt="阿里云"></a>
a[href]:<a href="http://www.hujiang.com" target="_blank">沪江网</a>
a[href]:<a href="http://www.chinaz.com/" target="_blank">站长之家</a>
a[href]:<a href="http://dev.yesky.com" target="_blank">天极网</a>
a[href]:<a href="http://down.admin5.com" target="_blank">A5源码下载</a>
a[href]:<a href="http://www.gcpowertools.com.cn" target="_blank">葡萄城控件</a>
a[href]:<a href="https://www.upyun.com/?md=cnblogs" target="_blank">又拍云存储</a>
a[href]:<a href="http://zt.cnblogs.com/tingyun/" target="_blank">听云APP</a>
a[href]:<a href="http://www.rongcloud.cn/" target="_blank">融云IM云</a>
a[href]:<a href="http://www.hightopo.com/cn-index.html" target="_blank">图扑软件</a>
a[href]:<a href="https://yq.aliyun.com/" target="_blank">阿里云栖社区</a>
a[href]:<a href="http://www.cnblogs.com/mipengine/" target="_blank">百度MIP博客</a>
a[href]:<a href="http://www.easemob.com" target="_blank">环信IM</a>
a[href]:<a href="http://wetest.qq.com/?from=links_cnblogs" target="_blank">腾讯WeTest</a>
a[href]:<a href="http://www.feige.ee?fm=cnblogs" target="_blank">飞鸽传书短信平台</a>
a[href]:<a href="https://www.mtyun.com/" target="_blank">美团云</a>
a[href]:<a href="/AboutUS.aspx">关于博客园</a>
a[href]:<a href="/ContactUs.aspx">联系我们</a>
a[href]:<a href="http://www.cnblogs.com/">博客园</a>
a[href]:<a href="http://www.miitbeian.gov.cn" target="_blank">沪ICP备09004260号</a>
a[href]:<a href="https://ss.knet.cn/verifyseal.dll?sn=e131108110100433392itm000000&ct=df&a=1&pa=0.25787803245785335" rel="nofollow" target="_blank"><img id="cnnic_img" src="//common.cnblogs.com/images/cnnic.png" alt="" width="64" height="23"></a>
a[href]:<a target="_blank" href="http://www.beian.gov.cn/portal/registerSystemInfo?recordcode=31011502001144" style="display:inline-block;text-decoration:none;height:20px;line-height:20px;"><img src="//common.cnblogs.com/images/ghs.png" alt=""><span style="float:left;height:20px;line-height:20px;margin: 0 5px 0 5px; color:#939393;">沪公网安备 31011502001144号</span></a>
========================
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/775365/20160228114235.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/686418/20170722174325.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/733213/20161113215823.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/418791/20130914174932.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/279374/20160822092145.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1102323/20170803222538.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/414640/20160117210953.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/803699/20160923095030.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/704456/20160908115235.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/881297/20160114203735.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1054024/20161031172341.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/672564/20150920190853.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/992994/20170504141151.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1055208/20170725183852.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1165242/20170630211508.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/615210/20140321143008.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/739006/20160530150513.png" alt="">
img[src$=.png]:<img width="48" height="48" class="pfs" src="//pic.cnblogs.com/face/1093385/20170109195643.png" alt="">
img[src$=.png]:<img src="//common.cnblogs.com/images/friend_link/aliyun.png" alt="阿里云">
img[src$=.png]:<img id="cnnic_img" src="//common.cnblogs.com/images/cnnic.png" alt="" width="64" height="23">
img[src$=.png]:<img src="//common.cnblogs.com/images/ghs.png" alt="">
========================
纯文本:友情链接:沪江网站长之家天极网A5源码下载葡萄城控件又拍云存储听云APP融云IM云图扑软件阿里云栖社区百度MIP博客环信IM腾讯WeTest飞鸽传书短信平台美团云
HTML:友情链接:
<a href="http://click.aliyun.com/m/994/" target="_blank"><img src="//common.cnblogs.com/images/friend_link/aliyun.png" alt="阿里云"></a>
<a href="http://www.hujiang.com" target="_blank">沪江网</a>
<a href="http://www.chinaz.com/" target="_blank">站长之家</a>
<a href="http://dev.yesky.com" target="_blank">天极网</a>
<a href="http://down.admin5.com" target="_blank">A5源码下载</a>
<a href="http://www.gcpowertools.com.cn" target="_blank">葡萄城控件</a>
<a href="https://www.upyun.com/?md=cnblogs" target="_blank">又拍云存储</a>
<a href="http://zt.cnblogs.com/tingyun/" target="_blank">听云APP</a>
<a href="http://www.rongcloud.cn/" target="_blank">融云IM云</a>
<a href="http://www.hightopo.com/cn-index.html" target="_blank">图扑软件</a>
<a href="https://yq.aliyun.com/" target="_blank">阿里云栖社区</a>
<a href="http://www.cnblogs.com/mipengine/" target="_blank">百度MIP博客</a>
<a href="http://www.easemob.com" target="_blank">环信IM</a>
<a href="http://wetest.qq.com/?from=links_cnblogs" target="_blank">腾讯WeTest</a>
<a href="http://www.feige.ee?fm=cnblogs" target="_blank">飞鸽传书短信平台</a>
<a href="https://www.mtyun.com/" target="_blank">美团云</a>

 

 

Reference:

[1] Java1234_小锋, Jsoup简介, http://blog.java1234.com/blog/articles/255.html

[2] Java1234_小锋, Jsoup HelloWorld实现, http://blog.java1234.com/blog/articles/256.html

[3] Java1234_小锋, 使用Jsoup解析出html中的img元素, http://blog.java1234.com/blog/articles/50.html

[4] Java1234_小锋, Jsoup查找DOM元素, http://blog.java1234.com/blog/articles/258.html

[5] Java1234_小锋, Jsoup使用选择器语法查找DOM元素, http://blog.java1234.com/blog/articles/260.html

[6] Java1234_小锋, Jsoup获取DOM元素属性值, http://blog.java1234.com/blog/articles/262.html

 

 

posted @ 2017-07-29 22:42  ryelqy  阅读(367)  评论(0编辑  收藏  举报