抓取页面源代码
近做一个分析网站页面功能,需要取得页面源代码进行分析div标签和js
这里先发一下抓取页面源代码方法
using System.Net;
using System.IO;
private string GetPageCodes(string Url)
{
string strHtml = "";
WebRequest web = WebRequest.Create(Url);
//WebResponse hwr = web.GetResponse();
HttpWebResponse hwr = (HttpWebResponse)web.GetResponse();
//采用流读取,并确定编码方式
Stream s = hwr.GetResponseStream();
StreamReader objReader = new StreamReader(s, Encoding.Default);
string strLine = "";
while (strLine != null)
{
strLine = objReader.ReadLine();
if (strLine != null)
{
strHtml += strLine;
}
}
return strHtml;
}
把结果赋值给用户控件
txtContext.Text = GetPageCodes("http://www.pcpop.com/");
抓取效果图如下所示: