后臺代碼如下:
? string url = "http://baoliao.cq.qq.com/pc/detail.html?id=443758s";??????????? HttpWebRequest request = (HttpWebRequest)WebRequest.Create(url);??????????? request.Accept = "*/*"; //接受任意文件??????????? request.UserAgent = "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.2; .NET CLR 1.1.4322)"; // ??????????? request.AllowAutoRedirect = true;//是否允許302??????????? request.Referer = url; //當前頁面的引用??????????? HttpWebResponse response = (HttpWebResponse)request.GetResponse();??????????? Stream stream = response.GetResponseStream();??????????? StreamReader reader = new StreamReader(stream, Encoding.GetEncoding("utf-8"));??????????? html = reader.ReadToEnd();??????????? stream.Close();??????????? text.Text = html;
?
如題,asp.net 抓取頁面內(nèi)容,如http://baoliao.cq.qq.com/pc/detail.html?id=443758這個網(wǎng)站的內(nèi)容,其他頁面的抓取都沒問題,這個網(wǎng)站好像有點特殊,他只能抓取到頁面的源代碼,但不能抓取到整個HTML,各位大神也可以打開這網(wǎng)站的源碼,也找不到內(nèi)容主體。但HTML有內(nèi)容主體,求解,怎么抓取到內(nèi)容主體。
- 0 回答
- 0 關注
- 427 瀏覽
添加回答
舉報
0/150
提交
取消