課程
/后端開發(fā)
/Python
/Python開發(fā)簡單爬蟲
爬到74卡住了
2016-05-26
源自:Python開發(fā)簡單爬蟲 7-7
正在回答
調(diào)用urlopen時(shí)要加timeout, 否則網(wǎng)速不好的時(shí)候容易卡死.
html_downloader.py:
#?coding:?UTF-8 import?urllib2 class?HtmlDownloader(object): ????def?__init__(self): ????????pass ???????? ????def?download(self,?url): ????????if?url?is?None: ????????????return?None ????????response?=?urllib2.urlopen(url,timeout=5) ???????? ????????if?response.getcode()?!=?200: ????????????print?'???-->?Can?not?open?url:',?url ????????????return?None ???????????? ????????html_doc?=?response.read() ????????#?print?html_doc ??????????????? ????????return?html_doc
舉報(bào)
本教程帶您解開python爬蟲這門神奇技術(shù)的面紗
Copyright ? 2025 imooc.com All Rights Reserved | 京ICP備12003892號-11 京公網(wǎng)安備11010802030151號
購課補(bǔ)貼聯(lián)系客服咨詢優(yōu)惠詳情
慕課網(wǎng)APP您的移動學(xué)習(xí)伙伴
掃描二維碼關(guān)注慕課網(wǎng)微信公眾號
2016-05-27
調(diào)用urlopen時(shí)要加timeout, 否則網(wǎng)速不好的時(shí)候容易卡死.
html_downloader.py: