已采納回答 / seU
前面15分鐘的兩節(jié)re模塊你一定沒看,已經(jīng)講的很清楚了match()是re模塊的函數(shù),根據(jù)匹配規(guī)則匹配對應(yīng)的字符串group()函數(shù),是返回匹配成功的字符串
2016-09-01
講得太棒了,學(xué)完很好的理解了正則表達(dá)式的用法,以及初步了解了爬蟲功能。非常感謝老師!
2016-08-27
In [104]: i = 0
In [105]: for url in listurl:
...: f = open(str(i) + '.jpg', 'wb')
...: req = urllib.request.urlopen(url)
...: buf = req.read()
...: f.write(buf)
...: i+=1
...:
感謝老師!寫了人生中第一個爬蟲?。?!
In [105]: for url in listurl:
...: f = open(str(i) + '.jpg', 'wb')
...: req = urllib.request.urlopen(url)
...: buf = req.read()
...: f.write(buf)
...: i+=1
...:
感謝老師!寫了人生中第一個爬蟲?。?!
2016-08-14
Python 3.x 版本這么輸入:
In [1]: import re
In [2]: import urllib.request
In [3]: req = urllib.request.urlopen('http://idcbgp.cn/course/list')
In [4]: buf = req.read()
In [5]: buf = buf.decode('utf-8')
In [6]: listurl = re.findall(r'src=.+\.jpg', buf)
In [1]: import re
In [2]: import urllib.request
In [3]: req = urllib.request.urlopen('http://idcbgp.cn/course/list')
In [4]: buf = req.read()
In [5]: buf = buf.decode('utf-8')
In [6]: listurl = re.findall(r'src=.+\.jpg', buf)
2016-08-14
Python 3.x 版本請這么輸入:
import urllib.request
req = urllib.request.urlopen('http://idcbgp.cn/course/list')
buf = req.read()
import urllib.request
req = urllib.request.urlopen('http://idcbgp.cn/course/list')
buf = req.read()
2016-08-14