soup = BeautifulSoup(html_doc,'html.parser',from_encoding='utf-8')
print '獲取所有 的鏈接'
links = soup.find_all('a')
for link in links:
print link.name,link['href'],link.get_text()
print '獲取所有 的鏈接'
links = soup.find_all('a')
for link in links:
print link.name,link['href'],link.get_text()
2016-04-23
感覺一開始在main那就開始一步到位的寫所需要的方法已經類等亞歷山大
看完跟著寫,自己不細心給搞錯了
需要代碼的可以看這里:https://github.com/hisen-yuan/PythonPractice/tree/master/src/baike_spider
看完跟著寫,自己不細心給搞錯了
需要代碼的可以看這里:https://github.com/hisen-yuan/PythonPractice/tree/master/src/baike_spider
2016-04-21