課程
                    
                        /后端開發(fā)
                        
                            /Python
                        
                        /Python開發(fā)簡單爬蟲

結(jié)果只輸出了源網(wǎng)址，然后就craw failed

代碼對比的跟老師的一樣了

慕桂英4447524

2018-11-18

源自：Python開發(fā)簡單爬蟲 7-7

關(guān)注問題我要回答

901

操作

收起

3 回答

葬酒為安
2019-03-03

下載器導(dǎo)入改成這個import urllib.request

0 回復(fù) 有任何疑惑可以回復(fù)我~

收起回答

慕七七998
2018-12-06

我和你的錯誤一樣，去掉try塊之后，顯示html_parser中的get_text()有錯誤，

Traceback (most recent call last):
? File "G:\eclipse-workspace(JAVAEE)\Python01\baike_spider\spider_main.py", line 41, in <module>
??? obj_spider.craw(root_url)????? #啟動爬蟲
? File "G:\eclipse-workspace(JAVAEE)\Python01\baike_spider\spider_main.py", line 23, in craw
??? new_urls, new_data =self.parser.parse(new_url,html_cont)???
? File "G:\eclipse-workspace(JAVAEE)\Python01\baike_spider\html_parser.py", line 40, in parse
??? new_data = self._get_new_data(page_url,soup)
? File "G:\eclipse-workspace(JAVAEE)\Python01\baike_spider\html_parser.py", line 27, in _get_new_data
??? res_data['title'] =title_node.get_text()
AttributeError: 'NoneType' object has no attribute 'get_text'