練習(xí)爬蟲,抓取鏈家頁面信息,鏈家頁面是utf-8,print出來后中文亂碼
import requests
from bs4 import BeautifulSoup
url = 'http://nj.lianjia.com/xiaoqu/'
html = requests.get(url)
soup = BeautifulSoup(html.text,'lxml')
title = soup.title.get_text()
print(title)
得到的是“????o??°???o?o???????(????o?é????????)”這玩意,請(qǐng)問如何能正常顯示中文?
python3爬鏈家utf8頁面,中文全部是“å??京å°?å?ºäº?æ??æ?¿(å??京é?¾å®¶ç½?)”
哆啦的時(shí)光機(jī)
2019-02-24 08:11:08