#?coding:utf8
class?Html0utputer(object):
????def?__init__(self):
????????self.datas?=?[]
????def?collect_data(self,data):
????????if?data?is?None:
????????????return
????????self.datas.append(data)
????def?output_html(self):
????????fout?=?open('d:\python\output.html',?'w')
????????fout.write('<html>')
????????fout.write("<head><meta?http-equiv='content-type'?content='text/html;charset=utf-8'></head>")
????????fout.write('<body>')
????????fout.write('<table>')
????????#?ascci
????????for?data?in?self.datas:
????????????fout.write('<tr>')
????????????fout.write('<td>%s</td>'?%?data['url'])
????????????fout.write('<td>%s</td>'?%?data['title'].encode(encoding='UTF-8'))
????????????fout.write('<td>%s</td>'?%?data['summary'].encode(encoding='UTF-8'))
????????????fout.write('</tr>')
????????fout.write('</table>')
????????fout.write('</body>')
????????fout.write('</html>')
2016-08-18
data是一個對象,它有三個屬性,分別是{url:xx, title:xx, summary:xx}
datas里面存放很多這樣的data對象,data['xxx']是獲取data元素的xxx屬性
2016-08-10
self.datas = []?
把datas定義成一個數(shù)組
collectdata是把爬到的數(shù)據(jù)依次添加到數(shù)組中
后面當然要循環(huán)按照表格式取出輸出啊,為什么要用表格,當然可以不用表格的形式,你可以直接把數(shù)組轉成字符串,然后文本輸出,這個看個人需要~
2016-08-04
用來存放數(shù)據(jù)