学了一阵子Python,拿来做个什么有意思的东西呢?爬糗百好了。爬到的内容,邮件分发出去。

然后又啃了两天的wxpython,做了个简易的邮件管理界面,能够在这里添加或者删除邮件,而且一键爬虫发送。

最后,索性封装成APP吧。又试了一把py2app。简单好用。

因为是让用户自行加入和删除邮箱。所以程序一定要兼顾到各种情况:比方输入的邮箱格式不合法。输入的邮箱里包括中文字符,分隔符不正确,删除了所有邮箱然后又要发邮件等问题。

首先是QiuBai.py:爬虫,正则匹配我们想要的内容。然后将内容稍作处理返回。

#!/usr/bin/env python
# -*- coding: utf-8 -*-
import re
import urllib2
__author__ = 'Sophie2805' class FetchData(object):
def __init__(self,URL,referURL,pattern_1,pattern_2):
self.URL = URL
self.pattern_1 = pattern_1
self.patttern_2 = pattern_2
self.referURL = referURL def getHtml(self):
headers = {
'User-Agent':'Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.1.6) Gecko/20091201 Firefox/3.5.6',
'Referer':self.referURL #http://www.qiushibaike.com/
}
req = urllib2.Request(
url = self.URL, #http://www.qiushibaike.com/text
headers = headers)
return urllib2.urlopen(req).read() def getData(self,html):
p_1 = re.compile(self.pattern_1)
div_content = p_1.findall(html.decode('utf8'))
row = 0
for m in range(len(div_content)):
div_content[m] = re.sub(self.patttern_2,'',div_content[m])
div_content[m] = ("---" + str(row+1) + "---" + div_content[m]).encode('utf-8')
row += 1
return div_content

接下来就是mail.py。注意。发送人的邮箱要检查下是否打开了SMTP协议

#!/usr/bin/env python
# -*- coding: utf-8 -*-
import smtplib
from email.mime.text import MIMEText
from email.header import Header __author__ = 'Sophie2805' class SMail:
def __init__(self):
self.sender = '***@qq.com'
self.receiver = ['***@163.com','***@qq.com']
self.subject = '糗百热门20条推送'
self.username = '***@qq.com'
self.password = '******' def send_mail(self,msg):
try:
self.msg = MIMEText(msg,'plain','utf-8')
self.msg['Subject'] = Header(self.subject,'utf-8')
self.msg['To'] = ','.join(self.receiver)
self.msg['From'] = self.sender
self.smtp = smtplib.SMTP()
self.smtp.connect('smtp.qq.com')
self.smtp.login(self.username,self.password)
self.smtp.sendmail(self.msg['From'],self.receiver,self.msg.as_string())
self.smtp.quit()
return '1'
except Exception, e:
return str(e)

最后,用wxpython做了个简易的收件人管理界面,Spider_QiuBai.py,实现:添加邮件、删除邮件、一键爬虫发送糗百热门20条到邮件列表。

当然,为了程序正确执行,会有各种验证,比方输入的邮箱格式是否合法,发送时检查receiver列表是否为空等

watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvc29waGllMjgwNQ==/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" width="450" height="630" alt="">

程序比較简单,代码例如以下:

#!/usr/bin/env python
# -*- coding: utf-8 -*-
__author__ = 'Sophie2805' import wx
import wx.grid
from mail import *
from QiuBai import * class Frame(wx.Frame):
QiuBai = None
content = None
m = SMail()
def __init__(self, parent=None, title='Get the Hottest QiuBai!',size=(450, 630),
style=wx.DEFAULT_FRAME_STYLE ^ (wx.RESIZE_BORDER |wx.MAXIMIZE_BOX)):
wx.Frame.__init__(self, parent=parent, title=title, size=size,style = style)
self.Centre()
panel = wx.Panel(self,-1) self.text_Email = wx.StaticText(panel,1, "You want it too? Add your email below. Using ';' between multiple emails. Existing emails will not be added again.")
self.input_Email = wx.TextCtrl(panel,2)
self.button_Email = wx.Button(panel,3,label='+') '''Bind with function'''
self.button_Email.Bind(wx.EVT_BUTTON,self.add) self.text_DeleteEmail = wx.StaticText(panel,4, "Existing receivers are listed below. You can delete by double clicking them and clicking on '-' button. Please note that it can hold 20 emails at most one time.") '''grid'''
self.grid = wx.grid.Grid(panel,5)
self.grid.CreateGrid(20,1)
self.grid.EnableEditing(False)
self.grid.SetColSize(0,200)
self.grid.SetColLabelValue(0,'Emails')
for i in range(len(self.m.receiver)):
self.grid.SetCellBackgroundColour(i,0,"grey")
self.grid.SetCellValue(i,0,self.m.receiver[i]) self.grid.Bind(wx.grid.EVT_GRID_CELL_LEFT_DCLICK,self.grid_change_color) self.button_DeleteEmail = wx.Button(panel,6,label='-')
self.button_Send = wx.Button(panel,7,label='send') '''Bind with function'''
self.button_Send.Bind(wx.EVT_BUTTON,self.sendmail)
self.button_DeleteEmail.Bind(wx.EVT_BUTTON,self.d_emails) '''BoxSizer'''
'''================================================================'''
add_Email = wx.BoxSizer() #input box and add button
add_Email.Add(self.input_Email,proportion=4,flag = wx.LEFT, border = 5)
add_Email.Add(self.button_Email,proportion=1,flag=wx.LEFT,border=15) ae = wx.BoxSizer(wx.VERTICAL)
ae.Add(self.text_Email,proportion =4, flag=wx.ALL, border=5)
ae.Add(add_Email,proportion=2, flag=wx.LEFT|wx.BOTTOM|wx.RIGHT, border=5) h_b = wx.BoxSizer()#send button and delete button
h_b.Add(self.button_Send,flag=wx.LEFT,border=5)
h_b.Add(self.button_DeleteEmail,flag=wx.LEFT,border=250) de = wx.BoxSizer(wx.VERTICAL)
de.Add(self.text_DeleteEmail,proportion=2,flag=wx.ALL, border=5)
de.Add(self.grid,proportion = 4,flag=wx.LEFT, border=80)
de.Add(h_b,proportion=1,flag=wx.TOP, border=10) vBoxSizer = wx.BoxSizer(wx.VERTICAL)
vBoxSizer.Add(ae,proportion=2,flag=wx.ALL,border=5)
vBoxSizer.Add(de,proportion=4,flag=wx.LEFT|wx.BOTTOM|wx.RIGHT,border=5)
panel.SetSizer(vBoxSizer) #delete the emails that cell bk color changed to yellow
def d_emails(self,evt):
count_yellow = 0
for i in range(len(self.m.receiver)):
if self.grid.GetCellBackgroundColour(i,0)==(255, 255, 0, 255): # yellow
count_yellow += 1 if count_yellow == 0:
wx.MessageBox('Wake up!\nSelect then delete!') else:
l = len(self.m.receiver)
self.m.receiver=[]
for i in range(l):
if self.grid.GetCellBackgroundColour(i,0)==(128, 128, 128, 255):#grey
self.m.receiver.append(self.grid.GetCellValue(i,0))
self.grid.SetCellValue(i,0,'')#clear the cell's value
self.grid.SetCellBackgroundColour(i,0,'white') for i in range(len(self.m.receiver)):
self.grid.SetCellBackgroundColour(i,0,"grey")
self.grid.SetCellValue(i,0,self.m.receiver[i])
self.grid.ForceRefresh() #change cell bk color when double click the cell. The cell without value will not be changed
def grid_change_color(self,evt):
if self.grid.GetCellValue(self.grid.GridCursorRow,self.grid.GridCursorCol) != '':
if self.grid.GetCellBackgroundColour(self.grid.GridCursorRow,self.grid.GridCursorCol)==(128, 128, 128, 255):#grey
self.grid.SetCellBackgroundColour(self.grid.GridCursorRow,self.grid.GridCursorCol,'yellow')
else:
self.grid.SetCellBackgroundColour(self.grid.GridCursorRow,self.grid.GridCursorCol,'grey')
self.grid.ForceRefresh() #add the emails that user input
def add(self,evt):
x = str(self.input_Email.GetValue().encode('utf-8'))
if len(x) == 0:
wx.MessageBox('Hey dude, input something first!')
elif len(x) != len(x.decode('utf-8')):#non ASCII character contained
wx.MessageBox('Hey dude, please input English character only!')
else:
x = x.split(';')
tag = 0
for i in range(len(x)):
if len(x[i]) != 0:
if x[i] in self.m.receiver:
x[i]=''#existing emails, set to ''
elif self.validate_email(x[i]) == None:
wx.MessageBox('Hey dude, eyes wide open! \nPlease input like this: xxx@xxx.com;eee@eee.org')
tag = 1
break
if tag == 0: # all emails are valid
for item in x:
if len(item) != 0:
self.m.receiver.append(item)
for i in range(len(self.m.receiver)):
self.grid.SetCellBackgroundColour(i,0,"grey")
self.grid.SetCellValue(i,0,self.m.receiver[i])
self.input_Email.Clear()
self.grid.ForceRefresh() #validate the emails user input
def validate_email(self,s):
pattern = "^[a-zA-Z0-9\._-]+@([a-zA-Z0-9_-]+\.)+([a-zA-Z]{2,3})$"
p = re.compile(pattern)
return p.match(s) def sendmail(self,evt):
if len(self.m.receiver) == 0:
wx.MessageBox('Are you kidding me?\n Add some receivers then send!')
else:
self.QiuBai = FetchData('http://www.qiushibaike.com/text','http://www.qiushibaike.com/',
'<div class="content">[\n\s]+.+[\n\s]','[<div class="content"><br/>\n]')
self.content = self.QiuBai.getData(self.QiuBai.getHtml())
msg = '\n\n'.join(self.content)
result = self.m.send_mail(msg)
if result == '1':
wx.MessageBox('The hottest QiuBai had been sent.\n Enjoy it ^_^ !')
else:
wx.MessageBox('Some exception occurred :-(\nTry again later...\n'+result)
class GetQiuBaiApp(wx.App):
def OnInit(self):
frame = Frame()
frame.Show()
return True if __name__ == "__main__":
app = GetQiuBaiApp()
app.MainLoop()

最后就是Python转APP了:

(1)在MAC终端里输入pip install py2app安装py2app

(2)创建setup.py,我的project位于/Users/Sophie/PycharmProjects/QiuBai,所以cd到这个路径下。然后vi setup.py。在这个文件中输入例如以下内容(提前准备个icns图标。放到project路径下)

"""

py2app build script for MyApplication

Usage:

python setup.py py2app

"""

from setuptools import setup

OPTIONS = {

'iconfile':'lemon.icns'

}

setup(

app=["Spider_QiuBai.py"],

options={'py2app': OPTIONS},

setup_requires=["py2app"],

)

(3)rm -rf build dist(假设反复多次打包的话。这个命令是须要的。第一次不须要)

(4)python setup.py py2app 用这个命令打包,打包好的app会存放在该路径下的dist路径下。我的例如以下:/Users/Sophie/PycharmProjects/QiuBai/dist

watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvc29waGllMjgwNQ==/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" alt="">

至此,这个小玩意儿已经做好了,当中略麻烦的可能就是正则还有界面了 Hope you enjoy it ^ ^

Python爬糗百热门20条并邮件分发+wxPython简易GUI+py2app转成可运行文件的更多相关文章

  1. 这届网友实在是太有才了!用python爬取15万条《我是余欢水》弹幕

    年初时我们用数据解读了几部热度高,但评分差强人意的国产剧,而最近正午阳光带着两部新剧来了,<我是余欢水>和<清平乐>,截止到目前为止,这两部剧在豆瓣分别为7.5分和7.9分,算 ...

  2. Python性能优化的20条建议 (转载)

    优化算法时间复杂度 算法的时间复杂度对程序的执行效率影响最大,在Python中可以通过选择合适的数据结构来优化时间复杂度,如list和set查找某一个元素的时间复杂度分别是O(n)和O(1).不同的场 ...

  3. python基础===Python性能优化的20条建议

    优化算法时间复杂度 算法的时间复杂度对程序的执行效率影响最大,在Python中可以通过选择合适的数据结构来优化时间复杂度,如list和set查找某一个元素的时间复杂度分别是O(n)和O(1).不同的场 ...

  4. Python性能优化的20条建议

    优化算法时间复杂度 算法的时间复杂度对程序的执行效率影响最大,在Python中可以通过选择合适的数据结构来优化时间复杂度,如list和set查找某一个元素的时间复杂度分别是O(n)和O(1).不同的场 ...

  5. Python爬取酷狗飙升榜前十首(100)首,写入CSV文件

    酷狗飙升榜,写入CSV文件 爬取酷狗音乐飙升榜的前十首歌名.歌手.时间,是一个很好的爬取网页内容的例子,对爬虫不熟悉的读者可以根据这个例子熟悉爬虫是如何爬取网页内容的. 需要用到的库:requests ...

  6. 萌新学习Python爬取B站弹幕+R语言分词demo说明

    代码地址如下:http://www.demodashi.com/demo/11578.html 一.写在前面 之前在简书首页看到了Python爬虫的介绍,于是就想着爬取B站弹幕并绘制词云,因此有了这样 ...

  7. Python学习-使用Python爬取陈奕迅新歌《我们》网易云热门评论

    <后来的我们>上映也有好几天了,一直没有去看,前几天还爆出退票的事件,电影的主题曲由陈奕迅所唱,特地找了主题曲<我们>的MV看了一遍,还是那个感觉.那天偶然间看到Python中 ...

  8. Python爬取十四万条书籍信息告诉你哪本网络小说更好看

    前言 本文的文字及图片来源于网络,仅供学习.交流使用,不具有任何商业用途,版权归原作者所有,如有问题请及时联系我们以作处理. 作者: TM0831 PS:如有需要Python学习资料的小伙伴可以加点击 ...

  9. 【2022知乎爬虫】我用Python爬虫爬了2300多条知乎评论!

    您好,我是 @马哥python说,一枚10年程序猿. 一.爬取目标 前些天我分享过一篇微博的爬虫: https://www.cnblogs.com/mashukui/p/16414027.html 但 ...

随机推荐

  1. Android通过泛型简化findViewById类型转换

    曾经老用findViewById,每次使用还得add cast一下今天看到一个视频(依据视频中使用的IDE判断,应该是几年前的视频了..),使用了一个方法,能够不用每次使用findViewById都去 ...

  2. hdu 5031 Lines 爆搜

    事实上嘞,这个线能够仅仅延伸一端 然后嘞,爆搜一次就能够 最后嘞,600-800ms过 本弱就是弱啊.你来打我呀-- #include<iostream> #include<cstr ...

  3. windows下PTAM的编译

    前些日子在研究PTAM,以下首先说说PTAM的编译过程,我在XP几WIN7搭配vs2010中均已測试过,都能够执行. 首先下载编译PTAM所必须的库文件.下载地址我会给出 PTAM(PTAM.zip) ...

  4. hdoj--2579--Dating with girls(2)(搜索+三维标记)

    Dating with girls(2) Time Limit: 2000/1000 MS (Java/Others)    Memory Limit: 32768/32768 K (Java/Oth ...

  5. AJAX请求 $.ajaxSetup方法的使用

    转自:https://blog.csdn.net/qq_23476319/article/details/78798885 jQuery.ajaxSetup()函数用于设置AJAX的全局默认设置. 该 ...

  6. SQL Server 忘记登录账号解决方法

    [1] 停止SQL Server 服务 和 SQL Server Agent 服务 [2] 以管理员身份打开命令行,单用户模式启动服务.(在单用户模式下启动 SQL Server 可使计算机本地 Ad ...

  7. 前端学习笔记-CSS

  8. 洛谷P3381 【模板】最小费用最大流(dijstra费用流)

    题目描述 如题,给出一个网络图,以及其源点和汇点,每条边已知其最大流量和单位流量费用,求出其网络最大流和在最大流情况下的最小费用. 输入输出格式 输入格式: 第一行包含四个正整数N.M.S.T,分别表 ...

  9. Java二维码生成与解码工具Zxing使用

    Zxing是Google研发的一款非常好用的开放源代码的二维码生成工具,目前源码托管在github上,源码地址: https://github.com/zxing/zxing 可以看到Zxing库有很 ...

  10. DataTables入门

    转载 https://blog.csdn.net/gfd54gd5f46/article/details/65938189