【Python】博客信息爬取-微信消息自动发送

1、环境安装

python -m pip install --upgrade pip

pip install bs4

pip install wxpy

pip install lxml

2、博客爬取及发送

from bs4 import BeautifulSoup

from threading import Timer

import requests

import traceback

from wxpy import *

url = ''

nIndex = 6

my_groups = None

def getMsg(nIndex):

    # 获得网址源码

    html = requests.get(url).content

    soup = BeautifulSoup(html, 'lxml') 

    # print('开始抓取')

    # title = soup.title.string

    # print(title)

    # areaall = soup.find(id='sina_keyword_ad_area2').children

    areaall = soup.select('#sina_keyword_ad_area2 p')

    # print(areaall)

    iCount = len(areaall)

    # print(iCount)

    # print(nIndex)

    msg = ""

    if iCount < nIndex:

        return msg,0

    else:

        msg = areaall[iCount - 1]

        msg = msg.get_text()

        # print(msg)

        return msg,iCount

msgTemp = ''

nNullMsg = 0

def auto_send():

    try:

        global nIndex

        global my_groups

        global msgTemp

        global nNullMsg

        msgContent,nIndexMsg = getMsg(nIndex)

        msgContent = str(msgContent).strip()

        # print(nIndexMsg)

        # nIndex += 1

        # print(msgContent)

        if len(msgContent) != 0 :

            # print(str(nIndex) + ":\t" + msgContent)

            # return

            if msgContent != msgTemp :

                if my_groups != None and len(my_groups) > 0 :

                    print("发送消息：" + msgContent)

                    my_groups[0].send(msgContent)

                    msgTemp = msgContent

            else:

                print('消息已发送')

        else:

            nNullMsg += 1

            print("没有新消息")

            if nNullMsg == 20 :

                print("恭喜发财，今日推送完毕")

                return

        # # 每隔86400秒（1天），发送1次

        t = Timer(3, auto_send)

        t.start()

    except  Exception as e:

        print(e)

        # 你的微信昵称，注意这里不是备注，也不是微信帐号

        my_friend = bot.friends().search('NetUML')[0]

        my_friend.send(u"报告老板，今日份的信息发送失败了！")

if __name__ == "__main__":

    # 初始化机器人，扫码登陆微信，适用于Windows系统

    # 初始化一个机器人对象

    bot = Bot(cache_path=True)

    my_groups = bot.groups().search('广告技术')    

    for group in my_groups:

        print(group)   

    # # Linux系统，执行登陆请调用下面的这句

    # bot = Bot(console_qr=2, cache_path="botoo.pkl")

    # 调用函数进行消息发送

    auto_send()

【Python】博客信息爬取-微信消息自动发送的更多相关文章

利用爬虫将Yuan先生的博客文章爬取下来
由于一次巧遇,我阅读了Yuan先生的一篇博客文章,感觉从Yuan先生得博客学到很多东西,很喜欢他得文章.于是我就关注了他,并且想阅读更多出自他手笔得博客文章,无奈,可能Yuan先生不想公开自己得博客吧 ...
itchat和matplotlib的结合使用爬取微信信息
前几天无意中看到了一片文章,<一件有趣的事:我用 Python 爬了爬自己的微信朋友>,这篇文章写的是使用python中的itchat爬取微信中朋友的信息,其中信息包括,昵称.性别.地理位 ...
python itchat 爬取微信好友信息
原文链接:https://mp.weixin.qq.com/s/4EXgR4GkriTnAzVxluJxmg 「itchat」一个开源的微信个人接口,今天我们就用itchat爬取微信好友信息,无图言虚 ...
使用Python爬取微信公众号文章并保存为PDF文件(解决图片不显示的问题)
前言第一次写博客,主要内容是爬取微信公众号的文章,将文章以PDF格式保存在本地. 爬取微信公众号文章(使用wechatsogou) 1.安装 pip install wechatsogou --up ...
python爬取微信公众号
爬取策略 1.需要安装python selenium模块包,通过selenium中的webdriver驱动浏览器获取Cookie的方法.来达到登录的效果 pip3 install selenium c ...
Python爬取微信好友
前言今天看到一篇好玩的文章,可以实现微信的内容爬取和聊天机器人的制作,所以尝试着实现一遍,本文记录了实现过程和一些探索的内容来源: 痴海链接: https://mp.weixin.qq.com/ ...
如何利用Python网络爬虫爬取微信朋友圈动态--附代码（下）
前天给大家分享了如何利用Python网络爬虫爬取微信朋友圈数据的上篇(理论篇),今天给大家分享一下代码实现(实战篇),接着上篇往下继续深入. 一.代码实现 1.修改Scrapy项目中的items.py ...
安居客scrapy房产信息爬取到数据可视化(下)-可视化代码
接上篇:安居客scrapy房产信息爬取到数据可视化(下)-可视化代码,可视化的实现~ 先看看保存的数据吧~ 本人之前都是习惯把爬到的数据保存到本地json文件, 这次保存到数据库后发现使用mongod ...
python爬取微信小程序（实战篇）
python爬取微信小程序(实战篇) 本文链接:https://blog.csdn.net/HeyShHeyou/article/details/90452656 展开一.背景介绍近期有需求需要抓 ...

随机推荐

C/JS_二分法查找
1. 二分法查找前提: 数据是排好序的. 题设:给出一个有序arr,从中找出key,arr的区间是array[ low , higt]. 步骤: (1)mid=(low+high)/2 (2)arr ...
解决telnet无法连接 Connection refused
telnet协议是TCP/IP协议族中的一员,是Internet远程登陆服务的标准协议和主要方式.它为用户提供了在本地计算机上完成远程主机工作的能力.在终端使用者的电脑上使用telnet程序,用它连接 ...
Asp.Net Core 404处理
在使用Asp.Net Core Mvc时 404处理整理如下一.自带404状态处理 1.控制器视图子弹404视图 NotFoundResult,NotFoundObjectResult // // ...
MySQL关于根据日期查询数据的sql语句
查询在某段日期之间的数据: select * from 数据表 where 时间字段名 BETWEEN '2016-02-01' AND '2016-02-05' 查询往前3个月的数据: selec ...
edis更新的正确方法
Redis更新的正确方法 https://www.cnblogs.com/westboy/p/8696607.html redis做缓存,怎么更新里面的数据 https://blog.csdn.net ...
UseSwagger
if [ "$UseSwagger" != "true" ]; then sed -i "s/\"UseSwagger\": tr ...
php读取ini配置文件属性
ini的内容格式如下,请根据自己的INI,格式修改下段程序. autostart = false font_size = font_color = red =================== fu ...
V-rep学习笔记：切削
V-REP allows you to perform cutting simulations. The user can model almost any type of cutting volum ...
py3下怎么用StringIO
try: from StringIO import StringIO except ImportError: from io import StringIO
http范围请求
基于范围请求可以实现断点续传和多线程分片下载 HTTP/1.1之后才支持,需要双端都支持服务端头信息中有 Accept-Ranges:bytes 表明服务器支持范围请求 curl -I &quo ...

【Python】博客信息爬取-微信消息自动发送

【Python】博客信息爬取-微信消息自动发送的更多相关文章

随机推荐

热门专题