使用python模拟登陆百度

#!/usr/bin/python

# -*- coding: utf- -*-

"""

Function:   Used to demostrate how to use Python code to emulate login baidu main page: http://www.baidu.com/

Note:       Before try to understand following code, firstly, please read the related articles:

            ()【整理】关于抓取网页，分析网页内容，模拟登陆网站的逻辑/流程和注意事项

http://www.crifan.com/summary_about_flow_process_of_fetch_webpage_simulate_login_website_and_some_notice/

            () 【教程】手把手教你如何利用工具(IE9的F12)去分析模拟登陆网站(百度首页)的内部逻辑过程

http://www.crifan.com/use_ie9_f12_to_analysis_the_internal_logical_process_of_login_baidu_main_page_website/

            () 【教程】模拟登陆网站 之 Python版

http://www.crifan.com/emulate_login_website_using_python

Version:    --

Author:     Crifan

"""

import re;

import cookielib;

import urllib;

import urllib2;

import optparse;

#------------------------------------------------------------------------------

# check all cookies in cookiesDict is exist in cookieJar or not

def checkAllCookiesExist(cookieNameList, cookieJar) :

    cookiesDict = {};

    for eachCookieName in cookieNameList :

        cookiesDict[eachCookieName] = False;

    allCookieFound = True;

    for cookie in cookieJar :

        if(cookie.name in cookiesDict) :

            cookiesDict[cookie.name] = True;

    for eachCookie in cookiesDict.keys() :

        if(not cookiesDict[eachCookie]) :

            allCookieFound = False;

            break;

    return allCookieFound;

#------------------------------------------------------------------------------

# just for print delimiter

def printDelimiter():

    print '-'*;

#------------------------------------------------------------------------------

# main function to emulate login baidu

def emulateLoginBaidu():

    print "Function: Used to demostrate how to use Python code to emulate login baidu main page: http://www.baidu.com/";

    print "Usage: emulate_login_baidu_python.py -u yourBaiduUsername -p yourBaiduPassword";

    printDelimiter();

    # parse input parameters

    parser = optparse.OptionParser();

    parser.add_option("-u","--username",action="store",type="string",default='',dest="username",help="Your Baidu Username");

    parser.add_option("-p","--password",action="store",type="string",default='',dest="password",help="Your Baidu password");

    (options, args) = parser.parse_args();

    # export all options variables, then later variables can be used

    for i in dir(options):

        exec(i + " = options." + i);

    printDelimiter();

    print "[preparation] using cookieJar & HTTPCookieProcessor to automatically handle cookies";

    cj = cookielib.CookieJar();

    opener = urllib2.build_opener(urllib2.HTTPCookieProcessor(cj));

    urllib2.install_opener(opener);

    printDelimiter();

    print "[step1] to get cookie BAIDUID";

    baiduMainUrl = "http://www.baidu.com/";

    resp = urllib2.urlopen(baiduMainUrl);

    #respInfo = resp.info();

    #print "respInfo=",respInfo;

    for index, cookie in enumerate(cj):

        print '[',index, ']',cookie;

    printDelimiter();

    print "[step2] to get token value";

    getapiUrl = "https://passport.baidu.com/v2/api/?getapi&class=login&tpl=mn&tangram=true";

    getapiResp = urllib2.urlopen(getapiUrl);

    #print "getapiResp=",getapiResp;

    getapiRespHtml = getapiResp.read();

    #print "getapiRespHtml=",getapiRespHtml;

    #bdPass.api.params.login_token='5ab690978812b0e7fbbe1bfc267b90b3';

    foundTokenVal = re.search("bdPass\.api\.params\.login_token='(?P<tokenVal>\w+)';", getapiRespHtml);

    if(foundTokenVal):

        tokenVal = foundTokenVal.group("tokenVal");

        print "tokenVal=",tokenVal;

        printDelimiter();

        print "[step3] emulate login baidu";

        staticpage = "http://www.baidu.com/cache/user/html/jump.html";

        baiduMainLoginUrl = "https://passport.baidu.com/v2/api/?login";

        postDict = {

            #'ppui_logintime': "",

            'charset'       : "utf-8",

            #'codestring'    : "",

            'token'         : tokenVal, #de3dbf1e8596642fa2ddf2921cd6257f

            'isPhone'       : "false",

            'index'         : "",

            #'u'             : "",

            #'safeflg'       : "",

            'staticpage'    : staticpage, #http%3A%2F%2Fwww.baidu.com%2Fcache%2Fuser%2Fhtml%2Fjump.html

            'loginType'     : "",

            'tpl'           : "mn",

            'callback'      : "parent.bdPass.api.login._postCallback",

            'username'      : username,

            'password'      : password,

            #'verifycode'    : "",

            'mem_pass'      : "on",

        };

        postData = urllib.urlencode(postDict);

        # here will automatically encode values of parameters

        # such as:

        # encode http://www.baidu.com/cache/user/html/jump.html into http%3A%2F%2Fwww.baidu.com%2Fcache%2Fuser%2Fhtml%2Fjump.html

        #print "postData=",postData;

        req = urllib2.Request(baiduMainLoginUrl, postData);

        # in most case, for do POST request, the content-type, is application/x-www-form-urlencoded

        req.add_header('Content-Type', "application/x-www-form-urlencoded");

        resp = urllib2.urlopen(req);

        #for index, cookie in enumerate(cj):

        #    print '[',index, ']',cookie;

        cookiesToCheck = ['BDUSS', 'PTOKEN', 'STOKEN', 'SAVEUSERID'];

        loginBaiduOK = checkAllCookiesExist(cookiesToCheck, cj);

        if(loginBaiduOK):

            print "+++ Emulate login baidu is OK, ^_^";

        else:

            print "--- Failed to emulate login baidu !"

    else:

        print "Fail to extract token value from html=",getapiRespHtml;

if __name__=="__main__":

    emulateLoginBaidu();

使用python模拟登陆百度的更多相关文章

【教程】模拟登陆百度之Java代码版
[背景] 之前已经写了教程,分析模拟登陆百度的逻辑: [教程]手把手教你如何利用工具(IE9的F12)去分析模拟登陆网站(百度首页)的内部逻辑过程然后又去用不同的语言: Python的: [教程]模 ...
模拟登陆百度以及Selenium 的基本用法
模拟登陆百度,需要依赖于selenium 模块,调用浏览器,执行python命令先来说一下这个selenium模块啦...... 本文参考内容来自 Selenium官网 SeleniumPython ...
Python模拟登陆新浪微博
上篇介绍了新浪微博的登陆过程,这节使用Python编写一个模拟登陆的程序.讲解与程序如下: 1.主函数(WeiboMain.py): import urllib2 import cookielib i ...
Python模拟登陆万能法-微博|知乎
Python模拟登陆让不少人伤透脑筋,今天奉上一种万能登陆方法.你无须精通HTML,甚至也无须精通Python,但却能让你成功的进行模拟登陆.本文讲的是登陆所有网站的一种方法,并不局限于微博与知乎,仅 ...
Python模拟登陆TAPD
因为在wiki中未找到需要的数据,查询也很迷,打算用python登录tapd抓取所需项目下的wiki数据,方便查找. 2018-9-30 19:12:44 几步走模拟登录tapd 抓取wiki页左侧 ...
Python模拟登陆淘宝并统计淘宝消费情况的代码实例分享
Python模拟登陆淘宝并统计淘宝消费情况的代码实例分享支付宝十年账单上的数字有点吓人,但它统计的项目太多,只是想看看到底单纯在淘宝上支出了多少,于是写了段脚本,统计任意时间段淘宝订单的消费情况,看 ...
Selenium模拟登陆百度贴吧
Selenium模拟登陆百度贴吧 from selenium import webdriver from time import sleep from selenium.webdriver.commo ...
python 模拟登陆，请求包含cookie信息
需求: 1.通过GET方法,访问URL地址一,传入cookie参数 2.根据地址一返回的uuid,通过POST方法,传入cooki参数实现思路: 1.理解http的GET和POST差别 (网上有很多 ...
python模拟登陆之下载
好长时间没有更新博客了,哈哈. 今天公司给了这么一个需求,现在我们需要去淘宝获取上一天的订单号,然后再根据订单号去另一个接口去获取订单详情,然后再给我展示到web! 中间涉及到的技术点有: 模拟登陆 ...

随机推荐

爬虫库之BeautifulSoup学习（二）
BeautifulSoup官方介绍文档:https://www.crummy.com/software/BeautifulSoup/bs4/doc/index.zh.html 四大对象种类: Beau ...
awk用法总结
简介 awk的命名来自于他的三位创始人Alfred Aho .Peter Weinberger 和 Brian Kernighan 的姓氏的首字母. 有多种版本:New awk(nawk),GNU a ...
html5代码如何转成小程序代码
插件源码链接:https://pan.baidu.com/s/1pGY8ZsdESaQGEzoEgpb_Rw 提取码:s1ix 放到与pages文件夹同级目录下 js中调用 //引入 var wx ...
阿里云的opensearch
官方文档:https://help.aliyun.com/document_detail/29104.html?spm=a2c4g.11186623.6.544.5215187be2TnnD
bzoj 2055: 80人环游世界【有上下界有源汇最小费用最大流】
连有上下界的边(ss,i,(0,m),0),(i',t,(0,m),0),表示从任意点开始和结束连(i,j,(0,m),d[i][j]),表示可以买票飞过去连(i,i',(v[i],v[i]),0 ...
阿里云物联网 .NET Core 客户端 | CZGL.AliIoTClient：4.1 上报位置信息
文档目录: 说明 1. 连接阿里云物联网 2. IoT 客户端 3. 订阅Topic与响应Topic 4. 设备上报属性 4.1 上报位置信息 5. 设置设备属性 6. 设备事件上报 7. 服务调用 ...
word2vec改进之Negative Sampling
训练网络时往往会对全部的神经元参数进行微调,从而让训练结果更加准确.但在这个网络中,训练参数很多,每次微调上百万的数据是很浪费计算资源的.那么Negative Sampling方法可以通过每次调整很小 ...
zookeeper 搭建
zookeeper 版本为zookeeper 3.4.8 操作系统为ubuntu 12.04 64位 zookeeper 单机搭建解压zookeeper 包 .tar.gz -C /root/sof ...
学习Mahout(二)
继续上一篇博客. 这篇博客介绍如何跑一下mahout自带的Hello world程序我将mahout 安装在/opt/hadoop/mahout-distribution-0.9 cd /opt/h ...
笔记-JavaWeb学习之旅4
JDBC: 概念:Java DataBase Connectivity Java 数据库连接,Java语言操作数据库本质:其实是官方(sun公司)定义的一套操作所有关系型数据库的规则,即接口,各个数 ...

使用python模拟登陆百度

使用python模拟登陆百度的更多相关文章

随机推荐

热门专题