python爬虫——验证码（1）下载到本地之登录古诗文网

一.分析

验证码：

登录抓包：登录不一定成功，但是接口可以捕获到

formdata：

获取 ‘__VIEWSTATEGENERATOR’ 和 ‘__VIEWSTATE’ 的值

二.代码

import requests
from bs4 import BeautifulSoup
import urllib.requestheaders = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/74.0.3729.131 Safari/537.36',
}def download_code(s):url = 'https://so.gushiwen.org/user/login.aspx?from=http://so.gushiwen.org/user/collect.aspx'r = s.get(url=url, headers=headers)soup = BeautifulSoup(r.text, 'lxml')# 得到图片链接image_src = 'https://so.gushiwen.org' + soup.find('img', id='imgCode')['src']# print(image_src) # https://so.gushiwen.org/RandCode.ashxr_image = s.get(image_src,headers=headers)with open('code.png', 'wb') as fp:fp.write(r_image.content)# 表单所需要的两个参数__VIEWSTATEGENERATOR = soup.find('input', id='__VIEWSTATEGENERATOR')['value']__VIEWSTATE = soup.find('input', id='__VIEWSTATE')['value']return __VIEWSTATEGENERATOR, __VIEWSTATEdef login(viewg, view, s):post_url = 'https://so.gushiwen.org/user/login.aspx?from='# 提示用户输入验证码code = input("请输入验证码：")formdata = {'pwd': '自己的密码','from': '','email': '13772804203','denglu ': '登录','code': code,'__VIEWSTATEGENERATOR': viewg,'__VIEWSTATE': view,}r = s.post(url=post_url, headers=headers, data=formdata)with open('古诗.html', 'w', encoding='utf8') as fp:fp.write(r.text)def main():# 创建会话s = requests.Session()# 下载验证码到本地viewg,view = download_code(s)# 向post地址发送请求login(viewg, view, s)if __name__ == '__main__':main()

python爬虫——验证码（1）下载到本地之登录古诗文网相关推荐

爬虫day01(上午) 模拟登录古诗文网
前言:今天是学习爬虫的第一天,因为看的教学视频比较老,所以很多案例都不能用了,于是我自己发挥动手操作,做了个比视频里更有含金量的练习,由于与视频案例大有不同,所以期间发生了点问题,经过探索现已解决,留 ...
python爬虫模拟登录古诗文网站
爬取目标网站https://so.gushiwen.cn/user/login.aspx?from=http://so.gushiwen.cn/user/collect.aspx?type=s 工具: ...
python爬虫+ffmpeg批量下载ts文件，解密合并成mp4
标题 python爬虫+ffmpeg批量下载ts文件,解密合并成mp4 文章目录标题前言一.分析目标二.寻找url规律三.写代码总结前言 (第一次写博客,写的不好请见谅哈~~) 目标是大 ...
python爬虫教程下载-Python爬虫视频教程全集下载
原标题:Python爬虫视频教程全集下载 Python作为一门高级编程语言,在编程中应用得非常广泛.随着人工智能的发展,python人才的需求更大.当然,这也吸引了很多同学选择自学Python爬虫.P ...
python爬虫实现批量下载百度图片
今天和小伙伴们合作一个小项目,需要用到景点图片作为数据源,在百度上搜索了一些图片,感觉一个一个手动保存太过麻烦,于是想到用爬虫来下载图片. 本次代码用到了下列一些python模块,需要预先安装Beau ...
Python 爬虫 m3u8的下载及AES解密
python爬虫 m3u8的下载及AES加密的解密前言 2023.1.23更新线程池版完整代码异步协程版前言这里与hxdm分享一篇关于m3u8视频流的爬取下载合并成mp4视频的方法,并且支 ...
python爬虫实现音乐下载
python爬虫实现音乐下载音乐下载功能模块 # !/usr/bin/env python # -*- coding:UTF-8 -*- # # @Version : 1.0 # @Time : 2 ...
Python爬虫验证码识别四
反爬机制:验证码, 第三方自动识别(推荐) 注意:使用该平台识别验证码,必须将验证码事先下载到本地,然后再将本地存储的验证码提交给平台的示例程序进行识别操作. 使用打码平台识别验证码的编码流程: ...
python爬虫-古诗文网验证码识别
文章目录一.前期准备二.示例代码一.前期准备古诗文网验证码识别,是通过对古诗文网登陆界面的验证码图片进行识别的,利用专门的验证码识别网站,可以提取验证码图片中的验证码网站推荐:超级鹰注册登 ...

python爬虫——验证码（1）下载到本地之登录古诗文网

一.分析

二.代码

python爬虫——验证码（1）下载到本地之登录古诗文网相关推荐

最新文章

热门文章