python爬取快手视频并批量下载
import re
import requests
import time
import json
import os
from pprint import pprintdef get(url: str) -> dict:"""title、imgs、videos"""data = {}failed = {'msg': 'failed...'}headers = {"User-Agent": "Mozilla/5.0 (iPhone; CPU iPhone OS 6_0 like Mac OS X) AppleWebKit/536.26 (KHTML, like Gecko) Version/6.0 Mobile/10A5376e Safari/8536.25","Cookie": "did=web_68e0268146694843a92700d2de49a0a6;"}# rewrite desktop urltemp = re.findall(r'live\.kuaishou\.com/u/\w+/(\w+)', url)if temp:url = 'https://c.kuaishou.com/fw/photo/{}'.format(temp[0])rep = requests.get(url, headers=headers, timeout=10)if rep.status_code != 200:return failedpage_data = re.findall(r'<script type="text/javascript">window\.pageData= (\{.*?\})</script>', rep.text)if not page_data:return failedtry:page_data = json.loads(page_data[0])except Exception:print('kuaishou loads json failed')return failedvideo_info = page_data['video']data['title'] = video_info['caption'].split(" ")[0]data['user'] = page_data['user']['name']# 获取视频try: # 如果出错,则可能是长图视频data['videos'] = [video_info['srcNoMark']]except Exception:passelse:data['videoName'] = data['title']data['msg'] = '如果快手视频下载出错请尝试更换网络'# 获取图片try: # 如果出错,则可能是普通视频;images = video_info['images']imageCDN: str = video_info['imageCDN']# 如果是长图视频,则这几项一定存在assert images is not Noneassert imageCDN is not Noneexcept Exception:passelse:if not imageCDN.startswith('http'):imageCDN = 'http://' + imageCDNdata['imgs'] = [imageCDN + i['path'] for i in images]return datadef download(name: str, filename: str, url: str):try:os.mkdir("E:\\视频\\kuaishou\\"+name)except Exception as e:passr = requests.get(url, stream=True)with open("E:\\视频\\kuaishou\\" + name + "\\" + filename + ".mp4", "wb") as mp4:for chunk in r.iter_content(chunk_size=1024 * 1024):if chunk:mp4.write(chunk)if __name__ == "__main__":with open("E:\\视频\\video.txt", 'r') as file_to_read:while True:lines = file_to_read.readline() # 整行读取数据if not lines:print("Finished!!!")breakgetVideo = get(lines)pprint(getVideo)download(getVideo['user'], getVideo['title'], getVideo['videos'][0])time.sleep(1)
需要将每个视频的网址放在一个txt文件下,比如这样:
https://live.kuaishou.com/u/zx34567studio/3xy2z95nw992rru?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/yuege33333/3xwapaztjzqfry9?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/yuege33333/3xk7h7k6ziam66g?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/Xiaxia977/3xa9c6z4irtvyx9?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xbyb7qjchwgeza/3x4f5xrztpqcgyy?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xbyb7qjchwgeza/3x4j6dtd3w5wj7a?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xi7ts3hndvw83g/3xex4u6bn2tkebw?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xi7ts3hndvw83g/3xhkdfxpc9h96su?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xi7ts3hndvw83g/3xcipugf9ycj65c?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xhvdt6nn3a5trq/3xtsqszw74pmgv2?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xhvdt6nn3a5trq/3xf6kkvhtq99wte?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xhvdt6nn3a5trq/3xz8fpa74te3irq?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xj82vnsnetdnbs?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xfdpt4we8kz4ki?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xyc52kqg3c4ejg?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xtrp9ud4vw2dzs?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xnensa7atv9nb9?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xk84tibykw7mba?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xcccj4g6erdzn2?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3x2vb44pyaysyd2?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3x4j2gt8b8p823w?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xkzsgx9ckdwn6k?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xm85ejcsfp6n44?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/qingge702/3xw8uj4zsp7swje?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xdzbs7wti2eqc6/3xkj8u8h8up9zia?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xdzbs7wti2eqc6/3xzv9cfj65z7mdw?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xdzbs7wti2eqc6/3xn8yrg7kasnec6?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xdzbs7wti2eqc6/3xnxhbxiw3fniby?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xdzbs7wti2eqc6/3x4vmqveeccbmty?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xdzbs7wti2eqc6/3xg9d2rrpw573ck?did=web_6949766fd3fc869e06adf4ad459aef38
https://live.kuaishou.com/u/3xbzgk99vs7g846/3x7hzqc8xtbe9r4?did=web_6949766fd3fc869e06adf4ad459aef38
python爬取快手视频并批量下载相关推荐
- python爬取快手视频 多线程下载
直接开始! 环境: python 2.7 + win10 工具:fiddler postman 安卓模拟器 首先,打开fiddler,fiddler作为http/https 抓包神器,这里就不多介绍. ...
- python爬取快手视频--json数据分析
打开快手主页,进行页面分析 对于快手这种平台,分析完页面代码之后,无任何想要的信息,所以,只能进行json数据的抓取,这些视频都是通过json语句传给前段,然后进行循环生成,所以,我们来看抓的json ...
- python爬取bilibili数据_如何使用Python爬取bilibili视频(详细教程)
Python爬取bilibili视频 摘要 为了解决PC端的bilibili无法下载视频的问题,使用python语言可以实现一个能够爬取bilibili某个视频资源(不包括会员视频)的程序.采用整个视 ...
- python爬取快手评论信息+快手号
python爬取快手评论信息+快手号 潦草记录一下思路 从浏览器进行抓包,分析提交参数, 1.从手机中打开快手,找到需要爬取的视频,转发复制链接,从网页中打开 2.按F12进入开发者模式,进行抓包 3 ...
- python3爬虫系列16之多线程爬取汽车之家批量下载图片
python3爬虫系列16之多线程爬取汽车之家批量下载图片 1.前言 上一篇呢,python3爬虫系列14之爬虫增速多线程,线程池,队列的用法(通俗易懂),主要介绍了线程,多线程,和两个线程池的使用. ...
- python如何爬取网页视频_快就完事了!10分钟用python爬取网站视频和图片
原标题:快就完事了!10分钟用python爬取网站视频和图片 话不多说,直接开讲!教你如何用Python爬虫爬取各大网站视频和图片. 638855753 网站分析: 我们点视频按钮,可以看到的链接是: ...
- python爬取快手粉丝榜_[Python] 【原创开源】快手爬虫,根据id批量爬取用户的所有图集和视频...
分享 写在前面代码功能如题,根据快手用户的id来爬取用户所有公开作品,包括图集和视频. 原理:其实就是利用基于chromium内核的浏览器自带的devtools对所有请求进行排查找出包含作品链接的请求 ...
- 用Python爬取Bilibili视频,难吗?
作者:Mike_Shine 来源:https://urlify.cn/2qyMBb 很多人学习python,不知道从何学起. 很多人学习python,掌握了基本语法过后,不知道在哪里寻找案例上手. 很 ...
- python爬取YB视频评论
python爬取评论 一.背景 二.整体思路 三.代码讲解 3.1 代码结构 3.2 核心代码 3.2.1 爬虫核心代码 3.2.2 界面核心代码 3.2.3 json转excel 四.爬取结果 五. ...
- Python爬取CCTV视频
文章目录 前言 一.需求 二.分析 微视频 长视频 三.处理 四.运行效果 前言 为了更好的掌握数据处理的能力,因而开启Python网络爬虫系列小项目文章. 小项目小需求驱动,每篇文章会使用两种以上的 ...
最新文章
- shell getopts命令
- jsp验证身份证号并自动计算年龄和出生日期
- hadoop配置文件说明
- 网易有道的产品总监王焱:如何利用数据分析推动产品设计
- 6 个 K8s 日志系统建设中的典型问题,你遇到过几个?
- 华为Mate20首支预告片爆出,30秒内包含太多黑科技
- mac 10.10.5 mysql_Macbook os x 10.10.5 下装mysql 一些记录
- 刀片服务器和机架服务器性能,刀片服务器与机架服务器的区别是什么 刀片服务器与机架服务器的区别介绍...
- 常见的 Java 错误及避免方法
- django,form表单,数据库增删改查
- php 设置 最大内存,修改PHP的memory_limit限制的方法分享
- OpenGL基础12:进入3D世界
- csp 201512-4 送货(hierholzer算法的递归和堆栈实现)
- 生物医学基础--人体阻抗模型
- 大一新生先学C语言编程还是先学C语言的数据结构和算法?
- 明翰英语教学系列之形容词与副词篇V0.1(持续更新)
- 关于ISO七层模型你应该知道的东西
- python numpy安装教程_python3.6下Numpy库下载与安装图文教程
- 【点云系列】基于图结构的点云快速重采样 翻译
- c语言asinh函数,C ++ STL中的asinh()函数