python selenium p_python selenium 常见问题列表

python selenium webdriver 常见问题FAQ

怎么用ChromeDriver ?

从这里下载最新的driver版本并解压

#好吧，这个命令是给linux or osx用户准备的#windows用户直接手点吧 by 乙醇

unzip chromedriver_linux32_x.x.x.x.zip

你会得到一个 chromedriver 的可执行文件. 现在用下面的代码就可以了:

driver = webdriver.Chrome(executable_path="/path/to/chromedriver")#windows如果还不行就把chrome driver扔到python的安装目录下 by乙醇

Selenium 2支持XPath 2.0吗?

Selenium把xpath处理委托给了浏览器的xpath解析引擎。所以浏览器支持什么，selenium就支持什么。如果那些奇葩的浏览器没有xpath引擎的话(IE6,7,8)，那么在这些大爷上selenium就只支持xpath1.0了。

怎样才能滚到页面的底部？

你可以用 execute_script方法来处理这个。调用原生javascript的API，这样你想滚到哪里就能滚到哪里。

下面的代码演示了如何滚到页面的最下面:

driver.execute_script("window.scrollTo(0, document.body.scrollHeight);")

window 对象

的 scrollTo . 方法可以滚到页面上的任何位置。 scrollHeight 是dom元素的通用属性。document.body.scrollHeight 会返回body元素的高度，基本上就是页面的高度了。

如何使用Firefox的profile来自动保存下载的文件

首先，你要保存的的文件类型你造么？

要搞清楚要自动下载的文件类型，用 curl就好了

curl -I URL | grep "Content-Type"

另一种方式是使用 requests _ module, 这样搞:

importrequestsprint requests.head('http://www.python.org').headers['content-type']

当你确定了content type之后，调用browser.helperApps.neverAsk.saveToDisk来设置firefox的profile就好了。

这是例子：

importosfrom selenium importwebdriver

fp=webdriver.FirefoxProfile()

fp.set_preference("browser.download.folderList",2)

fp.set_preference("browser.download.manager.showWhenStarting",False)

fp.set_preference("browser.download.dir", os.getcwd())

fp.set_preference("browser.helperApps.neverAsk.saveToDisk", "application/octet-stream")

browser= webdriver.Firefox(firefox_profile=fp)

browser.get("http://pypi.python.org/pypi/selenium")

browser.find_element_by_partial_link_text("selenium-2").click()

上例中,application/octet-stream就是content type。

browser.download.dir 指定了文件自动保存的文件夹。

如何在打开Firefox的同时打开firebug ?

首先下载Firebug XPI文件(这个就是friefox的扩展程序文件--by乙醇)，然后再调用firefox profile的add_extension方法。

from selenium importwebdriver

fp=webdriver.FirefoxProfile()

fp.add_extension(extension='firebug-1.8.4.xpi')

fp.set_preference("extensions.firebug.currentVersion", "1.8.4") #Avoid startup screen

browser = webdriver.Firefox(firefox_profile=fp)

怎么截图呢？

用webdriver提供的 save_screenshot 方法：

from selenium importwebdriver

driver=webdriver.Firefox()

driver.get('http://www.python.org/')

driver.save_screenshot('screenshot.png')

driver.quit()

如何使用默认已存在的profile启动firefox？by 乙醇

fp = webdriver.FirefoxProfile('/path/to/your/existing/profile')

browser= webdriver.Firefox(fp)

这里在windows上有个坑，就是路径分隔符在windows上是\而不是/,把这个弄明白然后指定对路径基本就可以了。

PS：这里还有另一个坑。就是如果你使用默认的profile的话，请一定关闭friefox以后再运行代码，否则会因为profile的文件锁问题而发生异常。就是说一次只能打开一个firefox实例，如果你使用默认的profile的话。

python selenium p_python selenium 常见问题列表相关推荐

python selenium p_Python爬虫(二十一)_Selenium与PhantomJS
本章将介绍使用Selenium和PhantomJS两种工具用来加载动态数据,更多内容请参考:Python学习指南 Selenium Selenium是一个Web的自动化测试工具,最初是为网站自动化测试 ...
Python 爬虫利器 Selenium 从入门到进阶
作者 | 俊欣来源 | 关于数据分析与可视化今天小编就来讲讲selenium,我们大致会讲这些内容 selenium简介与安装页面元素的定位浏览器的控制鼠标的控制键盘的控制设置元素的等待 ...
Python爬虫之selenium库使用详解
Python爬虫之selenium库使用详解本章内容如下: 什么是Selenium selenium基本使用声明浏览器对象访问页面查找元素多个元素查找元素交互操作交互动作执行JavaS ...
[Python爬虫]使用Selenium操作浏览器订购火车票
这个专题主要说的是Python在爬虫方面的应用,包括爬取和处理部分 [Python爬虫]使用Python爬取动态网页-腾讯动漫(Selenium) [Python爬虫]使用Python爬取静态网页-斗 ...
python抓取文献关键信息,python爬虫——使用selenium爬取知网文献相关信息
python爬虫--使用selenium爬取知网文献相关信息写在前面: 本文章限于交流讨论,请不要使用文章的代码去攻击别人的服务器如侵权联系作者删除文中的错误已经修改过来了,谢谢各位爬友指出错误 ...
python 模拟浏览器selenium 微信_Spider-Python爬虫之使用Selenium模拟浏览器行为
分析他的代码比较简单,主要有以下的步骤:使用BeautifulSoup库,打开百度贴吧的首页地址,再解析得到id为new_list标签底下的img标签,最后将img标签的图片保存下来. header ...
Python+selenium 自动化-selenium的版本查看和升级
Selenium版本的查看 pip show selenium查看selenium的版本. Selenium版本的升级 python -m pip install --upgrade selenium ...
Python之selenium：selenium库的简介、安装、使用方法之详细攻略
Python之selenium:selenium库的简介.安装.使用方法之详细攻略目录 selenium库的简介 1.Selenium需要一个驱动程序来与所选的浏览器交互 selenium库的安装 ...
python爬虫之selenium,谷歌无头浏览器
python爬虫之selenium和PhantomJS 主要的内容 lenium hantomjs 无头浏览器的懒加载一什么是selenium? 介绍它是python中的一个第三方库,对外提供 ...

python selenium p_python selenium 常见问题列表

python selenium p_python selenium 常见问题列表相关推荐

最新文章

热门文章