您的位置：首页 > 编程语言 > Python开发

python 爬虫利用selenium模拟登录帐号向requests中重设 cookie

2016-11-22 14:44 1306 查看

文章解决问题：

1、利用selenium模拟登录

2、获取模拟登录后的cookie

3、将cookie保存在python 的 requests中，做进一步的爬取工作。

具体步骤代码：

1、利用selenium模拟登录：

driver =webdriver.PhantomJS(executable_path="phantomjs.exe")

driver.get(self.login_url)

ck1 = self.driver.get_cookies()

elem_user = self.driver.find_element_by_xpath('//input[@id="loginname"]')

elem_user.send_keys('bzmcy@126.com')

time.sleep(1)

elem_pwd = self.driver.find_element_by_xpath('//input[@id="nloginpwd"]')

elem_pwd.send_keys('32mcymcymcy')

time.sleep(1)

elem_sub = self.driver.find_element_by_xpath('//div[@class="login-btn"]/a[@id="loginsubmit"]').click() #

time.sleep(3)

url = self.driver.current_url

if url!=self.login_url:

print "登录成功。"

2、获取模拟登录后的cookie，3、将cookie保存在python 的 requests中，做进一步的爬取工作：

cookie =[item["name"] + ":" + item["value"] for item in driver.get_cookies()]

cookiestr = ';'.join(item for item in cookie)

cook_map = {}

for item in cookie :

str = item.split(':')

cook_map[str[0]] = str[1]

print cook_map

cookies = requests.utils.cookiejar_from_dict(cook_map, cookiejar=None, overwrite=True)

self.session.cookies = cookies

参考文章：http://blog.csdn.net/falseen/article/details/46962011
http://blog.csdn.net/warrior_zhang/article/details/50198699
文章解释：

请先阅读参考文章内容，有一定基础后，参考本文。

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签： cookie python path 爬虫 selenium

相关文章推荐

新的分享

章节导航

python 爬虫 利用selenium模拟登录帐号 向requests中重设 cookie

python 爬虫利用selenium模拟登录帐号向requests中重设 cookie