【docker】CentOS7.4+Python2.7+selenium+Firefox+tesseract的搭建
2020-01-13 09:21
411 查看
当前Docker容器配置:
- Centos7.4
- python2.7.5
目标Docker容器配置:
- Centos7.4
- python2.7.5
- selenium 3.141.0
- geckodriver 0.15
- firefox 56.0.2
- Pillow 6.1.0
- pytesseract 0.2.7
安装依赖环境
yum install -y zlib-devel bzip2-devel openssl-devel ncurses-devel sqlite-devel readline-devel tk-devel libffi-devel gcc make wget git unzip gcc gcc-c++ libjpeg-devel libpng-devel libgif-devel
创建目录存放安装包
mkdir /usr/local/download cd /usr/local/download
安装pip
wget --no-check-certificate https://bootstrap.pypa.io/get-pip.py python get-pip.py mv /usr/bin/pip /usr/bin/pip_bak ln -s /usr/local/bin/pip /usr/bin/pip
根据需求安装所需包
pip install requests pip install Pillow pip install httplib2 pip install excel
安装tesseract
# 安装leptonica cd /usr/local/download/ wget http://www.leptonica.org/source/leptonica-1.72.tar.gz tar xvzf leptonica-1.72.tar.gz cd leptonica-1.72/ ./configure make && make install # 安装tesseract-3.04 cd /usr/local/download/ wget https://github.com/tesseract-ocr/tesseract/archive/3.04.zip unzip 3.04.zip && cd tesseract-3.04/ ./configure make && make install # 手动更新动态链接库 ldconfig pip install pytesseract # 安装语言包 在https://github.com/tesseract-ocr/tessdata 下载对应语言的模型文件 由于目前只需要识别手机号码和英文,只下载一个eng.traineddata文件即可, 将模型文件移动到/usr/local/share/tessdata 然后即可进行识别 # 示例 import pytesseract from PIL import Image image = Image.open('bb.png') code = pytesseract.image_to_string(image) print(code)
安装selenium+Firefox+Xvfb
yum install -y Xvfb gtk3 gtk3-devel libXfont xorg-x11-fonts* libgtk-3.so.0 bzip2 pip install xvfbwrapper selenium pyvirtualdisplay # 安装浏览器 cd /usr/local/download/ wget https://ftp.mozilla.org/pub/firefox/releases/56.0.2/linux-x86_64/en-US/firefox-56.0.2.tar.bz2 tar xjvf firefox-56.0.2.tar.bz2 rm -f /usr/bin/firefox ln -s /usr/local/download/firefox/firefox /usr/bin/firefox # 安装geckodriver wget https://github.com/mozilla/geckodriver/releases/download/v0.15.0/geckodriver-v0.15.0-linux64.tar.gz tar xvzf geckodriver-*.tar.gz rm -f /usr/bin/geckodriver ln -s /usr/local/download/geckodriver /usr/bin/geckodriver # 软链接必须用绝对路径
测试用例:
#!/usr/bin/python # -*- coding:utf-8 -*- from selenium import webdriver from pyvirtualdisplay import Display from selenium.webdriver.firefox.firefox_binary import FirefoxBinary display = Display(visible=0, size=(800,600)) display.start() binary = FirefoxBinary('/usr/bin/firefox') driver = webdriver.Firefox(firefox_binary=binary) driver.get('https://www.baidu.com') print(driver.title.encode('utf8')) driver.quit() display.stop()
关注公众号西加加先生
一起玩转Python。
- 点赞
- 收藏
- 分享
- 文章举报
相关文章推荐
- 【docker】CentOS7.4+Python3.7+selenium+Firefox+tesseract的搭建
- 【docker】CentOS6.8+Python3.7+selenium+Firefox的搭建
- 从零开始:Mac OS下搭建 python3+selenium+firefox+pycharm 环境
- CentOS 6 安装 python 2.7 和 mod_wsgi 运行pyocr[tesseract]
- selenium3+python2.7启动Firefox 53提示:socket.error: [Errno 10054]
- selenium 3+python3.6+firefox的Windows详细环境搭建
- selenium3+python2.7启动Firefox53提示:KeyError: 'sessionId'
- windows 搭建 selenium + python2.7 自动化测试环境
- 阿里云Linux CentOS 7.4 Docker+gogs搭建git服务器
- selenium, firefox, python环境搭建指南
- 基于CentOS系统Python2.7环境搭建和程序开发
- selenium + python + firefox 测试环境的搭建与配置
- Mac下selenium3.0+python2.7环境搭建
- 搭建ubuntu+firefox+jupyter+python3+selenium的环境
- ubuntu 16.04 搭建python+selenium+firefox环境
- python2.7+selenium测试环境搭建
- 在centos6.5环境下搭建多版本python(python2.6、python2.7、python3.5)共存环境
- selenium 3+python3.6+firefox的windows详细环境搭建以及小demo
- CentOS7.4搭建python开发环境(mysql)