python - 随笔分类 - jswf

python小白学习记录使用appium编写自动化脚本

摘要：1.解决每次运行脚本都要安装验证的问题参考https://blog.csdn.net/hszxd479946/article/details/78900982 2.安装appium的客户端 3.安装appium的python第三方库阅读全文

posted @ 2020-03-23 21:32 jswf 阅读(305) 评论(0) 推荐(0)

python小白学习记录安装AES模块

摘要：附 pip install pycryptodome 阅读全文

posted @ 2020-02-24 11:01 jswf 阅读(439) 评论(0) 推荐(0)

python小白学习记录多线程爬取ts片段

摘要：from lxml import etree import requests from urllib import request import time import os from queue import Queue import threading import re from multip 阅读全文

posted @ 2020-02-23 15:43 jswf 阅读(660) 评论(0) 推荐(0)

python小白学习记录关于scrapy框架的cookie存取使用（知乎手动验证码登录）

摘要：附 https://blog.csdn.net/weixin_43430036/article/details/84871624 # -*- coding: utf-8 -*- from urllib import request import scrapy import json from sel 阅读全文

posted @ 2020-02-23 12:31 jswf 阅读(450) 评论(0) 推荐(0)

python小白学习记录 scrapy 结合 selenium 使用自己的chrome浏览器

摘要：chrome.exe --remote-debugging-port=9222 --user-data-dir="C:\selenum\AutomationProfile" 此条命令复制到命令行，打开端口为9222的浏览器，勿关闭（此前应先配置环境变量否则无chrome.exe此命令） chr 阅读全文

posted @ 2020-02-21 17:00 jswf 阅读(451) 评论(0) 推荐(0)

python小白学习记录 scrapy设置随机请求头设置免费代理ip

摘要：from scrapy import signals import random class Test001UseragentMiddleware(object): USER_AGENT=[ "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/535.1 阅读全文

posted @ 2020-02-21 11:23 jswf 阅读(365) 评论(0) 推荐(0)

python小白学习记录 scrapy采用-t crawl模式爬取微信小程序社区

摘要：操作命令建立项目scrapy startproject [项目名] You can start your first spider with: cd jxnsh scrapy genspider example example.com 构建爬虫文件先转到项目目录下正常情况下再执行scrapy 阅读全文

posted @ 2020-02-18 22:03 jswf 阅读(726) 评论(0) 推荐(0)

python小白学习记录爬取小程序社区

摘要：初版 # -*- coding: utf-8 -*- import scrapy import requests from lxml import etree from selenium import webdriver from scrapy.http.response.html import H 阅读全文

posted @ 2020-02-17 20:57 jswf 阅读(215) 评论(0) 推荐(0)

python小白学习记录结合scrapy编写爬虫爬取古诗文网右侧的标签

摘要：1 # -*- coding: utf-8 -*- 2 import scrapy 3 import requests 4 from lxml import etree 5 from selenium import webdriver 6 from scrapy.http.response.html 阅读全文

posted @ 2020-02-17 16:33 jswf 阅读(298) 评论(0) 推荐(0)

python小白学习记录生产者消费者模型爬取斗图啊网站（源码有待修改）

摘要：from lxml import etree import requests from urllib import request import time import os from queue import Queue import threading import re class Procu 阅读全文

posted @ 2020-02-16 15:07 jswf 阅读(182) 评论(0) 推荐(0)

python小白学习记录 selenium的初步学习

摘要：from selenium import webdriver from selenium.webdriver.common.action_chains import ActionChains from selenium.webdriver.common.by import By from selen 阅读全文

posted @ 2020-02-16 14:57 jswf 阅读(206) 评论(0) 推荐(0)

python小白学习记录爬取斗图啦网站

摘要：from lxml import etree import requests from urllib import request import time import os number = 0 def get_page(): for x in range(1,20): url = "https: 阅读全文

posted @ 2020-02-13 15:12 jswf 阅读(741) 评论(0) 推荐(0)

正则表达式

摘要：import re text = "apple is $20.09,orange is $100.99" #ret = re.findall(".*\$\d+\.*\d*", text) #会找出所有匹配项以list形式返回 #ret = re.sub("\$","㊙", text,1) #会替换阅读全文

posted @ 2020-02-11 21:57 jswf 阅读(256) 评论(0) 推荐(0)

python小白学习记录 BeautifulSoup4学习

摘要：from bs4 import BeautifulSoup text = """ <ul id="navList" class="w1"> <li><a id="blog_nav_sitehome" class="menu" href="https://www.cnblogs.com/">博客园</ 阅读全文

posted @ 2020-02-11 11:36 jswf 阅读(180) 评论(0) 推荐(0)

python小白学习记录电影天堂多页爬取实例

摘要：from lxml import etree import requests #一般访问网页需要有request请求请求有请求头只需要模仿请求头就能访问到网页内容 baseurl0 = "https://www.ygdy8.net" headers = { "User-Agent": "Moz 阅读全文

posted @ 2020-02-10 19:10 jswf 阅读(465) 评论(0) 推荐(0)

python小白学习记录运用lxml的xpath解析html文件

摘要：1 from lxml import etree 2 text = "<div><p>nmsl</p><span>nmsl</span></div>" 3 def htmlstree(text): 4 html = etree.HTML(text) 5 result = etree.tostring 阅读全文

posted @ 2020-02-09 17:36 jswf 阅读(751) 评论(0) 推荐(0)

python小白学习记录网页爬取html文件

摘要：1.urllib库的几个基础方法 from urllib import request,parse request.urlretrieve("http://www.baidu.com","index.html") #可快捷的将网页源码保存到本地 req=request.Request("http:/ 阅读全文

posted @ 2020-02-08 20:25 jswf 阅读(618) 评论(0) 推荐(0)

jswf

随笔分类 - python

公告