wstong - 博客园

2023年3月28日

摘要： 1. 安装PyPDF2 pip3 install PyPDF2 2. 分割 from PyPDF2 import PdfReader, PdfWriter file = input() pdf_reader = PdfReader(file) for i in range(len(pdf_reade 阅读全文

posted @ 2023-03-28 23:47 wstong 阅读(165) 评论(0) 推荐(0) 编辑

2023年3月21日

python - PaddleOCR

摘要： 1. 安装 pip3 install paddleocr -i https://pypi.tuna.tsinghua.edu.cn/simple pip3 install paddlepaddle -i https://mirror.baidu.com/pypi/simple 2. 使用 from 阅读全文

posted @ 2023-03-21 19:54 wstong 阅读(134) 评论(0) 推荐(0) 编辑

2023年3月20日

python - tesseract-ocr

摘要： 1. 安装tesseract-ocr 下载链接：https://digi.bib.uni-mannheim.de/tesseract/ 安装后添加环境变量测试安装情况 2. 安装pytesseract pip3 install pytesseract -i https://pypi.tuna.ts 阅读全文

posted @ 2023-03-20 21:37 wstong 阅读(69) 评论(0) 推荐(0) 编辑

2023年3月16日

python - 获取法定节假日日历

摘要：先找一个网站，然后使用requests获取返回值并使用beautifulsoup解析，最后使用pandas导出excel文件脚本如下 import pandas as pd import requests import json from tqdm import trange from bs4 i 阅读全文

posted @ 2023-03-16 20:58 wstong 阅读(246) 评论(0) 推荐(0) 编辑

2023年3月15日

jquery.exportWord.js实现word的导出

摘要： 1. 脚本如下需要引入jquery.min.js，FileSaver.min.js和jquery.wordexport.js（注意顺序） <!DOCTYPE html> <html> <head> <script src="https://cdn.bootcdn.net/ajax/libs/jqu 阅读全文

posted @ 2023-03-15 23:36 wstong 阅读(779) 评论(2) 推荐(1) 编辑

2023年3月13日

python - ddddocr验证码识别

摘要： 1. ddddocr安装建议使用国内镜像安装 pip3 install ddddocr -i https://pypi.tuna.tsinghua.edu.cn/simple 2. 图片验证码 import ddddocr ocr = ddddocr.DdddOcr(show_ad=False) 阅读全文

posted @ 2023-03-13 21:12 wstong 阅读(2194) 评论(0) 推荐(1) 编辑

python - 多线程下载m3u8

摘要： import requests import m3u8 import os from multiprocessing.dummy import Pool from tqdm import tqdm from retry import retry from urllib.parse import ur 阅读全文

posted @ 2023-03-13 19:58 wstong 阅读(330) 评论(0) 推荐(0) 编辑

2023年3月12日

python - 操作sqlite

摘要： 1. 连接数据库和创建游标 import sqlite3 conn = sqlite3.connect("test.db") cur = conn.cursor() 2. 建表 sql = "CREATE TABLE test_table(id INTEGER PRIMARY KEY,name TE 阅读全文

posted @ 2023-03-12 11:49 wstong 阅读(51) 评论(0) 推荐(0) 编辑

2023年3月11日

python - jpg转pdf

摘要： 1. 需要先安装两个模块 pip3 install fitz pip3 install PyMuPDF 2. 脚本如下 import fitz import os from functools import cmp_to_key # 过滤掉当前目录除jpg以外的文件 def file_filter( 阅读全文

posted @ 2023-03-11 20:03 wstong 阅读(260) 评论(0) 推荐(0) 编辑

xlsx.full.min.js实现xlsx的导入与导出

摘要： 1. json转xlsx <html lang="zh"> <head> <script src="https://cdn.bootcdn.net/ajax/libs/jquery/3.6.3/jquery.min.js"></script> <script src="https://cdn.boo 阅读全文

posted @ 2023-03-11 11:53 wstong 阅读(2081) 评论(0) 推荐(0) 编辑

wstong2052

公告