Python编程快速上手-组织文件
-
shutil模块
复制文件和文件夹
shutil.copy() 复制一个文件,shutil.copytree()将复制整个文件夹
>>> import shutil, os >>> os.chdir('C:\\PyProjects') >>> shutil.copy('C:\\PyProjects\\this.txt','C:\\PyProjects\\this_copy.txt') 'C:\\PyProjects\\this_copy.txt' >>> shutil.copytree('bacon','bacon_copy') 'bacon_copy'
文件和文件夹的移动与改名
shutil.move(source, destination)
>>> import shutil >>> dir <built-in function dir> >>> import shutil >>> shutil.move('C:\\PyProjects\\bacon.txt','C:\\PyProjects\\bacon') 'C:\\PyProjects\\bacon\\bacon.txt'
改名
>>> shutil.move('C:\\PyProjects\\bacon\\bacon.txt','C:\\PyProjects\\bacon\\new_bacon.txt') 'C:\\PyProjects\\bacon\\new_bacon.txt'
永久删除文件和文件夹
利用os模块中的函数,可以删除一个文件或一个空文件夹,但利用shutil模块,可以删除一个文件夹及其所有的内容。
- 用os.unlink(path)将删除path处的文件
- os.rmdir(path) 将删除path处的文件夹,该文件夹必须为空,没有任何文件和文件夹
- shutil.rmtree(path)将删除path处的文件夹,它包含所有文件和文件夹都会被删除
import os for filename in os.listdir('C:\\PyProjects'): if filename.endswith('.txt'): print(filename) #os.unlink(filename)
C:\PyProjects>C:/Python37/python3.exe c:/PyProjects/demo.py bacon.txt this.txt
用send2trash模式安全地删除
安装send2trash
C:\Users\ElonTian>pip3 install send2trash Collecting send2trash Downloading Send2Trash-1.8.0-py3-none-any.whl (18 kB) Installing collected packages: send2trash Successfully installed send2trash-1.8.0 WARNING: You are using pip version 21.1.2; however, version 22.0.4 is available. You should consider upgrading via the 'c:\python37\python3.exe -m pip install --upgrade pip' command.
利用send2trash,比Python常规的删除函数要安全得多,因为它将文件夹和文件发送到计算机的垃圾箱或回收站,而不是永久删除。
>>> import send2trash >>> os.chdir('C:\\PyProjects') >>> baconFile = open('bacon.txt', 'a') >>> baconFile.write('Bacon is not a vegetable.') 25 >>> baconFile.close() >>> send2trash.send2trash('bacon.txt') >>>
send2trash.send2trash()函数删除文件和文件夹,虽然它将文件发送到垃圾箱,稍后能够恢复它们,但这不像永久删除文件,不会释放磁盘空间。这时,os和shutil可以删除文件和文件夹,释放磁盘空间。注意,send2trash()函数只能将文件送到垃圾箱,不能从中恢复文件。
-
遍历目录树
os.walk() 函数被传入一个字符串值,即一个文件夹的路径。在一个for循环语句中使用os.walk()函数,遍历目录树。os.walk()在循环的每次迭代中,返回3个值:
- 1、当前文件夹名称的字符串
- 2、当前文件夹中子文件夹的列表
- 3、当前文件夹中文件的字符串的列表
import os for folderName, subfolders, filenames in os.walk('C:\\PyProjects'): print('The current folders is ' + folderName) for subfolder in subfolders: print('SUBFOLDER OF ' + folderName + ': ' + subfolder) for filename in filenames: print('FILE INSIDE ' + folderName + ': ' + filename) print('')
-
用zipfile模块压缩文件
利用zipfile模块中的函数,Python程序可以创建和打开(解压)ZIP文件。假定,example.zip的文件,可以从http://nostarch.com/automatestuff/下载文件。
读取ZIP文件
要读取ZIP文件的内容,首先必须创建一个ZipFile对象 。调用zipfile.ZipFile()函数,向它传入一个字符串,表示.zip文件的文件名,请注意,zipfile是Python模块的名称,ZipFile() 是函数的名称。
>>> import zipfile, os >>> os.chdir('C:\\PyProjects') >>> exampleZip = zipfile.ZipFile('example.zip') >>> exampleZip.namelist() ['spam.txt', 'cats/', 'cats/catnames.txt', 'cats/zophie.jpg'] >>> spamInfo = exampleZip.getinfo('spam.txt') >>> spamInfo.file_size 13908 >>> spamInfo.compress_size 3828 >>> 'Compressed file is %sx smaller!' % (round(spamInfo.file_size / spamInfo.compress_size, 2)) 'Compressed file is 3.63x smaller!' >>> exampleZip.close()
ZipFile对象的namelist()方法,返回ZIP文件中包含的所有文件和文件夹的字符串的列表。ZipInfo对象有自己的属性,诸如表示字节数的file_size和compress_size。分别表示原来文件大小和压缩后文件大小。ZipFile 对象表示整个归档文件,而ZipInfo 对象则保存该归档文件中每个文件的有用信息。
从ZIP文件中解压缩
extractall() 方法从ZIP文件解压缩所有文件和文件夹,放到当前工作目录
>>> import zipfile, os >>> os.chdir('C:\\PyProjects') >>> exampleZip = zipfile.ZipFile('example.zip') >>> exampleZip.extractall() >>> exampleZip.close()
ZipFile对象的extract()方法从ZIP文件中解压缩单个文件
>>> exampleZip = zipfile.ZipFile('example.zip') >>> exampleZip.extract('spam.txt') 'C:\\PyProjects\\spam.txt' >>> exampleZip.extract('spam.txt', 'C:\\PyProjects\\cats') 'C:\\PyProjects\\cats\\spam.txt' >>> exampleZip.close()
传递给extract()的字符串,必须匹配namelist()返回的字符串列表中的一个。或者可以向extract() 传递第二个参数,将文件解压缩到指定的文件夹,而不是当前工作目录。如果第二个参数指定的文件夹不存在,Python就会创建它,extract() 的返回值是被压缩后文件的绝对路径。
创建和添加到ZIP文件
zipfile.ZIP_DEFLATED,指定deflate压缩算法,对各种类型的数据很有效。
>>> import zipfile >>> newZip = zipfile.ZipFile('new.zip', 'w') >>> newZip.write('spam.txt', compress_type=zipfile.ZIP_DEFLATED) >>> newZip.close()
-
将一个文件夹备份到一个zip文件
完整代码
#!/usr/bin/env python # backupToZip.py - Copies an entire folder and its contents into a ZIP file whose filename increments. import zipfile, os def backupToZip(folder): # Backup the entire contents of "folder" into a ZIP file. folder = os.path.abspath(folder) # make sure folder is absolute # Figure out the filename this code shoulde use based on # what files already exist. number = 1 while True: zipFilename = os.path.basename(folder) + '_' + str(number) + '.zip' if not os.path.exists(zipFilename): break number = number + 1 # TODO: Create the ZIP file. print('Creating %s...' % (zipFilename)) backupZip = zipfile.ZipFile(zipFilename, 'w') # TODO: Walk the entire folder tree and compress the files in each folder. for foldername, subfolders, filenames in os.walk(folder): print('Adding files in %s...' % (foldername)) # Add the current folder to the ZIP file. backupZip.write(foldername) # Add all the files in this folder to the ZIP file. for filename in filenames: if filename.startswith(os.path.basename(folder) + '_') and filename.endswith('.zip'): continue # don't backup the backup ZIP file backupZip.write(os.path.join(foldername, filename)) backupZip.close() print('Done.') backupToZip('C:\\PyProjects')
运行结果:
C:\PyProjects>C:/Python37/python3.exe c:/PyProjects/demo.py Creating PyProjects_1.zip... Adding files in C:\PyProjects... Adding files in C:\PyProjects\.vscode... Adding files in C:\PyProjects\zabbix... Adding files in C:\PyProjects\__pycache__... Done.
第1步:弄清楚ZIP文件的名称
定义backupToZip()函数,它只接收一个参数,即folder。这个参数是一个字符串路径,指向需要备份的文件夹。该函数将决定它创建的ZIP文件使用什么文件名,然后创建该文件,遍历folder文件夹,将每个子文件夹和文件添加到ZIP文件中。
os.path.exists() 检查备份文件是否存在,第一个不存在的文件名将导致循环break,因此它会发现新ZIP文件的文件名
import zipfile, os def backupToZip(folder): # Backup the entire contents of "folder" into a ZIP file. folder = os.path.abspath(folder) # make sure folder is absolute # Figure out the filename this code shoulde use based on # what files already exist. number = 1 while True: zipFilename = os.path.basename(folder) + '_' + str(number) + '.zip' if not os.path.exists(zipFilename): break number = number + 1 # TODO: Create the ZIP file. # TODO: Walk the entire folder tree and compress the files in each folder. print('Done.') backupToZip('C:\\PyProjects')
第2步:创建新ZIP文件
zipfile.ZipFile()创建ZIP文件
#!/usr/bin/env python # backupToZip.py - Copies an entire folder and its contents into a ZIP file --snip-- while True: zipFilename = os.path.basename(folder) + '_' + str(number) + '.zip' if not os.path.exists(zipFilename): break number = number + 1 # TODO: Create the ZIP file. print('Creating %s...' % (zipFilename)) backupZip = zipfile.ZipFile(zipFilename, 'w') # TODO: Walk the entire folder tree and compress the files in each folder. print('Done.') backupToZip('C:\\PyProjects')
第3步:遍历目录树并添加到ZIP文件
os.walk()函数,列出文件夹以及文件夹中的每个文件。
#!/usr/bin/env python # backupToZip.py - Copies an entire folder and its contents into a ZIP file whose filename increments. --snip-- # Walk the entire folder tree and compress the files in each folder. for foldername, subfolders, filenames in os.walk(folder): # 每次迭代,它将返回这次迭代当前的文件夹名称、这个文件夹的子文件夹,已经这个文件夹中的文件名 print('Adding files in %s...' % (foldername)) # Add the current folder to the ZIP file. backupZip.write(foldername) #在for循环中,该文件夹被添加到ZIP文件 # Add all the files in this folder to the ZIP file. for filename in filenames: #遍历filenames列表中的每个文件,每个文件都被添加到ZIP文件中,以前生成的备份ZIP除外 if filename.startswith(os.path.basename(folder) + '_') and filename.endswith('.zip'): continue # don't backup the backup ZIP file backupZip.write(os.path.join(foldername, filename)) backupZip.close() print('Done.') backupToZip('C:\\PyProjects')