python3.x 基础三:文件IO
打开文件的两种方式
1.直接打开文件并赋值给变量,打开后得到操作句柄,但不会自动关闭
- file = open('文件名‘,'打开模式',’编码‘)
-
fd = open('../config/file1.txt','r',encoding='utf-8')
2.使用with子句,打开后文件会自动关闭,建议使用,并可以同时打开多个文件
with open('../config/file1.txt','r',encoding='utf-8') as fd1,\ open('../config/file2.txt','r',encoding='utf-8') as fd2: print("I had open two files")
打开文件的8种模式
========= =============================================================== Character Meaning --------- --------------------------------------------------------------- 'r' open for reading (default) 'w' open for writing, truncating the file first 'x' create a new file and open it for writing 'a' open for writing, appending to the end of the file if it exists 'b' binary mode 't' text mode (default) '+' open a disk file for updating (reading and writing) 'U' universal newline mode (deprecated) ========= ===============================================================
1.’r',默认模式,参数可以不写,打开只读文件,写入报错
>>> fd = open('../config/file1.txt','r',encoding='utf-8') >>> fd.write('java c rubby') Traceback (most recent call last): File "<stdin>", line 1, in <module> io.UnsupportedOperation: not writable
2.‘w’,先truncate原文件,后写入,不可读,文件不存在则创建
>>> fd = open('../config/file1.txt','w',encoding='utf-8') >>> print(fd.read()) Traceback (most recent call last): File "<stdin>", line 1, in <module> io.UnsupportedOperation: not readable >>> fd.write('java rubby go') 13 >>> fd.close() >>> fd = open('../config/file1.txt','r',encoding='utf-8') >>> fd.read() 'java rubby go'
3.'x',创建新文件,打开并写入,如果文件已经存在,则报错
>>> fd = open('../config/file21.txt','x',encoding='utf-8' ) >>> fd.read() Traceback (most recent call last): File "<stdin>", line 1, in <module> io.UnsupportedOperation: not readable >>> fd.write('123456') 6 >>> fd.close() >>> fd = open('../config/file21.txt','r',encoding='utf-8') >>> fd.read() '123456' >>> fd.close() >>> fd = open('../config/file21.txt','x',encoding='utf-8') Traceback (most recent call last): File "<stdin>", line 1, in <module> FileExistsError: [Errno 17] File exists: '../config/file21.txt'
4.’a',追加写内容到文件末尾
>>> fd = open('../config/file1.txt','a',encoding='utf-8') >>> fd.write('linux windows aix') 17 >>> fd.close() >>> fd = open('../config/file1.txt','r',encoding='utf-8') >>> fd.read() 'java rubby golinux windows aix'
5.'b',二进制模式,比如流文件mp3,并且需要同时指定一种读写模式
>>> fd = open('/tmp/Front_Right.wav','b') Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: Must have exactly one of create/read/write/append mode and at most one plus >>> fd = open('/tmp/Front_Right.wav','rb') >>> fd1 = open('/tmp/Front_Right.wav','wb') >>> fd2 = open('/tmp/Front_Right.wav','ab') >>> fd2 = open('/tmp/Front_Right.wav','w+b') >>> fd2 = open('/tmp/Front_Right.wav','r+b')
6.'t',文本模式,默认打开文本并读取模式rt
7.'+',打开硬盘文件读写
- r+,打开并写入
-
>>> fd = open('../config/file1.txt','r+',encoding='utf-8') >>> fd.read() 'java rubby golinux windows aix' >>> fd.write("mage") 4 >>> fd.seek(0) 0 >>> fd.read() 'java rubby golinux windows aixmage'
- w+,打开文件读写,文件存在则覆盖,不存在则创建
-
>>> fd = open('../config/file4.txt','w+',encoding='utf-8') >>> fd.write('guangzhou') 9 >>> fd.seek(0) 0 >>> fd.read() 'guangzhou' >>> fd.seek(0) 0 >>> fd.write('hangzhou') 8 >>> fd.seek(0) 0 >>> fd.read() 'hangzhouu'
- a+,打开文件读写,存在则将指针置于末尾,不存在则创建新文件
-
>>> fd = open('../config/file4.txt','a+',encoding='utf-8') >>> fd.read() '' >>> fd.seek(0) 0 >>> fd.read() 'hangzhouu' >>> fd.close() >>> fd = open('../config/file4.txt','a+',encoding='utf-8') >>> fd.write('beijing') 7 >>> fd.read() '' >>> fd.seek(0) 0 >>> fd.read() 'hangzhouubeijing'
- rb+, wb+, ab+ 对象是二进制,其他以上面一样
8.‘U’,deprecated
指针位置
1.f.tell(),告知字符指针位置
2.f.seek(),移动字符指针位置,f.seek(0)文件开头
>>> fd = open('../config/file4.txt','r',encoding='utf-8') >>> fd.tell() 0 >>> fd.seek(0) 0 >>> fd.tell() 0 >>> fd.read(1) 'h' >>> fd.tell() 1 >>> fd.read(2) 'an' >>> fd.tell() 3 >>> fd.seek(0) 0 >>> fd.readline() 'hangzhouubeijing\n' >>> fd.tell() 17
读取文件的4个read,默认从头开始读,并将将指针留在行尾
1.fd.read(size)
- 默认省略size,size为整型,字符个数
- 读取全部内容到内存,并将指针留在行尾
- 大文件读取不要用,占内存
- 返回的是字符串类型
-
>>> fd = open('../config/file4.txt','r',encoding='utf-8') >>> fd.read() 'hangzhouubeijing' >>> fd.seek(0) 0 >>> fd.read(1) 'h' >>> fd.read(2) 'an' >>> fd.read(3) 'gzh' >>> fd.seek(0) 0 >>> fd.read(6) 'hangzh'
2.fd.readline(size)
- 默认一行行读取,size与上面一样
- 占用内存小
- 每行结尾带换行符
-
>>> fd = open('../config/file4.txt','r',encoding='utf-8') >>> fd.readline() 'hangzhouubeijing\n' >>> fd.readline() 'shenzhen\n' >>> fd.readline() 'shanghai\n' >>> fd.readline(1) 'a' >>> fd.readline(2) 'nh' >>> fd.readline() 'ui\n' >>> fd.readline() 'guangdong\n' >>> fd.readline() 'zhejiang' >>> fd.readline() ''
3.fd.readlines(size)
- 讲文本全部转换成列表,size表示下标
-
>>> fd = open('../config/file4.txt','r',encoding='utf-8') >>> fd.readlines() ['hangzhouubeijing\n', 'shenzhen\n', 'shanghai\n', 'anhui\n', 'guangdong\n', 'zhejiang'] >>> fd.seek(0) 0 >>> fd.readlines(1) ['hangzhouubeijing\n'] >>> fd.readlines() ['shenzhen\n', 'shanghai\n', 'anhui\n', 'guangdong\n', 'zhejiang']
4.fd.readable()
- 返回布尔值,判断文件是否可读
-
>>> fd = open('../config/file4.txt','r',encoding='utf-8') >>> fd.readable() True
循环遍历迭代文本内容对象(遍历操作都可以这么干)
>>> fd = open('../config/file4.txt','r',encoding='utf-8') >>> for line in fd: ... print(line) ... hangzhouubeijing shenzhen shanghai anhui guangdong zhejiang >>> fd.seek(0) 0 >>> for index,line in enumerate(fd.readlines()): ... print(index,line) ... 0 hangzhouubeijing 1 shenzhen 2 shanghai 3 anhui 4 guangdong 5 zhejiang >>>
其他方法
close(self, /) 关闭打开的文件
- | Flush and close the IO object.
- |
- | This method has no effect if the file is already closed.
detach(self, /) 干嘛用?
- | Separate the underlying buffer from the TextIOBase and return it.
- | After the underlying buffer has been detached, the TextIO is in an
- | unusable state.
>>> fd.detach() <_io.BufferedReader name='../config/file4.txt'> >>> fd.read() Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: underlying buffer has been detached
fileno(self, /) 返回文件描述符,干嘛用?
- | Returns underlying file descriptor if one exists.
- | OSError is raised if the IO object does not use a file descriptor.
>>> fd = open('../config/file4.txt','r',encoding='utf-8') >>> fd.fileno() 4 >>> fd = open('../config/filexxx.txt','w+',encoding='utf-8') >>> fd.fileno() 3
flush(self, /) 将缓存立即写入硬盘,提高效率
- | Flush write buffers, if applicable.
- | This is not implemented for read-only and non-blocking streams.
import time import sys for i in range(40): sys.stdout.write("#") sys.stdout.flush() time.sleep(0.1)
isatty(self, /) 是否连接到终端设备
- | Return whether this is an 'interactive' stream.
- | Return False if it can't be determined.
seekable(self, /)
- | Return whether object supports random access.
- | If False, seek(), tell() and truncate() will raise OSError.
- | This method may need to do a test seek().
truncate(self, pos=None, /)
- | Truncate file to size bytes.
- | File pointer is left unchanged. Size defaults to the current IO
- | position as reported by tell(). Returns the new size.
>>> fd = open('../config/file4.txt','r+',encoding='utf-8') >>> fd.truncate() 0
writable(self, /) 判断文件是否以写模式打开
| Return whether object was opened for writing.
>>> fd = open('../config/file4.txt','r+',encoding='utf-8') >>> fd.writable() True >>> fd = open('../config/file1.txt','r',encoding='utf-8') >>> fd.writable() False
修改文件的两种方式:
1.全部读入内存,修改完毕之后覆盖写入源文件
2.一行一行读取内存,修改完毕之后写入新文件,用新文件覆盖旧文件
练习一:实现sed替换功能
练习二:修改haproxy配置文件