Day2 - Python基础2-列表、字典、集合

1. 列表、元组操作

新建列表

names = ['tom','jack','sky','snow','may']

1.切片

#顾头不顾尾,从左往右数

names = ['tom','jack','sky','snow','may']
print(names)
#切片
print(names[0])
print(names[2])
print(names[-1])

print(names[0:2])
print(names[:2])

print(names[-3:-1])  #顾头不顾尾,取不到最后一个
print(names[-3:])

print(names[0::2])  #隔一个元素取


结果:
['tom', 'jack', 'sky', 'snow', 'may'] tom sky may ['tom', 'jack'] ['tom', 'jack'] ['sky', 'snow'] ['sky', 'snow', 'may'] ['tom', 'sky', 'may']

2.追加

names = ['tom','jack','sky','snow','may']
print(names)

names.append('yoyo')
print(names)

3.插入

names = ['tom','jack','sky','snow','may']
print(names)

names.insert(2,'ayda')
print(names)

4.修改

names = ['tom','jack','sky','snow','may']
print(names)

names[1] = 'yoyo'
print(names)

5.删除

names = ['tom','jack','sky','snow','may']
print(names)
#first method
names.remove('tom')
print(names)
#second method
del names[1]
print(names)
#third method
names.pop()
print(names)
names.pop(0)
print(names)

结果:
['tom', 'jack', 'sky', 'snow', 'may']
['jack', 'sky', 'snow', 'may']
['jack', 'snow', 'may']
['jack', 'snow']
['snow']

6.扩展

names = ['tom','jack','sky','snow','may']
print(names)
names2 = [1,2,3]
names.extend(names2)
print(names,names2)

结果:
['tom', 'jack', 'sky', 'snow', 'may']
['tom', 'jack', 'sky', 'snow', 'may', 1, 2, 3] [1, 2, 3]

7.统计

names = ['tom','jack','sky','snow','may','may']
print(names)
print(names.count('may'))


结果:
['tom', 'jack', 'sky', 'snow', 'may', 'may']
2

8.排序和翻转

names = ['tom','jack','sky','snow','may','may']
print(names)

names.sort()
print(names)
names.reverse()
print(names)

结果:
['tom', 'jack', 'sky', 'snow', 'may', 'may']
['jack', 'may', 'may', 'sky', 'snow', 'tom']
['tom', 'snow', 'sky', 'may', 'may', 'jack']

9.获取下标

names = ['tom','jack','sky','snow','may','may']
print(names)

print(names.index('may'))


结果:
4   #输出找到的第一个

10.复制

names = ['tom','jack','sky','snow','may','may']
print(names)

names_c = names.copy()
print(names_c)

#copy有深copy和浅copy
import copy
person = [name,[saving,100]]
#浅copy,修改saving,原文件也会改变,修改name,则只是复件改变
p1 = copy.copy(person)

#深copy,不影响源文件
p1 = copy.deepcopy(person)

  

元组

元组其实跟列表差不多,也是存一组数,只不是它一旦创建,便不能再修改,所以又叫只读列表

程序练习 

程序:购物车程序

需求:

  1. 启动程序后,让用户输入工资,然后打印商品列表
  2. 允许用户根据商品编号购买商品
  3. 用户选择商品后,检测余额是否够,够就直接扣款,不够就提醒 
  4. 可随时退出,退出时,打印已购买商品和余额
commodity = [['iphone',5800],['mac pro',12000],['book',50],['shoes',599],['water',3]]
print(['%s:%s'%(i,j) for i,j in commodity])
salary = input('please input your salary:')
good_cart = []
if salary.isdigit():
    salary = int(salary)

    while True :
        for i in enumerate(commodity):
            print(i[0],i[1][0],i[1][1])
        goods_order = input('please input your what you want to buy:')
        if goods_order.isdigit():
            goods_order = int(goods_order)
            print('pleasse wait.. ')
            if 0 <= goods_order and goods_order < len(commodity):
                if salary >= commodity[goods_order][1]:
                    good_cart.append(commodity[goods_order])
                    salary-=commodity[goods_order][1]
                    print('you have buy:\n', '\n'.join(['%s:%s'%(i,j) for i,j in good_cart]),'\nyour money :%s'%salary)
                else:
                    print('\033[41;1m you cuurent money is not enough!  your money :%s=\033[0m'%salary)
            else:
                print('error key...')
        elif goods_order == 'q':
            print('exit now...')
            exit()
        else :
            print('error key...')

 ps: 

  • enumerate(commodity) :生成带下标的元组
  • order.isdigit() :判断是否为数字
  • \033[41;1m 被修改颜色内容\033[0m

 

2. 字符串操作  

特性:不可修改 

name.capitalize()  首字母大写
name.casefold()   大写全部变小写
name.center(50,"-")  输出 '---------------------Arnol----------------------'
name.count('arnol') 统计 arnol出现次数
name.encode()  将字符串编码成bytes格式
name.endswith("Li")  判断字符串是否以 Li结尾
 "Alex\tLi".expandtabs(10) 输出'Alex      Li', 将\t转换成多长的空格 
 name.find('A')  查找A,找到返回其索引, 找不到返回-1 

format :
    >>> msg = "my name is {}, and age is {}"
    >>> msg.format("alex",22)
    'my name is alex, and age is 22'
    >>> msg = "my name is {1}, and age is {0}"
    >>> msg.format("alex",22)
    'my name is 22, and age is alex'
    >>> msg = "my name is {name}, and age is {age}"
    >>> msg.format(age=22,name="ale")
    'my name is ale, and age is 22'
format_map
    >>> msg.format_map({'name':'alex','age':22})
    'my name is alex, and age is 22'


msg.index('a')  返回a所在字符串的索引
'9aA'.isalnum()   True

'9'.isdigit() 是否整数
name.isnumeric  
name.isprintable
name.isspace
name.istitle
name.isupper
 "|".join(['alex','jack','rain'])
'alex|jack|rain'


maketrans
    >>> intab = "aeiou"  #This is the string having actual characters. 
    >>> outtab = "12345" #This is the string having corresponding mapping character
    >>> trantab = str.maketrans(intab, outtab)
    >>> 
    >>> str = "this is string example....wow!!!"
    >>> str.translate(trantab)
    'th3s 3s str3ng 2x1mpl2....w4w!!!'

 msg.partition('is')   输出 ('my name ', 'is', ' {name}, and age is {age}') 

 >>> "alex li, chinese name is lijie".replace("li","LI",1)
     'alex LI, chinese name is lijie'

 msg.swapcase 大小写互换


 >>> msg.zfill(40)
'00000my name is {name}, and age is {age}'



>>> n4.ljust(40,"-")
'Hello 2orld-----------------------------'
>>> n4.rjust(40,"-")
'-----------------------------Hello 2orld'


>>> b="ddefdsdff_哈哈" 
>>> b.isidentifier() #检测一段字符串可否被当作标志符,即是否符合变量命名规则
True

  

 

3. 字典操作

字典一种key - value 的数据类型,使用就像我们上学用的字典,通过笔划、字母来查对应页的详细内容。

1.增删改查

'''字典操作'''

info = {'stu001':'jack',
        'stu002':'tom',
        'stu003':'mack'
        }

#添加、修改
'''不存在就添加'''
info['stu004'] = 'meimei Han'
print(info)
'''存在就修改'''
info['stu001'] = 'jack chen'
print(info)


#删除
info.pop('stu002')
del info['stu004']
print(info)
'''随机删除'''
info.popitem()

#查找
'''两个方法相同,但是get,如果不存在返回None,不会报错'''
print(info.get('stu001'))
print(info['stu001'])
'''标准方法'''
print('stu001' in info)

 

2.多级嵌套

av_catalog = {
    "欧美":{
        "www.youporn.com": ["很多免费的,世界最大的","质量一般"],
        "www.pornhub.com": ["很多免费的,也很大","质量比yourporn高点"],
        "letmedothistoyou.com": ["多是自拍,高质量图片很多","资源不多,更新慢"],
        "x-art.com":["质量很高,真的很高","全部收费,屌比请绕过"]
    },
    "日韩":{
        "tokyo-hot":["质量怎样不清楚,个人已经不喜欢日韩范了","听说是收费的"]
    },
    "大陆":{
        "1024":["全部免费,真好,好人一生平安","服务器在国外,慢"]
    }
}

av_catalog["大陆"]["1024"][1] += ",可以用爬虫爬下来"
print(av_catalog["大陆"]["1024"])
#ouput 
['全部免费,真好,好人一生平安', '服务器在国外,慢,可以用爬虫爬下来']

  

3.其他方法

#values
>>> info.values()
dict_values(['LongZe Luola', 'XiaoZe Maliya'])

#keys
>>> info.keys()
dict_keys(['stu1102', 'stu1103'])


#setdefault
>>> info.setdefault("stu1106","Alex")
'Alex'
>>> info
{'stu1102': 'LongZe Luola', 'stu1103': 'XiaoZe Maliya', 'stu1106': 'Alex'}
>>> info.setdefault("stu1102","龙泽萝拉")
'LongZe Luola'
>>> info
{'stu1102': 'LongZe Luola', 'stu1103': 'XiaoZe Maliya', 'stu1106': 'Alex'}


#update 
>>> info
{'stu1102': 'LongZe Luola', 'stu1103': 'XiaoZe Maliya', 'stu1106': 'Alex'}
>>> b = {1:2,3:4, "stu1102":"龙泽萝拉"}
>>> info.update(b)
>>> info
{'stu1102': '龙泽萝拉', 1: 2, 3: 4, 'stu1103': 'XiaoZe Maliya', 'stu1106': 'Alex'}

#items
info.items()
dict_items([('stu1102', '龙泽萝拉'), (1, 2), (3, 4), ('stu1103', 'XiaoZe Maliya'), ('stu1106', 'Alex')])


#通过一个列表生成默认dict,有个没办法解释的坑,少用吧这个
>>> dict.fromkeys([1,2,3],'testd')
{1: 'testd', 2: 'testd', 3: 'testd'}

  

3.循环字典

 

#方法1
for key in info:
    print(key,info[key])

#方法2
for k,v in info.items(): #会先把dict转成list,数据里大时莫用
    print(k,v)

  

程序练习

程序: 三级菜单

要求: 

  1. 打印省、市、县三级菜单
  2. 可返回上一级
  3. 可随时退出程序
data = {
    '北京':{
        "昌平":{
            "沙河":["oldboy","test"],
            "天通苑":["链家地产","我爱我家"]
        },
        "朝阳":{
            "望京":["奔驰","陌陌"],
            "国贸":{"CICC","HP"},
            "东直门":{"Advent","飞信"},
        },
        "海淀":{},
    },
    '山东':{
        "德州":{},
        "青岛":{},
        "济南":{}
    },
    '广东':{
        "东莞":{},
        "常熟":{},
        "佛山":{},
    },
}
exit_flag = False
while not exit_flag:
    for i in data:
        print(i)
    choice1 = input('please input your 1 choice:')
    if choice1 in data:
        while not exit_flag:
            for i2 in data[choice1]:
                print(i2)
            choice2 = input('please input your 2 choice:')
            if choice2 in data[choice1]:
                while not exit_flag :
                    for i3 in data[choice1][choice2]:
                        print(i3)
                    choice3 = input('please input your 3 choice:')
                    if choice3 in data[choice1][choice2]:
                        for i4 in data[choice1][choice2][choice3]:
                            print(i4)
                        choice4 = input('please input your 4 choice:')
                        if choice4 == 'b':
                            pass
                        elif choice4 == 'q':
                            exit_flag = True
                    elif choice3 == 'b':
                        break
                    elif choice3 == 'q':
                        exit_flag = True
            elif choice2 == 'b':
                break
            elif choice2 == 'q':
                exit_flag = True
    elif choice1 == 'b':
        pass
    elif choice1 == 'q':
        exit_flag = True

  

作业:购物车优化
要求:
  用户入口:
  1.商品信息存在文件里
  2.已购商品,余额记录
  商家入口:
  1.可以添加商品,修改商品价格
# 读取商品信息
f = open('.\goods_sql.txt', 'r')
goods = []
for i in f.readlines():
    goods.append(i.split('\n')[0].split(','))
f.close()

user_type = input('0.商家\t\t1.用户\nplease choose your type:')
if user_type == '0' :
    #商家入口

    while True :
        order = input('0.添加商品\t\t1.修改商品价格\nplease input your choice:' )
        if order == '0':
            #添加商品
            print('添加商品'.center(50,'-'))
            while True :
                add_goods = input('please input your goods (name,price):')
                if add_goods == 'q':
                    break
                else:
                    add_goods = add_goods.strip().split(',')
                    print('>>>add %s to the shop...'%add_goods)
                    goods.append(add_goods)

        elif order == '1':
            #修改价格
            print('商品列表'.center(50,'-'))
            for index,item in enumerate(goods) :
                print(index, item)
            while True :
                update_order = input('please input your choice(index,price):')
                if update_order == 'q':
                    break
                else:
                    update_order = update_order.strip().split(',')
                    print('>>>now update %s price to %s'%(goods[int(update_order[0])],update_order[1]))
                    goods[int(update_order[0])][1] = update_order[1]
        elif order == 'q':
            #退出并保存操作
            print(goods)
            f = open('goods_sql.txt','w+')
            for item in goods:
                print(','.join(item))
                f.write(','.join(item))
                f.write('\n')
            f.close()
            break
        else:
            print('error type')
elif user_type == '1':
    #用户入口
    shopping_cart = []
    user_f = open('user.txt','r')
    user_data = user_f.readlines()
    if user_data :
        #提取数据
        salary = int(user_data[0].strip())
        user_data.pop(0)
        for i in user_data :
            shopping_cart.append(i.strip().split(','))
        print('you have buy %s,you balance :%s'%(shopping_cart,salary))
        user_f.close()
        pass
    else:
        #输入数据
        salary = input('please input your salary:')
        salary = int(salary)

    print('商品列表'.center(50,'-'))
    for index,item in enumerate(goods):
        print(index,item)
    while True :
        shopping_order = input('please what you want to buy:')
        if shopping_order.isdigit():
            shopping_order = int(shopping_order)
            if shopping_order <= len(goods):
                if int(goods[shopping_order][1]) <= salary :
                    shopping_cart.append(goods[shopping_order])
                    salary -= int(goods[shopping_order][1])
                    print('>>>add %s to shopping cart,your balance :%s' % (goods[shopping_order], salary))
                else:
                    shopping_order = 'q'
            else:
                print('error goods code!')
        elif shopping_order == 'q':
            #保存用户记录
            user_f = open('user.txt','w')
            user_f.write(str(salary))
            user_f.write('\n')
            for i in shopping_cart:
                user_f.write(','.join(i))
                user_f.write('\n')
            user_f.close()
            break
        else:
            print('shopping order type error!')


else :
    print('error type')

  

4.集合操作

集合是一个无序的,不重复的数据组合,它的主要作用如下:

  • 去重,把一个列表变成集合,就自动去重了
  • 关系测试,测试两组数据之前的交集、差集、并集等关系

常用操作

s = set([3,5,9,10])      #创建一个数值集合  
  
t = set("Hello")         #创建一个唯一字符的集合  


a = t | s          # t 和 s的并集  
  
b = t & s          # t 和 s的交集  
  
c = t – s          # 求差集(项在t中,但不在s中)  
  
d = t ^ s          # 对称差集(项在t或s中,但不会同时出现在二者中)  
  
   
  
基本操作:  
  
t.add('x')            # 添加一项  
  
s.update([10,37,42])  # 在s中添加多项  
  
   
  
使用remove()可以删除一项:  
  
t.remove('H')  
  
  
len(s)  
set 的长度  
  
x in s  
测试 x 是否是 s 的成员  
  
x not in s  
测试 x 是否不是 s 的成员  
  
s.issubset(t)  
s <= t  
测试是否 s 中的每一个元素都在 t 中  
  
s.issuperset(t)  
s >= t  
测试是否 t 中的每一个元素都在 s 中  
  
s.union(t)  
s | t  
返回一个新的 set 包含 s 和 t 中的每一个元素  
  
s.intersection(t)  
s & t  
返回一个新的 set 包含 s 和 t 中的公共元素  
  
s.difference(t)  
s - t  
返回一个新的 set 包含 s 中有但是 t 中没有的元素  
  
s.symmetric_difference(t)  
s ^ t  
返回一个新的 set 包含 s 和 t 中不重复的元素  
  
s.copy()  
返回 set “s”的一个浅复制

  

5. 文件操作

对文件操作流程

  1. 打开文件,得到文件句柄并赋值给一个变量
  2. 通过句柄对文件进行操作
  3. 关闭文件 

现有文件如下 

Somehow, it seems the love I knew was always the most destructive kind
不知为何,我经历的爱情总是最具毁灭性的的那种
Yesterday when I was young
昨日当我年少轻狂
The taste of life was sweet
生命的滋味是甜的
As rain upon my tongue
就如舌尖上的雨露
I teased at life as if it were a foolish game
我戏弄生命 视其为愚蠢的游戏
The way the evening breeze
就如夜晚的微风
May tease the candle flame
逗弄蜡烛的火苗
The thousand dreams I dreamed
我曾千万次梦见
The splendid things I planned
那些我计划的绚丽蓝图
I always built to last on weak and shifting sand
但我总是将之建筑在易逝的流沙上
I lived by night and shunned the naked light of day
我夜夜笙歌 逃避白昼赤裸的阳光
And only now I see how the time ran away
事到如今我才看清岁月是如何匆匆流逝
Yesterday when I was young
昨日当我年少轻狂
So many lovely songs were waiting to be sung
有那么多甜美的曲儿等我歌唱
So many wild pleasures lay in store for me
有那么多肆意的快乐等我享受
And so much pain my eyes refused to see
还有那么多痛苦 我的双眼却视而不见
I ran so fast that time and youth at last ran out
我飞快地奔走 最终时光与青春消逝殆尽
I never stopped to think what life was all about
我从未停下脚步去思考生命的意义
And every conversation that I can now recall
如今回想起的所有对话
Concerned itself with me and nothing else at all
除了和我相关的 什么都记不得了
The game of love I played with arrogance and pride
我用自负和傲慢玩着爱情的游戏
And every flame I lit too quickly, quickly died
所有我点燃的火焰都熄灭得太快
The friends I made all somehow seemed to slip away
所有我交的朋友似乎都不知不觉地离开了
And only now I'm left alone to end the play, yeah
只剩我一个人在台上来结束这场闹剧
Oh, yesterday when I was young
噢 昨日当我年少轻狂
So many, many songs were waiting to be sung
有那么那么多甜美的曲儿等我歌唱
So many wild pleasures lay in store for me
有那么多肆意的快乐等我享受
And so much pain my eyes refused to see
还有那么多痛苦 我的双眼却视而不见
There are so many songs in me that won't be sung
我有太多歌曲永远不会被唱起
I feel the bitter taste of tears upon my tongue
我尝到了舌尖泪水的苦涩滋味
The time has come for me to pay for yesterday
终于到了付出代价的时间 为了昨日
When I was young
当我年少轻狂

  

基本操作  

f = open('test.txt','r',encoding='utf-8')  #打开文件,encoding 指定用什么编码方式打开,电脑默认gbk会出错
print(f.readline())  #读一行
print('分割线'.center(50,'-'))
print(f.read()) # 读取剩下的所有内容,文件大时不要用
print('分割线'.center(50,'-'))
f.seek(0)    # 将指针返回到最开始
print(f.readlines())  #读取全部内容,每行为列表中的一个元素

  

打开文件的模式有:

  • r,只读模式(默认)。
  • w,只写模式。【不可读;不存在则创建;存在则删除内容;】
  • a,追加模式。【可读;   不存在则创建;存在则只追加内容;】

"+" 表示可以同时读写某个文件

  • r+,可读写文件。【可读;可写;可追加】
  • w+,写读
  • a+,同a

"U"表示在读取时,可以将 \r \n \r\n自动转换成 \n (与 r 或 r+ 模式同使用)

  • rU
  • r+U

"b"表示处理二进制文件(如:FTP发送上传ISO镜像文件,linux可忽略,windows处理二进制文件时需标注)

  • rb
  • wb
  • ab

其它语法

def close(self): # real signature unknown; restored from __doc__
        """
        Close the file.
        
        A closed file cannot be used for further I/O operations.  close() may be
        called more than once without error.
        """
        pass

    def fileno(self, *args, **kwargs): # real signature unknown
        """ Return the underlying file descriptor (an integer). """
        pass

    def isatty(self, *args, **kwargs): # real signature unknown
        """ True if the file is connected to a TTY device. """
        pass

    def read(self, size=-1): # known case of _io.FileIO.read
        """
        注意,不一定能全读回来
        Read at most size bytes, returned as bytes.
        
        Only makes one system call, so less data may be returned than requested.
        In non-blocking mode, returns None if no data is available.
        Return an empty bytes object at EOF.
        """
        return ""

    def readable(self, *args, **kwargs): # real signature unknown
        """ True if file was opened in a read mode. """
        pass

    def readall(self, *args, **kwargs): # real signature unknown
        """
        Read all data from the file, returned as bytes.
        
        In non-blocking mode, returns as much as is immediately available,
        or None if no data is available.  Return an empty bytes object at EOF.
        """
        pass

    def readinto(self): # real signature unknown; restored from __doc__
        """ Same as RawIOBase.readinto(). """
        pass #不要用,没人知道它是干嘛用的

    def seek(self, *args, **kwargs): # real signature unknown
        """
        Move to new file position and return the file position.
        
        Argument offset is a byte count.  Optional argument whence defaults to
        SEEK_SET or 0 (offset from start of file, offset should be >= 0); other values
        are SEEK_CUR or 1 (move relative to current position, positive or negative),
        and SEEK_END or 2 (move relative to end of file, usually negative, although
        many platforms allow seeking beyond the end of a file).
        
        Note that not all file objects are seekable.
        """
        pass

    def seekable(self, *args, **kwargs): # real signature unknown
        """ True if file supports random-access. """
        pass

    def tell(self, *args, **kwargs): # real signature unknown
        """
        Current file position.
        
        Can raise OSError for non seekable files.
        """
        pass

    def truncate(self, *args, **kwargs): # real signature unknown
        """
        Truncate the file to at most size bytes and return the truncated size.
        
        Size defaults to the current file position, as returned by tell().
        The current file position is changed to the value of size.
        """
        pass

    def writable(self, *args, **kwargs): # real signature unknown
        """ True if file was opened in a write mode. """
        pass

    def write(self, *args, **kwargs): # real signature unknown
        """
        Write bytes b to file, return number written.
        
        Only makes one system call, so not all of the data may be written.
        The number of bytes actually written is returned.  In non-blocking mode,
        returns None if the write would block.
        """
        pass

  

  • seek():将指针移到指定位置
  • tell() :告诉指针当前的位置
  • truncate():截取多少个字节长度的内容,不管游标位置,从头开始截取。
  • flush():强制将缓存写入硬盘

进度条功能实现(实时刷新,flush):

import sys,time
for i in range(20):
    sys.stdout.write('#')    #stdout稳定输出,就是界面显示,steady
    sys.stdout.flush()    #不写这行就不会有进度条的效果,而是一次出来,因为会先缓存,最后再打印。
    time.sleep(1)

  

  • read(),readlines()都是一次性全部读取到内存,文件大实不可取,最佳方案是:
  • for line in f :
        print(line)
    

      

  • 文件读写模式r+和写读模式w+ ,写入总是从最后写入,因为对硬盘写入只能替换原内容,不能插入。要修改文件只能拷贝:
  • f = open('test.txt','r',encoding='utf-8')
    f_new = open('test1.txt.bak','w',encoding='utf-8')
    for line in f :
        if "都不知不觉地离开了" in line :
            line = line.replace('都不知不觉地离开了','----都不知不觉地死去了----')
        f_new.write(line)
    

      

with语句

为了避免打开文件后忘记关闭,可以通过管理上下文,即:

with open('log','r') as f:
     
    ...

如此方式,当with代码块执行完毕时,内部会自动关闭并释放文件资源。

在Python 2.7 后,with又支持同时对多个文件的上下文进行管理,即:

with open('log1') as obj1, open('log2') as obj2:
    pass

 

程序练习  

程序1: 实现简单的shell sed替换功能

 1 #encoding:utf-8
 2 #author:Arnol
 3 import sys
 4 
 5 get_order = sys.argv
 6 
 7 print(get_order)
 8 def replace(old_str,new_str):
 9     with open('home.txt','r',encoding='utf-8') as f,\
10          open('nome1.txt','w',encoding='utf-8') as f_new:
11         for line in f:
12             if old_str in line:
13                 line = line.replace(old_str,new_str)
14             f_new.write(line)
15 
16 replace(get_order[1],get_order[2])
code

程序2:修改haproxy配置文件 

需求:

 1 1、查
 2     输入:www.oldboy.org
 3     获取当前backend下的所有记录
 4 
 5 2、新建
 6     输入:
 7         arg = {
 8             'bakend': 'www.oldboy.org',
 9             'record':{
10                 'server': '100.1.7.9',
11                 'weight': 20,
12                 'maxconn': 30
13             }
14         }
15 
16 3、删除
17     输入:
18         arg = {
19             'bakend': 'www.oldboy.org',
20             'record':{
21                 'server': '100.1.7.9',
22                 'weight': 20,
23                 'maxconn': 30
24             }
25         }
需求
 1 global       
 2         log 127.0.0.1 local2
 3         daemon
 4         maxconn 256
 5         log 127.0.0.1 local2 info
 6 defaults
 7         log global
 8         mode http
 9         timeout connect 5000ms
10         timeout client 50000ms
11         timeout server 50000ms
12         option  dontlognull
13 
14 listen stats :8888
15         stats enable
16         stats uri       /admin
17         stats auth      admin:1234
18 
19 frontend oldboy.org
20         bind 0.0.0.0:80
21         option httplog
22         option httpclose
23         option  forwardfor
24         log global
25         acl www hdr_reg(host) -i www.oldboy.org
26         use_backend www.oldboy.org if www
27 
28 backend www.oldboy.org
29         server 100.1.7.9 100.1.7.9 weight 20 maxconn 3000
原配置文件
#encoding:utf-8
#author:Arnol
print('选择你的操作'.center(50,'-'),'\n1.查找\n2.添加\n3.删除')
order = input('please input what you want:')
if order =='1':
#查
    find_str = input('please input you want to find:')
    with open('config.txt','r',encoding='utf-8') as f :
        for line in f:
            if 'backend' in line[0:7]:
                if find_str in line:
                    line = line.rsplit()
                    record = f.readline().lstrip().split()
                    record = {record[0]:[record[1],record[2]],
                              record[3]:record[4],
                              record[5]:record[6]}
                    info = {line[0]:line[1],
                             'record':record}
                    print(info)
            pass
elif order =='2':
    #新建
    info = {'backend':None,
            'record':{'server':[],'weight':None,'maxconn':None}}
    info['backend']=input('please input backend:')
    info['record']['server'].append(input('please input server:'))
    info['record']['weight'] = input('please input weight:')
    info['record']['maxconn'] = input('please input maxconn:')
    print(info)
    with open('config.txt','a',encoding='utf-8') as f:
        f.write('backend %s\n'%info['backend'])
        f.write(' '*7)
        f.write('server {server} weight {weight} maxconn {maxconn}'\
                .format(server=' '.join(info['record']['server']),\
                        weight=info['record']['weight'],\
                        maxconn=info['record']['maxconn']))

if order == '3':
    #删除
    del_str = input('please input want to del backend:')
    with open('config.txt','r',encoding='utf-8') as f,\
         open('config.bak','w',encoding='utf-8') as f_new :
        for line in f:
            f_new.write(line)
    with open('config.txt', 'w', encoding='utf-8') as f, \
         open('config.bak', 'r', encoding='utf-8') as f_new:
        for line in f_new:
            if 'backend' in line[0:7]:
                if del_str in line:
                    f_new.readline()
                    continue
            f.write(line)
else:
    print('error order')

  

6. 字符编码与转码

详细文章:

http://www.cnblogs.com/yuanchenqi/articles/5956943.html

http://www.diveintopython3.net/strings.html

需知:

1.在python2默认编码是ASCII, python3里默认是unicode

2.unicode 分为 utf-32(占4个字节),utf-16(占两个字节),utf-8(占1-4个字节), so utf-16就是现在最常用的unicode版本, 不过在文件里存的还是utf-8,因为utf8省空间

3.在py3中encode,在转码的同时还会把string 变成bytes类型,decode在解码的同时还会把bytes变回string

 

 上图仅适用于py2

 1 #-*-coding:utf-8-*-
 2 __author__ = 'Alex Li'
 3 
 4 import sys
 5 print(sys.getdefaultencoding())
 6 
 7 
 8 msg = "我爱北京天安门"
 9 msg_gb2312 = msg.decode("utf-8").encode("gb2312")
10 gb2312_to_gbk = msg_gb2312.decode("gbk").encode("gbk")
11 
12 print(msg)
13 print(msg_gb2312)
14 print(gb2312_to_gbk)
in python2
 1 #-*-coding:gb2312 -*-   #这个也可以去掉
 2 __author__ = 'Alex Li'
 3 
 4 import sys
 5 print(sys.getdefaultencoding())
 6 
 7 
 8 msg = "我爱北京天安门"
 9 #msg_gb2312 = msg.decode("utf-8").encode("gb2312")
10 msg_gb2312 = msg.encode("gb2312") #默认就是unicode,不用再decode,喜大普奔
11 gb2312_to_unicode = msg_gb2312.decode("gb2312")
12 gb2312_to_utf8 = msg_gb2312.decode("gb2312").encode("utf-8")
13 
14 print(msg)
15 print(msg_gb2312)
16 print(gb2312_to_unicode)
17 print(gb2312_to_utf8)
in python3

 

 

 

温故知新

1. 集合

主要作用: 

  1. 去重
  2. 关系测试, 交集\差集\并集\反向(对称)差集

2. 元组  

只读列表,只有count, index 2 个方法

作用:如果一些数据不想被人修改, 可以存成元组,比如身份证列表

3. 字典

key-value对

  1. 特性:
  2. 无顺序
  3. 去重
  4. 查询速度快,比列表快多了
  5. 比list占用内存多

为什么会查询速度会快呢?因为他是hash类型的,那什么是hash呢?

哈希算法将任意长度的二进制值映射为较短的固定长度的二进制值,这个小的二进制值称为哈希值。哈希值是一段数据唯一且极其紧凑的数值表示形式。如果散列一段明文而且哪怕只更改该段落的一个字母,随后的哈希都将产生不同的值。要找到散列为同一个值的两个不同的输入,在计算上是不可能的,所以数据的哈希值可以检验数据的完整性。一般用于快速查找和加密算法

dict会把所有的key变成hash 表,然后将这个表进行排序,这样,你通过data[key]去查data字典中一个key的时候,python会先把这个key hash成一个数字,然后拿这个数字到hash表中看没有这个数字, 如果有,拿到这个key在hash表中的索引,拿到这个索引去与此key对应的value的内存地址那取值就可以了。

上面依然没回答这样做查找一个数据为什么会比列表快,对不对? 呵呵,等我课上揭晓。

4. 字符编码

 

先说python2

  1. py2里默认编码是ascii
  2. 文件开头那个编码声明是告诉解释这个代码的程序 以什么编码格式 把这段代码读入到内存,因为到了内存里,这段代码其实是以bytes二进制格式存的,不过即使是2进制流,也可以按不同的编码格式转成2进制流,你懂么?
  3. 如果在文件头声明了#_*_coding:utf-8*_,就可以写中文了, 不声明的话,python在处理这段代码时按ascii,显然会出错, 加了这个声明后,里面的代码就全是utf-8格式了
  4. 在有#_*_coding:utf-8*_的情况下,你在声明变量如果写成name=u"大保健",那这个字符就是unicode格式,不加这个u,那你声明的字符串就是utf-8格式
  5. utf-8 to gbk怎么转,utf8先decode成unicode,再encode成gbk

再说python3

  1. py3里默认文件编码就是utf-8,所以可以直接写中文,也不需要文件头声明编码了,干的漂亮
  2. 你声明的变量默认是unicode编码,不是utf-8, 因为默认即是unicode了(不像在py2里,你想直接声明成unicode还得在变量前加个u), 此时你想转成gbk的话,直接your_str.encode("gbk")即可以
  3. 但py3里,你在your_str.encode("gbk")时,感觉好像还加了一个动作,就是就是encode的数据变成了bytes里,我擦,这是怎么个情况,因为在py3里,str and bytes做了明确的区分,你可以理解为bytes就是2进制流,你会说,我看到的不是010101这样的2进制呀, 那是因为python为了让你能对数据进行操作而在内存级别又帮你做了一层封装,否则让你直接看到一堆2进制,你能看出哪个字符对应哪段2进制么?什么?自己换算,得了吧,你连超过2位数的数字加减运算都费劲,还还是省省心吧。  
  4. 那你说,在py2里好像也有bytes呀,是的,不过py2里的bytes只是对str做了个别名(python2里的str就是bytes, py3里的str是unicode),没有像py3一样给你显示的多出来一层封装,但其实其内部还是封装了的。 这么讲吧, 无论是2还是三, 从硬盘到内存,数据格式都是 010101二进制到-->b'\xe4\xbd\xa0\xe5\xa5\xbd' bytes类型-->按照指定编码转成你能看懂的文字

编码应用比较多的场景应该是爬虫了,互联网上很多网站用的编码格式很杂,虽然整体趋向都变成utf-8,但现在还是很杂,所以爬网页时就需要你进行各种编码的转换,不过生活正在变美好,期待一个不需要转码的世界。

最后,编码is a piece of fucking shit, noboby likes it.

 

posted @ 2017-05-24 15:46  痴道三  阅读(388)  评论(0编辑  收藏  举报