您的位置:首页 > 编程语言 > Python开发

Python 迭代删除重复项,集合删除重复项

2015-09-29 16:21 495 查看
1. 迭代删除重复项:先排序列表项,然后通过新迭代(not in)去除重复项,分片打印

def sanitize(time_string):
if '-' in time_string:
splitter = '-'
elif ':' in time_string:
splitter = ':'
else:
return (time_string)
(mins, secs)=time_string.split(splitter)
return(mins + '.' + secs)

with open ('james.txt') as jas: data = jas.readline()
james=data.strip().split(',')

print('before sort and senitize, unique for james',james)
james=sorted ([sanitize(t) for t in james])
unique_james=[]
for each_t in james:
if each_t not in unique_james:
unique_james.append(each_t)
print('First 3 time for james',unique_james[0:3])

=========== RESTART: C:\Users\eric\Documents\Python\kelly\kelly.py ===========
before sort and senitize, unique for james ['2-34', '3:21', '2.34', '2.45', '3.01', '2:01', '2:01', '3:10', '2-22']
First 3 time for james ['2.01', '2.22', '2.34']


2. 集合删除重复项:先set创建集合去除重复项,然后进行排序,分片打印

def sanitize(time_string):
if '-' in time_string:
splitter = '-'
elif ':' in time_string:
splitter = ':'
else:
return (time_string)
(mins, secs)=time_string.split(splitter)
return(mins + '.' + secs)

with open ('james.txt') as jas: data = jas.readline()
james=data.strip().split(',')

print('before sort and senitize, unique for james',james)
james=sorted (set([sanitize(t) for t in james]))
print('First 3 time for james',james[0:3])

=========== RESTART: C:\Users\eric\Documents\Python\kelly\kelly.py ===========
before sort and senitize, unique for james ['2-34', '3:21', '2.34', '2.45', '3.01', '2:01', '2:01', '3:10', '2-22']
First 3 time for james ['2.01', '2.22', '2.34']


3.精简代码,创建一个小函数rmspace去除空白符,通过函数调用分片打印

def sanitize(time_string):
if '-' in time_string:
splitter = '-'
elif ':' in time_string:
splitter = ':'
else:
return (time_string)
(mins, secs)=time_string.split(splitter)
return(mins + '.' + secs)

def rmspace(file):
with open(file) as fo: data=fo.readline()
return data.strip().split(',')

james=rmspace('james.txt')
print('before sort and senitize, unique for james',james)
print('First 3 time for james',sorted(set([sanitize(t) for t in james]))[0:3])

=========== RESTART: C:\Users\eric\Documents\Python\kelly\kelly.py ===========
before sort and senitize, unique for james ['2-34', '3:21', '2.34', '2.45', '3.01', '2:01', '2:01', '3:10', '2-22']
First 3 time for james ['2.01', '2.22', '2.34']
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: