这是我的代码:
[abc
是多维列表
输入列表:abc = [['word1_word1.5','word2'],['word1_word1.5','word2']]
for i in range(len(abc)):
for j in range(len(abc[i])):
if "word1_word1.5" in abc[i][j]:
newword = abc[i][j].split("_")
// Replace the abc[i][j] with "word1" and "word1.5"
预期输出:
abc = [['word1','word1.5','word2'],['word1','word1.5','word2']]
给出此:
>>> abc = [['word1_word1.5', 'word2', 'word3', 'word4_word4.5'], ['word5_word5.5', 'word2', 'word3', 'word1_word1.5']]
分割每个单词:
>>> [[w.split('_') for w in l] for l in abc]
[[['word1', 'word1.5'], ['word2'], ['word3'], ['word4', 'word4.5']], [['word5', 'word5.5'], ['word2'], ['word3'], ['word1', 'word1.5']]]
连接子列表:
>>> from itertools import chain
>>> [list(chain.from_iterable(w.split('_') for w in l)) for l in abc]
[['word1', 'word1.5', 'word2', 'word3', 'word4', 'word4.5'], ['word5', 'word5.5', 'word2', 'word3', 'word1', 'word1.5']]
仅当w
出现在xyz
中时才分割:
>>> xyz = {'word1_word1.5', 'word5_word5.5', 'word4_word4.5'}
>>> [list(chain.from_iterable(w.split('_') if w in xyz else [w] for w in l)) for l in abc]
[['word1', 'word1.5', 'word2', 'word3', 'word4', 'word4.5'], ['word5', 'word5.5', 'word2', 'word3', 'word1', 'word1.5']]
for lst in abc:
for n in range(len(lst)):
if lst[n] in xyz:
wrd = lst.pop(n)
lst += wrd.split("_")
这将xyz
中存在的单词替换为abc
内的拆分版本>
for item in abc:
for ele in item:
if ele in xyz:
split_word = ele.split("_")