将字符串拆分为大小均匀的块

问题描述 投票:0回答:10

我怎么能把像

'aaaaaaaaaaaaaaaaaaaaaaa'
这样的字符串 并将其分成 4 个长度元组,例如 (
aaaa
,
aaaa
,
aaaa
)

python string tuples
10个回答
45
投票

使用

textwrap.wrap

>>> import textwrap
>>> s = 'aaaaaaaaaaaaaaaaaaaaaaa'
>>> textwrap.wrap(s, 4)
['aaaa', 'aaaa', 'aaaa', 'aaaa', 'aaaa', 'aaa']

24
投票

使用列表理解,生成器表达式:

>>> s = 'aaaaaaaaaaaaaaaaaaaaaaa'
>>> [s[i:i+4] for i in range(0, len(s), 4)]
['aaaa', 'aaaa', 'aaaa', 'aaaa', 'aaaa', 'aaa']

>>> tuple(s[i:i+4] for i in range(0, len(s), 4))
('aaaa', 'aaaa', 'aaaa', 'aaaa', 'aaaa', 'aaa')

>>> s = 'a bcdefghi j'
>>> tuple(s[i:i+4] for i in range(0, len(s), 4))
('a bc', 'defg', 'hi j')

5
投票

另一个使用正则表达式的解决方案:

>>> s = 'aaaaaaaaaaaaaaaaaaaaaaa'
>>> import re
>>> re.findall('[a-z]{4}', s)
['aaaa', 'aaaa', 'aaaa', 'aaaa', 'aaaa']
>>>

4
投票

你可以使用石斑鱼食谱

zip(*[iter(s)]*4)

In [113]: s = 'aaaaaaaaaaaaaaaaaaaaaaa'

In [114]: [''.join(item) for item in zip(*[iter(s)]*4)]
Out[114]: ['aaaa', 'aaaa', 'aaaa', 'aaaa', 'aaaa']

请注意,如果字符串包含空格,

textwrap.wrap
可能不会将
s
拆分为长度为 4 的字符串:

In [43]: textwrap.wrap('I am a hat', 4)
Out[43]: ['I am', 'a', 'hat']

石斑鱼食谱比使用更快

textwrap

In [115]: import textwrap

In [116]: %timeit [''.join(item) for item in zip(*[iter(s)]*4)]
100000 loops, best of 3: 2.41 µs per loop

In [117]: %timeit textwrap.wrap(s, 4)
10000 loops, best of 3: 32.5 µs per loop

石斑鱼食谱可以与任何迭代器一起使用,而

textwrap
仅适用于字符串。


1
投票
s = 'abcdefghi'

k - 字符串部分的编号

k = 3

parts - 存储部分字符串的列表

parts = [s[i:i+k] for i in range(0, len(s), k)]

部分 --> ['abc', 'def', 'ghi']


0
投票
s = 'abcdef'

我们需要拆分成 2

[s[pos:pos+2] for pos,i in enumerate(list(s)) if pos%2 == 0]

答案:

['ab', 'cd', 'ef']

0
投票

我觉得这个方法比较简单。但是消息长度必须用 split_size 分割。或者必须在消息中添加字母。例子:message = "lorem ipsum_" 那么添加的字母就可以删除了

message = "lorem ipsum"

array = []

temp = ""

split_size = 3

for i in range(1, len(message) + 1):
    temp += message[i - 1]

    if i % split_size == 0:
        array.append(temp)
        temp = ""

print(array)

输出: ['lor', 'em', 'ips']


0
投票

这是给定问题的另一种可能的解决方案:

def split_by_length(text, width):
    width = max(1, width)
    chunk = ""
    for v in text:
        chunk += v
        if len(chunk) == width:
            yield chunk
            chunk = ""

    if chunk:
        yield chunk

if __name__ == '__main__':
    x = "123456789"
    for i in range(20):
        print(i, list(split_by_length(x, i)))

输出:

0 ['1', '2', '3', '4', '5', '6', '7', '8', '9']
1 ['1', '2', '3', '4', '5', '6', '7', '8', '9']
2 ['12', '34', '56', '78', '9']
3 ['123', '456', '789']
4 ['1234', '5678', '9']
5 ['12345', '6789']
6 ['123456', '789']
7 ['1234567', '89']
8 ['12345678', '9']
9 ['123456789']
10 ['123456789']
11 ['123456789']
12 ['123456789']
13 ['123456789']
14 ['123456789']
15 ['123456789']
16 ['123456789']
17 ['123456789']
18 ['123456789']
19 ['123456789']

0
投票

孩子的方式

def wrap(string, max_width):
    i=0
    strings = []
    s = ""
    for x in string:
        i+=1
        if i == max_width:
            s = s + x
            strings.append(s)
            s = ""
            i = 0
        else:
            s = s + x
    strings.append(s)
    return strings

wrap('ABCDEFGHIJKLIMNOQRSTUVWXYZ',4)
# output: ['ABCD', 'EFGH', 'IJKL', 'IMNO', 'QRST', 'UVWX', 'YZ']

0
投票

Ashwini Chaudhary 是对的。 textwrap 模块很棒。想从维基百科获取信息?安装 wikipedia 模块(pip 或其他),然后导入它和 textwrap 模块。

下面的简单脚本给出了维基百科关于 pyhton 编程语言的页面的摘要:

import wikipedia, textwrap
python_summary = wikipedia.summary("Python (programming language)")
char_limit = 100
# Makes a list of strings
line_list = textwrap.wrap(python_summary, char_limit)
for line in line_list:
    # Words wrap around line breaks,
    # they are not cut off. Awsome!
    print(line)
# You will not be diappointed!
© www.soinside.com 2019 - 2024. All rights reserved.