搜索带有部分名称的文件并添加到字典中

问题描述 投票:0回答:1

大家好我将文件保存在某个目录中。

dir ="/user/raw/samples"

而且我需要将所有文件以某些字符串结尾到字典dic_x和其他字典dic_z

以下是dir中的文件名:

SAP_XCV_1980_H7LYVDRXX_2_ALPH_C_cR_c1_v1_0_FCDSSA_DDDD_S14_L002_R1_001_MM_1.fz.gz
SAP_XCV_1980_H7LYVDRXX_2_ALPH_C_cR_c1_v1_0_FCDSSA_DDDD_S14_L002_R2_001_MM_1.fz.gz
SAP_XCV_1980_H7LYVDRXX_2_ALPH_C_cR_c1_v1_0_FCDSSA_DDDD_S14_L001_R2_001_MM_1.fz.gz
SAP_XCV_1980_H7LYVDRXX_2_ALPH_C_cR_c1_v1_0_FCDSSA_DDDD_S14_L001_R1_001_MM_1.fz.gz

我有两本词典

 import sys
import os
import subprocess
import glob
import fnmatch
import re
    listF = ['ALPH']
    FH_sample = listF
    dic_x = {}
    dic_z = {}

    for line in FH_sample:
        dic_x[line.strip().split('\n')[0]] =[]
        dic_z[line.strip().split('\n')[0]] =[]

        for root, sub, file in os.walk(dir):
            for name in file:
                if name.endswith(".fz.gz"):
                    m1 = re.search('(.*)_R1', file)
                    m2 = re.search('(.*)_R2', file)
                if m1 and m1.group(1) in dic_x:
                    dic_x[m1.group(1)].append(file)
                if m2 and m2.group(1) in dic_z:
                    dic_z[m2.group(1)].append(file)

但是,我无法使用上面的代码行获取字典中的所有值。我需要像下面这样的词典,

dic_x = {'ALPH': ['SAP_XCV_1980_H7LYVDRXX_2_ALPH_C_cR_c1_v1_0_FCDSSA_DDDD_S14_L001_R1_001_MM_1.fz.gz','SAP_XCV_1980_H7LYVDRXX_2_ALPH_C_cR_c1_v1_0_FCDSSA_DDDD_S14_L002_R1_001_MM_1.fz.gz']}

dic_z = {'ALPH': ['SAP_XCV_1980_H7LYVDRXX_2_ALPH_C_cR_c1_v1_0_FCDSSA_DDDD_S14_L001_R2_001_MM_1.fz.gz','SAP_XCV_1980_H7LYVDRXX_2_ALPH_C_cR_c1_v1_0_FCDSSA_DDDD_S14_L002_R2_001_MM_1.fz.gz']}

目前dic_xdic_z看起来像这样,

dic_z = {'ALPH': []}
dic_x = {'ALPH': []}
python dictionary os.walk
1个回答
1
投票

您可以使用列表

import os
import fnmatch
path = "test_images/"
x_list = ([f for f in os.listdir(path) if fnmatch.fnmatch(f, "*.png")])
z_list= ([f for f in os.listdir(path) if fnmatch.fnmatch(f, "*.bmp")])
dic_x = {'ALPH' : x_list }
dic_z = {'ALPH': z_list}

在你的情况下,你可以用*.png替换*R1_001_MM_1.fz.gz*.bmp*R2_001_MM_1.fz.gz'。还有test_images//user/raw/samples/

© www.soinside.com 2019 - 2024. All rights reserved.