将大字符串解析为字段的Pythonic方法

问题描述 投票:0回答:1

我有数百个包含电子邮件字符串的文件。文件名如下所示:

00005.34bcaad58ad5f598f5d6af8cfa0c0465 --
00250.c7603b27a45284d12b49adf767b2b6fa --
00249.b9183324a9726e8b6c8779045a921243 --
00248.9599b06d2d2c08b57ff1de06316d66c0 --
00247.42534d5df0700cb2adf240556c539947 --
00246.fdaacadac7143848978ea0af07eed070 --

这些文件的内容与下面的代码片段类似

Return-Path: <[email protected]>
Received: from lockergnome.com (sprocket.lockergnome.com [130.94.96.247])
    by dogma.slashnull.org (8.11.6/8.11.6) with SMTP id g6IKksJ07017
    for <[email protected]>; Thu, 18 Jul 2002 21:46:54 +0100
X-Mailer: ListManager Web Interface
Date: Thu, 18 Jul 2002 09:55:22 -0500
Subject: [Lockergnome Windows Daily]  Sticker Courtesy
To: [email protected]
From: Lockergnome Windows Daily <[email protected]>
List-Unsubscribe: <mailto:[email protected]>
List-Subscribe: <mailto:[email protected]>
List-Owner: <mailto:[email protected]>
X-URL: <http://www.lockergnome.com/>
X-List-Host: Lockergnome <http://www.lockergnome.com/>
Reply-To: [email protected]
Sender: [email protected]
Message-Id: <LISTMANAGERSQL-2534368-1682723-2002.07.18-09.57.34--qqqqqqqqqq-lg#[email protected]>
MIME-Version: 1.0
Content-Type: text/html; charset=us-ascii
...

文件中的实际内容没有任何规则或标准(我已经很难解码它们了)。

Python 有没有办法将其解析为类似于字典的东西?我不是很挑剔,尽管一本字典会很可爱。真的,我只是不想编写一个庞大的自定义解析器。我尝试了几个标准(例如 json.loads),但我还没有找到任何通用的标准。

python parsing data-manipulation
1个回答
0
投票
© www.soinside.com 2019 - 2024. All rights reserved.