我有一个文件是.Txt文件,我想制作一个可以解析其中某种数据的正则表达式。
我曾尝试这样做,但我无法获得想要的东西
这是一种TABLE数据,其他文件的格式可能相同
我在这里添加这些数据,请将其视为.Txt文件
将不胜感激。
Tribhuwan Diagnostic Centre | HOSPITALROAD, Morne)
East Champaran- 845401 (Bihar)
(FULLY AUTOMATED & COMPUTERISED LAB) Mob. :+9162046 29003
Name HAJAN sadshaj Booking Date 22/s/2020
G/A male 18 Yrs Reporting Date 22/05/2020
Lab No. 10203693 Sample Collected At Lab
Ref. By Dr. I.C.U
; UVLO
Test Name Value Unit Biological Ref Interval
COMPLETE BLOOD COUNT (CBC)
TOTAL LEUCOCYTES COUNT (TLC) 23160 cells/cmm 4000 - 11000
DIFFERENTIAL LEUCOCYTES COUNT (DLC)
NEUTROPHILS 93.4 % 45.0 - 65.0
LYMPHOCYTES 3.3 % 20.0 - 45.0
MONOCYTES 3.1 % 4.0 - 10.0
EOSINOPHILS 0.2 % 0.0 - 5.0
BASOPHILS 0.0 % 0.0-1.0
ABSOLUTE NEUTROPHILS 21620.0 3000.0 - 7000.0
ABSOLUTE LYMPHOCYTES 750.0 800.0 - 4000.0
ABSOLUTE MONOCYTES 730.0 0.0 - 1200.0
ABSOLUTE EOSINOPHILS 50.0 0.0 - 500.0
ABSOLUTE BASOPHILS 10.0 0.0 - 100.0
RBC COUNT 4.31 Millions/cmm 3.80 - 5.80
HAEMOGLOBIN (Hb) 13.1 gm/dl 11.0 - 16.5
P.C.V/HCT 41.2 % 35.0 - 50.0
MCV 95.5 fl. 80.0 - 97.0
MCH 30.3 Picogram 26.5 - 35.5
MCHC 31.8 g/dl 31.5-35.5
RDW / SD 49.7 FI 37.0 - 54.0
RDW / CV 12.3 % 10.0 - 15.0
PLATELET COUNT 148000 /cmm 150000 - 450000
PDW 17.0 fl 10.0 - 18.0
MPV 13.3 fl 6.5 - 11.7
PCT 0.198 % 0.108 - 0.282
Le
_
我只想从此获得前两列
我想要的输出(测试名称,值):
TOTAL LEUCOCYTES COUNT (TLC) 23160
DIFFERENTIAL LEUCOCYTES COUNT (DLC)
NEUTROPHILS 93.4
LYMPHOCYTES 3.3
MONOCYTES 3.1
EOSINOPHILS 0.2
BASOPHILS 0.0
ABSOLUTE NEUTROPHILS 21620.0
ABSOLUTE LYMPHOCYTES 750.0
ABSOLUTE MONOCYTES 730.0
ABSOLUTE EOSINOPHILS 50.0
ABSOLUTE BASOPHILS 10.0
RBC COUNT 4.31
HAEMOGLOBIN (Hb) 13.1
P.C.V/HCT 41.2
MCV 95.5
MCH 30.3
MCHC 31.8
RDW / SD 49.7
RDW / CV 12.3
PLATELET COUNT 148000
PDW 17.0
MPV 13.3
PCT 0.198
您可以使用python regex library实现您想要的。我开始为您的问题写一个正则表达式,但没有完成。当我感到满意时,我将更新我的帖子。
^\s+([[a-zA-Z()\/. ]+)\s+(\d+.\d+)
您可以在regex101.com上轻松编写和测试正则表达式,它使您可以轻松地直观看到它们在调试它们的过程。