我正在使用lxml读取带有以下内容的xml文件:
tree = etree.parse(r'C:\Users\xxx\Desktop\misc work\xmledit\SalesTransactionCustom.xml')
并获得一个类似的xml文件:
<?xml version="1.0" encoding="UTF-8"?>
<ProcessSalesTransactionCustom xmlns="http://schema.xxxx.com/xxxxx/2" releaseID="9.2">
<ApplicationArea>
<Sender>
<LogicalID>xxxxxx.file.syncxxxxx5salesinvoice</LogicalID>
<ComponentID>External</ComponentID>
<ConfirmationCode>OnError</ConfirmationCode>
</Sender>
<CreationDateTime>2020-04-16T14:50:26.976Z</CreationDateTime>
<BODID>xxxx-nid:xxxxx:1001::Default_1001#320000:?SalesTransactionCustom&verb=Process</BODID>
</ApplicationArea>
<DataArea>
<Process>
<TenantID>infor</TenantID>
<AccountingEntityID>4710</AccountingEntityID>
<LocationID>S_4710</LocationID>
<ActionCriteria>
<ActionExpression actionCode="Add"/>
</ActionCriteria>
</Process>
<SalesTransactionCustom>
<FinancialBatch>
<TransactionDate>2019-09-27T00:00:00</TransactionDate>
<BatchReference>KUKS_20190928052427</BatchReference>
</FinancialBatch>
<TransactionHeader>
<TransactionType>HEI</TransactionType>
<SalesInvoice>
<Invoice>19001160</Invoice>
<BusinessPartner>417B00</BusinessPartner>
<DocumentDate>2019-09-27T00:00:00</DocumentDate>
<DueDate>2019-11-20T00:00:00</DueDate>
<Amount>152248.80</Amount>
<Currency>EUR</Currency>
<TaxCountry>DK</TaxCountry>
<TaxCode>BESIT</TaxCode>
<NonFinalizedTransaction>
<TransactionReference>417B00 PC210LCI-11</TransactionReference>
<LedgerAccount>50000400</LedgerAccount>
<Dimension1>100</Dimension1>
<Dimension2>KUK</Dimension2>
<Dimension3/>
<Dimension4/>
<Dimension5/>
<Dimension6/>
<Dimension7/>
<Dimension8/>
<TaxAmount>0.00</TaxAmount>
<DebitCreditFlag>credit</DebitCreditFlag>
<Amount>152248.80</Amount>
</NonFinalizedTransaction>
</SalesInvoice>
</TransactionHeader>
<TransactionHeader>
<TransactionType>HEI</TransactionType>
<SalesInvoice>
<Invoice>19001161</Invoice>
<BusinessPartner>412600</BusinessPartner>
<DocumentDate>2019-09-27T00:00:00</DocumentDate>
<DueDate>2019-11-20T00:00:00</DueDate>
<Amount>113848.17</Amount>
<Currency>EUR</Currency>
<TaxCountry>AT</TaxCountry>
<TaxCode>GBSI</TaxCode>
<NonFinalizedTransaction>
<TransactionReference>412600 PC210NLC-11</TransactionReference>
<LedgerAccount>50000400</LedgerAccount>
<Dimension1>100</Dimension1>
<Dimension2>KUK</Dimension2>
<Dimension3/>
<Dimension4/>
<Dimension5/>
<Dimension6/>
<Dimension7/>
<Dimension8/>
<TaxAmount>0.00</TaxAmount>
<DebitCreditFlag>credit</DebitCreditFlag>
<Amount>113848.17</Amount>
</NonFinalizedTransaction>
</SalesInvoice>
</TransactionHeader>
</SalesTransactionCustom>
</DataArea>
</ProcessSalesTransactionCustom>
我有一个类似pandas的数据框(这里第一行是列名:]
Tag Old Value New Value
BusinessPartner 417B00 BPE000104
BusinessPartner 412600 BPE000153
LedgerAccount 50000400 108092200
我想参考此熊猫数据框替换xml中元素的属性。我希望能够找到标记和旧值的组合,并将属性替换为新值。我该如何使用lxml和pandas?
提前谢谢您
为什么不只将XML
读取为字符串并执行str.replace
?
tag = df.Tag; old = df.Old; new = df.New
for i in range(len(tag)):
_str = _str.replace("<"+tag[i]+">"+old[i]+"</"+tag[i]+">","<"+tag[i]+">"+new[i]+"</"+tag[i]+">")