使用使用python3登录网站(Mediawiki)的请求

问题描述 投票:4回答:1

我想使用Python3中的Requests库登录到Mediawiki网站。

我已经寻找了类似的主题,但最终的解决方案不起作用。

我要使用的登录公式如下:

<form name="userlogin" class="mw-ui-vform" method="post" action="/index.php?title=Spezial:Anmelden&amp;action=submitlogin&amp;type=login&amp;returnto=Hauptseite">
    <section class="mw-form-header"></section>

    <div class="mw-ui-vform-field">
        <label for='wpName1'> Benutzername </label>
        <input class="loginText mw-ui-input" id="wpName1" tabindex="1" autofocus="" placeholder="Gib deinen Benutzernamen ein" name="wpName"/>
    </div>

    <div class="mw-ui-vform-field">
        <label for='wpPassword1'> Passwort </label>
        <input class="loginPassword mw-ui-input" id="wpPassword1" tabindex="2" placeholder="Gib dein Passwort ein" type="password" name="wpPassword" />
    </div>

    <div class="mw-ui-vform-field">
        <div class="mw-ui-checkbox">
            <input name="wpRemember" type="checkbox" value="1" id="wpRemember" tabindex="4">
            <label for="wpRemember"> Angemeldet bleiben</label>
        </div>
    </div>

    <div class="mw-ui-vform-field">
        <input id="wpLoginAttempt" name="wpLoginAttempt" tabindex="6" type="submit" value="Anmelden" class="mw-ui-button mw-ui-constructive" />
    </div>

    <div class="mw-ui-vform-field mw-form-related-link-container" id="mw-userlogin-help">
        <a href="https://www.mediawiki.org/wiki/Special:MyLanguage/Help:Logging_in">Hilfe beim Anmelden</a>
    </div>
    <div class="mw-ui-vform-field mw-form-related-link-container">
        <a href="/index.php/Spezial:Passwort_neu_vergeben" title="Spezial:Passwort neu vergeben">Passwort vergessen?</a>
    </div>
    <input type="hidden" value="9a57c2f503220304d0cc80633bad12d1" name="wpLoginToken" />
</form>

我的代码如下:

import requests
import re
a=('user','pass')

postloginurl='https://example.com/index.php?title=Spezial:Anmelden&amp;action=submitlogin&amp;type=login&amp;returnto=Hauptseite'
requesturl='https://example.com/index.php/Hauptseite'   


with requests.Session() as session:
  session.auth=a

  #finding the logintoken for the formular
  get = session.get(postloginurl)
  m=re.search('value="(?P<logintoken>.*)" name="wpLoginToken"',get.text)
  logintoken=m.group('logintoken')

  #fill out the login formular
  name=input('name:')
  password=input('password:')
  payload={'action': 'login', 'wpName':name, 'wpPassword':password, 'wpRemember':'1' , 'wpLoginAttempt':'Anmelden', 'wpLoginToken':logintoken, 'submit':'submit'}
  post = session.post(postloginurl, data=payload)

  #test if I'm logged in
  r = session.get(requesturl)
  print(r.text)
  #here it says: ""wgUserName":null" in the head, so I'm not logged in

该站点还使用身份验证,因此我需要身份验证部分。我还尝试管理logintoken,每次启动新会话时都会更改。

你们是否知道什么地方可能是错误的,或者我可以通过测试来发现我的错误?

python python-3.x python-requests mediawiki
1个回答
0
投票

您必须在登录请求中添加headers。无需在session.auth中添加“用户”和“传递”。

import requests
import bs4 as bs

login_page_url = 'https://www.mediawiki.org/w/index.php?title=Special:UserLogin&returnto=MediaWiki'
login_url = 'https://www.mediawiki.org/w/index.php?title=Special:UserLogin&returnto=MediaWiki'

# Create Session.
session = requests.session()

# Get HTML of Login page.
response = session.get(login_page_url)

# Extract token from HTML using BeautifulSoup.
soup = bs.BeautifulSoup(response.text, 'lxml')
wpLoginToken = soup.find('input', {'name': 'wpLoginToken'}).get('value')

payload = {
    "wpName":"YOUR_USERNAME",
    "wpPassword":"YOUR_PASSWORD",
    "wpRemember":"1", "authAction":"login",
    "wpLoginToken":wpLoginToken}

headers = {
    'Host': 'www.mediawiki.org','Origin': 'https://www.mediawiki.org', 'Content-Type': 'application/x-www-form-urlencoded',
    'User-Agent': 'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:71.0) Gecko/20100101 Firefox/71.0',
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
    'Accept-Language': 'en-US,en;q=0.5', 'Accept-Encoding': 'gzip, deflate, br',
    'Referer': 'https://www.mediawiki.org/w/index.php?title=Special:UserLogin&returnto=MediaWiki',
}

# Login request.
response = session.post(login_url, data=payload,)
print(response.status_code, response.text)

并且可以在response.text中找到您的用户名(“ wgUserName”:“ YOUR_USERNAME”)

© www.soinside.com 2019 - 2024. All rights reserved.