如何通过 SOCKS 代理使 python 请求工作

问题描述 投票:0回答:9

我在我的 Python 脚本中使用了很棒的 Requests 库:

import requests
r = requests.get("http://example.com")
print(r.text)

我想使用 SOCKS 代理,我该怎么做? Requests 似乎只支持 HTTP 代理。

python proxy socks python-requests
9个回答
179
投票

现代方式:

pip install -U 'requests[socks]'

然后

import requests

resp = requests.get('http://go.to', 
                    proxies=dict(http='socks5://user:pass@host:port',
                                 https='socks5://user:pass@host:port'))

74
投票

如果有人尝试了所有这些旧答案,但仍然遇到以下问题:

requests.exceptions.ConnectionError: 
   SOCKSHTTPConnectionPool(host='myhost', port=80): 
   Max retries exceeded with url: /my/path 
   (Caused by NewConnectionError('<requests.packages.urllib3.contrib.socks.SOCKSConnection object at 0x106812bd0>: 
   Failed to establish a new connection: 
   [Errno 8] nodename nor servname provided, or not known',))

这可能是因为,默认情况下,

requests
配置为解析连接的本地端的 DNS 查询。

尝试将您的代理 URL 从

socks5://proxyhost:1234
更改为
socks5h://proxyhost:1234
。请注意额外的
h
(它代表主机名解析)。

PySocks 包模块默认是进行远程解析,我不确定为什么它们的集成有如此隐晦的分歧,但正如一些评论者指出的那样,curl 是这样工作的


67
投票

自2016年4月29日发布的

requests
版本2.10.0开始,
requests
支持SOCKS。

需要PySocks,可以使用

pip install pysocks
安装。

使用示例:

import requests
proxies = {'http': "socks5://myproxy:9191"}
requests.get('http://example.org', proxies=proxies)

22
投票

你需要安装pysocks,我的版本是1.0,代码对我有用:

import socket
import socks
import requests
ip='localhost' # change your proxy's ip
port = 0000 # change your proxy's port
socks.setdefaultproxy(socks.PROXY_TYPE_SOCKS5, ip, port)
socket.socket = socks.socksocket
url = u'http://ajax.googleapis.com/ajax/services/search/images?v=1.0&q=inurl%E8%A2%8B'
print(requests.get(url).text)

7
投票

一旦 python

requests
SOCKS5
拉取请求合并,它将像使用
proxies
字典一样简单:

更新:PR 已合并。

#proxy
        # SOCKS5 proxy for HTTP/HTTPS
        proxies = {
            'http' : "socks5://myproxy:9191",
            'https' : "socks5://myproxy:9191"
        }

        #headers
        headers = {

        }

        url='http://example.com/'
        res = requests.get(url, headers=headers, proxies=proxies)

请参阅 SOCKS 代理支持

另一个选项,如果您无法等待

request
准备好,当您无法使用
requesocks
时 - 就像在 GoogleAppEngine 上由于缺少
pwd
内置模块一样,可以使用前面提到的 PySocks上图:

  1. 从存储库中获取
    socks.py
    文件并将副本放入根文件夹中;
  2. 添加
    import socks
    import socket

此时,请在使用

urllib2
之前配置并绑定套接字 - 在以下示例中:

import urllib2
import socket
import socks

socks.set_default_proxy(socks.SOCKS5, "myprivateproxy.example",port=9050)
socket.socket = socks.socksocket
res=urllib2.urlopen(url).read()

6
投票

您可以使用

https_proxy
环境变量运行脚本。

  1. 如有必要,安装袜子支撑。
pip install PySocks
pip install pysocks5
  1. 设置环境变量
export https_proxy=socks5://<hostname or ip>:<port>
  1. 运行您的脚本。此示例使用代理发出请求并显示 IP 地址:
echo Your real IP
python -c 'import requests;print(requests.get("http://ipinfo.io/ip").text)'

echo IP with socks-proxy
python -c 'import requests;print(requests.get("https://ipinfo.io/ip").text)'

4
投票
# SOCKS5 proxy for HTTP/HTTPS
proxiesDict = {
    'http' : "socks5://1.2.3.4:1080",
    'https' : "socks5://1.2.3.4:1080"
}

# SOCKS4 proxy for HTTP/HTTPS
proxiesDict = {
    'http' : "socks4://1.2.3.4:1080",
    'https' : "socks4://1.2.3.4:1080"
}

# HTTP proxy for HTTP/HTTPS
proxiesDict = {
    'http' : "1.2.3.4:1080",
    'https' : "1.2.3.4:1080"
}

3
投票

我在urllib3中安装了pysocks和monkey patched create_connection,如下所示:

import socks
import socket
socks.setdefaultproxy(socks.PROXY_TYPE_SOCKS4, "127.0.0.1", 1080)

def create_connection(address, timeout=socket._GLOBAL_DEFAULT_TIMEOUT,
                      source_address=None, socket_options=None):
    """Connect to *address* and return the socket object.

    Convenience function.  Connect to *address* (a 2-tuple ``(host,
    port)``) and return the socket object.  Passing the optional
    *timeout* parameter will set the timeout on the socket instance
    before attempting to connect.  If no *timeout* is supplied, the
    global default timeout setting returned by :func:`getdefaulttimeout`
    is used.  If *source_address* is set it must be a tuple of (host, port)
    for the socket to bind as a source address before making the connection.
    An host of '' or port 0 tells the OS to use the default.
    """

    host, port = address
    if host.startswith('['):
        host = host.strip('[]')
    err = None
    for res in socket.getaddrinfo(host, port, 0, socket.SOCK_STREAM):
        af, socktype, proto, canonname, sa = res
        sock = None
        try:
            sock = socks.socksocket(af, socktype, proto)

            # If provided, set socket level options before connecting.
            # This is the only addition urllib3 makes to this function.
            urllib3.util.connection._set_socket_options(sock, socket_options)

            if timeout is not socket._GLOBAL_DEFAULT_TIMEOUT:
                sock.settimeout(timeout)
            if source_address:
                sock.bind(source_address)
            sock.connect(sa)
            return sock

        except socket.error as e:
            err = e
            if sock is not None:
                sock.close()
                sock = None

    if err is not None:
        raise err

    raise socket.error("getaddrinfo returns an empty list")

# monkeypatch
urllib3.util.connection.create_connection = create_connection

3
投票

我可以在 Linux 上做到这一点。

$ pip3 install --user 'requests[socks]'
$ https_proxy=socks5://<hostname or ip>:<port> python3 -c \
> 'import requests;print(requests.get("https://httpbin.org/ip").text)'
© www.soinside.com 2019 - 2024. All rights reserved.