使用 Pandas Datareader 从雅虎财经获取股票数据时出现“类型错误:字符串索引必须是整数”

问题描述 投票:0回答:6
import pandas_datareader

end = "2022-12-15"
start = "2022-12-15"
stock_list = ["TATAELXSI.NS"]

data = pandas_datareader.get_data_yahoo(symbols=stock_list, start=start, end=end)

print(data)

当我运行此代码时,出现错误

"TypeError: string indices must be integers"

编辑:我已经更新了代码并将列表作为符号参数传递,但它仍然显示相同的错误

错误:

Traceback (most recent call last):
  File "C:\Users\Deepak Shetter\PycharmProjects\100DAYSOFPYTHON\mp3downloader.py", line 7, in <module>
    data = pandas_datareader.get_data_yahoo(symbols=[TATAELXSI], start=start, end=end)
  File "C:\Users\Deepak Shetter\PycharmProjects\100DAYSOFPYTHON\venv\lib\site-packages\pandas_datareader\data.py", line 80, in get_data_yahoo
    return YahooDailyReader(*args, **kwargs).read()
  File "C:\Users\Deepak Shetter\PycharmProjects\100DAYSOFPYTHON\venv\lib\site-packages\pandas_datareader\base.py", line 258, in read
    df = self._dl_mult_symbols(self.symbols)
  File "C:\Users\Deepak Shetter\PycharmProjects\100DAYSOFPYTHON\venv\lib\site-packages\pandas_datareader\base.py", line 268, in _dl_mult_symbols
    stocks[sym] = self._read_one_data(self.url, self._get_params(sym))
  File "C:\Users\Deepak Shetter\PycharmProjects\100DAYSOFPYTHON\venv\lib\site-packages\pandas_datareader\yahoo\daily.py", line 153, in _read_one_data
    data = j["context"]["dispatcher"]["stores"]["HistoricalPriceStore"]
TypeError: string indices must be integers
python yahoo-finance pandas-datareader
6个回答
18
投票

到目前为止,这里报告的解决方案都不适合我。根据讨论here雅虎对其 API 进行了更改,破坏了与以前的 pandas datareader 版本的兼容性。

在同一个 Github 线程中报告了一个修复,该修复是在 Github 用户 raphi6 的拉取请求中实现的。我确认拉取请求工作正常。可以使用以下 3 行安装拉取请求中的版本:

conda install pycryptodome pycryptodomex
conda uninstall pandas-datareader
pip install git+https://github.com/raphi6/pandas-datareader.git@ea66d6b981554f9d0262038aef2106dda7138316

pycrypto*
软件包是我必须安装才能使其工作的依赖项。请注意,我在这里使用提交哈希而不是分支名称,因为它是
Yahoo!_Issue#952
并且以这种方式使用 pip 时存在 哈希字符问题

这也可以使用 pip 代替 conda 来完成所有命令(请参阅下面的更新 1)。

更新1

要在 Google Colab 上尝试此操作(如此处所示):

! pip install pycryptodome pycryptodomex
! pip uninstall --yes pandas-datareader
! pip install git+https://github.com/raphi6/pandas-datareader.git@ea66d6b981554f9d0262038aef2106dda7138316

更新2(2022年12月27日)

虽然上周我无法使其工作,但我再次尝试了,并且我可以确认 Nikhil Mulley 下面提到的

pdr_override()
解决方法现在正在工作(至少在 yfinance 0.2.3 和 pandas-datareader 0.10.0 中) .

原始答案(有效,但代码行更多)

在同一个 Github 线程中报告了一个修复,该修复是在 Github 用户 raphi6 的拉取请求中实现的。我确认拉取请求工作正常。拉取请求的详细安装说明可以在here找到,为了完整起见,复制如下:

git clone https://github.com/raphi6/pandas-datareader.git
cd pandas-datareader
conda uninstall pandas-datareader
conda install pycryptodome pycryptodomex
git checkout 'Yahoo!_Issue#952'
python setup.py install --record installed_files.txt

安装命令中的

--record
参数是为了获取已安装文件的列表,以便将来轻松卸载(遵循这个SO线程)。
pycrypto*
文件是我必须安装才能使其工作的依赖项。


12
投票

这不是答案,但我认为问题与从雅虎本身获取的 pdr 数据读取器有关

>>> import pandas_datareader as dtr
>>> from datetime import datetime
>>> initial_portfolio=['AAPL', 'MA', 'F', 'MSFT', '^GSPC']
>>> startdate = datetime(2022,12,1)
>>> enddate=datetime(2022,12,10)
>>> stock_data=dtr.yahoo.daily.YahooDailyReader(initial_portfolio,start=startdate,end=enddate).read()

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "lib/python3.9/site-packages/pandas_datareader/base.py", line 258, in read
    df = self._dl_mult_symbols(self.symbols)
  File "lib/python3.9/site-packages/pandas_datareader/base.py", line 268, in _dl_mult_symbols
    stocks[sym] = self._read_one_data(self.url, self._get_params(sym))
  File "lib/python3.9/site-packages/pandas_datareader/yahoo/daily.py", line 153, in _read_one_data
    data = j["context"]["dispatcher"]["stores"]["HistoricalPriceStore"]
TypeError: string indices must be integers

短期修复可能是使用 yfinance 覆盖,看看这是否有帮助,直到雅虎财经恢复其数据功能?

Python 3.9.1 (default, Dec 28 2020, 11:22:14)
[Clang 11.0.0 (clang-1100.0.33.17)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from pandas_datareader import data as pdr
>>> import yfinance as yf
>>> yf.pdr_override()
>>> y_symbols = ['SCHAND.NS', 'TATAPOWER.NS', 'ITC.NS']
>>> from datetime import datetime
>>> startdate = datetime(2022,12,1)
>>> enddate = datetime(2022,12,15)
>>> data = pdr.get_data_yahoo(y_symbols, start=startdate, end=enddate)
[*********************100%***********************]  3 of 3 completed
>>> data
             Adj Close                                Close                           ...        Open                             Volume
                ITC.NS   SCHAND.NS TATAPOWER.NS      ITC.NS   SCHAND.NS TATAPOWER.NS  ...      ITC.NS   SCHAND.NS TATAPOWER.NS    ITC.NS SCHAND.NS TATAPOWER.NS
Date                                                                                  ...
2022-12-01  339.549988  195.949997   224.850006  339.549988  195.949997   224.850006  ...  341.700012  191.600006   225.250000  16630417    544485      7833074
2022-12-02  337.149994  196.600006   225.250000  337.149994  196.600006   225.250000  ...  339.350006  196.000000   225.449997   8388835    122126      7223274
2022-12-05  336.750000  191.050003   224.199997  336.750000  191.050003   224.199997  ...  337.649994  200.850006   225.250000   9716390    107294     10750610
2022-12-06  337.299988  196.399994   228.800003  337.299988  196.399994   228.800003  ...  334.100006  191.000000   224.199997   6327430    102911     20071039
2022-12-07  340.100006  187.350006   225.850006  340.100006  187.350006   225.850006  ...  338.500000  198.000000   228.800003   9813208    122772      7548312
2022-12-08  338.399994  181.850006   225.050003  338.399994  181.850006   225.050003  ...  340.200012  186.000000   226.000000   6200447    114147      7507975
2022-12-09  341.399994  176.899994   219.399994  341.399994  176.899994   219.399994  ...  339.750000  183.899994   225.899994   8132228    179660     13087278
2022-12-12  343.200012  177.350006   217.699997  343.200012  177.350006   217.699997  ...  341.000000  177.750000   219.750000  11214662    133507      8858525
2022-12-13  345.600006  178.449997   218.850006  345.600006  178.449997   218.850006  ...  344.500000  179.350006   218.800003  10693426     74873      7265105
2022-12-14  345.399994  179.149994   222.699997  345.399994  179.149994   222.699997  ...  346.000000  180.449997   219.800003   7379878     32085      9179593

[10 rows x 18 columns]
>>>

11
投票

我的解决方案很简单,我只需将 pandas_datareader.get_data_yahoo 替换为 yf.download 即可。不用担心结果,因为 yf.download 还会给你一个数据框。我将语法简化如下:

import yfinance as yf

df = yf.download(your_ticks_or_a_tick_list, start=start_date, end=end_date)

但是请记住清除所有单元格的输出,然后重新启动,否则会出现错误。


4
投票

改用雅虎财经,它对我有用:

import datetime as dt
import yfinance as yf

company = 'TATAELXSI.NS'

# Define a start date and End Date
start = dt.datetime(2020,1,1)
end =  dt.datetime(2022,1,1)

# Read Stock Price Data 
data = yf.download(company, start , end)

data.tail(10)

0
投票

2023 年 9 月 25 日对我有用的解决方案

我替换

import pandas_datareader as web
web.DataReader('AMZN', 'yahoo', start, end)

import yfinance 
yfinance.download('AMZN', start, end)

-4
投票

更新 yfinance 它对我有用。

© www.soinside.com 2019 - 2024. All rights reserved.