选择 '数据reactid' 具有值” .romjx8c48.1.0.5.1:1.4.0.3.1.0.0.0.0.1。 0.0.0.0.1。$ company-data-Revenues($ M).1'}并读取它的文本。
from bs4 import BeautifulSoup
html = """
$ millions
% change
Revenues ($M)
$135,987
27.1%
Profits ($M)
$2,371.0
297.8%
"""
soup = BeautifulSoup(html, 'html.parser')
print(soup.find('td', {'data-reactid': '.romjx8c48.1.0.5.1:1.4.0.3.1.0.0.0.0.1.0.0.0.0.1.$company-data-Revenues ($M).1'}).text)
输出:
$135,987
更新响应评论:
呈现页面的JavaScript,你可以使用Selenium以使其:
首先安装硒:
sudo pip3 install selenium
import bs4 as bs
from selenium import webdriver
browser = webdriver.Chrome()
url = "/fortune500/amazon-com/"
browser.get(url)
html_source = browser.page_source
browser.quit()
soup = bs.BeautifulSoup(html_source, "html.parser")
# print (soup)
tds = soup.find_all('td')
print(tds[1].text)