代码之家 › 专栏 › 技术社区 › Kamikaze_goldfish

同时用漂亮的汤和蟒蛇做循环

beautifulsoup loops python

Kamikaze_goldfish · 技术社区 · 7 年前

https://www.brightscope.com/ratings/a 收视率很高 other . 评级后的每个字母(如a、b、c……)都有多页。我正在尝试创建一个while循环来转到每个页面,并且在存在某个条件时,将所有的href(我还没有得到该代码)。但是,当我运行代码时,while循环继续不停地运行。如何修复它以转到每个页面并搜索要运行的条件,如果找不到,则转到下一个字母?在任何人可能会问,我已经搜索了代码,但没有看到任何 li

https://www.brightscope.com/ratings/A/18 是最高的,它将去为A的,但它继续运行。

import requests
from bs4 import BeautifulSoup

url = "https://www.brightscope.com/ratings/"
page = requests.get(url)
soup = BeautifulSoup(page.text, 'html.parser')
hrefs = []
ratings = []
ks = []
pages_scrape = []

for href in soup.findAll('a'):
    if 'href' in href.attrs:
        hrefs.append(href.attrs['href'])
for good_ratings in hrefs:
    if good_ratings.startswith('/ratings/'):
        ratings.append(url[:-9]+good_ratings)

del ratings[0]
del ratings[27:]
count = 1
# So it runs each letter a, b, c, ... 
for each_rating in ratings:
    #Pulls the page
    page = requests.get(each_rating)
    #Does its soup thing
    soup = BeautifulSoup(page.text, 'html.parser')
    #Supposed to stay in A, B, C,... until it can't find the 'li' tag
    while soup.find('li'):
        page = requests.get(each_rating+str(count))
        print(page.url)
        count = count+1
        #Keeps running this and never breaks
    else:
        count = 1
        break

2 回复 | 直到 7 年前

leotrubach 7 年前

博特弗苏的 find() <li> 元素,您需要使用findAll()方法并对其结果进行迭代。

Deejpake 7 年前

这个 soup.find('li') page count 页码

while soup.find('li'):
        page = requests.get(each_rating+str(count))
        soup = BeautifulSoup(page.text, 'html.parser')
        print(page.url)
        count = count+1
        #Keeps running this and never breaks

希望这有帮助

推荐文章

Google User · Django管理员在`list_display中未显示`creation_date`字段`

1 年前

user29747013 · 如何创建一个新的数据框架,其中包含原始数据框架中列的聚合列?

1 年前

ÎÎÎ½Î· ÎÎ®Î¹Î½Î¿Ï · Python lxml.html语法错误:使用lxml find时XPATH的谓词无效

1 年前

user29715306 · from_users=和chats=电视节目中的差异

1 年前

Redshoe · 当执行numpy.genfromtxt()时,python是否会读取文件的所有行?

1 年前

RASEL MAHMUD · 为什么以及如何在is_even()函数内的IF条件中递归X变量在满足0后递增?[副本]

1 年前

prayner · 更新嵌套字典包含列表中的项

1 年前

Bringo Jr · 我可以在O(n)中解决这个问题吗?

1 年前

Dave · 如何在for循环中修改列表值

1 年前

Shukurullox Komiljonov · 从记录中获得相互和解。使用SQL

1 年前