BeautifulSoup findAll() given multiple classes?

小开

One way to do it is to use regular expression instead of a class name:

import re
import requests
from bs4 import BeautifulSoup




s = requests.Session()
link = 'https://leaderboards.guildwars2.com/en/na/achievements'
r = s.get(link)




soup = BeautifulSoup(r.text)
for item in soup.findAll(True, {"class": re.compile("^(equal|up)$")}):
if 'achievements' in item.attrs['class'] and 'number' in item.attrs['class']:
print item

小开

you can do this

soup.findAll(True, {'class':['class1', 'class2']})

example:

>>> from bs4 import BeautifulSoup
>>> soup = BeautifulSoup('<html><body><div class="class1"></div><div class="class2"></div><div class="class3"></div></body></html>')
>>> soup.findAll(True, {"class":["class1", "class2"]})
[<div class="class1"></div>, <div class="class2"></div>]

小开

I am new to Python with BeautifulSoup but may be my answer help you. I came across the same situation where I have to find multiple classes of one tag so, I just pass the classes into an array and it works for me. Here is the code snippet

# Search with single Class
find_all("tr",  {"class":"abc"})
# Search with multiple classes
find_all("tr",  {"class": ["abc", "xyz"]})

小开

Or this with the more recent version of BeautifulSoup:

find_all('a', class_=['class1', 'class2'])

Using "class" would return an error so they use "class_" instead.

小开

    <html>
<body>
<div class="cls1">ok</div>
<div class="cls2">hi</div>
<div class="cls1 cls2">both</div>
</body>
</html>

OR operator

    from bs4 import BeautifulSoup
soup = BeautifulSoup(html)
divs = soup.find_all('div', class_=['cls1', 'cls2'])
print(divs)

output:

[<div class="cls1">ok</div>, <div class="cls2">hi</div>, <div class="cls1 cls2">both</div>]

AND operator

    from bs4 import BeautifulSoup
soup = BeautifulSoup(html)
divs = soup.select('div.cls1.cls2')
print(divs)

output:

[<div class="cls1 cls2">both</div>]

小开

If you're working with an Url as parameter dont forget to pass the headers too. I was fighting for like one hour to get these div elements with 2 classes and it wasnt working for mi till i noticed that i forget to pass the this headers.

header = {
"Accept-Language": "es-ES,es;q=0.9",
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/94.0.4606.81 Safari/537.36"
}
url = 'something.com'
response = requests.get(url=url,headers=header)
response.raise_for_status()
data = response.text


soup = BeautifulSoup(data, 'html.parser')


elements = soup.select('div.fde444d7ef._c445487e2')