python - Issue extracting data using BS4 -


i'm trying extract data website using bs4 can't access exact text need. using following code

name = (soup('td', {'class':'mstat'})) 

outputs

[<td class="mstat" colspan="3"><span class="r">badges</span></td>] 

however need class "r". if try access class "r" using following code empty list returned []

name = (soup('td', {'class':'r'})) 

output

[] 

i text reads 'badges', text can change length , different can't delete surrounding code.

desired output

'badges' 

would know i'm doing wrong...

you looking <span> tag instead:

soup('span', class_='r') 

if want find span tags class inside of td tags mstat class you'll need loop:

spans = [] td in soup('td', class_='mstat'):     spans.expand(td('span', class_='r')) 

Comments

Popular posts from this blog

SPSS keyboard combination alters encoding -

Add new record to the table by click on the button in Microsoft Access -

CSS3 Transition to highlight new elements created in JQuery -