Cannot scrape with beautifulsoup and urllib because of javascript variable -

March 15, 2013

unfortunately newbie beautifulsoup , urllib might not ask correctly need.. there website www.example.com need extract data website displays random message.

the problem message displayed after user presses button, otherwise shows general message "press button see message".

after searching stackoverflow realised there no way change variables calling browser url this.. www.example.com/?showrandommsg='true'

in threads read maybe can bookmarlets..

is there anyway use bookmarklets beautifulsoup or urllib in order access website , make display random message?

thanks in advance! :d

i came after long time answer own question..

i found many solutions , tutorials on web , of them suggesting using selenium , xpath method more complex needed..

so ended using selenium emulating browser (firefox in case) , grabbing html after page loaded completely.

after still using beautifoulsoup parse html code (whihc include javascript data too).

Search This Blog

Three

Cannot scrape with beautifulsoup and urllib because of javascript variable -

Comments

Post a Comment

Popular posts from this blog

Socket.connect doesn't throw exception in Android -

SPSS keyboard combination alters encoding -

iphone - How do I keep MDScrollView from truncating my row headers and making my cells look bad? -