Drexel dragonThe Math ForumDonate to the Math Forum



Search All of the Math Forum:

Views expressed in these public forums are not endorsed by Drexel University or The Math Forum.


Math Forum » Discussions » Software » comp.soft-sys.math.mathematica

Topic: Problem Importing web site in Mathematica: How to by pass pages asking for login credentials
Replies: 1   Last Post: May 11, 2012 12:16 AM

Advanced Search

Back to Topic List Back to Topic List Jump to Tree View Jump to Tree View   Messages: [ Previous | Next ]
mike.honeychurch@gmail.com

Posts: 242
Registered: 2/27/09
Re: Problem Importing web site in Mathematica: How to by pass pages asking for login credentials
Posted: May 11, 2012 12:16 AM
  Click to see the message monospaced in plain text Plain Text   Click to reply to this topic Reply

On May 9, 5:53 pm, Mark Coleman <markspcole...@gmail.com> wrote:
> Hi,
>
> I'm using Mathematica v8 for some text mining/classification analysis of web
> sites. As part of this I first Import[] the hyperlinks from the web
> site 's home page into a list, and then systematically traverse this
> list and Import each URL. In some cases, I hit a page or set of pages
> that requires a user to enter login credentials. At this point my code
> pops up the site's login screen and waits for manual input before
> proceeding. This obviously makes importing a large set of URLs
> infeasible.
>
> I'm wondering if it's possible to identify these pages in advance, so
> I can filter them out of my list of URLs. allowing me to automatically
> Import the remaining pages?
>
> Thanks,
>
> Mark


I use wget in combination with Mathematica to work around logins and
cookies.

Mike




Point your RSS reader here for a feed of the latest messages in this topic.

[Privacy Policy] [Terms of Use]

© Drexel University 1994-2013. All Rights Reserved.
The Math Forum is a research and educational enterprise of the Drexel University School of Education.