• Visitors can check out the Forum FAQ by clicking this link. You have to register before you can post: click the REGISTER link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below. View our Forum Privacy Policy.
  • Want to receive the latest contracting news and advice straight to your inbox? Sign up to the ContractorUK newsletter here. Every sign up will also be entered into a draw to WIN £100 Amazon vouchers!

Website crawling

Collapse
X
  •  
  • Filter
  • Time
  • Show
Clear All
new posts

    Website crawling

    I need to get some data from a web site that has a report with a fixed maximum of 20 lines per page. There are about 20k pages.

    I need a tool that will download the data into some format - CSV, MySQL, I don't care - and then autmatically follow the link to the next page and repeat.

    I could write one, but has anyone used a tool that does this?

    Going straight to the database is not an option, I'm afraid. This is the only way to the data.

    #2
    wget or curl

    Comment


      #3
      'A friend' used a product from Lencom Software to do this with great success. I can't remember the exact name of the tool but looking at their offerings they have one called Visual Web Task which looks like it does the job. If it was the same one I used it worked very well pulling data and pictures from a page.

      Lencom Software Inc Software Informer: Latest Lencom Software Inc software updates and reviews: Fast Email Extractor,...
      'CUK forum personality of 2011 - Winner - Yes really!!!!

      Comment


        #4
        YQL?

        Comment


          #5
          Download the Microsoft SEO Toolkit... its free and will do what you need and more.

          Comment


            #6
            Dapper: The Data Mapper
            free and does what you need

            Comment

            Working...
            X