• Visitors can check out the Forum FAQ by clicking this link. You have to register before you can post: click the REGISTER link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below. View our Forum Privacy Policy.
  • Want to receive the latest contracting news and advice straight to your inbox? Sign up to the ContractorUK newsletter here. Every sign up will also be entered into a draw to WIN £100 Amazon vouchers!

SKA news - more than 1 trillion unique URLs found - this beats Google's figure

Collapse
X
  •  
  • Filter
  • Time
  • Show
Clear All
new posts

    #31
    Congrats - whats next?
    Beer
    is proof that God loves us and wants us to be happy.
    Benjamin Franklin

    Comment


      #32
      Originally posted by Coalman View Post
      Congrats - whats next?
      according to TPD rules:
      1000001000001 being palindromic is a significant number
      Coffee's for closers

      Comment


        #33
        Just checked HAB Inc's entry and it has sheadloads of URL's dating back to when I was hosted elsewhere - I moved it something like two years ago.

        Correct links are probably < 0.01 % in my case.
        How did this happen? Who's to blame? Well certainly there are those more responsible than others, and they will be held accountable, but again truth be told, if you're looking for the guilty, you need only look into a mirror.

        Follow me on Twitter - LinkedIn Profile - The HAB blog - New Blog: Mad Cameron
        Xeno points: +5 - Asperger rating: 36 - Paranoid Schizophrenic rating: 44%

        "We hang the petty thieves and appoint the great ones to high office" - Aesop

        Comment


          #34
          Originally posted by AtW View Post
          Yes.

          http://www.majesticseo.com is our backlinks search engine that now has more unique URLs than announced by Google last year. I'd love for them to update on their current number so that we could catch up with it shortly...

          Every day SKA run by SKA Troopers crawls around 350-400 mln URLs with total size of crawled and analyzed data being over 10 TB - every day.

          ta for good wishes everyone - SKA is developing nicely - get ready for even bigger SKA news in the future.

          I just entered my no-longer-existing site and got a very nice report.
          Serious question, how do you get rid of no-longer-existing pages?
          Do you re-build your index regularly?
          "Condoms should come with a free pack of earplugs."

          Comment


            #35
            i may be being stupid but whats the use of it?

            Comment


              #36
              Originally posted by NetwkSupport View Post
              i may be being stupid but whats the use of it?
              Shsssssssss!

              You've come right out the other side of the forest of irony and ended up in the desert of wrong.

              Comment


                #37
                Originally posted by ThomasSoerensen View Post
                I just entered my no-longer-existing site and got a very nice report.
                Serious question, how do you get rid of no-longer-existing pages?
                Do you re-build your index regularly?
                Why would we want to get rid of this data? It's perfect for deciding if you want to buy a particular domain that may have expired or no longer working.

                We now rebuild index (does not mean all pages recrawled) every month, and we will mark backlinks as deleted if upon successful recrawl they are no longer present on original page.

                Comment


                  #38
                  Originally posted by HairyArsedBloke View Post
                  Just checked HAB Inc's entry and it has sheadloads of URL's dating back to when I was hosted elsewhere - I moved it something like two years ago.

                  Correct links are probably < 0.01 % in my case.
                  There is a date on when links were found. There are also flags if they were deleted (marked when we recrawl that page successfully).

                  Our links are correct - you say it yourself, only you suggest that they are not up to date: we keep complete view of the web as we find it, the same as Google, some links are broken, some expired - all that is useful information for an SEO if they want to take advatage of it.

                  It is also useful data for our future full-text search engine.

                  Comment


                    #39
                    As I have said before about the search engine, search on Manchester United and abolutely nothing comes up about football.

                    Comment


                      #40
                      Originally posted by minestrone View Post
                      As I have said before about the search engine, search on Manchester United and abolutely nothing comes up about football.
                      It isn't really a search engine as yet
                      ǝןqqıʍ

                      Comment

                      Working...
                      X