Current User: Guest Login Register
Please consider registering


Lost Your Password?

Search Forums:


 






Wildcard Usage:
*    matches any number of characters
%    matches exactly one character

robots.txt unreachable – sitemap error

Reply to Post
UserPost

3:37 am
June 16, 2008


Roy Khoh

Canning Vale, Western Australia

Admin

posts 163

Google Webmaster Tools reported sitemap errors. The details showed that robots.txt was unreachable and/or network unreachable. This was roughly from Jun 6 through to Jun 14.

Researching online revealed that it could be a DNS lookup failure – so I have since changed nameserver DNS records. I am now using “everydns”.

I'm not sure if that just took time to propogate, I did wait the 48 hours – still to no avail. Looking at packet headers, when I access the robots.txt myself – it returned 200 code (successful and working properly).

I checked with the host (3ix) whether the ip address/range for googlebot had been banned. Their response didn't answer my question, they just said "everything is fine" … "it is not our end" type of mentality.

Finally I deactivated “WP Super Cache”. A couple hours after that, the robots.txt file was reachable again by Google Webmaster Tools (googlebot crawler). Thinking back it was roughly the time I updated to the latest 0.6.4 version did this problem roughly begin. Albeit I also updated some other plugins around the same time.

7:26 am
July 1, 2008


Roy Khoh

Canning Vale, Western Australia

Admin

posts 163

It's only been half a month since, and I have had not problems with the sitemap or robots.txt unreachable error so far.

"WP Super Cache" is still deactivated, I had not switched it on – even after upgrading to WordPress 2.5.0 or 2.5.1

The nameservers used for DNS are still with "everydns" and the original nameservers from 3ix.

Ultimately, because of a mis-timing for about 24 hours between the 48 hour DNS propogation and the "fix", I am pin-pointing the solution to either "WP Super Cache" or the host finally realising and removing googlebot from their blacklist/firewall.

I am leaning towards the host firewall as the problem, because the technical support I chatted to didn't really seem too knowledgeable. Especially since they did not answer my direct question of "is googlebot blacklisted from the host?" They just kind of side-stepped around it and pointed at other things.

Reply to Post


Reply to Topic:
robots.txt unreachable – sitemap error

Guest Name (Required):

Guest Email (Required):

NOTE: New Posts are subject to administrator approval before being displayed

Smileys
Confused Cool Cry Embarassed Frown Kiss Laugh Smile Surprised Wink Yell
Post New Reply

Guest URL (required)

Math Required!
What is the sum of:
10 + 12
   



Share this article
  • Google Bookmarks
  • Digg
  • del.icio.us
  • StumbleUpon
  • MySpace
  • Facebook
  • Twitter
  • PDF
  • Print
  • email
Kim Seng: Congratulations to Tim - a massive achievement 3 times over.  Well done and The Pilbara regi...
Kim Seng: ATA Peel region grading is on Thursday 8th December from 5:30 pm (for juniors); and from 7 pm (fo...
Kim Seng: ATA Hedland grading on 3rd December 2011 at JD Hardie Centre.  for details call Tim Turner 0...
Jason: very cool that is all...
Roy Khoh: I've just added Riley's and Quinn's videos back on. Have been trying all day to get t...
Kim Seng: The photos and videos are placed in the current ATA members only pages. You have recently signed in...
chrisbin:   This...