Top "Googlebot" questions

Googlebot is Google's web crawling bot which discovers new and updated pages/documents from the web to build a searchable index for the Google search engine.

What does this this HTTP Authorization RewriteRule do?

I have an rewrite recursion error somewhere on my website that Google Bot caused, but I can't find the url …

.htaccess mod-rewrite url-rewriting apache2 googlebot
<noindex> tag for Google

I would like to tell Google not to index certain parts of the page. In Yandex (russian SE) there's a …

seo googlebot yandex noindex
Is there a way to make robots ignore certain text?

I have my blog (you can see it if you want, from my profile), and it's fresh, as well as …

html seo googlebot
Avoid crawling part of a page with "googleoff" and "googleon"

I am trying to tell Google and other search engines not to crawl some parts of my web page. What …

html seo comments googlebot google-crawlers
Google bot crawling on AngularJS site with HTML5 Mode routes

We have an AngularJS site using HTML5 routes. I just did some test "Fetch as Google" runs. The results are …

html angularjs nginx seo googlebot
modsecurity whitelist ip range

I'm trying to whitelist a range of ips (Googlebots) on modsecurity on an Ubuntu 12.04 server. For example, here's a range …

ubuntu apache2 googlebot mod-security mod-security2
How to set up a robot.txt which only allows the default page of a site

Say I have a site on http://example.com. I would really like allowing bots to see the home page, …

web-crawler bots robots.txt googlebot slurp
robots.txt: user-agent: Googlebot disallow: / Google still indexing

Look at the robots.txt of this site: fr2.dk/robots.txt The content is: User-Agent: Googlebot Disallow: / That ought …

robots.txt googlebot google-index
Does googlebot keep sessions when crawling?

When googlebot crawls pages does it have session? For example I am storing some variables on the session and using …

asp.net session googlebot google-crawlers
HTTP status code for overloaded server

Some hours my web site's server has too much load. Which HTTP status code should I send to the Googlebot …

http seo http-status-codes googlebot http-status-code-503