crawler.com
robots.txt

Robots Exclusion Standard data for crawler.com

Resource Scan

Scan Details

Site Domain crawler.com
Base Domain crawler.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-11-24T16:01:24+00:00
Next Scan 2026-02-22T16:01:24+00:00

Last Successful Scan

Scanned2023-10-14T00:52:03+00:00
URL http://crawler.com/robots.txt
Redirect http://www.crawler.com/robots.txt
Redirect Domain www.crawler.com
Redirect Base crawler.com
Domain IPs 64.135.77.50
Redirect IPs 64.135.77.50
Response IP 64.135.77.50
Found Yes
Hash 58fbdff1ebc91c4dddd89d9a1a2edcdb14edd7ff2bd23b54756c21732be4da43
SimHash 6706b8e1efb4

Groups

*

Rule Path
Disallow /_portal/
Disallow /search/
Disallow /rss.aspx
Disallow /weather.aspx

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Disallow /xfb_redir.aspx
Disallow /_portal/
Disallow /search/
Disallow /rss.aspx
Disallow /weather.aspx

Other Records

Field Value
crawl-delay 1

msnbot

Rule Path
Disallow /xfb_redir.aspx
Disallow /_portal/
Disallow /search/
Disallow /rss.aspx
Disallow /weather.aspx

Other Records

Field Value
crawl-delay 1

msnbot-newsblogs

Rule Path
Disallow /xfb_redir.aspx
Disallow /_portal/
Disallow /search/
Disallow /rss.aspx
Disallow /weather.aspx

msnbot-products

Rule Path
Disallow /xfb_redir.aspx
Disallow /_portal/
Disallow /search/
Disallow /rss.aspx
Disallow /weather.aspx

Other Records

Field Value
crawl-delay 1

msnbot-media

Rule Path
Disallow /xfb_redir.aspx
Disallow /_portal/
Disallow /search/
Disallow /rss.aspx
Disallow /weather.aspx

Other Records

Field Value
crawl-delay 1