cyrusojohnson.com
robots.txt

Robots Exclusion Standard data for cyrusojohnson.com

Resource Scan

Scan Details

Site Domain cyrusojohnson.com
Base Domain cyrusojohnson.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-05-22T14:01:35+00:00
Next Scan 2025-08-20T14:01:35+00:00

Last Successful Scan

Scanned2024-07-04T13:58:46+00:00
URL https://cyrusojohnson.com/robots.txt
Redirect https://www.cyrusojohnson.com/robots.txt
Redirect Domain www.cyrusojohnson.com
Redirect Base cyrusojohnson.com
Domain IPs 131.153.147.42
Redirect IPs 131.153.147.42
Response IP 131.153.147.42
Found Yes
Hash 238ba766f62027ab861087d30beef4112fd7cf3e70301810edfd608337ef179f
SimHash 25537aa071c3

Groups

*

Rule Path
Disallow /images/
Disallow /styles/
Disallow /css/
Disallow /js/
Disallow /?*Query=
Disallow /?Query=
Disallow /?*query=
Disallow /?query=
Disallow /*?tab=*
Disallow /?tab=*
Disallow /?ref=*
Disallow /*?ref=*
Disallow /?review-page*
Disallow /*?review-page*
Disallow /*?gclid=*
Disallow /?gclid=*
Disallow /cgi-bin/
Disallow /en/

*
rogerbot
googlebot
googlebot-image
googlebot-video
googlebot-news
bingbot
msnbot
msnbot-media
bingpreview
slurp
duckduckbot
baiduspider
baiduspider-mobile
yandexbot
facebot
teoma
aolbuild
naverbot
applebot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.cyrusojohnson.com/sitemap.xml

Comments

  • https://developers.google.com/webmasters/control-crawl-index/docs/robots_txt?hl=en
  • allow all crawlers
  • images
  • search
  • country page tabs
  • 10-23-2017 Update
  • Meeting On 6/7
  • 3-15-2016 Meeting
  • sitemap - Supported by Google, Ask, Bing, Yahoo; defined on sitemaps.org