grit.com
robots.txt

Robots Exclusion Standard data for grit.com

Resource Scan

Scan Details

Site Domain grit.com
Base Domain grit.com
Scan Status Ok
Last Scan2024-06-22T09:35:42+00:00
Next Scan 2024-06-29T09:35:42+00:00

Last Scan

Scanned2024-06-22T09:35:42+00:00
URL https://grit.com/robots.txt
Redirect https://www.grit.com/robots.txt
Redirect Domain www.grit.com
Redirect Base grit.com
Domain IPs 18.155.68.34, 18.155.68.50, 18.155.68.55, 18.155.68.94
Redirect IPs 13.226.225.101, 13.226.225.119, 13.226.225.51, 13.226.225.93
Response IP 3.160.246.37
Found Yes
Hash ef174328228db31f4dbfaedd4fa5e759dc9ec91038a04e17c1cf52cc6b50e3e6
SimHash 83a5b060cba3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /_custom/*
Disallow /*/print/
Disallow /wp-json/*
Disallow /search/*
Disallow /email
Disallow /print
Disallow /print-article.aspx
Disallow /sso/*
Disallow /store/offer/*
Disallow /store/author/*
Disallow /watch/*
Disallow /uploadedFiles/*
Disallow /tags/*
Disallow /search
Disallow /contributors/*

rogerbot

Rule Path
Allow /*
Disallow /wp-admin/
Disallow /_custom/
Disallow /wp-json/

twitterbot

Rule Path
Allow /

dotbot

Rule Path
Allow /*
Disallow /wp-admin/
Disallow /wp-json/

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.grit.com/sitemap.xml