/.well-known/

Log In Sign Up

crain.com
robots.txt

Robots Exclusion Standard data for crain.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	crain.com
Base Domain	crain.com
Scan Status	Ok
Last Scan	2024-11-11T23:11:57+00:00
Next Scan	2024-12-11T23:11:57+00:00

Last Scan

Scanned	2024-11-11T23:11:57+00:00
URL	https://crain.com/robots.txt
Redirect	https://www.crain.com/robots.txt
Redirect Domain	www.crain.com
Redirect Base	crain.com
Domain IPs	3.138.73.154
Redirect IPs	3.138.73.154
Response IP	3.138.73.154
Found	Yes
Hash	252642986107ee31c6348b84e4ac58f950db4c1a79dddaa499268a3e63e6afb0
SimHash	280c88c0e1f3

Groups

*

Rule

Path

Disallow

ccbot

Rule

Path

Disallow

/

google-extended

Rule

Path

Disallow

/

gptbot

Rule

Path

Disallow

/

Back to top

Other Records

Field

Value

sitemap

https://www.crain.com/sitemap_index.xml

Back to top

Comments

START YOAST BLOCK
---------------------------
---------------------------
END YOAST BLOCK

Back to top