thecrafttrain.com
robots.txt

Robots Exclusion Standard data for thecrafttrain.com

Resource Scan

Scan Details

Site Domain thecrafttrain.com
Base Domain thecrafttrain.com
Scan Status Ok
Last Scan2024-11-15T09:31:34+00:00
Next Scan 2024-11-22T09:31:34+00:00

Last Scan

Scanned2024-11-15T09:31:34+00:00
URL https://thecrafttrain.com/robots.txt
Redirect https://www.thecrafttrain.com/robots.txt
Redirect Domain www.thecrafttrain.com
Redirect Base thecrafttrain.com
Domain IPs 104.21.75.21, 172.67.210.61, 2606:4700:3030::6815:4b15, 2606:4700:3036::ac43:d23d
Redirect IPs 104.21.75.21, 172.67.210.61, 2606:4700:3030::6815:4b15, 2606:4700:3036::ac43:d23d
Response IP 172.67.210.61
Found Yes
Hash deff653084ee7e57425019c79ddd837020514be3a0046429434c5d6cee377a6c
SimHash 1a60dac0a193

Groups

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.thecrafttrain.com/sitemap_index.xml

Comments

  • ======Raptive Begin======
  • ======Raptive End======
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK